Changelog¶
All notable changes to GPUX will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
Unreleased¶
[0.2.0] - 2025-01-XX¶
Changed¶
- Documentation reorganization: Focus on
gpux pullas primary workflow - Updated all examples to use modern 2025 models (text, audio, image)
- Moved
gpux.ymlconfiguration to advanced section - Reorganized tutorial order: Configuration moved to end as advanced feature
Added¶
- Examples for text, audio, and image models throughout documentation
- Clear distinction between basic (
gpux pull) and advanced (gpux.yml) workflows
0.1.0 - 2024-10-05¶
Added¶
- Initial release of GPUX runtime
- Universal GPU compatibility (NVIDIA, AMD, Apple Silicon, Intel)
- Docker-like CLI (
build,run,serve,inspect) - ONNX Runtime with execution providers
- Automatic provider selection
- Configuration via
gpux.yml - HTTP serving with FastAPI
- Model introspection and validation
- Benchmarking capabilities
- Python API for programmatic usage
Features¶
- CLI Commands:
gpux build- Build and validate modelsgpux run- Run inferencegpux serve- Start HTTP server-
gpux inspect- Inspect models -
Execution Providers:
- TensorRT (NVIDIA)
- CUDA (NVIDIA)
- ROCm (AMD)
- CoreML (Apple Silicon)
- DirectML (Windows)
- OpenVINO (Intel)
-
CPU (Universal)
-
Configuration:
- YAML-based configuration
- Input/output specifications
- Runtime settings
-
Serving configuration
-
HTTP API:
/predict- Run inference/health- Health check/info- Model information/metrics- Performance metrics
Performance¶
- Sub-millisecond inference on modern GPUs
- RTX 3080: 2,400 FPS (BERT with TensorRT)
- M2 Pro: 450 FPS (BERT with CoreML)
- RX 6800 XT: 600 FPS (BERT with ROCm)
Release Types¶
- Major (X.0.0): Breaking changes
- Minor (0.X.0): New features, backwards compatible
- Patch (0.0.X): Bug fixes, backwards compatible
Categories¶
- Added: New features
- Changed: Changes to existing functionality
- Deprecated: Soon-to-be removed features
- Removed: Removed features
- Fixed: Bug fixes
- Security: Security improvements