Skip to content

API Reference

Complete API reference for GPUX.


Command-Line Interface

Documentation for all CLI commands:


Configuration Reference

Complete gpux.yml configuration reference:


Python API

Python API reference for programmatic usage:


HTTP API

REST API reference for serving:


Quick Reference

Common Commands

# Build model
gpux build .

# Run inference
gpux run model-name --input '{"data": [1,2,3]}'

# Start server
gpux serve model-name --port 8080

# Inspect model
gpux inspect model-name

Configuration Template

name: model-name
version: 1.0.0

model:
  source: ./model.onnx

inputs:
  - name: input
    type: float32
    shape: [1, 10]

outputs:
  - name: output
    type: float32
    shape: [1, 2]

runtime:
  gpu:
    backend: auto
    memory: 2GB

Python API Example

from gpux import GPUXRuntime

runtime = GPUXRuntime(model_path="model.onnx")
result = runtime.infer({"input": data})
print(result["output"])

See Also