`gpux build`¶

Build and optimize models for GPU inference.

Overview¶

The gpux build command validates your GPUX configuration, inspects your model, checks provider compatibility, and prepares everything for optimal GPU inference. It creates build artifacts in the .gpux directory.

gpux build [PATH] [OPTIONS]

Arguments¶

`PATH`¶

Path to the GPUX project directory.

Type: string
Default: . (current directory)
Required: No

Examples:

gpux build                    # Build from current directory
gpux build ./my-model        # Build from specific directory
gpux build ../sentiment      # Build from parent directory

Options¶

`--config`, `-c`¶

Configuration file name.

Type: string
Default: gpux.yml

gpux build --config custom.yml
gpux build -c model-config.yml

`--provider`, `-p`¶

Preferred execution provider (cuda, coreml, rocm, etc.).

Type: string
Default: Auto-detected
Choices: cuda, coreml, rocm, directml, openvino, tensorrt, cpu

gpux build --provider cuda
gpux build -p coreml

`--optimize` / `--no-optimize`¶

Enable or disable model optimization.

Type: boolean
Default: true

gpux build --optimize         # Enable optimization (default)
gpux build --no-optimize     # Disable optimization

`--verbose`¶

Enable verbose output for debugging.

Type: boolean
Default: false

gpux build --verbose

Build Process¶

The build command performs the following steps:

Parse Configuration: Reads and validates gpux.yml
Validate Model Path: Checks if the model file exists
Inspect Model: Extracts input/output specifications, metadata
Check Provider Compatibility: Determines the best execution provider
Optimize Model: (Optional) Applies model optimizations
Save Build Artifacts: Stores model info and provider info in .gpux/

Build Artifacts¶

The build process creates the following files in .gpux/:

.gpux/
├── model_info.json       # Model specifications (inputs, outputs, size)
└── provider_info.json    # Provider information (platform, availability)

`model_info.json`¶

{
  "name": "sentiment-analysis",
  "version": "1.0.0",
  "format": "onnx",
  "size_bytes": 268435456,
  "inputs": [
    {
      "name": "input_ids",
      "type": "int64",
      "shape": [1, 128],
      "required": true
    }
  ],
  "outputs": [
    {
      "name": "logits",
      "type": "float32",
      "shape": [1, 2]
    }
  ]
}

`provider_info.json`¶

{
  "provider": "CUDAExecutionProvider",
  "platform": "NVIDIA CUDA",
  "available": true,
  "description": "NVIDIA CUDA GPU acceleration"
}

Examples¶

Basic Build¶

Build from the current directory:

gpux build

Output:

✅ Build completed successfully!
Build artifacts saved to: .gpux

Build with Specific Provider¶

Build with CUDA provider:

gpux build --provider cuda

Build from Another Directory¶

Build a model in a different directory:

gpux build ./models/sentiment-analysis

Build Without Optimization¶

Skip model optimization:

gpux build --no-optimize

Verbose Build¶

Show detailed build information:

gpux build --verbose

Output¶

The build command displays the following information:

Model Information¶

Property	Value
Name	sentiment-analysis
Version	1.0.0
Format	onnx
Size	256.0 MB
Inputs	2
Outputs	1

Execution Provider¶

Property	Value
Provider	CUDAExecutionProvider
Platform	NVIDIA CUDA
Available	✅ Yes
Description	NVIDIA CUDA GPU acceleration

Input Specifications¶

Name	Type	Shape	Required
input_ids	int64	[1, 128]	✅
attention_mask	int64	[1, 128]	✅

Output Specifications¶

Name	Type	Shape
logits	float32	[1, 2]

Error Handling¶

Configuration File Not Found¶

Error: Configuration file not found: ./gpux.yml

Solution: Ensure gpux.yml exists in the project directory.

Model File Not Found¶

Error: Model file not found

Solution: Check the model.source path in gpux.yml.

Invalid Configuration¶

Build failed: Invalid configuration: missing required field 'name'

Solution: Validate your gpux.yml against the schema.

Best Practices¶

Always Build Before Deploying

Run gpux build before deploying to production to catch configuration errors early.

Use Specific Providers in Production

Specify the provider explicitly in production to ensure consistent behavior:

gpux build --provider cuda

Enable Optimization

Keep optimization enabled (default) for better inference performance.

Check Provider Availability

The build command will show provider availability. Ensure your target provider is available before deploying.

gpux run - Run inference on models
gpux serve - Start HTTP server
gpux inspect - Inspect model details

gpux build¶