Feature gates
Feature gates control experimental and beta functionality in the DRA Driver for NVIDIA GPUs. They follow Kubernetes feature gate conventions.
Set feature gates
Set feature gates in your Helm values file:
featureGates:
TimeSlicingSettings: true
MPSSupport: true
Or pass them at install time:
helm install nvidia-dra-driver-gpu nvidia/nvidia-dra-driver-gpu \
--set "featureGates.TimeSlicingSettings=true"
Available feature gates
| Feature gate | Stage | Default | Description |
|---|---|---|---|
TimeSlicingSettings |
Alpha | false |
Enables customization of CUDA time-slicing settings in GpuConfig. |
MPSSupport |
Alpha | false |
Enables Multi-Process Service (MPS) sharing strategy in GpuConfig and MigDeviceConfig. |
IMEXDaemonsWithDNSNames |
Beta | true |
IMEX daemons use DNS names instead of raw IP addresses for peer communication. Required by ComputeDomainCliques. |
PassthroughSupport |
Alpha | false |
Enables VFIO passthrough device allocation using VfioDeviceConfig. |
DynamicMIG |
Alpha | false |
Enables dynamic MIG device allocation and reconfiguration. |
NVMLDeviceHealthCheck |
Alpha | false |
Enables GPU health checking using NVML. |
ComputeDomainCliques |
Beta | true |
Uses ComputeDomainClique CRD objects to track IMEX daemon membership. Requires IMEXDaemonsWithDNSNames. |
CrashOnNVLinkFabricErrors |
Beta | true |
Causes the kubelet plugin to crash rather than fall back to non-fabric mode when NVLink fabric errors are detected. |
Constraints
The following feature gate combinations are mutually exclusive and will cause a startup error:
| Combination | Reason |
|---|---|
DynamicMIG + PassthroughSupport |
Mutually exclusive |
DynamicMIG + NVMLDeviceHealthCheck |
Mutually exclusive |
DynamicMIG + MPSSupport |
Mutually exclusive |
The following feature gates have hard dependencies:
| Feature gate | Requires |
|---|---|
ComputeDomainCliques |
IMEXDaemonsWithDNSNames |