Error CUDA 12.8 GPU detection

Environment:
  • RunPod GPU Instance (RTX A5000, 24GB VRAM)
  • Ray Serve deployment
  • Piper TTS model via ONNX Runtime
  • CUDA 12.8
Issue:
Since today, experiencing GPU detection failure when deploying a Ray Serve application with Piper TTS model. The ONNX Runtime correctly detects CUDAExecutionProvider but fails during model initialization.

Error Details:
CUDA failure 100: no CUDA-capable device is detected ; GPU=-1 when using [('CUDAExecutionProvider', {'cudnn_conv_algo_search': 'HEURISTIC'})]
Falling back to ['CPUExecutionProvider'] and retrying.

Has anyone encountered similar GPU detection issues ?

Any guidance would be appreciated! 🙏
Screenshot_2025-08-03_at_23.20.18.png
Screenshot_2025-08-03_at_23.21.23.png
Screenshot_2025-08-03_at_23.30.54.png
Was this page helpful?