Environment: - RunPod GPU Instance (RTX A5000, 24GB VRAM) - Ray Serve deployment - Piper TTS model via ONNX Runtime - CUDA 12.8
Issue: Since today, experiencing GPU detection failure when deploying a Ray Serve application with Piper TTS model. The ONNX Runtime correctly detects CUDAExecutionProvider but fails during model initialization.
Error Details: CUDA failure 100: no CUDA-capable device is detected ; GPU=-1 when using [('CUDAExecutionProvider', {'cudnn_conv_algo_search': 'HEURISTIC'})] Falling back to ['CPUExecutionProvider'] and retrying.
Has anyone encountered similar GPU detection issues ?