Why the available GPUs are only 1?
Faster-Whisper worker template is not fully up-to-date
1.0.2, whereas the Runpod template is still on 0.10.0.
There are a few changes that have been introduced in Faster-Whisper (now using CUDA 12) since, that we would like to benefit from, especially the language_detection_threshold setting, since it seems like most of our transcriptions done by people with British accent are being transcribed into Welsh (with a language detection confidence of around 0.51 to 0.55) - which could be circumvented by increasing the threshold....Slow IO speeds on serverless
How to download models for Stable Diffusion XL on serverless?
2) I created a Stable Diffusion XL endpoint on serverless, but couldn't attach the network storage.
3) After the deployment succeeded, I clicked on edit endpoint and attached that network storage to it. So far so good I believe. But how do I exactly download various SDXL models into my network storage, so that I could use them via Postman?...
0% GPU utilization and 100% CPU utilization on Faster Whisper quick deploy endpoint

Loading models from network volume cache is taking too long.
Are webhooks fired from Digital Ocean?
AWS#AWSManagedRulesBotControlRuleSet#SignalKnownBotDataCenter . The IP address in these requests seems to be a Digital Ocean Data Center. I have disabled the WAF for my ALB for my RunPod webhooks temporarily, but hoping that someone can confirm whether these are legitimate requests or not, because I was under the impression that RunPod uses AWS and not Digital Ocean.best architecture opinion
Cancelling job resets flashboot
RUNPOD_API_KEY and MAX_CONTEXT_LEN_TO_CAPTURE
Do I need to allocate extra container space for Flashboot?
Thanks...
When servless is used, does the machine reboot if it is executed consecutively? Currently seeing iss
Slow I/O
Problem with RunPod cuda base image. Jobs stuck in queue forever
FROM runpod/base:0.4.0-cuda11.8.0
I want the serverside to run the input_fn function when I do the request. This is part of the server side code:
```model = model_fn('/app/src/tapnet/checkpoints/')...
runpod-worker-a1111 and loras
Intermittent connection timeouts to api.runpod.ai
vLLM streaming ends prematurely
Why no gpu in canada data center today?
