Help: Serverless Mixtral OutOfMemory Error
48GB (also tried 80GB)
Container Image: runpod/worker-vllm:0.3.0-cuda11.8.0...Can we add minimum GPU configs required for running the popular models like Mistral, Mixtral?
Severless 404
Unacceptably high failed jobs suddenly
Two Network Volumes
container start command troubleshooting
Active worker keeps downloading images and Im being charged for it

Webhook problem
optimize ComfyUI on serverless
Probleme when writing a multi processing handler
Idle time: High Idle time on server but not getting tasks from queue
Is there a programatic way to activate servers on high demand / peak hours load?
Increasing costs?

[URGENT] EU-RO region endpoint currently only processing one request at a time

Unable to Add Container Registry Auth due to Next.js Crashes

Returning error, but request has status "Completed"
Can I emulate hitting serverless endpoints locally?
python -u handler.py
python -u handler.py
All 27 workers throttled
I'm using SDXL serverless endpoint and sometimes I get an error.
RuntimeError: expected scalar type Float but found Half, Stack Trace: <traceback object at 0x7f779ace2a00>
RuntimeError: expected scalar type Float but found Half, Stack Trace: <traceback object at 0x7f779ace2a00>
API Wrapper