© 2026 Hedgehog Software, LLC

Twitter GitHub Discord

More

Communities Docs About Terms Privacy

GGUF vllm - Runpod

Runpod•2y ago•

15 replies

GGUF vllm

It seems that the newest version of vllm's supports gguf models, have anyone figured out how to make this work in runpod serverless? Seems like need to set some custom ENV vars, or maybe anyone knows a way to convert gguf back to safetensors?

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

21,202Members

Resources

Recent Announcements

Similar Threads

Was this page helpful?

Similar Threads

GGUF in serverless vLLM

RRunpod / ⚡｜serverless

RRunpod / ⚡｜serverless

vllm +openwebui

RRunpod / ⚡｜serverless

RRunpod / ⚡｜serverless