RunpodR
Runpod2y ago
57 replies
Armyk

GGUF in serverless vLLM

How do I run a GGUF quantized model?
I need to run this LLM: https://huggingface.co/mradermacher/OpenBioLLM-Llama3-70B-GGUF

What parameters should I specify?

Thank you
Was this page helpful?