Search
Star
Feedback
Setup for Free
© 2026 Hedgehog Software, LLC
Twitter
GitHub
Discord
System
Light
Dark
More
Communities
Docs
About
Terms
Privacy
Selecting a hf quant - Runpod
R
Runpod
•
9mo ago
•
3 replies
Bj9000
Selecting a hf quant
Hi
, using vllm serveless
. Is there a way to specify a specific quant to use for a hf gguf model directory url
?
Runpod
Join
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!
21,202
Members
View on Discord
Resources
ModelContextProtocol
ModelContextProtocol
MCP Server
Recent Announcements
Similar Threads
Was this page helpful?
Yes
No
Similar Threads
HF Cache
R
Runpod / ⚡|serverless
15mo ago
HF_TOKEN question
R
Runpod / ⚡|serverless
2y ago
Serveless quants
R
Runpod / ⚡|serverless
13mo ago
how to pass hf_overrides to worker-vllm ?
R
Runpod / ⚡|serverless
9mo ago