Search
Star
Feedback
Setup for Free
© 2026 Hedgehog Software, LLC
Twitter
GitHub
Discord
System
Light
Dark
More
Communities
Docs
About
Terms
Privacy
Serveless quants - Runpod
R
Runpod
•
13mo ago
•
9 replies
Bj9000
Serveless quants
Hi
, how do you specify a specific gguf quant file from a hf repo when configuring a vllm serveless endpoint
? Only seems to let you specify the repo level
.
Runpod
Join
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!
21,202
Members
View on Discord
Resources
ModelContextProtocol
ModelContextProtocol
MCP Server
Recent Announcements
Similar Threads
Was this page helpful?
Yes
No
Similar Threads
Streaming responses Serveless Endpoint
R
Runpod / ⚡|serverless
8mo ago
Llama 3.1 + Serveless
R
Runpod / ⚡|serverless
2y ago
Timeout prior to serveless start
R
Runpod / ⚡|serverless
3w ago
Serveless prices and technical questions
R
Runpod / ⚡|serverless
4mo ago