Search
Star
Feedback
Setup for Free
© 2026 Hedgehog Software, LLC
Twitter
GitHub
Discord
System
Light
Dark
More
Communities
Docs
About
Terms
Privacy
Why is 125M from facebook loading into VLLM quickdeploy even though another model is specified? - Runpod
R
Runpod
•
16mo ago
•
1 reply
bradfox2
Why is 125M from facebook loading into VLLM quickdeploy even though another model is specified?
Specified a qwen variant
- get facebook opt125m deployed instead
.
Runpod
Join
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!
21,202
Members
View on Discord
Resources
ModelContextProtocol
ModelContextProtocol
MCP Server
Similar Threads
Was this page helpful?
Yes
No
Similar Threads
slow model loading times with vllm
R
Runpod / ⚡|serverless
9mo ago
VLLM model loading, TTFT unhappy path
R
Runpod / ⚡|serverless
15mo ago
worker-vllm cannot download private model
R
Runpod / ⚡|serverless
3y ago
Model loading time
R
Runpod / ⚡|serverless
4mo ago