Search
Star
Feedback
Setup for Free
© 2026 Hedgehog Software, LLC
Twitter
GitHub
Discord
System
Light
Dark
More
Communities
Docs
About
Terms
Privacy
Serverless vLLM changing engine arguments - Runpod
R
Runpod
•
10mo ago
•
9 replies
Ale
Serverless vLLM changing engine arguments
Hi
, I got vLLM Serverless worker up and running
, but want to change one engine argument
(which is not overridable through environment variables
)
, specifically
--limit-mm-per-prompt
--limit-mm-per-prompt
, how could I do that with your custom image
runpod/worker-v1-vllm:v2.3.0stable-cuda12.1.0
runpod/worker-v1-vllm:v2.3.0stable-cuda12.1.0
that endpoints use
? Thanks
Runpod
Join
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!
21,202
Members
View on Discord
Resources
ModelContextProtocol
ModelContextProtocol
MCP Server
Recent Announcements
Similar Threads
Was this page helpful?
Yes
No
Similar Threads
Serverless VLLM batching
R
Runpod / ⚡|serverless
9mo ago
Serverless vllm - lora
R
Runpod / ⚡|serverless
17mo ago
vLLM Serverless error
R
Runpod / ⚡|serverless
2y ago
Veryyyyyy slow serverless VLLM
R
Runpod / ⚡|serverless
10mo ago