© 2026 Hedgehog Software, LLC
Twitter
GitHub
Discord
System
Light
Dark
More
Communities
Docs
About
Terms
Privacy
Search
Star
Feedback
Setup for Free
Serverless vLLM changing engine arguments - Runpod
R
Runpod
•
12mo ago
•
9 replies
Ale
Serverless vLLM changing engine arguments
Hi
, I got vLLM Serverless worker up and running
, but want to change one engine argument
(which is not overridable through environment variables
)
, specifically
--limit-mm-per-prompt
--limit-mm-per-prompt
, how could I do that with your custom image
runpod/worker-v1-vllm:v2.3.0stable-cuda12.1.0
runpod/worker-v1-vllm:v2.3.0stable-cuda12.1.0
that endpoints use
? Thanks
Similar Threads
Serverless VLLM batching
R
Runpod / ⚡|serverless
10mo ago
Serverless vllm - lora
R
Runpod / ⚡|serverless
2y ago
vLLM Serverless error
R
Runpod / ⚡|serverless
2y ago
Veryyyyyy slow serverless VLLM
R
Runpod / ⚡|serverless
11mo ago