Is it possible to set vl models like qwen3 vl model series in runpod serverless endpoints?
Hey guys, I was wondering if anyone has deployed a mixed model or multi modal model in runpod, coz I have been facing issues whenever I tried to load the model in runpod using the serverless vLLM repos.
Continue the conversation
Join the Discord to ask follow-up questions and connect with the community
R
Runpod
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!