Runpod•10mo ago

Can you now run gemma 3 in the vllm container?

In the serverless, its seems im getting an error, any help on this

Jason•3/17/25, 8:45 AM

can you send the error

YobinOP•3/17/25, 9:23 AM

I deleted it but it seems bc gemma3 is a new model so transformer is relatively outdated afaik?

Jason•3/17/25, 9:32 AM

hmm did you use vllm? or only transformer?

Jason•3/17/25, 9:32 AM

yeah maybe its a good thing to check first for compatibility

JJason hmm did you use vllm? or only transformer?

YobinOP•3/17/25, 10:22 AM

I used the preset vllm, llama 3.2b worked but the new gemma 3 didnt

Dj•3/17/25, 4:52 PM

vLLM needs to publish an update first unfortunately

Dj•3/17/25, 4:53 PM

You can use vLLM directly from the main branch, but that's not super easy if you're using our vLLM template iirc

Jason•3/18/25, 8:01 AM

I think we can update and build our own vllm template from that vllm-worker repo in github easily

Jason•3/18/25, 8:01 AM

Just update the requirement.txt or wherever installs the vllm

Bj9000•3/24/25, 4:59 PM

Looks like vllm v0.8.0 added gemma3 support will the serverless vllm be updated soon?

Jason•3/24/25, 5:05 PM

usually its delayed, so probably a few days/ weeks late

YYobin In the serverless, its seems im getting an error, any help on this

Aizen•3/30/25, 10:51 AM

Hi, i have the same issue , have you resolved it? , if then please help me out with it too

AAizen Hi, i have the same issue , have you resolved it? , if then please help me out w...

YobinOP•3/30/25, 11:19 AM

I used ollana

YobinOP•3/30/25, 11:19 AM

Ollama

Aizen•3/30/25, 12:07 PM

Okay

Jason•3/30/25, 1:17 PM

yes i think vllm is updated

Jason•3/30/25, 1:17 PM

already

Javier•3/31/25, 4:56 PM

I deployed an end point to try to call gemma3:4b but nothing is happening when I call it, anybody managed?

Jason•4/1/25, 1:28 AM

Yes it works

Jason•4/1/25, 1:28 AM

just now i've tried for you!

Output-from-gemma3.txt5.42KB

Jason•4/1/25, 1:29 AM

you need access from hf + hf token to access it

Jason•4/1/25, 1:31 AM

use vllm to configure + check the allow remote code options in the config ( in runpod menu when configuring vllm )

Jason•4/1/25, 1:32 AM

second image is the next page

Can you now run gemma 3 in the vllm container?

Similar Threads

Similar Threads

Similar Threads