Unable to deploy my LLM serverless with the vLLM template

I am trying to deploy a serverless LLM with the vLLM template. But I cannot get it to work. Is there something wrong with the configurations?
Ideally, I want to deploy the model I trained, but even deploying the "meta-llama/Llama-3.1-8B-Instruct" as shown in the tutorials didn't work.

Continue the conversation

Join the Discord to ask follow-up questions and connect with the community

Runpod

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

21,906 Members

Join

Unable to deploy my LLM serverless with the vLLM template

Unable to deploy my LLM serverless with the vLLM template

Continue the conversation

Runpod

Continue the conversation

Runpod

Similar Threads