Runpod•15mo ago

Serverless vllm - lora

Is there a way to set the lora-modules (for the vllm docker container --lora-modules lora_adapter1=abc/efg) in the Template, or do i need to use the "standard" vllm container for it?

Jason•10/2/24, 11:35 AM

Is there an option in the ui?

SvenOP•10/2/24, 11:42 AM

there are only the options for enabling it and so on, the option for adding the url is missing

Jason•10/2/24, 1:22 PM

Do you have the docs for the args

Jason•10/2/24, 1:22 PM

In vllm

Jason•10/2/24, 1:22 PM

Docs

SvenOP•10/7/24, 6:04 AM

For the runpod vllm serverless worker, these are the ENVs, which are available: https://docs.runpod.io/serverless/workers/vllm/environment-variables

Environment variables | RunPod Documentation

Configure your vLLM Worker with environment variables to control model selection, access credentials, and operational parameters for optimal performance. This guide provides a reference for CUDA versions, image tags, and environment variable settings for model-specific configurations.

Jason•10/7/24, 10:25 AM

No the one you wanna use, in vllm's docs

Jason•10/7/24, 10:25 AM

Make an issue in the github of runpod vllm-worker I guess

Hawk•10/31/24, 12:40 PM

Thanks Sven for making this

crazy enable_lora flag was added but not the arg to actually add lora adapters

Hawk•10/31/24, 12:40 PM

Hoping this gets merged soon

SvenOP•11/7/24, 1:43 PM

The Pull Request was merged. Does someone know when the new Versions appears for Runpod? (also the vllm worker with the 1.7.4 SDK?)

Serverless vllm - lora

Similar Threads

Similar Threads

Similar Threads