R
Runpod•2y ago
Hermann

Can we run aphrodite-engine on Serverless?

aphrodite-engine is a fork from vLLM and also supports exl2 format, which gives it a huge advantage. Are there any plans to support aphrodite-engine in future on RunPod's serverless offering? I believe currently aphrodite-engine is only supported as a single server on RunPod. Thanks
8 Replies
Madiator2011
Madiator2011•2y ago
I mean nothing stops your from building own worker 🙂
Hermann
HermannOP•2y ago
How? lol
Madiator2011
Madiator2011•2y ago
all workers are open source so you can have look at code and build own. Some coding and then packaging docker image
Hermann
HermannOP•2y ago
Seriously, I would be interested building my own queue and host aphrodite-engine on a single pod instead.
Hermann
HermannOP•2y ago
GitHub
GitHub - runpod-workers/worker-vllm: The RunPod worker template for...
The RunPod worker template for serving our large language model endpoints. Powered by vLLM. - runpod-workers/worker-vllm
Hermann
HermannOP•2y ago
Or any other you could recommend?
Madiator2011
Madiator2011•2y ago
I do not know what aphrodite-engine is so cant tell but yes if it's fork ov vllm it will be good start point
Unknown User
Unknown User•2y ago
Message Not Public
Sign In & Join Server To View

Did you find this page helpful?