tanawatl
tanawatl
RRunPod
Created by tanawatl on 5/24/2025 in #⚡|serverless
Which file of git worker-template handle vllm ?
I need to add langchain to vllm , which file in git that is core llm initialize and able to modify ?
3 replies
RRunPod
Created by tanawatl on 5/21/2025 in #⚡|serverless
Cold start issue
I stuck with cold start issue,that make the response very slow when make a new request after a longtime. Are there anyways to solve this issues ?
4 replies
RRunPod
Created by tanawatl on 5/20/2025 in #⚡|serverless
About building container with Git repo
I'm not sure, can I use buildx command in 'Container start commnd'. And from chatGPT, it said it need to push image to Docker hub before using. This is my command , is it valid ? docker buildx create --name mybuilder --use docker buildx build \ --builder mybuilder \ --cache-from type=local,src=./docker-cache \ --cache-to type=local,dest=./docker-cache \ --load \ -t runpod_worker_typhoon:latest . echo "$PASSWORD" | docker login -u "$USERNAME" --password-stdin docker tag runpod_worker_typhoon:latest bluegoatai/runpod_worker_typhoon:latest docker push bluegoatai/runpod_worker_typhoon:latest
10 replies
RRunPod
Created by tanawatl on 5/17/2025 in #⚡|serverless
Some query take a long time than usual
No description
10 replies
RRunPod
Created by tanawatl on 5/16/2025 in #⚡|serverless
The total token limit at 131
No description
21 replies