Container Image: vllm/vllm-openai:latest) because serverless was getting very expensive.ai SDK to call one of the three pods (I just choose one of the three randomly). This works okay as a fake load balancer, but sometimes the pods are all busy and I fail with: