Atherion Comments - Answer Overflow

Atherion

•Created by LisT_99 on 2/6/2025 in #⚡｜serverless

vLLM serverless output cutoff

I was running into the same issue. There is a key for sampling_params, found the solution here https://docs.runpod.io/serverless/workers/vllm/get-started#sample-api-requests

5 replies