R
Runpod6mo ago
tanawatl

The total token limit at 131

I use vLLM and set max model length to 8000 a2048 but out is just 131 (total out + in ), although i have set max tokens to 2048. I try with 2 models and result is the same.
No description
No description
8 Replies
Unknown User
Unknown User6mo ago
Message Not Public
Sign In & Join Server To View
riverfog7
riverfog76mo ago
does runpod do json schema validation why did that invalid JSON not cause an error
Unknown User
Unknown User6mo ago
Message Not Public
Sign In & Join Server To View
riverfog7
riverfog76mo ago
it should return 4xx error (if they do validation stuff)
Unknown User
Unknown User6mo ago
Message Not Public
Sign In & Join Server To View
riverfog7
riverfog76mo ago
essentially its your fault 5xx: uh oh i messed up 3xx: go somewhere else
Unknown User
Unknown User6mo ago
Message Not Public
Sign In & Join Server To View
riverfog7
riverfog76mo ago
that's unfortunate pydantic schema and json validation would be nice

Did you find this page helpful?