© 2026 Hedgehog Software, LLC
Twitter
GitHub
Discord
System
Light
Dark
More
Communities
Docs
About
Terms
Privacy
Search
Star
Feedback
Setup for Free
The total token limit at 131 - Runpod
R
Runpod
•
11mo ago
•
20 replies
tanawatl
The total token limit at 131
I use vLLM and set max model length to 8000 a2048 but out is just 131
(total out
+ in
)
, although i have set max tokens to 2048
. I try with 2 models and result is the same
.
Recent Announcements
Similar Threads
Serverless SGLang - 128 max token limit problem.
R
Runpod / ⚡|serverless
15mo ago
Total crash | HELP!!!!
R
Runpod / ⚡|serverless
5mo ago
Automate the generation of the ECR token in Serverless endpoint?
R
Runpod / ⚡|serverless
3y ago
Worker Limit Increase
R
Runpod / ⚡|serverless
4d ago