if it's a text generation model you can lower the `max_tokens`

if it's a text generation model you can lower the max_tokens
Was this page helpful?