Found a somewhat weird bug: models that start with `@hf/thebloke` won't generate a response if max_t

Found a somewhat weird bug: models that start with @hf/thebloke won't generate a response if max_tokens is set to 597 or higher. (Fails with "3025: Unknown internal error".)
Was this page helpful?