Found a somewhat weird bug: models that start with `@hf/thebloke` won't generate a response if max_t
Found a somewhat weird bug: models that start with
@hf/thebloke won't generate a response if max_tokens is set to 597 or higher. (Fails with "3025: Unknown internal error".)



