Runpod•2mo ago

How can I create machine-readable output in a specific order?

I created vLLM serverless and enabled GUIDED_DECODING_BACKEND and access it with API with

extra_body={
"guided_json": {

it does enforce the json output, but the order of properties is not enforced, despite both backends claiming to enforce this (LMFE_STRICT_JSON_FIELD_ORDER = "true").

What's the easiest way to do inference with LoRA confined to structured outputs in a specific order from a small LLM?

Doesn't need to be json. Just output reasoning first, then the answer components, in a way that each component can be 100% extracted without any extra garbage included?

How can I create machine-readable output in a specific order?

Similar Threads

Similar Threads

Similar Threads