And what models does this occur on?
And what models does this occur on?
@hf/nousresearch/hermes-2-pro-mistral-7b and @cf/meta/llama-3.3-70b-instruct-fp8-fast) behaves differently.<tool_call> label in stream mode, as shown in Image 1.

tool_calls field rather than response field, just like non-stream mode







{"code":10000,"message":"Authentication error"} in the REST response, but do get {"message":"Server Error","code":6001}, anyone seen this error before?bge-base-en-v1.5 (768 dimension) model$ curl https://api.cloudflare.com/client/v4/accounts/MY_ACCOUNT_ID/ai/run/@cf/mistral/mistral-7b-instruct-v0.2-lora \
-H 'Authorization: Bearer MY_API_TOKEN' \
-d '{
"messages": [{"role": "user", "content": "Hello world"}],
"raw": true,
"lora": "MY_LORA_ID"
}'
{"errors":[{"message":"AiError: AiError: Finetune MY_LORA_ID not found (07af68df-18b6-4a5d-bfeb-3453830da3be)","code":3037}],"success":false,"result":{},"
messages":[]}AMAAN KHAN Mobile: (+92) 321 322 578 Email: a4amaan@asd.com SUMMARY: ▪ 7+ Years of Software Development Experience. ▪ Experience in design, development of web based applications. ▪ Experience in Python: Django, Flask, Django Rest Framework, JWT Authentication, SaaS Based Applications. ▪ Experience in Web Development: HTML, CSS, Javascript, Nodejs, Express, Mysql, Mongodb ▪ Experience Developing SPA in VUE Js, VUEX, LocalStorage along with Rest APIs in Django Rest Framework. ▪ Experience in const answer = await ctx.env.AI.autorag("ats-rag").aiSearch({
query: 'Who is Amaan Khan',
// model: "@cf/meta/llama-3-8b-instruct",
model: "@cf/meta/llama-3.3-70b-instruct-fp8-fast",
rewrite_query: true,
max_num_results: 10,
ranking_options: {
score_threshold: 0.4,
},
stream: true
});
return new Response(answer, {
headers: {
'Content-Type': 'text/event-stream'
}
});"logs": [
{
"message": [
"SyntaxError: Unexpected token 'd', \"data: {\"no\"... is not valid JSON"
],
"level": "error",
"timestamp": 1744242318653
}
]