issue with a LoRa finetune on the Workers AI platform
The REST API for running inference (/run) fails with a "model does not exist" error, even though the management API (/finetunes) correctly lists the finetune as existing under the same credentials.
1. Proof the Finetune Exists (Successful GET Request): This curl command, using my API token, correctly lists the finetune, proving it exists and my credentials are valid. curl -X GET "https://api.cloudflare.com/client/v4/accounts/ACCOUNT_ID/ai/finetunes" \ -H "Authorization: Bearer {token} Result: {"success":true,"result":[{"id":"214de035-8495-4c51-af6f-6687a2c29fdd","name":"my-gemma-finetune", ...}]}
2,2. Proof the Inference Call Fails (Failing POST Request): This curl command, using the exact same credentials and the ID from the successful call above, fails. curl -X POST "https://api.cloudflare.com/client/v4/accounts/{account_id}/ai/run/@cf/google/gemma-3-12b-it" \ -H "Authorization: Bearer {token} \ -d '{"messages":[{"role":"user","content":"..."}],"lora":"214de035-8495-4c51-af6f-6687a2c29fdd"}' Result: {"errors":[{"message":"AiError: AiError: The model 214de035-8495-4c51-af6f-6687a2c29fdd does not exist. (01e3d7a2-3518-4e2d-a1a5-48b09dba46a0)","code":3043}]}
Happy to send you the AccounT, Finetune ID I have already tried deleting and re-uploading the finetune via wrangler, which resulted in a new ID but the same error pattern. This confirms the issue is not related to a specific finetune asset.
• Failing Request ID: 01e3d7a2-3518-4e2d-a1a5-48b09dba46a0 (from the error message)Could you please have your engineering team investigate this internal discrepancy using the provided Request ID?