yeah im running into the same issue as well im getting an absolute ton of 429 exceeded capacity erro
yeah im running into the same issue as well im getting an absolute ton of 429 exceeded capacity errors when i try to use workersai



@cf/google/embeddinggemma-300m Does anyone know the price of this model?
AiError: 9000: model temporarily unavailable when trying to use @cf/baai/bge-m3is there a status page for Workers AI and model capacity? https://www.cloudflarestatus.com/ says all systems operational
nova-3 work with language spanish websocket via REST API, i'm testing this with the repository: https://github.com/cloudflare/realtime-examples/tree/main/ai-tts-stt when setting the language to 'es' or 'es-419' it gives me a 500 error: Failed to establish Nova WebSocket: 500 while the websocket keeps running, it works for english only, can anyone help me? error: '4002: could not route request to AI model', with model: '@cf/baai/bge-m3',Error: 3030: 1 validation error for VllmBatchRequest does anyone know what doesn't mean and how to resolve it?InferenceUpstreamError: error code: 1031 and now I'm getting InferenceUpstreamError: <!DOCTYPE html>... where it is returning CF html error page, with 500 error. I've tried using my production worker, and then in dev with --remote and dev locally. In all cases it returns some variation of the above but all fail. The code worked fine yesterday and hasn't changed. It fails at the call "@cf/qwen/qwen2.5-coder-32b-instruct". All other requests appear to work.const embeddings: any = await env.AI.run("@cf/baai/bge-m3", {
text: [prompt],
});@cf/google/embeddinggemma-300m{
"errors": [
{
"message": "AiError: AiError: model temporarily unavailable (58da3f75-5ff1-46ba-a45e-a59d4ada1c33)",
"code": 9000
}
],
"success": false,
"result": {},
"messages": []
}AiError: 9000: model temporarily unavailable@cf/baai/bge-m3nova-3 error: '4002: could not route request to AI model', model: '@cf/baai/bge-m3',Error: 3030: 1 validation error for VllmBatchRequest.venv/lib/python3.10/site-packages/llama_index/embeddings/cloudflare_workersai/base.py", line 118, in _aget_text_embeddings
return resp["result"]["data"]
KeyError: 'data'InferenceUpstreamError: error code: 1031InferenceUpstreamError: <!DOCTYPE html>..."@cf/qwen/qwen2.5-coder-32b-instruct"