Hi Cloudflare team, I've been working with AutoRAG and it's a great feature for building RAG syste

Hi Cloudflare team,


I've been working with AutoRAG and it's a great feature for building RAG systems. However, I notice it's currently limited to models available in Workers AI.
Is there any plan to support custom model providers and custom embedding providers in AutoRAG?

Specifically:
Custom LLM providers (OpenAI, Claude, ETC)
Custom embedding providers (OpenAI embeddings, specialized models)
Bring-your-own-model support

This would enable using specialized embeddings for specific domains, cost optimization across providers, and better architectural flexibility for enterprise use cases.
Is this being considered for the roadmap, or is there currently an undocumented way to achieve this?

Thanks for the great work on Workers AI.
Was this page helpful?