No this is just good old chat completion API, it’s looking like the compat endpoint is very slow, especially running from local dev server. Certainly not because of networking between my laptop to Cloudflare, as I can see the total request time from my browser is a bit over 20 seconds, but on the AI gateway, I can see the llm call from gateway to Google Gemini took 19188ms