Well I thought that maybe it has to load the model into memory the first time it runs on a worker so it takes so long and following requests are faster
does flux work for anyone? i'm getting AiExternal: Couldn't fetch external AI provider response (500, Internal Server Error)AiExternal: Couldn't fetch external AI provider response (500, Internal Server Error)
I'm using Llama via api.cloudflare with Bearer Auth. Where can I check my usage? If I exceed 10,000 free neurons, am I charged automatically or does it stop working?