I am not sure what has happened with

I am not sure what has happened with workflows product recently, but since yesterday, the product just doesn't work for me, i've had to stop everything, and now looking at alternative solutions. I can't trigger / pause / resume / terminate any workflows, l met with the error code 10001 as earlier. Over 3000 workflows failed yesterday with the exact same error This instance experienced an internal error which prevented it from running. If this is persistent, please contact support: https://cfl.re/3WgEyrH Any new workflow l try to start this morning goes straight into queued state and will never run. I have 4K+ messages going into dead-letter-queues because workflows can't be instantiated. @James / @Diogo Ferreira any help would be appreciated, as this product was working perfectly up until yesterday moring.
38 Replies
Ollie
OllieOP•2d ago
Untouchable instances, some are 2 days+ old
No description
Ollie
OllieOP•2d ago
Even the cf dashboard cannot fetch information about workflows:
No description
Ollie
OllieOP•2d ago
waited some time, and tried to trigger a few hundred instances, again either inactive or stuck running, i am a bit shocked how this is product has been broken for over 24hrs now
No description
Ollie
OllieOP•2d ago
@Thomas Ankcorn i am not sure if you could help on any of the above?
Thomas Ankcorn
Thomas Ankcorn•2d ago
Thanks Ollie, flagging with the team
Diogo Ferreira
Diogo Ferreira•2d ago
Hi Ollie, sorry for the issues you are seeing. Since yesterday, we have started monitoring your account, and we have initiated a cleanup process. This was the cause of the 3000 workflow instances failed, as during the cleanup process, the system purges irrecoverable instances. Regarding the new instances created this morning, they should have run normally, which doesn't see to be the case. We are looking into it as we speak.
Ollie
OllieOP•2d ago
Ah amazing, and thanks a lot, sorry to sound like am complaining, l do love the cloudflare products and everything has been working so smoothly, i've ran almost 1 million workflow instances, so real credit to the platform being able to handle all that
Diogo Ferreira
Diogo Ferreira•2d ago
Let me also point out, that the issue is limited to your account.
Ollie
OllieOP•2d ago
Good to know
Diogo Ferreira
Diogo Ferreira•2d ago
Not that it is of any help to you, I know, but just for the sack of clarification
Ollie
OllieOP•2d ago
Deffo good to know, otherwise one can start to wonder if there is a wide spread issue, and then just constantly spamming the status page Also some new instances that i've been triggering seem to be at least running, not sure if all of them are but some of them are, there just seems to be a backlog of stuck "queued" ones
Diogo Ferreira
Diogo Ferreira•2d ago
Ollie, does this issue still persist for you?
Ollie
OllieOP•2d ago
Not always, so say if l try to load the page, maybe 2/5 fail to load, but this happens mainly on workflow heavy pages, e.g. one of the workflows that has ran 800,000+ times so l wonder if its calculating the stats per page load or not?
Diogo Ferreira
Diogo Ferreira•2d ago
Yes, that can affect it @Ollie I can see that for the last hour, your Workflows are being completed. Yes, there are some instances still in running state (we are looking into those), but newer instances are running correctly. Can you please confirm?
Ollie
OllieOP•2d ago
Yep this looks to be correct
Ollie
OllieOP•2d ago
I can see from my external logging thrings are running but now l can no longer see via the cf dashboard:
No description
Diogo Ferreira
Diogo Ferreira•2d ago
That's strange, I can query your account with no issues
Ollie
OllieOP•2d ago
Yeah really weird, still not able to see anything on it, tried on a different PC as well, nothing Same on my mobile, just checked
Diogo Ferreira
Diogo Ferreira•2d ago
All with the same internal_server ?
Ollie
OllieOP•2d ago
Diogo Ferreira
Diogo Ferreira•2d ago
Hummm
Ollie
OllieOP•2d ago
Yeah really weird, and also sometimes it never even makes that network request when reloading Yeah all of the other pages work fine except for any pages under workflows On reloads l seem to get a flash of the page structure loading then it vanishes
Diogo Ferreira
Diogo Ferreira•2d ago
Can you check now, please?
Ollie
OllieOP•2d ago
am not sure what magic you did but that works SO l can see its cleared up a lot now, only have around 65 instances stuck in a running state now
Diogo Ferreira
Diogo Ferreira•2d ago
Yep, I can also see that. We are trying to understand what is causing that
Ollie
OllieOP•2d ago
thanks a lot for the help so far btw
Diogo Ferreira
Diogo Ferreira•2d ago
I believe you are facing a race condition here
Ollie
OllieOP•2d ago
im putting the workflows through their paces then 😛
Jadugar_Jaggu
Jadugar_Jaggu•this hour
Hey, I'm facing the same issue. I think it's related to the Cloudflare dashboard—maybe it was recently updated to a new version, and older versions cached in users' browsers aren't able to fetch the latest version correctly. It could also be a conflict between the old and new versions of the dashboard. Everything was working fine until yesterday, so it's possible that some changes made by the Cloudflare frontend team in the last 24 hours triggered this issue. That said, if you open the dashboard in an incognito tab, everything works fine—probably because incognito loads the latest version without using the cached files.
No description
Jadugar_Jaggu
Jadugar_Jaggu•this hour
this is not related to only workflows but may be related to the frontend of cloudflare
No description
Diogo Ferreira
Diogo Ferreira•this hour
Yes, we had a regression with a Front-end release, which was already reverted. Clearing your cache will fix it.
Jadugar_Jaggu
Jadugar_Jaggu•this hour
Yes after doing force reload ( ctrl + shift + R ) it fixed
Diogo Ferreira
Diogo Ferreira•23h ago
@Ollie you have stop creating/running new instances, correct?
Ollie
OllieOP•23h ago
Yeah l didn't want to interfer with your debugging so just stopped all
Diogo Ferreira
Diogo Ferreira•23h ago
You can continue
Ollie
OllieOP•21h ago
Ah perfect will do All new instances of workflows seem to be running pretty smoothly, though am keeping numbers low, should l start to ramp up to see if the race condition gets hit again? Oh dear @Diogo Ferreira we are back to the same issue after having ramped up: { code: 10001, message: "workflows.api.error.internal_server" }
Ollie
OllieOP•21h ago
Instances are starting to pile up and get stuck:
No description
Ollie
OllieOP•21h ago
CF Dashboards not loading anymore for me either:
No description

Did you find this page helpful?