bit difficult to reproduce that, i'm now pretty sure it's caused by a lot of load by some kind of "message recursion" (in my code), but CF should be blocking at some point and not hanging (it already did)
+ i can't really change something in the client code that quickly since it's all native apps (deployed in a restaurant on 10+ embedded devices lol)
Looks like a WebSocket connection was established at 17:43... Then we get a few more invocations (RPC or HTTP) all successful until we see two requests cancelled at 17:46. There are two WebSocket messages which are sent before the requests are cancelled.
A trace would be helpful because we can see what is happening when you say it is unresponsive, but from our end it looks like nothing is actually happening...
This was the confusing part to me as well, I was looking through observability and nothing was visible.
The client called me a few minutes later and then were saying "nothing works", I immediately checked Observability - nothing. Then used a mounted Outerbase instance which uses the RPC connection - infinitely loading with a pending request.
I deployed a new version, this reset the DO and everything was back to normal.
It's worth mentioning that your DO dropped the server side of the WebSocket connection at 17:45. When you say the DO was not responding was that to WebSocket messages? Could you have been sending them on a closed pipe?
I have not received an answer yet to my ticket, but I also downgraded its priority to normal, since in my case it was most certainly caused by an overload due to my code.
Stranglely enough, my problem was also resolved by deploying a new version of the worker containing the DO, only added a console.log statement, so it should have been something else than an implementation issue in our code