CA
Crawlee & Apify5mo ago
extended-yellow

Not scheduling new tasks - system is overloaded - gcp cloud run

getting this system overloading message just trying to scrape two urls. this check just keeps looping for almost 10 mins now. i set the cpu to 4 and memeory to 4gb but still getting this message. i know cloud runs dont like threads and background tasks is that the real issue? not sure wondering if anyone has run them on cloud run
[crawlee.events._event_manager] DEBUG LocalEventManager.on.listener_wrapper(): Awaiting listener task...
[crawlee.events._event_manager] DEBUG LocalEventManager.on.listener_wrapper(): Awaiting listener task...
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
'[crawlee.storages._request_queue] DEBUG There are still ids in the queue head that are pending processing ({"queue_head_ids_pending": 1})
[crawlee._utils.system] DEBUG Calling get_memory_info()...
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
'[crawlee.storages._request_queue] DEBUG There are still ids in the queue head that are pending processing ({"queue_head_ids_pending": 1})
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
'[crawlee.storages._request_queue] DEBUG There are still ids in the queue head that are pending processing ({"queue_head_ids_pending": 1})
[crawlee._utils.system] DEBUG Calling get_cpu_info()...
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
'[crawlee.storages._request_queue] DEBUG There are still ids in the queue head that are pending processing ({"queue_head_ids_pending": 1})
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
[crawlee.events._event_manager] DEBUG LocalEventManager.on.listener_wrapper(): Awaiting listener task...
[crawlee.events._event_manager] DEBUG LocalEventManager.on.listener_wrapper(): Awaiting listener task...
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
'[crawlee.storages._request_queue] DEBUG There are still ids in the queue head that are pending processing ({"queue_head_ids_pending": 1})
[crawlee._utils.system] DEBUG Calling get_memory_info()...
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
'[crawlee.storages._request_queue] DEBUG There are still ids in the queue head that are pending processing ({"queue_head_ids_pending": 1})
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
'[crawlee.storages._request_queue] DEBUG There are still ids in the queue head that are pending processing ({"queue_head_ids_pending": 1})
[crawlee._utils.system] DEBUG Calling get_cpu_info()...
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
'[crawlee.storages._request_queue] DEBUG There are still ids in the queue head that are pending processing ({"queue_head_ids_pending": 1})
'[crawlee._autoscaling.autoscaled_pool] DEBUG Not scheduling new tasks - system is overloaded
2 Replies
Hall
Hall5mo ago
Someone will reply to you shortly. In the meantime, this might help:
Mantisus
Mantisus5mo ago
I didn't run this in GCP. But I have several projects on Apify using python Crawlee and they work well. Could you tell me more about your configuration that you are running? Crawlee is built on asyncio, but yes it uses Threads, to handle non-asynchronous code

Did you find this page helpful?