crawlee-js
apify-platform
crawlee-python
💻hire-freelancers
🚀actor-promotion
💫feature-request
💻devs-and-apify
🗣general-chat
🎁giveaways
programming-memes
🌐apify-announcements
🕷crawlee-announcements
👥community
What does produce this error?
setCookie and session.getCookies don't work together
One-proxy, many-sessions?
proxyUrl
from which any number of connections can be opened, i.e. each connection having a different IP despite proxyUrl
all being seemingly the same. I wonder, is Crawlee able to create e.g. 100 Session
s (and rotate upon retire
/markBad
), despite there only being one proxyUrl
specified? From reading the docs, a round robin rotation mechanism is referred to and a sessionId-proxyUrl pair is mentioned, so I get the sense that Crawlee every one Session
locks exactly one proxyUrl
, making redundant the setting sessionPoolOptions.maxPoolSize > 0
. In other words, each proxyConfiguration.proxyUrl
can have max. 1 session attached to it.
Hypothesis: When ProxyConfiguration.proxyUrls.length === 1
, even though e.g. sessionPoolOptions.maxPoolSize === 100
, Crawlee can/will only create one Session
because Crawlee thinks there's only one IP available (or something). Does this happen?
Sorry for long winded question, it's my first time using Crawlee and I'm unsure how the details fit together. Thanks for any attention this may get....Request works in Postman but doesnt work with Cheerio Crawler, request object headers empty
Retire session after request handler timed out
parallel Login Scraping
Error: browserController.newPage() failed on basic puppeteer example
Elements not rendering
Pupeteer unable to find element (dev tools show the element)

running multiple scrapers with speed
How to authenticate PlaywrightCrawler
Random disappearing requests
Errors when trying to send a request
DEBUG Error while disabling request interception {"error":{"name":"ProtocolError","message":"Protocol error (Network.setCacheDisabled): Target closed","stack":"ProtocolError: Protocol error (Network.setCacheDisabled): Target closed\n at new Callback (x/Connection.js:61:35)\n at CallbackRegistry.create (x/Connection.js:106:26)\n at Connection._rawSend (x/Connection.js:216:26)\n at CDPSessionImpl.send x/Connection.js:425:78)\n at NetworkManager._NetworkManager_updateProtocolCacheDisabled (x/NetworkManager.js:198:69)\n at NetworkManager._NetworkManager_updateProtocolRequestInterception (x/NetworkManager.js:191:119)\n at NetworkManager.setRequestInterception (x/NetworkManager.js:163:127)\n at CDPPage.setRequestInterception (x/Page.js:297:88)\n at disableRequestInterception (x/puppeteer_request_interception.js:221:16)\n at ObservableSet.onDelete (x/puppeteer_request_interception.js:207:31)"}}
WARN PuppeteerCrawler: Reclaiming failed request back to the list or queue. net::ERR_INVALID_ARGUMENT at https://www.x.com/grading/set_match/8897845
DEBUG Error while disabling request interception {"error":{"name":"ProtocolError","message":"Protocol error (Network.setCacheDisabled): Target closed","stack":"ProtocolError: Protocol error (Network.setCacheDisabled): Target closed\n at new Callback (x/Connection.js:61:35)\n at CallbackRegistry.create (x/Connection.js:106:26)\n at Connection._rawSend (x/Connection.js:216:26)\n at CDPSessionImpl.send x/Connection.js:425:78)\n at NetworkManager._NetworkManager_updateProtocolCacheDisabled (x/NetworkManager.js:198:69)\n at NetworkManager._NetworkManager_updateProtocolRequestInterception (x/NetworkManager.js:191:119)\n at NetworkManager.setRequestInterception (x/NetworkManager.js:163:127)\n at CDPPage.setRequestInterception (x/Page.js:297:88)\n at disableRequestInterception (x/puppeteer_request_interception.js:221:16)\n at ObservableSet.onDelete (x/puppeteer_request_interception.js:207:31)"}}
WARN PuppeteerCrawler: Reclaiming failed request back to the list or queue. net::ERR_INVALID_ARGUMENT at https://www.x.com/grading/set_match/8897845
running numerous scrapers from one start file with speed
requestHandler: [router, router2]
requestHandler: [router, router2]
Custom user agent playwright browser

RequestQueue.open issue in dockerized app
cookies help
Could not find file at storage/key_value_stores/default/SDK_SESSION_POOL_STATE.json
Maintain the same browser/scope
Cheerio Crawler inner text