crawlee-js
apify-platform
crawlee-python
💻hire-freelancers
🚀actor-promotion
💫feature-request
💻devs-and-apify
🗣general-chat
🎁giveaways
programming-memes
🌐apify-announcements
🕷crawlee-announcements
👥community
New Tab
Efficient css selectors
How to transfer data between playwrightcrawler and cheeriocrawler?
Ignore previously crawled URLs
How to make Puppeteer crawler ignore errors on page?
chromium.launchpersistentcontext with crawlee
Page.goto never resolves in headful (using XVFB) using `apify/actor-node-puppeteer-chrome` Docker
puppeteer.page.goto
never resolves in a page load event (either load
or any of the other events).
We are not seeing this behavior when we run the same script (using chromium116, puppeteer21, and the latest version of crawlee) outside of a Docker container. We also don't see this behavior on the Apify platform using Actors, but currently can't use the service due to our security requirements.
Happy to share more detail but would appreciate any ideas on where to look. Thanks!...Throw error that respects maxRequestRetries
Basic Crawlee how do I use my own proxies?
proxyConfiguration
to exist, got [object Object]
in object BasicCrawlerOptions
'How to run cheerio crawler with Bun?

Webscraper.io
Playwright crawler failing when element is not found
page.locator
functions to find elements and scrap text from them. Most of the elements are always on the page, but few elements like reviews are not always there since the product may be new and doesn't have any review yet. That would be no problem at all if not the playwright / crawlee failing because of it. What I saw is that when page.locator
can't find an given element it throws an error - that's okay. But crawlee is picking this error as like "the whole page error" and marks request to the page as failed. Even though other locators are working and there's a lot of data that has been found with other page.locator
I'm getting messages that request to url someshop/product-55 failed. How can I somehow fix this and tell crawlee / playwright to not fail if the page.locator
fails? I'm okay with having empty string if there's no reviews found, but I'm not okay with igoring other data because of one page.locator
failure. Example code:
`const a = await page
.locator(a_locator)
.textContent(); // element found...Sorting Quora's questions by number of answers and views
Parse facebook posts in group
How to open multiple browsers?
TSConfig in Crawlee projects.
Cannot find module 'crawlee'. Did you mean to set the 'moduleResolution' option to 'nodenext', or to add aliases to the 'paths' option?ts(2792)
Cannot find module 'crawlee'. Did you mean to set the 'moduleResolution' option to 'nodenext', or to add aliases to the 'paths' option?ts(2792)
how deep can website-content-crawler go?
Crawlee request handler no access to class functions in NestJS
Set debug breakpoint in VS Code