Unable to crawl

Hi, I'm trying to crawl a few pages, with this script: Now, I don't know if I'm doing something wrong, but I'm getting timeouts and not crawling at all sometimes. The script will run, but not log any pdf, or link, it just says it crawled the two pages, and that's it. Someone, please help.
2 Replies
statutory-emerald
statutory-emerald2y ago
Hi. You can start by running the crawler with headless: false to see what's happening on the page. Also I see you are navigating to the request url in the request handler function, you can remove that as it is done automatically by the crawler
national-gold
national-goldOP2y ago
Hey man, thanks. I found out that the sites I want to scrape are totally fine with CheerioCrawler, it was easier that way. I'm still finding my way with the amount of options (I don't know what most them do tbh)

Did you find this page helpful?