Trying out Crawlee, etsy not working..
Hi Apify,
Thank you for this fine auto-scraping tool Crawlee! I wanted to try out along with the tutorial but with different url e.g. https://www.etsy.com/search?q=wooden%20box but it failed with PlaywrightCrawler. It seems to fail at Checking device, I thought it injected TLS fingerprint and Browser fingperint but it seems Etsy still blocks it with 403! Thank you!
Thank you for this fine auto-scraping tool Crawlee! I wanted to try out along with the tutorial but with different url e.g. https://www.etsy.com/search?q=wooden%20box but it failed with PlaywrightCrawler. It seems to fail at Checking device, I thought it injected TLS fingerprint and Browser fingperint but it seems Etsy still blocks it with 403! Thank you!
4 Replies
Someone will reply to you shortly. In the meantime, this might help:
Try to use proxies and bump retries a little bit
rare-sapphireOP•4mo ago
Thanks azzouz, I don't think it helped as I can hit the url with my real ip. It's 403 everytime: ERROR PlaywrightCrawler: Request failed and reached maximum retries. Error: Request blocked - received 403 status code.
I read it further, I think it's interstitial state device checking that blocked Crawlee.
I read it further, I think it's interstitial state device checking that blocked Crawlee.
sensitive-blue•4mo ago
1、You need to confirm that you have correctly set and replaced the proxy in your scraper script.
2、Try changing the User-Agent.
2、Check if it is related to headless browser characteristics.