Bypassing cookies consent

Hello everyone. I want to scrape data from Google Maps using Crawlee. However, it seems that, after scraping content of certain tag, I realize that content is about the Cookies consent that the first page of Google Maps shows you. Some of you may know that of you visit Google Maps for the first time, you will face a different page telling about accepting cookies consent and all that and after you click on accept all, you will be forwarded to the Maps itself. How can I make sure that I go straight to Google Maps so I can start scraping data by bypassing the consent or some way to accept the consent and automatically start scraping data from Google Maps right after that?
12 Replies
correct-apricot
correct-apricot•3y ago
Do not use cookies not session, on consent just retry (assuming you using proxies so retry will be done from new IP), you should be able to access data with like max 5 retries
wise-white
wise-whiteOP•3y ago
I'm sorry, I don't really understand. I am a beginner when it comes to scraping, is it possible to explain it like I'm 5?
adverse-sapphire
adverse-sapphire•3y ago
well, I'm not Alexey, but I can explain... I try... So, step by step... 1. Are you using some pool of "rotating" proxies? Rotating - means every time you do an HTTP request the target website (Google maps or some other website) see this request coming from a DIFFERENT IP? Which rotating proxies you are using? Name of the service?
wise-white
wise-whiteOP•3y ago
Thanks for explanation. I'm not using proxies. But somewhere in the Crawlee documentation says I can configure proxies right?
adverse-sapphire
adverse-sapphire•3y ago
I'm not using proxies.
This is the mistake number 1
adverse-sapphire
adverse-sapphire•3y ago
Go and read about using (rotating) proxies for scraping/crawling. You can start here https://developers.apify.com/academy/anti-scraping/mitigation/proxies#understanding-proxy-links
Apify
Proxies · Apify Developers
Learn all about proxies, how they work, and how they can be leveraged in a scraper to avoid blocking and other anti-scraping tactics.
adverse-sapphire
adverse-sapphire•3y ago
Almost always using some (rotating) proxies = you PAY somebody for providing it Here we discuss such services https://discord.com/channels/801163717915574323/1060179502392684594 and I am using smartproxy.com So, peek something...
absent-sapphire
absent-sapphire•3y ago
@1chbinamin See https://blog.apify.com/step-by-step-guide-to-scraping-google-maps/ As indicated Apify: "One slight caveat is that it's preferable to scrape such a huge website as Google Maps by using proxies, that way, it's faster and more efficient"
Apify
How to scrape Google Maps
Extract data without limits with this unofficial Google Maps API.
continuing-cyan
continuing-cyan•3y ago
Consent screen on Google Maps appears always for EU proxies, just use US only. The scraper in Apify Stores handles all of this - https://apify.com/compass/crawler-google-places. You need to dynamically click on the consent screen
wise-white
wise-whiteOP•3y ago
Thank you everyone. I understand what I have to do now.
MEE6
MEE6•3y ago
@1chbinamin just advanced to level 1! Thanks for your contributions! 🎉
rival-black
rival-black•2y ago
Did you ever solve this @1chbunamin ?

Did you find this page helpful?