Captcha detection?

How to detect captcha? I see this in the response HTML:
<head>
...
<meta name="captcha-challenge" content="1">
...
<head>
...
<meta name="captcha-challenge" content="1">
...
but I would prefer to use some function in Playwright/Crawlee. I mean, some generic way to detect captcha - who knows which variant of captha I will get in the future.... I can not use HTTP status - this page returns status=200 but it shows captcha!
1 Reply
exotic-emerald
exotic-emerald3y ago
You can try this - https://github.com/apify-projects/store-website-checker/blob/master/checker-cheerio/src/checkers.ts We might move it to Crawlee but these will always be a bit unstable because it can change
GitHub
store-website-checker/checkers.ts at master · apify-projects/store-...
Analyzes target website for anti-scraping protections and performance. Saves screenshots/HTML snapshots. - store-website-checker/checkers.ts at master · apify-projects/store-website-checker

Did you find this page helpful?