Robots.txt
Hey, do you have any idea how to respect robots.txt? We must code that ourself?
2 Replies
afraid-scarletOP•10mo ago
I have made this scripts if you want
and to use it:
wise-white•10mo ago
Hey @Jourdelune thanks! I think this better belongs to Crawlee for Python repository issue section: https://github.com/apify/crawlee-python/
feel free to open an feat request/issue and propose the contribution, our team will be happily look into it 🙂
GitHub
GitHub - apify/crawlee-python: Crawlee—A web scraping and browser a...
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo...