Unable to crawl
I'm unable to crawl this website: https://www.coditas.com
This is my req body:
{
"url": "https://www.coditas.com",
"limit": 5,
"scrapeOptions": {
"formats": [
"markdown"
],
"waitFor": 1000
}
}
But when I tried on firecrawl playground it was able to crawl, I'm running it using docker.
7 Replies
I have one more question: If I want to scrape the homepage using a crawler, how can I do that? I am also passing some specific URLs, so I assume that I cannot include '/' in them. Is that correct?
I am having a similar issue on another website self hosted on Docker
response is
This is working on the playground though
@Moderator Any update on it?
@Arbaz are you self hosting it or not?
Yes, I've self-hosted it
We will take a look. Just opened a github issue: https://github.com/mendableai/firecrawl/issues/625
GitHub
[Self-host] Unable to crawl certain pages · Issue #625 · mendableai...
(from discord) +2 This only happens in the self hosted version: Unable to crawl this website: https://www.coditas.com/ This is my req body: { "url": "https://www.coditas.com/", ...
Hey @Arbaz @Benos we've made several improvements to our self-host codebase. Could you please update your local repository and give it another try? Additionally, feel free to check out the self-host guide for further assistance.
Let me know how it goes!
Firecrawl Docs
Self-hosting | Firecrawl
Learn how to self-host Firecrawl to run on your own and contribute to the project.