Firecrawl•8mo ago

So what do you guys suggest in this

So what do you guys suggest in this scenario? Because I would like to continue using firecrawl but it cant crawl some domains then thats very bad for me

6 Replies

ayang.•8mo ago

you can try allowing backwards links

YouKn0wWhoOP•8mo ago

but I dont want backward linked pages

ayang.•8mo ago

just to clarify what you need, do you need to crawl through all the sublinks of https://docs.tokenterminal.com/ ?

Token Terminal

About Token Terminal

The most comprehensive and high-quality data set in crypto.

ayang.•8mo ago

Not sure why but for this link, if you allow backward links, it looks like it gets all the pages it misses from just crawling this would work if your need is to get all the sublinks of the website allowing backward links wont follow links to external websites, just to clarify

YouKn0wWhoOP•8mo ago

so what should be the general algorithm? Because according to firecrawl's documentation, backlink is only if we want to scrape a url which is not a sublink of a url but in this case all are sublinks I sometimes need to ignore backlinks as well, for example for https://medium.com/etherfi I dont want to get all possible medium.com urls only etherfi posts yeah for tokenterminal docs, allowing backlink works but for https://medium.com/etherfi, I can't seem to make it work

ayang.•8mo ago

im not a dev or too technical, but let me look at the src code to see if i can understand it and give u an answer on that cuz im not sure either

Gaming

Programming

So what do you guys suggest in this

Did you find this page helpful?