Firecrawl•14mo ago

ideal input for `search` param in /map endpoint

hi @rafaelmiller @Caleb i was testing out the /map endpoint to fetch a given specific subset of links for a company's domain. so i wanted to understand what would an ideal query to the search param should look like ? for eg. if i want a company's global offices information i would ideally go to the page where locations , offices information is there, soo in the search query i passed location keyword as the param value. I got the website where the locations are present however i got additional > 50 websites which were not required at all. so i am hoping to narrow down the filtered list .. i mean one potential solution i can think of is adding an additional llm call on top of these filtered queries to further filter the specific web pages but i believe it will add an overhead in the service time ? another thing i noticed was given a url and search param value, if we run multiple iterations the total number of results is varied across iterations ( even though url and search param value is same ). This inconsistency might be an issue for enterprise solutions. the frequency of this is less and i am sure u guys might have thought of this or considered but still any visibility would surely help ! potential solution that i can think of is maybe implement a caching mechanism with a specified ttl value ? - update : i noticed u guys have already implemented this .. but what if i trigger the same query after the ttl ? will i get the same number of results ? sharing this sheet for some reference : https://docs.google.com/spreadsheets/d/1EmIxn6sVOheugBu1RASAadJab5C0OqysLmaYjTpb-i8/edit?usp=sharing

Google Docs

firecrawl testing

3 Replies

eciarla•14mo ago

Hi @lazycoderai ! Great question, ccing @Adobe.Flash here.

Caleb•14mo ago

Hey! This is a great question. Right now, we just return as many results as we can find. In the short term, you may want to pick only the first 10 results for example. Combinded with your solution, this would probably work well, even if not perfect. In the long term, we should probably add a relevancy threshold @Adobe.Flash On the second point, I’m not sure why you’re getting different results, we will look into it!

lazycoderaiOP•14mo ago

Thanks @Caleb for the response.

Gaming

Programming

ideal input for `search` param in /map endpoint

Did you find this page helpful?