<head> on html content
Hey, is there a way I can get the <head> and scripts in it. I have a scenario where I want to check for any markups in there.
5 Replies
@Community Moderator anyone you know that can help with this?
Use format
rawHtml
thanks @mogery I still don't see <script> elements within the rawHtml. Is that expected?
I tried setting onlyMainContent to false as well
https://github.com/mendableai/firecrawl/blob/main/apps/api/src/scraper/scrapeURL/lib/removeUnwantedElements.ts -- probably related to line 92 here. can you help confirm
GitHub
firecrawl/apps/api/src/scraper/scrapeURL/lib/removeUnwantedElements...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API. - mendableai/firecrawl
I want to propose that at least the ld+json script is tagged as a metadata just like the rest of the meta tags
@mogery any ideas/
@mogery ping ping 🙂