F
Firecrawl13mo ago
Kaleb

`format: ["links"]` doesn't respect `excludeTags`

format:["links"] does not appear to respect excludeTags. for example:
const INPUT_URL = "https://www.jndla.com/cases/class-action-administration"

const response = await app.scrapeUrl(INPUT_URL, {
formats: ["links"],
excludeTags: ["a"],
})

console.log(response.links)
// this still has links, even though we excluded the <a> tags
const INPUT_URL = "https://www.jndla.com/cases/class-action-administration"

const response = await app.scrapeUrl(INPUT_URL, {
formats: ["links"],
excludeTags: ["a"],
})

console.log(response.links)
// this still has links, even though we excluded the <a> tags
Since we excluded <a> tags, my expectation is firecrawl will return an empty response.links array. But, we still get the entire list of links on the page.
1 Reply
mogery
mogery13mo ago
Hey, thanks for bringing this up. I've opened a GitHub issue for tracking this one: https://github.com/mendableai/firecrawl/issues/701
GitHub
[Bug] format: ["links"] doesn't respect excludeTags · Issue #70...
Discord thread const INPUT_URL = "https://www.jndla.com/cases/class-action-administration" const response = await app.scrapeUrl(INPUT_URL, { formats: ["links"], excludeTags: [&q...

Did you find this page helpful?