Images in PDF

Hello, when we try to scrape a PDF, it gets image links for correct locations but the links are sadly not working which results on the response in the image. What can we do?
No description
7 Replies
micah.stairs
micah.stairs6d ago
Hmm, they must use short-term image links when rendering their PDFs. I logged a feature request for Firecrawl to host the images for you and return those instead. I'll let you know if/when we implement it.
Okan
OkanOP6d ago
Thank you! @micah.stairs We are using one of these plans, not sure but if enterprise can help for that, please let us know, so that we can contact you
No description
micah.stairs
micah.stairs6d ago
Nothing to do with enterprise! If we build this feature, we would likely include it as a general feature.
Okan
OkanOP2d ago
Thank you! @micah.stairs great to hear Hello @micah.stairs is there any workaround we can do till it is released? Maybe doing multiple calls etc.
micah.stairs
micah.stairs2d ago
Hmm I can't think of a good workaround, since the issue is that the site serving the PDF is only using short-lived links to render the images (if I'm understanding the issue correctly). If you share the PDF URL, I will take a closer look to confirm.
Okan
OkanOP2d ago
Example PDF: the-guild.eu/publications/position-papers/the-guild-s-position-paper-on-the-use-of-ai-in-research_nov2024.pdf THE URL to image: https://www.the-guild.eu/publications/position-papers/images/23dc0b86a720a02fff852d34bed200c17e190396db045212a3403fbc93491909.jpg It is actually happening for any PDF I am trying
micah.stairs
micah.stairs2d ago
Oh thanks for flagging! We will definitely dig into this.

Did you find this page helpful?