Firecrawl•5mo ago

/scrape Endpoint fails with 500, can i see the error message somewhere in the dashboard?

Like the title says. I fired a call to the /scrape endpoint with an url and get a 500 error. Am i allowed to post the job id and or the url to the page?

15 Replies

micah.stairs•5mo ago

Hey @endless90 can you share which URL you are having trouble with? I can see if I can reproduce it.

endless90OP•5mo ago

hi @micah.stairs sure. i think it was this url here https://apps.wixrestaurants.com/?type=wixmenus.client&pageId=c1aeq&compId=idgzhkta_0&viewerCompId=idgzhkta_0&siteRevision=230&viewMode=site&deviceType=desktop&locale=de&tz=Europe%2FBerlin&regionalLanguage=de&width=980&height=4398&instance=mjK7u6npsH7USXwj657ONoCpQWXNmE7A9IEJKYPOjv0.eyJpbnN0YW5jZUlkIjoiMmRjMWNiOTUtZTc3ZS00ZThhLThhNDYtZGZhNjRhYWZhZjk0IiwiYXBwRGVmSWQiOiIxM2MxNDAyYy0yN2YyLWQ0YWItNzQ2My1lZTdjODllMDc1NzgiLCJtZXRhU2l0ZUlkIjoiMzljYjIwN2ItYTc2OC00NmNkLWE4MjUtYmJlYzViOGYwZWZiIiwic2lnbkRhdGUiOiIyMDI1LTA1LTI0VDA3OjIwOjQxLjYzOFoiLCJkZW1vTW9kZSI6ZmFsc2UsIm9yaWdpbkluc3RhbmNlSWQiOiI2N2UzYzM5ZS03NzZiLTQ0YTQtYWM5OS1hNzQ4NTU2MWMxMmIiLCJhaWQiOiIwMWNkNDZmNS03MDViLTQyZmEtOTFkNi02NTlmNGQ3NTkzYTYiLCJiaVRva2VuIjoiMTQwYWViZWUtNDAxNi0wODQ3LTIyNjMtNjQ0YTExMjBhMTZmIiwic2l0ZU93bmVySWQiOiI3NDVjNDE1ZS0xZGJlLTRiZGQtOWRhOC03YWY3YzRlOGIyMjQiLCJicyI6Ii1IM3JFbWxQYl8yVFlOSHVIQTNPdWZ0N0ltZ3NqU2xoQURwTFl6QlNiSmciLCJzY2QiOiIyMDIwLTA2LTE2VDEzOjMwOjUzLjczM1oifQ&currency=EUR&currentCurrency=EUR&commonConfig=%7B%22brand%22%3A%22wix%22%2C%22host%22%3A%22VIEWER%22%2C%22bsi%22%3A%22d5f9be0d-40a8-4446-83d3-df00a144372c%7C1%22%2C%22siteRevision%22%3A%22230%22%2C%22renderingFlow%22%3A%22NONE%22%2C%22language%22%3A%22de%22%2C%22locale%22%3A%22de-de%22%2C%22BSI%22%3A%22d5f9be0d-40a8-4446-83d3-df00a144372c%7C1%22%7D&currentRoute=.%2Fspeisekarte&target=_top&section-url=https%3A%2F%2Fwww.cafebar-central.de%2Fspeisekarte%2F&vsi=7d42948a-bf40-4988-9933-47f5b26872e1

endless90OP•5mo ago

iam trying to scrape the "unsere wochenkarte" section of this page https://www.cafebar-central.de/speisekarte

central

Wochenkarte/Speisekarte | central

Speisekarte - Tageskarte - Wochenkarte "regional• bio• mediterran • saisonal • vegan•Frühstück•Mittagstisch•hausgemachte Kuchen•welchselnde Tageskarte• erlesene Weine• glutenfrei• Gutscheine • Kaffeespezialitäten• täglich geöffnet • Nachhaltigkeit• wir kochen frisch ..... und vieles mehr"

micah.stairs•5mo ago

@endless90 I was able to scrape it fine without any errors: https://www.firecrawl.dev/app/playground?url=https%3A%2F%2Fapps.wixrestaurants.com%2F%3Ftype%3Dwixmenus.client%26pageId%3Dc1aeq%26compId%3Didgzhkta_0%26viewerCompId%3Didgzhkta_0%26siteRevision%3D230%26viewMode%3Dsite%26deviceType%3Ddesktop%26locale%3Dde%26tz%3DEurope%252FBerlin%26regionalLanguage%3Dde%26width%3D980%26height%3D4398%26instance%3DmjK7u6npsH7USXwj657ONoCpQWXNmE7A9IEJKYPOjv0.eyJpbnN0YW5jZUlkIjoiMmRjMWNiOTUtZTc3ZS00ZThhLThhNDYtZGZhNjRhYWZhZjk0IiwiYXBwRGVmSWQiOiIxM2MxNDAyYy0yN2YyLWQ0YWItNzQ2My1lZTdjODllMDc1NzgiLCJtZXRhU2l0ZUlkIjoiMzljYjIwN2ItYTc2OC00NmNkLWE4MjUtYmJlYzViOGYwZWZiIiwic2lnbkRhdGUiOiIyMDI1LTA1LTI0VDA3OjIwOjQxLjYzOFoiLCJkZW1vTW9kZSI6ZmFsc2UsIm9yaWdpbkluc3RhbmNlSWQiOiI2N2UzYzM5ZS03NzZiLTQ0YTQtYWM5OS1hNzQ4NTU2MWMxMmIiLCJhaWQiOiIwMWNkNDZmNS03MDViLTQyZmEtOTFkNi02NTlmNGQ3NTkzYTYiLCJiaVRva2VuIjoiMTQwYWViZWUtNDAxNi0wODQ3LTIyNjMtNjQ0YTExMjBhMTZmIiwic2l0ZU93bmVySWQiOiI3NDVjNDE1ZS0xZGJlLTRiZGQtOWRhOC03YWY3YzRlOGIyMjQiLCJicyI6Ii1IM3JFbWxQYl8yVFlOSHVIQTNPdWZ0N0ltZ3NqU2xoQURwTFl6QlNiSmciLCJzY2QiOiIyMDIwLTA2LTE2VDEzOjMwOjUzLjczM1oifQ%26currency%3DEUR%26currentCurrency%3DEUR%26commonConfig%3D%257B%2522brand%2522%253A%2522wix%2522%252C%2522host%2522%253A%2522VIEWER%2522%252C%2522bsi%2522%253A%2522d5f9be0d-40a8-4446-83d3-df00a144372c%257C1%2522%252C%2522siteRevision%2522%253A%2522230%2522%252C%2522renderingFlow%2522%253A%2522NONE%2522%252C%2522language%2522%253A%2522de%2522%252C%2522locale%2522%253A%2522de-de%2522%252C%2522BSI%2522%253A%2522d5f9be0d-40a8-4446-83d3-df00a144372c%257C1%2522%257D%26currentRoute%3D.%252Fspeisekarte%26target%3D_top%26section-url%3Dhttps%253A%252F%252Fwww.cafebar-central.de%252Fspeisekarte%252F%26vsi%3D7d42948a-bf40-4988-9933-47f5b26872e1&mode=scrape&limit=10&excludes=&includes=&formats=markdown&onlyMainContent=true&excludeTags=&includeTags=&proxy=undefined&includeSubdomains=true&mapSearch=&sessionId=fc-33e2-pL2bEaz4bWmyJlb7NtmG&uniqueKey=1748110346207 I'm curious, are you using the playground or the API?

endless90OP•5mo ago

the api. ill later paste the prompt and settings here. thanks for the help

micah.stairs•5mo ago

Sounds good! Once you share, I can try to reproduce it locally.

endless90OP•5mo ago

this is the request with actions. sadly this returns 500 because it cant find the element. maybe you have a hint for me why it cant be found.

message.txt

endless90OP•5mo ago

the other request withour actions now just times out. maybe it is a problem with charles proxy. ill disable the proxy and try again

message.txt

endless90OP•5mo ago

yeah that worked. but sadly without actions so i cannot click on the second tab on the page.

endless90OP•5mo ago

this is interesting. when i use the playground ill get the "wochengerichte" without any actions or any settings. you can see that because it got the "Asia Nudelsuppe" which is on the second page. when i run my code and send the prompt over the api i only get the first page. maybe i have to add one actions which waits for 2 seconds.

endless90OP•5mo ago

hmm sadly that did not work

message.txt

micah.stairs•5mo ago

@endless90 to debug why it's not finding the link, I would try two things: - use the "screenshot" action so that you can see what Firecrawl sees: https://docs.firecrawl.dev/advanced-scraping-guide#screenshot - ask for the "rawHtml" of the page, so that you can confirm that the HTML entity that you are looking for is actually there Another idea is to trying a simpler selector: #menu0c689dce-b142-432f-bcdd-ddb0e29edfc1. Seems brittle since I'm not sure what will cause the UUID in there to change, but depending on what your use case is, this may be sufficient.

endless90OP•5mo ago

damn seems like i get a timeout now. i added rawHtml, html and the screenshot action. thats the job id c8047128-da10-4d4b-81ae-4d88f7b3fd95 maybe raw html and html are too much for the crawler

endless90OP•5mo ago

I tested a couple of requests. And i think it has to do something with the wix page. When i dont wait for it to load it sometimes work. When i add a wait action > 1 second it crashes with an 500 error. if i use my private page it works like a charm

endless90OP•4mo ago

hey @micah.stairs i saw firecrawl had a couple of updates. now i tried it again but sadly i still get an timeout error when i try to scrape the WIX iframe page. I attached my request

message.txt

Gaming

Programming

/scrape Endpoint fails with 500, can i see the error message somewhere in the dashboard?

Did you find this page helpful?