/scrape Endpoint fails with 500, can i see the error message somewhere in the dashboard?

Like the title says. I fired a call to the /scrape endpoint with an url and get a 500 error. Am i allowed to post the job id and or the url to the page?
15 Replies
micah.stairs
micah.stairs5mo ago
Hey @endless90 can you share which URL you are having trouble with? I can see if I can reproduce it.
endless90
endless90OP5mo ago
endless90
endless90OP5mo ago
iam trying to scrape the "unsere wochenkarte" section of this page https://www.cafebar-central.de/speisekarte
central
Wochenkarte/Speisekarte | central
Speisekarte - Tageskarte - Wochenkarte "regional• bio• mediterran • saisonal • vegan•Frühstück•Mittagstisch•hausgemachte Kuchen•welchselnde Tageskarte• erlesene Weine• glutenfrei• Gutscheine • Kaffeespezialitäten• täglich geöffnet • Nachhaltigkeit• wir kochen frisch ..... und vieles mehr"
micah.stairs
micah.stairs5mo ago
@endless90 I was able to scrape it fine without any errors: https://www.firecrawl.dev/app/playground?url=https%3A%2F%2Fapps.wixrestaurants.com%2F%3Ftype%3Dwixmenus.client%26pageId%3Dc1aeq%26compId%3Didgzhkta_0%26viewerCompId%3Didgzhkta_0%26siteRevision%3D230%26viewMode%3Dsite%26deviceType%3Ddesktop%26locale%3Dde%26tz%3DEurope%252FBerlin%26regionalLanguage%3Dde%26width%3D980%26height%3D4398%26instance%3DmjK7u6npsH7USXwj657ONoCpQWXNmE7A9IEJKYPOjv0.eyJpbnN0YW5jZUlkIjoiMmRjMWNiOTUtZTc3ZS00ZThhLThhNDYtZGZhNjRhYWZhZjk0IiwiYXBwRGVmSWQiOiIxM2MxNDAyYy0yN2YyLWQ0YWItNzQ2My1lZTdjODllMDc1NzgiLCJtZXRhU2l0ZUlkIjoiMzljYjIwN2ItYTc2OC00NmNkLWE4MjUtYmJlYzViOGYwZWZiIiwic2lnbkRhdGUiOiIyMDI1LTA1LTI0VDA3OjIwOjQxLjYzOFoiLCJkZW1vTW9kZSI6ZmFsc2UsIm9yaWdpbkluc3RhbmNlSWQiOiI2N2UzYzM5ZS03NzZiLTQ0YTQtYWM5OS1hNzQ4NTU2MWMxMmIiLCJhaWQiOiIwMWNkNDZmNS03MDViLTQyZmEtOTFkNi02NTlmNGQ3NTkzYTYiLCJiaVRva2VuIjoiMTQwYWViZWUtNDAxNi0wODQ3LTIyNjMtNjQ0YTExMjBhMTZmIiwic2l0ZU93bmVySWQiOiI3NDVjNDE1ZS0xZGJlLTRiZGQtOWRhOC03YWY3YzRlOGIyMjQiLCJicyI6Ii1IM3JFbWxQYl8yVFlOSHVIQTNPdWZ0N0ltZ3NqU2xoQURwTFl6QlNiSmciLCJzY2QiOiIyMDIwLTA2LTE2VDEzOjMwOjUzLjczM1oifQ%26currency%3DEUR%26currentCurrency%3DEUR%26commonConfig%3D%257B%2522brand%2522%253A%2522wix%2522%252C%2522host%2522%253A%2522VIEWER%2522%252C%2522bsi%2522%253A%2522d5f9be0d-40a8-4446-83d3-df00a144372c%257C1%2522%252C%2522siteRevision%2522%253A%2522230%2522%252C%2522renderingFlow%2522%253A%2522NONE%2522%252C%2522language%2522%253A%2522de%2522%252C%2522locale%2522%253A%2522de-de%2522%252C%2522BSI%2522%253A%2522d5f9be0d-40a8-4446-83d3-df00a144372c%257C1%2522%257D%26currentRoute%3D.%252Fspeisekarte%26target%3D_top%26section-url%3Dhttps%253A%252F%252Fwww.cafebar-central.de%252Fspeisekarte%252F%26vsi%3D7d42948a-bf40-4988-9933-47f5b26872e1&mode=scrape&limit=10&excludes=&includes=&formats=markdown&onlyMainContent=true&excludeTags=&includeTags=&proxy=undefined&includeSubdomains=true&mapSearch=&sessionId=fc-33e2-pL2bEaz4bWmyJlb7NtmG&uniqueKey=1748110346207 I'm curious, are you using the playground or the API?
endless90
endless90OP5mo ago
the api. ill later paste the prompt and settings here. thanks for the help
micah.stairs
micah.stairs5mo ago
Sounds good! Once you share, I can try to reproduce it locally.
endless90
endless90OP5mo ago
this is the request with actions. sadly this returns 500 because it cant find the element. maybe you have a hint for me why it cant be found.
endless90
endless90OP5mo ago
the other request withour actions now just times out. maybe it is a problem with charles proxy. ill disable the proxy and try again
endless90
endless90OP5mo ago
yeah that worked. but sadly without actions so i cannot click on the second tab on the page.
endless90
endless90OP5mo ago
this is interesting. when i use the playground ill get the "wochengerichte" without any actions or any settings. you can see that because it got the "Asia Nudelsuppe" which is on the second page. when i run my code and send the prompt over the api i only get the first page. maybe i have to add one actions which waits for 2 seconds.
No description
endless90
endless90OP5mo ago
hmm sadly that did not work
micah.stairs
micah.stairs5mo ago
@endless90 to debug why it's not finding the link, I would try two things: - use the "screenshot" action so that you can see what Firecrawl sees: https://docs.firecrawl.dev/advanced-scraping-guide#screenshot - ask for the "rawHtml" of the page, so that you can confirm that the HTML entity that you are looking for is actually there Another idea is to trying a simpler selector: #menu0c689dce-b142-432f-bcdd-ddb0e29edfc1. Seems brittle since I'm not sure what will cause the UUID in there to change, but depending on what your use case is, this may be sufficient.
endless90
endless90OP5mo ago
damn seems like i get a timeout now. i added rawHtml, html and the screenshot action. thats the job id c8047128-da10-4d4b-81ae-4d88f7b3fd95 maybe raw html and html are too much for the crawler
endless90
endless90OP5mo ago
I tested a couple of requests. And i think it has to do something with the wix page. When i dont wait for it to load it sometimes work. When i add a wait action > 1 second it crashes with an 500 error. if i use my private page it works like a charm
No description
No description
endless90
endless90OP4mo ago
hey @micah.stairs i saw firecrawl had a couple of updates. now i tried it again but sadly i still get an timeout error when i try to scrape the WIX iframe page. I attached my request

Did you find this page helpful?