Running crawlee multiple times with the same URL
Hi!
I am trying to build a crawler using PuppeteerCrawler. The crawler will be started by sending a POST to an API endpoint. The API is implemented using azure durable functions.
The first time I call the API it works as expected. The next time I call it I get no output. This is the log output on the second run:
How do I configure crawlee such that every call to the API runs a new crawl?
Here is my current implementation. This function is called from an orchestrator function.
2 Replies
You have to use “useExtendedUniqueKey” on your requests .
wise-whiteOP•2y ago
Thanks!
"useExtendedUniqueKey" didn't work for me, because in my case, the request urls are identical. But you put me on the right track, and I solved it by modifying the uniqueKey of each request, and adding an "invocationId" from azure, which is unique for each run.