Crawlee & Apify

CA

Crawlee & Apify

This is the official developer community of Apify and Crawlee.

Join

crawlee-js

apify-platform

crawlee-python

šŸ’»hire-freelancers

šŸš€actor-promotion

šŸ’«feature-request

šŸ’»devs-and-apify

šŸ—£general-chat

šŸŽgiveaways

programming-memes

🌐apify-announcements

šŸ•·crawlee-announcements

šŸ‘„community

firm-tan
firm-tan10/4/2023

SCRAPE IMAGES, TEX AND URLS FROM WEBSITES

Hi, been navigating trough the store and tried some actors, but i have not a found a scraper where if you put 1 or more urls, he will return back with images (not only the link) urls of subpages, and text data all toghether, If someone can drive me to something like this or maybe building one for me, thanks...
xenial-black
xenial-black10/4/2023

Invite users

Is there a way to privately invite or allow a user to be able to use run an actor without subscribing to it.
fair-rose
fair-rose10/4/2023

question about the Apify API

hey everyone, I just have one quick question. Is there a way to integrate a specific Apify actor into Sheets? thank you...
rising-crimson
rising-crimson10/3/2023

playwright crawler + firebase functions

Hey guys I have a firebase functions as endpoint that is suppose to run my playwright crawler but it outputs an error saying to run the following command: npx playwright install-deps Any help?...
sensitive-blue
sensitive-blue10/3/2023

how to get run cost and time from API?

I’m wondering if there’s any way to grab what you see in the consult regarding the time of a runner and a cost of a run and I’d like to be able to grab that we are the API in the JSON. I don’t see anything in the documentation and I’ve searched on discord here, but don’t say anything so curious if anybody has a way of how to do this or is it not possible?
ambitious-aqua
ambitious-aqua10/3/2023

"@crawlee/core"' has no exported member named 'RequestQueueOptions'.

hi, im using crawlee 3.5.5 followed the tutorial to run it on apify platform but at build time i have this error: node_modules/apify/index.d.ts:6:207 - error TS2724: '"@crawlee/core"' has no exported member named 'RequestQueueOptions'. Did you mean 'RequestOptions'? 2023-10-03T07:35:56.741Z 6 export { Dataset, DatasetDataOptions, DatasetIteratorOptions, DatasetConsumer, DatasetMapper, DatasetReducer, DatasetOptions, DatasetContent, RequestQueue, QueueOperationInfo, RequestQueueOperationOptions, RequestQueueOptions, KeyConsumer, KeyValueStoreOptions, RecordOptions, KeyValueStoreIteratorOptions, log, Log, LoggerOptions, LogLevel, Logger, LoggerJson, LoggerText, } from '@crawlee/core'; 2023-10-03T07:35:56.742Z...
wise-white
wise-white10/2/2023

How to bulk download content from the key-value store?

Hi community! I used the TikTok scraper to get some videos and now they are stored i nmy key-value store. There I only see how to download them one by one ... but acutally I would like to download them all at once. Can anybody let me know if this is possible and how?
afraid-scarlet
afraid-scarlet10/1/2023

Website content crawler - How to wait until network idle or all apis to finish

I tried using "Wait for dynamic content" to wait for the website to load but it doesnt seem to work, a lot of js content is still missing from the results, is there any other option for this? The url I'm using is https://aecinspire.com/
gradual-turquoise
gradual-turquoise9/30/2023

How do i use Woocommerce tool?

Im trying to learn how I use the woocommerce scraper, but I dont understand it. When I add the homepage, only 29 items display, what about all the other hundreds of products? And adding the sitemap didnt work either. Do I have to add each site manually??
gradual-turquoise
gradual-turquoise9/29/2023

Best way for adding ecommerce website to ai chatbot

Hi, I am looking for a solution where I can scrape and add products which are divided into categories, and the products have pictures and names of the product, and also store possibility to ask for product info. What would be the best way to do this? I am kinda new to this so all help is valuable!
xenial-black
xenial-black9/29/2023

Help regarding Apify Output schema nested fields

```json { "actorSpecification": 1, "name": "my_actor", "title": "MY Acror",...
wise-white
wise-white9/29/2023

How can I use an old version of an actor?

I'm using Facebook ads scrapper, but the scrapping information has changed, it's no longer bringing some information like collationCount or the domain, I'd like to use the old version of this actor, what can I do?
sensitive-blue
sensitive-blue9/28/2023

"failed to verify identity"

I started by having an account with an email. An error then occurred and I deleted my account and then created a new one. But when I had to create a new one, it said "failed to verify identity". I then tried to create a new account with a different email, but it said "failed to verify identity" again. I have written an email to Apify, but they have not responded and a few days have passed. Is there someone that can help?? plz
rare-sapphire
rare-sapphire9/28/2023

Apollo.io leads scraper. how to bypass block?

Error from apollo: Prospecting is blocked for 24 hours because you have violated our Terms of Services for the Unlimited plan. (Code 8) Our system identified a prospecting rate that is unusual for a human, and a limit was placed on your team to protect our database from any automations. ...
stuck-chocolate
stuck-chocolate9/28/2023

2 part Newbie Question

1. Using the google maps actor, can I scrape for only business that are perm/temp closed? Trying to clean my data. 2. What is the best way/actor to scrape all the date from this site? https://www.appyhourmobile.com...
optimistic-gold
optimistic-gold9/28/2023

Builds getting tagged incorrectly (Github Actions)

I've managed to set it up nicely according to instructions, and i have made 2 pipelines, with separate .yml-files. The issue is that both builds get tagged with beta in apify, the only way i got it to tag it with latest was when using the webhook (same webhook url mind you) directly inside github repo settings, not via Actions. It looks good inside Github actions, it runs the different .yml-files depending on branches like it should....
rising-crimson
rising-crimson9/27/2023

How to pass data (req.body) to an Actor?

I attached a screenshot showing my endpoint (express) that receives data from the request body, but I'm not sure how to pass that data down to my actor?
No description
jolly-crimson
jolly-crimson9/27/2023

question about instagram scrapers

Is it possible to access the source code behind the Instagram scrapers? I want to use these scrapers but have them store the scraped info directly into a postgres database, so having the source code would be useful to tweak and deploy the existing code.
harsh-harlequin
harsh-harlequin9/26/2023

Get tweets by date

Hi. Is it possible to get most recent tweets of a twitter user? Preferably since a datetime (e.g. from yesterday)?