Web Scraping in Google Sheets: I replaced importXML with Make (Integromat) and ScrapeNinja

Поделиться
HTML-код
  • Опубликовано: 16 дек 2022
  • In this video I develop a simple low-code Make.com scenario which iterates over Google Sheets rows and scrapes websites from each row, using ScrapeNinja.net, and puts results back to the same Google Sheet.
    Why ImportXML is not perfect for web scraping: pixeljets.com/blog/importxml-...
  • СпортСпорт

Комментарии • 35

  • @JustPromptMe
    @JustPromptMe Год назад +1

    You changed my life man. Keep up the the great instruction.

    • @pixeljets
      @pixeljets  Год назад

      Thanks mate, I appreciate it.

  • @EmanueleCannizzaro
    @EmanueleCannizzaro Год назад

    Hello,
    thank you for the video.

  • @LuizFSAlmeida
    @LuizFSAlmeida 6 месяцев назад

    Geat video.

  • @double-H2
    @double-H2 Год назад +1

    Thanks for this, looks great. I'm just getting started and following along, but I don't see ScrapeNinja in the list when I try to 'Add Module'. I did subscribe to it via RapidAPI (free subscription to start). I must be missing a step, would appreciate any advice.

    • @pixeljets
      @pixeljets  Год назад +1

      Thanks! ScrapeNinja module was approved by Make team, but it will be available in public list only in a few weeks during next Make update. To use ScrapeNinja now, you need to click invitation link from the description of this video to see the module: eu1.make.com/app/invite/6a5739aa760491ee365289b800649846

    • @pixeljets
      @pixeljets  Год назад

      UPD: ScrapeNinja integration is now available in official Make integrations list: www.make.com/en/integrations/scrapeninja so you don't need to click invite link anymore. Yay!

  • @hoangphucnguyenma1945
    @hoangphucnguyenma1945 2 месяца назад

    I want to get the product price, how can I do that? Please help me

  • @henryadams4915
    @henryadams4915 Год назад

    I had this working a couple days ago, but not anymore. I tried everything... even started over and copied all your steps using the same URLs in my google sheet. No matter what I try, I get a ModuleTimeoutError for each operation before ScrapeNinja gives an output. Any tips?

    • @pixeljets
      @pixeljets  Год назад

      I think we figured it out over email. Thanks for reporting!

  • @ricard_o21
    @ricard_o21 8 месяцев назад

    Nice video! In my case i need to make a scrap of different news pages, blogs, etc; to collect these news, organize and compose it for my Newsletter with the Open AI API; I have been trying with ninja scrapper and make but I can’t get it to enter each of the written articles, it only takes the text that is outside, like titles, labels, etc. Any ideas? I want to automate the entire process of collecting and writing content from different websites

    • @pixeljets
      @pixeljets  6 месяцев назад

      did you see my another video related to content extraction pipeline? ruclips.net/video/hRQqJtgYz_Q/видео.html

  • @MannyBernabe
    @MannyBernabe Год назад

    How do I scrape data from a number of items listed on a page. For example, I want to see all companies in vc portfolio, scape name, url, etc. I'd like this in a spreadsheet. Can I use ScrapeNinja for that?

    • @pixeljets
      @pixeljets  Год назад +1

      Sure, ScrapeNinja can definitely be used for this kind of task. The implementation details depend on a particular webpage.

    • @MannyBernabe
      @MannyBernabe Год назад

      @@pixeljets Do you have documentation I can reference. I'm new to scraping. I tried, but could not get it work with scrapeNinja.

  • @olkam4803
    @olkam4803 2 месяца назад

    Hi! Can you help me? When I try to add scrapeNinja to “make” I have a problem with “creating connection”. Because I need RapidAPI key. But I don’t understand where I can generate this one.

  • @EmanueleCannizzaro
    @EmanueleCannizzaro Год назад

    Can you please share your list of automation service?
    An article that compare them would be great!

    • @pixeljets
      @pixeljets  Год назад

      sure, I have such an article: pixeljets.com/blog/zapier-make-com-pipedream-from-a-developer-perspective/

  • @nurmimika
    @nurmimika 6 месяцев назад

    Hi, thank you for the great video! Been looking for this kind of tutorial for web scraping. I'm running into cookie consent and cloudfare problems on some of the websites. If you havent visited the website, you're trying to scrape and they have a cookie consent, then the scraper only gets the data from the cookie consent. Or sometimes cloudfare stops the bot. Got any tips for these? Thank you again!

    • @pixeljets
      @pixeljets  6 месяцев назад

      Hi, thank you! I would try to use playwright/puppeteer with chrome extension (like "I dont care about cookies" extension) to auto-hide the consent. playwright.dev/docs/chrome-extensions

    • @nurmimika
      @nurmimika 6 месяцев назад

      @@pixeljets Thank you for the quick answer! I decided to go the hard way and found out that when making a scraper with python, it autopasses these problems and same with autogen + anaconda combo, which i definetly would recommend to check :)

    • @pixeljets
      @pixeljets  6 месяцев назад

      ​@@nurmimika thanks for sharing your experience! Using plain python (like, using requests module) will probably open another can of worms though. do you use autogen agents for real world scraping? how its going?

  • @markbenjamin7940
    @markbenjamin7940 Год назад

    How do i find my API key to add scrapeninja in Make?

    • @KillfeedBO2
      @KillfeedBO2 Год назад

      If you are on this page at 1:30 , click the button labled "Subscribe to test" then click the free subscription. Go back and scroll down and to the right youll see code. Look for 'X-RapidAPI-Key" Copy that, this is your ScrapeNinja Key.

    • @pixeljets
      @pixeljets  Год назад

      Hi! Get your key by subscribing: rapidapi.com/restyler/api/scrapeninja

  • @correaa2009
    @correaa2009 Год назад

    looks great video,
    please, how to get my api key?

    • @pixeljets
      @pixeljets  Год назад +1

      For ScrapeNinja, you can get the API key here: rapidapi.com/restyler/api/scrapeninja/

    • @correaa2009
      @correaa2009 Год назад

      @@pixeljets thanks boss

  • @rayfellers
    @rayfellers 10 месяцев назад

    Like so many this video's volume is far too low for the hard of hearing. Using CC is the only way I can folllow what's being done.

    • @nurmimika
      @nurmimika 6 месяцев назад

      Have you tried using headphones? If the problem still consists, i doubt it's because of the volume on this video. If i put system volume and RUclips video volume to max. i'm going to break my eardrums.

  • @user-iz9sj1nn5q
    @user-iz9sj1nn5q 11 дней назад

    3:06
    7:18
    7:52
    8:45
    10:18