Puppeteer: Headless Automated Testing, Scraping, and Downloading

Поделиться
HTML-код
  • Опубликовано: 24 июл 2024
  • This tutorial walks you through every thing you need to know about Puppeteer and headless browsers, so you can automate website testing, web scraping, fetching and downloading content, and more.
    code from video: github.com/prof3ssorSt3v3/pup...
    Puppeteer Reference: pptr.dev/
    Puppeteer Extra Stealth Plugin: www.npmjs.com/package/puppete...
    Chrome Dev Tools Protocol: chromedevtools.github.io/devt...
    Pages for testing headless status:
    arh.antoinevastel.com/bots/ar...
    bot.sannysoft.com/
    0:00:00 intro and install
    0:03:26 basic commands
    0:18:50 interface testing
    0:47:26 web scraping
    1:05:33 downloading images and files

Комментарии • 51

  • @user-tj6py8sk9x
    @user-tj6py8sk9x 8 месяцев назад +5

    40 years old guy here diving into frontend. I''m only 15 mins in and I've already learned so much! Thank you, Steve. You're awesome!

  • @patrickconrad396
    @patrickconrad396 10 месяцев назад

    Dude for years I loom for great in depth Node content and i always come across your videos and they are so freaking helpful. Thanks for everything

  • @Chris-qg6kc
    @Chris-qg6kc Год назад +7

    I love and appreciate what you bring to the table. Very refined and polished teaching technique with a good voice.

  • @fredthefed3144
    @fredthefed3144 Год назад +2

    Felt like the Bob Ross of coding. Very informative. Concise and clear. Great presentation. TY!

  • @barungh
    @barungh 10 месяцев назад +1

    Awesome, I mean saving the screenshot along with data in json file - it is really nice approach to Web Scraping, and you showed both in your tutorial, thanks

  • @loydcose2780
    @loydcose2780 10 месяцев назад

    I'm absolutely thrilled that I stumbled upon this video. It's exactly what I was looking for. Thank you for creating such fantastic content! 🙌🙌

  • @real23lions
    @real23lions 7 месяцев назад +3

    excellent video. i've been trying to get into web scraping for a few months and the slow and methodical way of teaching really helped. thanks from a self-taught programmer

  • @picozzimichele
    @picozzimichele 7 месяцев назад

    One of the best videos about this so far, explained very well!

  • @MikeChalmers
    @MikeChalmers 2 месяца назад

    what a brilliant tutorial. this is the clearest and most engaging coding video i can remember. thanks!

  • @BitWizCoder
    @BitWizCoder 10 месяцев назад

    Thanks for the video, really enjoyd it. can't wait tor try some scrapping with the Puppeteer.

  • @kranzaklilu496
    @kranzaklilu496 Год назад

    Loving the new thumbnails. Thanks for your work

  • @RafaelSales55
    @RafaelSales55 8 месяцев назад

    I'm in my first job, and because of you I could learn how to use Puppeteer. I hope I can learn more with you, good job and thank you a lot!

  • @kubataiupov1245
    @kubataiupov1245 5 месяцев назад

    Awsome stuff man. Awesome explanations. Thanks a lot.

  • @brucewayne252
    @brucewayne252 Год назад

    Learning something new with this vid. Thanks Steve, again!

  • @israelortiz8820
    @israelortiz8820 Год назад

    Man. you always do great content. Thanks a lot for sharing your knowledge. 👍

  • @BunniesAI
    @BunniesAI 10 месяцев назад

    Very vey good walkthrough. Loved it 🫶🏻

  • @michaellavelle7354
    @michaellavelle7354 5 месяцев назад

    Great job Steve. Thanks.

  • @_ademmeral
    @_ademmeral Год назад

    Great tuto, Sir

  • @seanduignan294
    @seanduignan294 Год назад

    I built a few tools with this framework a year or so ago for work. It forced me to learn some javascript properly - from your channel. FYI: have a look at playwright.

  • @EltonSantanaRockerForLife
    @EltonSantanaRockerForLife 6 месяцев назад

    wow, thank you so much for this!

  • @isaacdruin
    @isaacdruin Год назад

    Man, your channel is fantastic.

  • @AlexBodrovFamily
    @AlexBodrovFamily 7 месяцев назад

    so nice! Didn't know such headless browser exists at my 36 now :( So looks like you could write end-to-end tests with it easily without using any framework ? I can also think of puppeteer + ChatGPT API combo. Thanks for this super clear and easy to grasp content, Mr. Steve!

  • @greatnile
    @greatnile Год назад

    thanks a lot for the valuable video.
    a question in a different context:
    are you planning to make video of a deep dive into the new OPFS (origin private file system) APIs?

  • @sharvilprabhudesai3210
    @sharvilprabhudesai3210 4 месяца назад

    Any way to click ALLOW or BLOCK when the website opens a dialog box before loading the scripts ????

  • @nattanontesavibul4450
    @nattanontesavibul4450 10 месяцев назад

    kept getting this error. Does anyone knows how to fix this?
    Error: Could not find Chromium (rev. 1108766). This can occur if either
    1. you did not perform an installation before running the script (e.g. `npm install`) or
    2. your cache path is incorrectly configured (which is: /Users/ohmnattanon/.cache/puppeteer).

  • @freefd3929
    @freefd3929 Год назад

    plugin-stealth from chrome update to 114 not work

  • @twansrude
    @twansrude 13 часов назад

    I can't be the first to one to think Steve sounds exactly like Robert Downey Jr. 🤣

  • @JoopKomkommerShake
    @JoopKomkommerShake 4 месяца назад

    Pitty you dont show how to find the selector names en button names on the RUclips page.
    For the other readers: right click in your browser on the required item choose inspect and voila.

  • @nori_san10
    @nori_san10 6 месяцев назад

    How do you deal with timeout errors? Is it ok to rerun the script everytime it encounters timeout error until you scraped all the data you needed?

    • @SteveGriffith-Prof3ssorSt3v3
      @SteveGriffith-Prof3ssorSt3v3  6 месяцев назад

      Depends on what you are doing and if there are restrictions on the server or with authentication

  • @stefanotosolini
    @stefanotosolini 26 дней назад

    Grazie.

  • @davidhusted817
    @davidhusted817 7 месяцев назад

    Hello mr. Steve I'm working at project Similar but I'm scared about using streams instead of events because I want after saving the images i 'll send it into api to make image processings with tesseract js and sharp is it the right solution or can get a nice solution from you Mr, and thanks I learn much from you 😊😊

    • @SteveGriffith-Prof3ssorSt3v3
      @SteveGriffith-Prof3ssorSt3v3  7 месяцев назад

      The "right" solution depends on what you are trying to do with your project. I can't really tell you what to do without being part of the project. Sorry, I don't have any videos on Streams in JS yet.

  • @owszystkim5415
    @owszystkim5415 Год назад

    Hey why you don't have cookies modal when you open youtube and i have. I run your code and it doesn't work because i cannot accept or decline cookies. I tried accepting or declining them on my own but for some reason even tough i was passing correct classes it didn't work.

    • @SteveGriffith-Prof3ssorSt3v3
      @SteveGriffith-Prof3ssorSt3v3  Год назад

      This is why you start your testing with the non-headless version. So you can look at the HTML in the elements tab and find the cookie dialog reference. Then you can deal with it as part of your script.

    • @owszystkim5415
      @owszystkim5415 Год назад

      @@SteveGriffith-Prof3ssorSt3v3 thanks. I managed to decline cookies, but i was just curious why this modal didn't appear on your screen.

    • @SteveGriffith-Prof3ssorSt3v3
      @SteveGriffith-Prof3ssorSt3v3  Год назад +1

      @@owszystkim5415 I have already visited the site previously and dismissed it

  • @rickrhodes2506
    @rickrhodes2506 4 месяца назад

    in all the tutorials nobody mentions what you are running when typing all these commands and installing puppeteer??? Is it Powershell, CMD, Node.js, Visual Studio???

    • @SteveGriffith-Prof3ssorSt3v3
      @SteveGriffith-Prof3ssorSt3v3  4 месяца назад +1

      It is an NPM module so, it is the same process as any npm module. It uses NodeJS to install modules.
      In whatever Terminal / command prompt you want to use.
      I'm using the version that is built into VS Code.

  • @kingwindie
    @kingwindie Год назад +1

    wow, i have never contributed to any RUclipsr but you bring topnotch amazing content this was exceptional, I actually want to donate or do something to help the channel out

    • @SteveGriffith-Prof3ssorSt3v3
      @SteveGriffith-Prof3ssorSt3v3  Год назад +1

      Thanks!
      Please share the channel to let others know about content that helped you.
      If you are wanting to donate, you can use the Super Thanks button (the heart icon with the dollar sign).

    • @kingwindie
      @kingwindie Год назад

      @@SteveGriffith-Prof3ssorSt3v3 thanks, i wasn't sure how the super thanks work, but i will use that

  • @athwala
    @athwala 9 месяцев назад

    I tried to scrape data from Facebook post but I couldn't. How can I do

    • @SteveGriffith-Prof3ssorSt3v3
      @SteveGriffith-Prof3ssorSt3v3  9 месяцев назад

      For any advanced site with credentials, it can be a trial and error process. You have to move through each step one at a time, testing after each command until you find the right combination. It's different for each site.

  • @juliushernandez9855
    @juliushernandez9855 3 месяца назад

    How to deploy puppeteer? Vercel, netlify and render

    • @SteveGriffith-Prof3ssorSt3v3
      @SteveGriffith-Prof3ssorSt3v3  3 месяца назад

      I made a few videos about using Netlify. This is the first one - ruclips.net/video/TSQAjWq49iw/видео.html No videos on Vercel or Render yet.