Puppeteer: Headless Automated Testing, Scraping, and Downloading
HTML-код
- Опубликовано: 24 июл 2024
- This tutorial walks you through every thing you need to know about Puppeteer and headless browsers, so you can automate website testing, web scraping, fetching and downloading content, and more.
code from video: github.com/prof3ssorSt3v3/pup...
Puppeteer Reference: pptr.dev/
Puppeteer Extra Stealth Plugin: www.npmjs.com/package/puppete...
Chrome Dev Tools Protocol: chromedevtools.github.io/devt...
Pages for testing headless status:
arh.antoinevastel.com/bots/ar...
bot.sannysoft.com/
0:00:00 intro and install
0:03:26 basic commands
0:18:50 interface testing
0:47:26 web scraping
1:05:33 downloading images and files
40 years old guy here diving into frontend. I''m only 15 mins in and I've already learned so much! Thank you, Steve. You're awesome!
Dude for years I loom for great in depth Node content and i always come across your videos and they are so freaking helpful. Thanks for everything
I love and appreciate what you bring to the table. Very refined and polished teaching technique with a good voice.
Felt like the Bob Ross of coding. Very informative. Concise and clear. Great presentation. TY!
Awesome, I mean saving the screenshot along with data in json file - it is really nice approach to Web Scraping, and you showed both in your tutorial, thanks
I'm absolutely thrilled that I stumbled upon this video. It's exactly what I was looking for. Thank you for creating such fantastic content! 🙌🙌
excellent video. i've been trying to get into web scraping for a few months and the slow and methodical way of teaching really helped. thanks from a self-taught programmer
One of the best videos about this so far, explained very well!
what a brilliant tutorial. this is the clearest and most engaging coding video i can remember. thanks!
Thanks for the video, really enjoyd it. can't wait tor try some scrapping with the Puppeteer.
Loving the new thumbnails. Thanks for your work
I'm in my first job, and because of you I could learn how to use Puppeteer. I hope I can learn more with you, good job and thank you a lot!
Congratulations on the job!
Awsome stuff man. Awesome explanations. Thanks a lot.
Learning something new with this vid. Thanks Steve, again!
Man. you always do great content. Thanks a lot for sharing your knowledge. 👍
Very vey good walkthrough. Loved it 🫶🏻
Great job Steve. Thanks.
Great tuto, Sir
I built a few tools with this framework a year or so ago for work. It forced me to learn some javascript properly - from your channel. FYI: have a look at playwright.
wow, thank you so much for this!
Man, your channel is fantastic.
Thanks. Please share it with others.
so nice! Didn't know such headless browser exists at my 36 now :( So looks like you could write end-to-end tests with it easily without using any framework ? I can also think of puppeteer + ChatGPT API combo. Thanks for this super clear and easy to grasp content, Mr. Steve!
thanks a lot for the valuable video.
a question in a different context:
are you planning to make video of a deep dive into the new OPFS (origin private file system) APIs?
It's on my list but I have no idea when I will be able to get to it.
Any way to click ALLOW or BLOCK when the website opens a dialog box before loading the scripts ????
kept getting this error. Does anyone knows how to fix this?
Error: Could not find Chromium (rev. 1108766). This can occur if either
1. you did not perform an installation before running the script (e.g. `npm install`) or
2. your cache path is incorrectly configured (which is: /Users/ohmnattanon/.cache/puppeteer).
plugin-stealth from chrome update to 114 not work
I can't be the first to one to think Steve sounds exactly like Robert Downey Jr. 🤣
Pitty you dont show how to find the selector names en button names on the RUclips page.
For the other readers: right click in your browser on the required item choose inspect and voila.
How do you deal with timeout errors? Is it ok to rerun the script everytime it encounters timeout error until you scraped all the data you needed?
Depends on what you are doing and if there are restrictions on the server or with authentication
Grazie.
Thank you very much 😀🙏❤️
Hello mr. Steve I'm working at project Similar but I'm scared about using streams instead of events because I want after saving the images i 'll send it into api to make image processings with tesseract js and sharp is it the right solution or can get a nice solution from you Mr, and thanks I learn much from you 😊😊
The "right" solution depends on what you are trying to do with your project. I can't really tell you what to do without being part of the project. Sorry, I don't have any videos on Streams in JS yet.
Hey why you don't have cookies modal when you open youtube and i have. I run your code and it doesn't work because i cannot accept or decline cookies. I tried accepting or declining them on my own but for some reason even tough i was passing correct classes it didn't work.
This is why you start your testing with the non-headless version. So you can look at the HTML in the elements tab and find the cookie dialog reference. Then you can deal with it as part of your script.
@@SteveGriffith-Prof3ssorSt3v3 thanks. I managed to decline cookies, but i was just curious why this modal didn't appear on your screen.
@@owszystkim5415 I have already visited the site previously and dismissed it
in all the tutorials nobody mentions what you are running when typing all these commands and installing puppeteer??? Is it Powershell, CMD, Node.js, Visual Studio???
It is an NPM module so, it is the same process as any npm module. It uses NodeJS to install modules.
In whatever Terminal / command prompt you want to use.
I'm using the version that is built into VS Code.
wow, i have never contributed to any RUclipsr but you bring topnotch amazing content this was exceptional, I actually want to donate or do something to help the channel out
Thanks!
Please share the channel to let others know about content that helped you.
If you are wanting to donate, you can use the Super Thanks button (the heart icon with the dollar sign).
@@SteveGriffith-Prof3ssorSt3v3 thanks, i wasn't sure how the super thanks work, but i will use that
I tried to scrape data from Facebook post but I couldn't. How can I do
For any advanced site with credentials, it can be a trial and error process. You have to move through each step one at a time, testing after each command until you find the right combination. It's different for each site.
How to deploy puppeteer? Vercel, netlify and render
I made a few videos about using Netlify. This is the first one - ruclips.net/video/TSQAjWq49iw/видео.html No videos on Vercel or Render yet.