I have been using puppeteer to automate the process of finding a new apartment and it has been such a great experience. The API is really well documented and its really straight forward to use.
in case someone is trying the 2nd example of server.mjs, the example is missing a line of code. If your index.html is located in the root folder of the server, add : app.use(express.static('.'))
headless chrome is the best, I used lot of headless browser and lot of them don't support some javascript sometime I couldnt load a page but with chromium you can load any web site I never had any issue to load pages or with the javascript, you can just do everything even watching a youtube video ^^, all you do with chrome you can do with headless chrome and you can run it on linux.I have been using pupeeteer for a moment and it's the best headless browser, just thank a lot for making this.
22:54 *Creating custom PDFs* I would very much like support for adding metadata to PDFs, like you can with PDFKit. For storing PDF page snapshots it would be great to have them associated with searchable metadata (like Title, Author, Subject, Keywords, Comments). Where should I create the feature request? Puppeteer? SkiaPDF? Chromium?
why would not be? just create a module which exposes one instance on headless chrome. Now you can create as many tabs as your system can handle. Or, more realistically, you can create a queue for rendering tasks, and reause let's say 10 tabs, to render the queued requests.
Nice video, Does Puppeteer-SSR is more effective than TemplateEngines-SSR ?. My point is, template engines do not require for taking care of filtering requests, and also you could inject data on templates without calling external services. I like the puppet-ssr approach but just asking for any advantages over template engines ssr.
how can i set background image add use margin @page for each page in file pdf, i can margin top and bottom for each pages but can't set background image, Please help me, Thanks!
I do not understand why this approach does not wait the whole browser to load? Because server side have to do all the work before res.send to the client.
Would like to run with it's "head full", i n incognito mode, and would like to save the interactive timeline report from the performance tab as is. Can this be done? Where can I find a video that shows this?
First of all many thanks for uploading the info about headless chrome. Problem : I am trying to access a password protected url which loads pdf. This works absolutely fine in normal chrome browser, however, in Chrome headless mode because of some reason the PDF never gets loaded. Need your help to approach this problem. It would really be a BIG help if you can help me in anyways… Many thanks to you in advance
So we've gone from PHP or server-side nodejs that renders all this directly to HTML to a client-side language that should create this, but instead, we do this server side using a headless browser which scales in much the same way that a banana doesn't, all instead of a fast script interpreter, and then send it to the client to be handled as static HTML.. only to have it all in 1 language? is that really the only reason?! the lazyness of being able to use one language?
Puppeteer works only with Chromium or Chrome. However, many teams only run unit tests with a single browser (e.g. PhantomJS). In non-testing use cases, Puppeteer provides a powerful but simple API because it's only targeting one browser that enables you to rapidly develop automation scripts. Puppeteer bundles the latest versions of Chromium.
I wonder why we even care about 3g for most part. It’s like the same old discussion to support IE but now it’s network. If your page requires animations and high-performance. Seriously, don’t even think about what is better for the first paint. Think of what is the best overall. This first paint strategy is Google dogma
Am I the only one feeling web developers spent 20 years slowing down the internet only to realize that all of this wasn't necessary in the first place. And now the pyroman is the fireman. We'll soon have to pay Google to generate html. Just like Maps. 😔
I can't come up with a reason why all this shit is needed when you have Varnish and a random scripting language like PHP 7.2, or HELL.. server-side NODEJS even... just sitting there that can render this stuff at 5000 hits per second instead of 4...
With puppeteer you can click on Css Selectors (pptr.dev/#?product=Puppeteer&version=v1.13.0&show=api-pageclickselector-options) or you can use the DevTools Protocol Input.dispatchMouseEvent method to specify a (x,y) location. See puppeteer's CDPSession class for examples of how to use DevTools Protocol with puppeteer.
ruclips.net/video/lhZOFUY1weo/видео.html The presentation mentioned a protocol monitor panel there. Its not in the menu in Chrome 72, even when Im in the right context (localhost:9222). Did it get remove ?
If puppetier is the way, then why not use PHP as it does server side rendering. Personally i love JS, but are we not reinventing the wheel in another language here.
Because you can re-use the JS that you wrote for the client to work for server-side rendering. You don't have to duplicate the logic in two places. Furthermore, Puppeteer gives you more than just SSR - you can take screenshots, make PDFs. Those things you cannot do with just PHP (of course you could use PHP and websockets to control a headless chrome, but I sense that's not the point you're trying to make, correct me if I'm wrong). That said, using a bunch of headless chrome instances might use more computing resources than you want, at which point PHP or Go or any other language would work. Tons of ways to do this, it's all dependent on your requirements.
Nope. The benefit is that you can build apps with a great user experience (=pure js frontend) without the disadvantages (in terms of seo, ssr complexity and initial page load).
@@rckd5903 but wait.. isn't all the javascript removed from the page when it's sent to the browser... soooo.. the next request HAS to go through that headless monstrosity again?
*Timestamps*
0:07 introduction (Eric Bidelman)
1:17 agenda
1:38 Intro to Headless Chrome
3:00 What does it unlock for us?
4:54 [Pyramid of Puppeteer] Chrome DevTools Protocol (CDP)
5:45 [Pyramid of Puppeteer] Puppeteer w/ usage examples
10 Things you can do (your scripts at the top of Pyramid of Puppeteer)
9:02 0. Quick Intro
9:30 1. *Pre-rendering* *JS* *sites*
12:46 1.1 Performance Comparison Demo
13:51 1.2 4 Pre-rendering Optimization tips
17:29 2. *Is* *Lazy-loading* *paying* *off?*
19:22 3. *A/B* *Testing*
21:09 4. *Catch* *issues* *for* *the* *Google* *crawler*
22:54 5. *Create* *custom* *PDFs*
24:20 6. *Make* *your* *browser* *talk*
26:28 7. *Test* *a* *Chrome* *extension*
28:10 8. *Crawl* *a* *SPA*
29:13 9. *Verify* *service* *worker* *offline* *caching*
30:52 10. *Procrastinate* (showcase touch/keyboard API)
31:39 Closing Remarks
Thank youu :)
You're doing God's work here
awesome
I have been using puppeteer to automate the process of finding a new apartment and it has been such a great experience. The API is really well documented and its really straight forward to use.
Having a hard time with this. I need to do the same thing. Are you able to give a step by step please?
This is cool and all, but at 9:43 he put his words where his mouth is, and i think thats the greatest thing of all.
24:55 "The rise of the machines has begun" _me nervously laughing_
headless chrome with puppeteer is a bliss
in case someone is trying the 2nd example of server.mjs, the example is missing a line of code.
If your index.html is located in the root folder of the server, add : app.use(express.static('.'))
Such a genius idea to SSR with puppeteer. Great work!
I don't understand tho...why ssr with puppeteer when you can ssr the whole thing?
headless chrome is the best, I used lot of headless browser and lot of them don't support some javascript sometime I couldnt load a page but with chromium you can load any web site I never had any issue to load pages or with the javascript, you can just do everything even watching a youtube video ^^, all you do with chrome you can do with headless chrome and you can run it on linux.I have been using pupeeteer for a moment and it's the best headless browser, just thank a lot for making this.
Very cool tech that I didn’t know existed. Thanks for sharing. Now to learn more about this
Headless Chrome + Karate + Gatling is the bomb.
14:35 “How to wait until pages get partly or completely loaded”
22:54 *Creating custom PDFs* I would very much like support for adding metadata to PDFs, like you can with PDFKit. For storing PDF page snapshots it would be great to have them associated with searchable metadata (like Title, Author, Subject, Keywords, Comments). Where should I create the feature request? Puppeteer? SkiaPDF? Chromium?
That's not a pyramid, mate; that's a triangle.
it can still be a pyramid from an orthographic point of view :p
ROFL
How does it compare to Nightmare?
apparently it made it obsolete
What about concurrency? Is it really ok to call a puppetier browser in an enpoint that can be called by multiple clients at the same time?
why would not be? just create a module which exposes one instance on headless chrome. Now you can create as many tabs as your system can handle. Or, more realistically, you can create a queue for rendering tasks, and reause let's say 10 tabs, to render the queued requests.
i dont think it is a major issue if you put the caching in place
Hi, Can I insert script that using puppeteer library in a tag of a web page?
Thank you
Nice video, Does Puppeteer-SSR is more effective than TemplateEngines-SSR ?. My point is, template engines do not require for taking care of filtering requests, and also you could inject data on templates without calling external services.
I like the puppet-ssr approach but just asking for any advantages over template engines ssr.
Nicely Explaination
how can i set background image add use margin @page for each page in file pdf, i can margin top and bottom for each pages but can't set background image, Please help me, Thanks!
why he didnt use chromebook?
I do not understand why this approach does not wait the whole browser to load? Because server side have to do all the work before res.send to the client.
Would like to run with it's "head full", i n incognito mode, and would like to save the interactive timeline report from the performance tab as is. Can this be done? Where can I find a video that shows this?
First of all many thanks for uploading the info about headless chrome.
Problem :
I am trying to access a password protected url which loads pdf. This works absolutely fine in normal chrome browser, however, in Chrome headless mode because of some reason the PDF never gets loaded.
Need your help to approach this problem.
It would really be a BIG help if you can help me in anyways…
Many thanks to you in advance
"Kinda cool"..."Really Awesome !!!"
help me..
i want to use puppeteer to fetch a page and press a specific button on the page..how can i do that?
I tried this the other day, but I couldn't bypass Cloudflare DDOS protection, don't know why...
So we've gone from PHP or server-side nodejs that renders all this directly to HTML to a client-side language that should create this, but instead, we do this server side using a headless browser which scales in much the same way that a banana doesn't, all instead of a fast script interpreter, and then send it to the client to be handled as static HTML.. only to have it all in 1 language? is that really the only reason?! the lazyness of being able to use one language?
Still can't easily download files with Puppeteer and without crunches.
Nice ...
Unable to use download options in headless chrome.
10:38 FIrst code sample
WOW! WOW! WOW!
can we use pupeeteer for cross browsers as silenium ?
Puppeteer works only with Chromium or Chrome. However, many teams only run unit tests with a single browser (e.g. PhantomJS). In non-testing use cases, Puppeteer provides a powerful but simple API because it's only targeting one browser that enables you to rapidly develop automation scripts.
Puppeteer bundles the latest versions of Chromium.
Sad thing is, this is only for node.js
I want to import expression in control panel is not it?
Great
"Basically"
Yes. Let's use eager Promises instead of lazy Futures. And, wow, we also get a then then then then. Bravo.
I wonder why we even care about 3g for most part. It’s like the same old discussion to support IE but now it’s network.
If your page requires animations and high-performance. Seriously, don’t even think about what is better for the first paint. Think of what is the best overall.
This first paint strategy is Google dogma
good
Am I the only one feeling web developers spent 20 years slowing down the internet only to realize that all of this wasn't necessary in the first place.
And now the pyroman is the fireman. We'll soon have to pay Google to generate html. Just like Maps. 😔
I can't come up with a reason why all this shit is needed when you have Varnish and a random scripting language like PHP 7.2, or HELL.. server-side NODEJS even... just sitting there that can render this stuff at 5000 hits per second instead of 4...
LUV!
So far ahead of me automation
Can you simulate mouse clicks at a specific location?
With puppeteer you can click on Css Selectors (pptr.dev/#?product=Puppeteer&version=v1.13.0&show=api-pageclickselector-options) or you can use the DevTools Protocol Input.dispatchMouseEvent method to specify a (x,y) location. See puppeteer's CDPSession class for examples of how to use DevTools Protocol with puppeteer.
nothing works -
ruclips.net/video/lhZOFUY1weo/видео.html
The presentation mentioned a protocol monitor panel there. Its not in the menu in Chrome 72, even when Im in the right context (localhost:9222).
Did it get remove ?
Yes, it was removed. I saw this protocol monitor pannel for a while in a early version. Don't remember exactly when... maybe 8 months ago.... maybe...
If puppetier is the way, then why not use PHP as it does server side rendering. Personally i love JS, but are we not reinventing the wheel in another language here.
You can also control the headless chrome with PHP, via websockets. So someone could reprogram Puppeteer in PHP or is there something against?
Because you can re-use the JS that you wrote for the client to work for server-side rendering. You don't have to duplicate the logic in two places.
Furthermore, Puppeteer gives you more than just SSR - you can take screenshots, make PDFs. Those things you cannot do with just PHP (of course you could use PHP and websockets to control a headless chrome, but I sense that's not the point you're trying to make, correct me if I'm wrong).
That said, using a bunch of headless chrome instances might use more computing resources than you want, at which point PHP or Go or any other language would work. Tons of ways to do this, it's all dependent on your requirements.
Stephan Bijzitter i guess the benefit here is utilizing one language. Thanks
Nope. The benefit is that you can build apps with a great user experience (=pure js frontend) without the disadvantages (in terms of seo, ssr complexity and initial page load).
@@rckd5903 but wait.. isn't all the javascript removed from the page when it's sent to the browser... soooo.. the next request HAS to go through that headless monstrosity again?
still too slow in x2...
How to download the files in chrome headless mode in automation.... Can you please help me on this
too long...
You can do all this stuff in normal Chrome.
puppeteer is opening doors to alot of hacking methods and will be banned i think..........
good