Playwright Python Tutorial - Introduction
HTML-код
- Опубликовано: 19 июн 2024
- This is a quick introduction to Playwright using Python. Playwright is a browser automation testing tool or a headless browser that is better than Selenium. Developed by Microsoft, it supports multiple programming languages and almost all browsers.
🔥 codeRECODE.com EXCLUSIVE
⮕ Become a member and get access to all the courses on my site:
coderecode.com/membership
⮕ Take the course on Scrapy Basics for $1 or free 😀
(Use coupon code FREE on checkout page)
coderecode.com/scrapy-crash-c...
🎥 RELATED VIDEOS
⮕ Selenium for Web Scraping - Real World - • Selenium - Real World ...
⮕ Splash for Web Scraping - • Scrape Dynamic Sites w...
🔓 SOURCE CODE
github.com/coderecode-com/pla...
📠 GEAR I USE and RECOMMEND
Note: These are affiliate links. I get a small commission if you click on these links and buy. This does not cost you anything.
⮕ Røde PodMic
Amazon US - amzn.to/3FeWjzo
Amazon India - amzn.to/2YkRVhq
⮕ Audio Interface - Focusrite Scarlett 2i2
Amazon US - amzn.to/3a16Obc
Amazon India - amzn.to/3Ddj598
📕 CHAPTERS
00:00 What is Playwright
02:07 Plan for This Video
2:45 Getting Started
09:40 Selectors in Playwright
16:10 Checking Presence of Elements
19:21 Reading / Scraping Data
#playwright #python #codeRECODE #upendra
~-~~-~~~-~~-~
Please watch: "Making Scrapy Playwright fast and reliable"
• How to make Scrapy Pla...
~-~~-~~~-~~-~
I just recently switched to playwright from selenium and I must say I am impressed, I vote for more playwright videos please 😁
Happy to hear that! So you are using Python OR JS?
@@codeRECODE Sir I'm using python and waiting for the next tutorial. Your teaching method is very easy to understand. Please upload the next video of this tutorial series.
@@Pylogicx Thank you :-)
Uploading one today.
Absolutely great videos. I been following along for quite few videos now and am very grateful for your teachings... would definitely be interested on seeing more videos like this. (about scraping dynamics webpages)
Very quick and clear. Going to now watch your scrapy+playwright video. I'm glad I found your channel.
Awesome, thank you!
Nice tutorial. Expecting more videos in playwright-python series.
🔥🔥🔥🔥🔥🔥 so easy and fast. Great video. I would love to see more in depth videos about playwright from you.
Noted!
great tutorial - thanks for demonstrating the different selector choices
Glad it was helpful!
Loved the video!
Please make this a series🙏
I am seriously considering this. Thanks for the feedback 🙂
Concise video. Thanks. Please do more in-depth web scraping project videos.
Thank you for the initiative.
It was a really great hands-on session.
Would request you to make a playlist on Playwright.
Yes, I need to get to it soon. Thanks for sharing your interest :-)
I just love your content ❤️ you make it look so simple 😅 I've done no professional course of python yet I'm using scrapy to scrape sites and doing automation just because of your videos I've learnt alot ❤️ thank you and please keep up the good work.
Thank you so much for your kind words. This is exactly why I make these videos. 🙂
@@codeRECODE please if possible create some tutorials for scrapy and playeright together.
Very very very good explanation. You have convinced me to switch to playwright. Scrapy + playwright is a deadly combo and thank you for such a beautiful explanation
Great to hear!
Love to see new content
Yeah, it's looking very interesting.
Waiting for more detail videos on playwright.
More to come soon.
This is an awesome! new subscriber here!😃
thanks, will give playwright a try today...
Best of luck!
Thanks for the great video !
Glad you liked it!
very good tutorial :)
Wonder and great teaching. I want from beginer level playwright with java videos
Interesting! Good to see interest in Java 😃
great sir... waiting for next video
Very Soon.
Great Video Sir🙏
Thank you 🙂
Hope it will be a good Selenium alternative. Thanks for this addition.💖
This is definitely a good alternative, especially for those who don't know JS and for those who need multiple Browsers support.
As a user of both Selenium and Playwright at work, I must say Playwright is far superior, much faster, better documentation, and a lot less flaky. Once the Playwright community grows, it will become the industry standard without a doubt.
We want more videos on playwright. Also a video on Why playwright is better than selenium
Thank you for the video. Maybe be you can try the scrapy-playwright package.
Sure.
Please make more videos on python playwright
I've done scraping the contents on the first page and all the next pages but what to do when we need to store the scraped data. Do I need to use another library for that or playwright has inbuilt storing in different format commands??
Use usual python libraries like csv, etc.
I am getting error:
It looks like you are using Playwright Sync API inside the asyncio loop.
Please use the Async API instead.
I tried async calls too but still error. Any idea how to solve this? I am using jupyter notebook.
Can we see more website interactions, it is much more interesting 👌 want to learn more about it 🙏
Sure. More videos coming up soon
Sir please make more videos on scrapy playwright Thanks !
Sure I will
How can I export to playwright exe pyinstaller does not work
Thank you!
Can you show how to write information in csv or json file ?
And how to get Title from all pages?
For writing to files, the usual CSV, JSON modules can be used.
To extract title from all pages, visit all pages and use selectors. For this specific scenario, scrapy crawl spider is more suited though.
Would’ve been helpful to mention why someone might use this over selenium, it’s advantages and disadvantages. Saying one is interesting doesn’t explain much. Thanks for the video.
Great point! I will post another video with a detailed comparison.
Can you guide on how we can use playwright with scrappy?
Will post this soon 🙂
Nice video bro. When installing pip install playwright browser in vscode, i'm getting failed error which says "failed install chromium v100 and unable to get local issuer certificate" in my company laptop. Do you have any suggestion?
I did a google search with your error and this is what I got:
1. Reported here github.com/microsoft/playwright/issues/5636
2. Solution here: playwright.dev/docs/browsers#install-behind-a-firewall-or-a-proxy
Great video. Could you make a Go version of this video please?
Web scraping with Go is a good idea. Not sure about Go with Playwright.
There is a Playwright for Go - pkg.go.dev/github.com/mxschmitt/playwright-go
It's not official though. Officially supported languages are JS, Python, Java and C#
@@codeRECODE I was able to start using Playwright with Go by translating your Python code to Go.
Selenium miss match the time of selectors in headless it is so much easy with playwright and fast.. also we need to use explicit wait in each element find in selenium but in playwright we can simple pass the wait with so much ease... I will definitely go with playwright
True!
how to find multiple elements using python? which locator or selector should i need?
A selector matches all elements - so if the selector is div.alert it will match all divs that have this class.
Hi how can we do this with playwright or if is it even possible? in Scrapy response.replace(body=webdriver.page_source) using selenium, how can we do this with playwright, i tried response.replace(body=page.content()) but this doesn't work, kindly help!
Why replace the response? You can create a new selector object
from scrapy.selector import Selector
s = Selector(text=page.content())
s.xpath('').get()
i see that ti has been 4 months withouth a new video in the list..will you continue this list? its a goog one , thanks
Yeah! I do plan to continue this. Thanks for the reminder :-)
which theme are you running on vs code ?
Monokai Contrast with colors further customized in Settings
How to click on "logout"? it doesn't look like button in html although it is. I am confused with that
Hey! Somehow missed your comment. Are you asking about the website in this video or something else?
Hello Sir,
I have watched your Scrapy selenium tutorial and tried to implement the same procedure with Scrapy-Playwright. When I tried to run the script "raise Error(
playwright._impl._api_types.Error: It looks like you are using Playwright Sync API inside the asyncio loop.
Please use the Async API instead." this error is raised every time. Why this thing is happening?
By the way, eagerly waiting for your scrapy playwright tutorial.
The error is obvious but I did not cover async API. Will create a video on async api to help you understand the concept.
@@codeRECODE sir waiting for your video on that. I am badly stuck here.
if this is on Window O/S, can it use Webkit??
Yes
Somehow missed replying to this. Yes, webkit does work on Windows.
What are the benefits of Playwright over Selenium?
Playwright is much faster than selenium. That's the most important point. I will post another video with detailed comparison.
Can you try next a tutorial on Playwright with Scrapy?
Thanks for the specific suggestion.
Yes, I have more tutorials planned around this subject.
Selenium is like Vikram Rathore from Jawaan
🙂
Why does webscraping look so complex?
I don't understand a thing.
I think Selenium is still interesting enough because i can use VPN and proxy at the same time to scrape any sites i want
Hi, could you direct me to a tutorial explaining how to use VPN with Selenium? I am scraping a website that's blocked in my country! 🙁
Proxies can be used with Playwright. VPN would be at OS level. The biggest win for Selenium is the community support.
@@sheikhakbar2067 use any vpn. I recommend vyprvpn because it's cheap.
VPN is not at browser level but machine level. Once you connect to VPN, all apps including selenium will use VPN
@@sheikhakbar2067 www.vyprvpn.com/buy-vpn
Less than 2$ a month if you take long term plan.
More
Definitely! More to come soon 🙂
My system does not show browser even if i have set timeout to page.wait_for_timeout(50000) simply terminal shows [Done] exited with code=0 in 0.175 seconds, headless=False
Run this and see if this works - github.com/coderecode-com/playwright_sync_api_intro/blob/main/quotes.py
If this doesn't work, try another browser. For example, for firefox, change line 6 to the following
```
browser = p.firefox.launch(headless=False)
```