Python Upwork web scraper - Easy step by step guide | How to make money with Python Episode 2

Поделиться
HTML-код
  • Опубликовано: 24 сен 2024

Комментарии • 138

  • @epilefchannel9424
    @epilefchannel9424 3 года назад +25

    dude this was clear as fuuuck, thanks for just saying what's neccesary, and being so clear ... LLLLLLOVE ITT

    • @MakeDataUseful
      @MakeDataUseful  3 года назад +2

      Thanks for the feedback! Feeling motivated to put out more content!

  • @russelldjmcguire
    @russelldjmcguire Год назад +24

    Just started learning Python last week. Stumbled onto your channel today and it’s a breath of fresh air to provide some excitement halfway through my coursework! You explain everything in easy terms and it’s a lot more exciting than basic programs. I’m looking forward to finishing my course now!!!

    • @MakeDataUseful
      @MakeDataUseful  Год назад +8

      Woohoo! That's awesome to hear. I really struggled learning Python years ago and I think it helps to have a project to work through.

    • @Aoi_Hikari
      @Aoi_Hikari Год назад +2

      Same!

  • @namangadia9056
    @namangadia9056 3 года назад +4

    How am I not exhausted after watching a 30 min video. Man you are awesome.

  • @ji23delgado
    @ji23delgado 3 года назад +3

    I really like the fact that you explain things so clearly, and I regret the fact that I haven't been able to see your videos in the past few months
    Great work!

  • @gianfrancodagostino3938
    @gianfrancodagostino3938 2 года назад +4

    Man you are pure gold. Your examples are very useful and clear. Definitely worth every second of it. Keep the great job man!

  • @pw5687
    @pw5687 3 года назад +8

    hey mate, i recently found your channel when looking for some projects to busy myself with in lockdown and i love how simple you explain things and i hope you get many more subs because your amazing 😀

    • @MakeDataUseful
      @MakeDataUseful  3 года назад +1

      Thank you!!

    • @pw5687
      @pw5687 3 года назад

      @@MakeDataUseful 🙂
      Also, if you need a video idea Id love to see a notepad / MS word kind of thing in python. It doesn't have to have a GUI but I've been trying to make one and I'd love to see how someone else would try this. I'm using as few external librarys as possible so it could be a kind of plugandplay solution for quick edits?

    • @qasmb1546
      @qasmb1546 Год назад

      You're*@@pw5687

  • @balewgizem
    @balewgizem Год назад +1

    Many thanks to you! I found an Upwork job after watching your tutorial and creating portfolio project. Keep it up, your tutorial are indeed practical. Thanks agian!

  • @rafidrahman8654
    @rafidrahman8654 4 года назад +21

    Hey there, absolutely loving this series. It is quite hard for new comers to get their first job on upwork, if you had any tips regarding how to land the first job, smooth and easy it would be a real help ! Anyways, keep the series going. Loving these videos

    • @MakeDataUseful
      @MakeDataUseful  4 года назад +11

      Thank you for the feedback and yes totally agree about landing those first few jobs! I will do some more client communication and landing the job in upcoming videos :)

    • @rafidrahman8654
      @rafidrahman8654 4 года назад +9

      @@MakeDataUseful hello, thanks to you I got my first contract ! Keep inspiring, keep building !

    • @whs861
      @whs861 Год назад

      ​@@MakeDataUsefulhi

  • @heritage1834
    @heritage1834 4 месяца назад

    Great content 👍. It's very explanatory and easy to follow along

  • @ignaciojavierkairuzeguia599
    @ignaciojavierkairuzeguia599 3 года назад +1

    I'm happy I discovered your channel. Just what I was looking for!

  • @carlnysten2137
    @carlnysten2137 4 года назад +5

    You'll have 1k subscribers way before the end of the year if you continue, i'm sure of it

  • @junaidalikhan0704
    @junaidalikhan0704 5 месяцев назад

    Really helpful bro. Your content is unique. I just found your channel accidently. It gives a lot of information in easy way. Thanks a lot

  • @stuarthoughton3517
    @stuarthoughton3517 4 года назад +1

    Superb video. 🤯🤯Loving the real-world examples for scripts!!! Great work!! 👏🏻👏🏻

  • @MrLj1983
    @MrLj1983 4 года назад +1

    This work is really great. It is a real problem example.thanks for this video.

  • @everlastingideas8625
    @everlastingideas8625 2 года назад

    This is the second tutorial I am following with you and I like this one way better than the first (the website in the 1st one didn't work), it s good to see the process of webscraping working, thank you!

    • @AdamHammel
      @AdamHammel Год назад

      I was able to accomplish the first video, if you need help ask away. The xml sitemaps page changed in this video's (#2) and now has images instead of the text in 4:38 , but I don't think it is important, but the speed at which this site has changed over 3 years is amazing.

  • @halawani939
    @halawani939 9 месяцев назад

    Thank you for this wonderful work, it is very useful, easy and amazing

  • @Lekichu
    @Lekichu Год назад

    Great job, just found your chanel and I already love your videos! Keep the good work.

  • @dmitrywebsmith4031
    @dmitrywebsmith4031 Год назад

    Thank you very much! These videos are very useful. It would be nice if you continue publish them.

  • @_xji_
    @_xji_ 2 года назад

    You are a wonderful teacher!!! I just had to say.

  • @oscarmartinezbeltran
    @oscarmartinezbeltran 3 года назад

    fantastic tutorial !!!!!
    your approach to try to solve real life problems is incredibly useful !

  • @jeffsinason4336
    @jeffsinason4336 4 года назад +1

    Even better than episode one. Thank you

    • @MakeDataUseful
      @MakeDataUseful  4 года назад

      Thanks for the feedback Jeff, appreciate it! :)

  • @pkavenger9990
    @pkavenger9990 3 года назад +1

    This is a good video for me as i am also a Attack on Titan fan as well.

  • @ahmedjalali1
    @ahmedjalali1 10 месяцев назад

    Keep posting! Now it’s my wish to learn and solve problems like you do :)

    • @MakeDataUseful
      @MakeDataUseful  10 месяцев назад

      Thank you!! Lots more to come 😎

  • @NexusGuru
    @NexusGuru Год назад

    masterful tutorial dude by far the best explanations of everything even though im still trying to understand some for loops and exception handling also what packages to use but with practice i know i can do it i just stumbled on your videos so i hope i can start learning to do jobs like this so i can do this on the side

  • @helioradial
    @helioradial Год назад +1

    dude ! you are awesome thank you very much for the content! respect!

  • @fazlaynur4509
    @fazlaynur4509 3 года назад +1

    Thank you Sir, for your tutorials

  • @scott_itall8638
    @scott_itall8638 Год назад

    AsyncIO or Multiprocessing can speed this up a lot and request sessions.

  • @RyanParnow
    @RyanParnow 4 года назад +1

    Love the videos. I have learned a ton! Towards the end you mentioned the virtual machine. Could you do a video on how that works?

    • @MakeDataUseful
      @MakeDataUseful  4 года назад

      That's really great to hear Ryan! And yep, virtual machine video is on the cards.

  • @danielderma
    @danielderma 4 года назад +1

    Excellent, nice video. Keep doing this work please, man.

  • @Decolsin
    @Decolsin Год назад

    love that shirt, just like my high school teachers wearing their batiks 😄

  • @zerogruen
    @zerogruen 4 года назад +1

    Really good videos! Thanks a lot. I’d be interested in the server deployment / infrastructure process one day

    • @MakeDataUseful
      @MakeDataUseful  4 года назад

      Hey great idea! Thanks for the idea, I'll share some hosting examples in upcoming vids

  • @rayolcode
    @rayolcode 2 месяца назад

    Which IDE are you using for coding Python? I've never seen this.

  • @abdulnuhu
    @abdulnuhu Год назад

    Enjoying your videos!
    Thank you

  • @pw5687
    @pw5687 3 года назад

    who tf disliked this amazing guy!!!!

  • @ataimebenson
    @ataimebenson 3 года назад

    Very awesome video

  • @dannyrock7261
    @dannyrock7261 4 года назад +1

    Great video mate, easy to follow

    • @MakeDataUseful
      @MakeDataUseful  4 года назад

      Thank you Danny Rock, really means a lot to me :)

  • @syedhyder5630
    @syedhyder5630 3 года назад +1

    Please do more videos like this .... U will have many subscribers

    • @MakeDataUseful
      @MakeDataUseful  3 года назад

      You got! Another video coming up!

    • @syedhyder5630
      @syedhyder5630 3 года назад

      @@MakeDataUseful wow... Really excited... U know it's really useful... I was struck at a point thinking what will i do after learning python. How will i convert my skills to a profession... Now this tutorials really helped me... I am thinking of learning web scraping with python and if u teach any webscraping courses with python.. please let us know.... Will be happy to be your student...

  • @isaacbawangisah6096
    @isaacbawangisah6096 Год назад

    I am a new subscriber and I am happy to join this family.

    • @MakeDataUseful
      @MakeDataUseful  Год назад

      Welcome to the family! Lots of great people here all helping each other learn and grow

  • @arnoor077
    @arnoor077 4 года назад

    Very nice video, keep up the good work.

  • @martin-xq7te
    @martin-xq7te 3 года назад

    Great video, well explained. how about a video showing how to use headers

  • @daniyalmehmood2912
    @daniyalmehmood2912 3 года назад

    Superb video

  • @portraying_my_life
    @portraying_my_life 3 года назад +1

    You will have 20 k subs by the end of 2021 don't worry.

  • @benediktbiechele9121
    @benediktbiechele9121 4 года назад

    Ripper content (did I do that right?). Subscribed and looking forward to more. Thank you!

  • @MndTheNerd
    @MndTheNerd Год назад

    Great content and inspiring ❤

  • @drivelement
    @drivelement 3 года назад

    Bro I just love this serie

  • @DenisAnzoategui
    @DenisAnzoategui Год назад

    Wonderful!

  • @keifer7813
    @keifer7813 2 года назад

    Absolute legend. Thanks a lot man

  • @mattdorsey395
    @mattdorsey395 Год назад

    You got a new subscriber. This was great thank you

  • @og420obi
    @og420obi Год назад

    Thanks brother, just subscribed.

  • @czarnobrody9992
    @czarnobrody9992 Год назад

    Im wondering if this days you can still make money on that

  • @BookSays03
    @BookSays03 Год назад

    Great video❤, Can you teach us about the ide you use?

  • @GamingOzzz
    @GamingOzzz 3 года назад

    You deserve a million subs

  • @DaneNanahara
    @DaneNanahara Год назад

    can i do this to google my business scraping?

  • @zbigniewloboda3393
    @zbigniewloboda3393 Год назад

  • @TechX5others
    @TechX5others 2 года назад

    Please do you have a community for web scrapers especially for beginners like us that hardly gets the job and feels they are not good enough

  • @NexusGuru
    @NexusGuru Год назад

    when you do work with python yourself do you usually only target web scrapping or do you other jobs as well. by the way i really like web scrapping with some practice i think i can become good at this

    • @MakeDataUseful
      @MakeDataUseful  Год назад

      I do a lot of web scraping when I am building my own databases/APIs. A lot of my work is data cleaning and remediation prep for the visualization of data. The viz component is usually 10-20% of the effort, most of it is extracting and transforming data. Keep practicing an let me know if you need any help!

  • @keshavdk3080
    @keshavdk3080 3 года назад

    Thanks for the video.
    I have a python experience as a backend developer for 1 year. I have used python flask framework.
    Due to covid ,the start i was working has stopped its operation and i lost my job.
    Can i know how can i start earning as a freelancer with the python skills.

  • @salehramadan5407
    @salehramadan5407 3 года назад

    thanks man I really appreciate your efforts!

  • @luispunq
    @luispunq Год назад +1

    Arigato Kosaimas Sensei!

  • @fcoatis
    @fcoatis 3 года назад

    Hey there, great video. Can you post the consolidated code I´m stucked trying to put it all together. Thanks

  • @marwanhex
    @marwanhex Год назад

    best regards

  • @DrKnowsMore
    @DrKnowsMore Год назад +1

    Here's a question. Why would someone who's going to utilize 50,000 images in a machine learning environment need someone to scrape the web for them?

    • @MakeDataUseful
      @MakeDataUseful  Год назад +1

      There are several no-code/low-code tools available that allow you to perform machine learning tasks on images without having to write code. I imagine they may be utilising one of those. Also might be easier to give the task to someone else to do to save time and focus on other aspects of their project

  • @imperialwes49
    @imperialwes49 Год назад

    I tried to follow along but couldnt get the xml to work.
    FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?

    • @boraaslan1890
      @boraaslan1890 Год назад

      yeah i failed too i thought site changed its structure but im new in web scraping so i dunno

    • @tralfazy
      @tralfazy Год назад

      In Aug 2023 the page format is changed and BeautifulSoup has changed too.
      Fixes that worked for me:
      1) install lxml (letter L in front) with pip install lxml
      2) Then use this at about line 20: soup = BeautifulSoup(response.text, features="xml")
      The web page structure has changed so that the id number is on one line and the PNG on the line following it.
      I used the following code (may not be the best way but it works):
      for image_url in master_list[0:5]: # get a sample of some images. (requires two lines per image) this 0:5 downloads 2 images
      if 'image' in image_url: # the id number line has the word 'image' in it
      id_number = image_url.split('/')[-1] # get the last string following the backslash
      print (id_number)
      elif 'files' in image_url: # the png line follows the id line and has the word 'files' in it
      print(image_url)
      image = requests.get(image_url) # download the PNG image
      image_title = image_url.split('/')[-1] # get the file name in the last string after the final backslash
      file_name = id_number + "-" + image_title # add the id number + image name together
      # write the PNG file
      # example: 755899-Poker-Chip-PNG-Photos-80x80.png
      with open(file_name, 'wb') as file:
      file.write(image.content)

  • @elcanbayramli4444
    @elcanbayramli4444 3 года назад

    great tutorial!

  • @omarazhar2749
    @omarazhar2749 Год назад

    i understood all but i have confusion. there were 5k link so how did we download 50k images??

  • @pkavenger9990
    @pkavenger9990 3 года назад +1

    Did RUclips suggested me this video because it has Eren Jeager's images in this video or is it because i am learning python lol. every where i go my favorite anime follows me.

  • @AdventuresStories
    @AdventuresStories 11 месяцев назад

    I applied to. numerous projects in upwork, no one is contacting. Competitions are too much.

  • @OmerHarbi
    @OmerHarbi 7 месяцев назад

    thanks so much for the video, but unfortunately the website didn't works for me , (page not found). but i'll keep learning

    • @MakeDataUseful
      @MakeDataUseful  7 месяцев назад

      Dang, about time I did a new web scraping tutorial! Watch this space!

  • @beeohenji
    @beeohenji Год назад

    In 8:29 and 14:04 you defined "soup" and "loc" on different lines. Does that mean we can use the same name for variables? Sorry im still new to python. Also i noticed you mispelled "response" as "repsonse" in 11:50, was that intentional? Thank you for a real practical use of python though :)
    Edit: nevermind, you said its okay for repeating the "soup", my bad :)

    • @Gasterus69
      @Gasterus69 Год назад

      Python is both a strongly typed and a dynamically typed language. Strong typing means that variables do have a type and that the type matters when performing operations on a variable. Dynamic typing means that the type of the variable is determined only during runtime.
      So yes, you can use the same variable

    • @beeohenji
      @beeohenji Год назад

      @@Gasterus69 thanks!

  • @moheb_shia
    @moheb_shia 9 месяцев назад

    hi body.
    you forgot the insert loop for part xml link.(part1,part2,.....
    )

  • @kerolosmakram5364
    @kerolosmakram5364 Год назад

    thank you so so much for this ❤❤

  • @Harounu_35Dz
    @Harounu_35Dz Год назад

    great tutorial my friend , i want to ask you how you send this script to the client ? as .exe file or just the script?

    • @MakeDataUseful
      @MakeDataUseful  Год назад

      Depends on the client, a lot of the time they just care about the data and solving the problem.

    • @Harounu_35Dz
      @Harounu_35Dz Год назад

      @@MakeDataUseful ok, thank you

  • @OrianaVerity
    @OrianaVerity Год назад

    You're a champ!

  • @caribepedro
    @caribepedro Год назад

    Please create a video in which you interact with the employer? I'm looking for my first job at the platform, but I'm not sure how to proceed when I finally get "selected" and have to interact with the employer. Thanks!

  • @csspace441
    @csspace441 2 года назад

    Very helpful

  • @SonoftheMustardTiger
    @SonoftheMustardTiger Год назад

    Do you have a video about how to deal with clients?

  • @crees6447
    @crees6447 3 года назад

    Awesome tutorial, Can you post here the code for incremental downloading up to 50k to not stress down the web server.

    • @simonquvang6073
      @simonquvang6073 3 года назад

      If you run it in the cloud, just use something like time.sleep(time_in_sec). There is other ways to do it, but that one is quite easy.

  • @thatolebethe3238
    @thatolebethe3238 3 года назад

    Could u do a vid using urllib and beautiful soup please

  • @rahalmehdiabdelaziz8121
    @rahalmehdiabdelaziz8121 3 года назад

    That's great, but I have a ConnectionError problem, even StackOverflow solution doesn't work perfectly

    • @MakeDataUseful
      @MakeDataUseful  3 года назад +1

      Hi Rahal what is your error?

    • @rahalmehdiabdelaziz8121
      @rahalmehdiabdelaziz8121 3 года назад

      @@MakeDataUseful HTTPConnectionPool(host='www.pngmart.com', port=80): Max retries exceeded with url: /image/246105 (Caused by NewConnectionError(': Failed to establish a new connection: [WinError 10060] A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond'))

    • @MakeDataUseful
      @MakeDataUseful  3 года назад +1

      @@rahalmehdiabdelaziz8121 looks like a timeout, a couple of things to test. Try visiting the url with your browser and I'd you are successful they connection may be getting refused due to incorrect headers. Another option you could try is adding timeout=X where X is the number of seconds you want to wait. More details available here requests.readthedocs.io/en/master/user/quickstart/

    • @rahalmehdiabdelaziz8121
      @rahalmehdiabdelaziz8121 3 года назад

      @@MakeDataUseful I tried those type of solution but there is no efficient result ... however I found a " SOLUTION" : the error doesn't occur everytime (I suspected the weaknesss of my internet connection) so I've done a WHILE loop to repeat the request until it works ... thank you very much for your response ... I wish you do series of real world tasks other than webscraping ... Thank you and good luck

  • @dopplegangerdavid
    @dopplegangerdavid 4 месяца назад

    I didn't know Matt Walsh was a programmer.

  • @artabra1019
    @artabra1019 4 года назад

    this is great but how the extracted html file become a picture

    • @MakeDataUseful
      @MakeDataUseful  4 года назад

      Hey Art,
      Thanks for the feedback! At around 20:10 I source the image URL then use a response = request.get(image_url) to source the image then use a with open... and access the response.content to obtain the binary image file.
      Best of luck!
      Adam

  • @arungomathi1224
    @arungomathi1224 Год назад

    Do you get any money

  • @ladykilla85
    @ladykilla85 3 года назад

    You have a skinny seth rogen look going on here

  • @ollie6845
    @ollie6845 Год назад

    Webscraping mission on upwork for 20$..you're never gonna be rich doing that. I hate those fakes titles

    • @_Lumiere_
      @_Lumiere_ Год назад

      It says making money with web scraping on upwork, not getting rich lol

    • @MakeDataUseful
      @MakeDataUseful  Год назад +1

      Agree! A lot has changed over the last couple of years. Great opportunity to learn Python