Web Scraping with Python and BeautifulSoup is THIS easy!

Поделиться
HTML-код
  • Опубликовано: 21 ноя 2024

Комментарии • 61

  • @gonzalobarral2357
    @gonzalobarral2357 10 дней назад +1

    Hi Thomas, thanks so much for sharing it! i was looking all day trying to find some good material and this by far saved the day!

    • @TomsTechAcademy
      @TomsTechAcademy  10 дней назад

      Thank you for your comment! Glad to hear my video has been helpful for you :) happy coding!

  • @ahassan7270
    @ahassan7270 7 месяцев назад +3

    Thank you so much for your valuable information and the easy-to-follow explanation style.👏👏

    • @TomsTechAcademy
      @TomsTechAcademy  7 месяцев назад

      You're very welcome Abdullah 😃 is there anything else you would like to learn?

  • @rkcorbes
    @rkcorbes 3 месяца назад +3

    Thank you so much man. I'm totally beginner and it's very easy to follow.

    • @TomsTechAcademy
      @TomsTechAcademy  3 месяца назад

      Thank you for the kind words! Happy coding 😀

  • @priyeshudawat
    @priyeshudawat 9 месяцев назад +3

    Very good explaination.. Tyx.. Hope you make some more videos on data scraping using BeautifulSoup

    • @TomsTechAcademy
      @TomsTechAcademy  9 месяцев назад

      Thank you for the kind words, there’re more data scraping videos in the pipeline 😀

  • @drendelous
    @drendelous 4 дня назад

    ty please consider covering other frameworks or websites

  • @Test-ne6bt
    @Test-ne6bt 2 месяца назад +1

    Thanks for the video! My first Python experience. I did get 'No module named 'openpyxl'' error when trying to export to excel, even though CSV worked. Although I tried to install via pip in VS. I ended up having to run 'py -m pip install --user xlsxwriter' in my windows terminal and then it worked. Thanks!

    • @TomsTechAcademy
      @TomsTechAcademy  2 месяца назад

      Congrats with your first Python script! Here's to many more scripts ;)

  • @islamicdesiretv
    @islamicdesiretv 5 месяцев назад +1

    Hy, Thomas you've done a great job, I learned a lot of imp things from you by watching only this video.
    Plz make a playlist on web scraping from basic to advance so that a new person can understand better.
    Once again amazing video

    • @TomsTechAcademy
      @TomsTechAcademy  5 месяцев назад +2

      Thank you for your comment, expect more videos about web scraping soon! Cheers, Thomas

    • @islamicdesiretv
      @islamicdesiretv 5 месяцев назад +1

      @@TomsTechAcademy Thomas, that's good for all new web scrapers bcz your style of conveying useful content is very simple thanks ☺☺☺

    • @japhethmutuku8508
      @japhethmutuku8508 3 месяца назад

      hello, I can teach you web scraping from the basics to advanced

  • @jesusleguiza77
    @jesusleguiza77 17 дней назад

    Hello, very good. What about products that have variants that need click for show price? How are different prices handled, whether by color or other different features? called without ajax, in detail page

  • @ebinmolloy7596
    @ebinmolloy7596 7 месяцев назад +4

    Great video, really very helpful, thanks!

  • @alexanonym1584
    @alexanonym1584 29 дней назад +1

    There could be f used instead of plus

  • @illuminaughty666
    @illuminaughty666 11 месяцев назад +1

    Awesome tutorial 👍

    • @TomsTechAcademy
      @TomsTechAcademy  11 месяцев назад

      Thank you for the kind words! Happy coding :)

  • @abigailmapuladikobo9941
    @abigailmapuladikobo9941 7 месяцев назад

    Thank you for this video. What is the risk of web scraping a Google scholar page (only one page) with an unhidden IP address?

    • @TomsTechAcademy
      @TomsTechAcademy  7 месяцев назад +1

      If you only scrape 1 page and only do it one time (so not for example every hour) I would say the risk of your IP being banned is pretty low.. In case you have any doubt, just google "free proxy server" - there're a lot of proxy servers you can use for free. The challenge with free proxy servers is that they aren't reliable - but if you only scrape one page that's not a big deal.. Good luck with your project :)

    • @abigailmapuladikobo9941
      @abigailmapuladikobo9941 7 месяцев назад +1

      @@TomsTechAcademyThank you 🙏🏽❤️

  • @mahdiosman7921
    @mahdiosman7921 6 месяцев назад

    Great video and flow. I did this for another website but when i run the code at the end the scraping of pages should be 235 but it doesn't stop it keeps scraping pages? How do i stop it at the amount of pages i would like to scrape from the website.

    • @TomsTechAcademy
      @TomsTechAcademy  6 месяцев назад +1

      Hi, there’s a link to my Discord server in the description of this video. If you share your script and the website you want to scrape there, I’m happy to help you with your script 😀

  • @mahatechtalks9496
    @mahatechtalks9496 5 месяцев назад +1

    Great Video , Thanks

    • @TomsTechAcademy
      @TomsTechAcademy  5 месяцев назад

      Thank you, glad to hear my video is useful for you 😀

  • @nmsv1
    @nmsv1 6 месяцев назад +1

    Since the items are added to a set, how do you ensure that the columns always come back in the same order?

    • @TomsTechAcademy
      @TomsTechAcademy  6 месяцев назад

      Hi, the items in the dictionary (e.g. Title, Link, Price) will always be in the same order

  • @codecobber1107
    @codecobber1107 6 месяцев назад +1

    Good video. Many thanks.

    • @TomsTechAcademy
      @TomsTechAcademy  6 месяцев назад

      Thank you for your comment Code Cobber! Glad my video was useful for you 😀

  • @raagha7733
    @raagha7733 5 месяцев назад +1

    Can i scrape the reviews from amazon through this same method? Cuz I'm unable to access the page through my requests

  • @daviannajones5648
    @daviannajones5648 3 месяца назад

    Thanks for the video! The only issue I have is that it will not allow me to put the results into an excell file. It does for a csv, but not excell

    • @japhethmutuku8508
      @japhethmutuku8508 3 месяца назад

      hello i can help you with this

    • @TomsTechAcademy
      @TomsTechAcademy  3 месяца назад

      Hi, you mention "it will not allow" - which error are you getting exactly?

    • @Blackops1990
      @Blackops1990 3 месяца назад +1

      Hey I also had this problem. I fixed it by adding the openpyxl inside my python interpreter

  • @abigailmapuladikobo9941
    @abigailmapuladikobo9941 7 месяцев назад +1

    I followed the same code for a different website except I left out the if and else codes because I wanted to scrape only the first page. Everything worked out except my output which came back in the same column, and it is very disorganized. How can I fix this in my code?

    • @TomsTechAcademy
      @TomsTechAcademy  7 месяцев назад +1

      Hi, best way to help you is if you share the website + your code on my Discord, then I can help you right away :)
      Join my Discord here: discord.gg/WHJWFNDXXX

  • @bouncepen932
    @bouncepen932 5 месяцев назад +1

    Good tutorial.

    • @TomsTechAcademy
      @TomsTechAcademy  5 месяцев назад

      Thank you for your comment, glad my video is useful for you 😀

  • @SaifAliShugan
    @SaifAliShugan 3 месяца назад +1

    very nice

    • @TomsTechAcademy
      @TomsTechAcademy  3 месяца назад +1

      Thank you for your comment, glad the video was valuable for you 😀

  • @y.hidayat7391
    @y.hidayat7391 5 месяцев назад

    Nece tutorial, thanks

    • @TomsTechAcademy
      @TomsTechAcademy  5 месяцев назад

      You are welcome, glad my video was helpful for you!

  • @BhushanAhire-gg5wb
    @BhushanAhire-gg5wb 7 месяцев назад

    thank you so much

    • @TomsTechAcademy
      @TomsTechAcademy  7 месяцев назад

      Thank you for your comment, glad my video is helpful for you :)

  • @hotdoggyboomboom411
    @hotdoggyboomboom411 10 месяцев назад

    does it work in similarweb

    • @TomsTechAcademy
      @TomsTechAcademy  10 месяцев назад

      I can't tell you whether this method works for specific websites. It should work for most websites, although in case you need to log in or the website uses lazy loading, you need a bit more work to get it up and running..

  • @sleex_official
    @sleex_official 3 месяца назад +1

    Ich rieche deutsch

    • @TomsTechAcademy
      @TomsTechAcademy  3 месяца назад

      You’re almost right, my native tone is Dutch :)

  • @teclote
    @teclote 6 месяцев назад

    Brilliant. Thank you.

  • @LastGamerName
    @LastGamerName 3 месяца назад

    Hi! I'm practiced in C#, but new to Python. I watched your video all the way through before watching again to try myself. My requests.get() works off and on without changing any of the code. I keep getting 'getaddrinfo failed' error even though it just worked with no changes made. I didn't add a proxy yet since that's later in the tutorial. Would that cause this?

    • @TomsTechAcademy
      @TomsTechAcademy  3 месяца назад

      Hi, this shouldn't be about the proxy. It could be either an issue with the website or the connection. Could you try:
      - Running the same script on a different network / PC
      - Try scraping from a different website (preferably a very simple one like the one I use in the demo - definitely not something behind Cloudflare)
      This will help you to figure out whether this issue is network/pc or website related. Good luck!

  • @Mrshanmugham1
    @Mrshanmugham1 8 месяцев назад +1

    Nice explanation, Very Good, I have a doubt , Kindly provide your email pls....

    • @TomsTechAcademy
      @TomsTechAcademy  8 месяцев назад +1

      Thank you for your comment, glad my video is useful for you 😀 you can ask any questions in the comments, this way others who have the same question can also benefit from the answer.