How to Scrape Websites Without Getting Blacklisted or Blocked

Поделиться
HTML-код
  • Опубликовано: 16 янв 2025

Комментарии • 71

  • @kertiz74
    @kertiz74 2 года назад +4

    I love this! Very in-depth thank you! and I can also add that it's better to use the right package of proxies like from proxy-store for web scraping specifically to minimize chances of being blocked

  • @michaelzumpano7318
    @michaelzumpano7318 2 года назад +3

    Wow, that was very well done. I like how you explained each part so that a novice could follow everything. I’m going to look at your other videos. You should get recommended by the algorithm more often.

  • @Meleeman011
    @Meleeman011 2 года назад +1

    my plan is to cache and save all queries till I eventually have all the data I need

  • @SF-fb6lv
    @SF-fb6lv 3 года назад +4

    Wow what a great tutorial! Nice work.

  • @ninjamaster7986
    @ninjamaster7986 3 года назад +3

    Thanks for the info!

  • @mahmoodsanglay
    @mahmoodsanglay 3 года назад +4

    Great tips and exceptional utility value.

  • @hassangill2732
    @hassangill2732 4 года назад +3

    When I change proxies while scraping Instagram it asks for phone verification and scraping stops. How to overcome this problem. Please guide.

    • @Octoparsewebscraping
      @Octoparsewebscraping  4 года назад +1

      Hi Hassan. You can send a request to our support. They are professional on this: helpcenter.octoparse.com/hc/en-us/requests/new (They will reply within 1-2 working days, so go ahead). Have a nice day.

  • @richardmhain
    @richardmhain 4 года назад +5

    Cool, that's a practical view of this activity, much better sounds too. Thanks for the info.
    Cheers!

  • @hymerrathebarbarian
    @hymerrathebarbarian Год назад

    Nice info. After this tutorial would be awesome to see an actual tutorial where all the information is applied in a project. Can you make one please?

  • @Curtis3600
    @Curtis3600 4 года назад +5

    Excellent video, graphics, and description of scraping problems to avoid.

  • @SMacCuUladh
    @SMacCuUladh 4 года назад +7

    That's a lovely presenter, warm and clear and a great coat. Pretty too, which never hurts.

  • @brettadler1013
    @brettadler1013 2 года назад +1

    Thank you ma'am!

  • @cookingloverswithhania
    @cookingloverswithhania 3 года назад

    how u access the auto user agent rotatatio setting? is this option we can get in paid version?

  • @tomcha75
    @tomcha75 2 года назад

    Is it possible to use geolocation proxy to simulate a localized Google search?

    • @Octoparsewebscraping
      @Octoparsewebscraping  2 года назад

      Hi, yeah it is possible. You can use the built-in proxies to select the location according to your needs.

    • @tomcha75
      @tomcha75 2 года назад

      @@Octoparsewebscraping Is it only for cloud based scraping? I use the desktop app version and can't seem to find it anywhere.

    • @Octoparsewebscraping
      @Octoparsewebscraping  2 года назад

      @@tomcha75 Yeah it is for cloud scraping.

  • @haifengsu
    @haifengsu 2 года назад +1

    nice one!

  • @criscanlas1784
    @criscanlas1784 3 года назад

    May i ask what version of octoparse? 7 or 8?

    • @Octoparsewebscraping
      @Octoparsewebscraping  3 года назад

      This video is based on version 7.

    • @criscanlas1784
      @criscanlas1784 3 года назад

      @@Octoparsewebscraping I cannot create a pagination loop.. Octoparse extracted 2pages only??

    • @Octoparsewebscraping
      @Octoparsewebscraping  3 года назад +1

      @@criscanlas1784 Hi, sorry for the inconvenience caused. You may reach out to support@octoparse.com and the customer service team can help you step by step.

  • @birdsculptures
    @birdsculptures 3 года назад

    Does Octoparse provide the proxy IP addresses?

    • @Octoparsewebscraping
      @Octoparsewebscraping  3 года назад

      Yeah, this article can be helpful: helpcenter.octoparse.com/hc/en-us/articles/900004936243-Set-up-IP-proxies-Version-8-

    • @ridamahmood3342
      @ridamahmood3342 2 года назад

      @@Octoparsewebscraping This link is not working. Please provide a functional link.

  • @archytekt
    @archytekt 3 года назад

    How can avoid cloudfare security on a web scraping?

    • @Octoparsewebscraping
      @Octoparsewebscraping  3 года назад

      Hi, please reach out to support@octoparse.com and the customer service team can help you.

  • @patrickstar8585
    @patrickstar8585 Год назад

    would a VPN keep me from getting blocked?

    • @Octoparsewebscraping
      @Octoparsewebscraping  Год назад

      Hi there, there are many reasons that can cause it to be blocked, but usually, a VPN won't keep you from getting blocked. If you run into any problems, please contact our customer service team to get help.😀

  • @faizanasif3196
    @faizanasif3196 4 года назад +1

    Do you guys know about content grabber ??

  • @aMODiEswede
    @aMODiEswede 4 года назад +1

    My god , what else you dont already have , thanks for video

  • @transientaardvark6231
    @transientaardvark6231 2 года назад +2

    It baffles me why scraping is even necessary, and even more so why it would be actively blocked (obviously assuming that the scraping is being done "politely"). Most of the pages you want to scrape are dynamically generated from a database. Why do web sites not just offer a download-as-CSV link ? They seem insistent that you can only look at the data *though their UI* while at the same time refusing to make their own UI any good, indeed actively making their own UI rubbish for the sake of prettiness (like overly graphics intensive, poor search/filter/sort options, slow client-side scripting). Anyone who wants the data as CSV has already identified themselves as someone who finds "pretty" annoying and will not be manipulated by it, and already proved they are sufficiently engaged that they don't need superficial temptations.

    • @Octoparsewebscraping
      @Octoparsewebscraping  2 года назад

      Hello, Transient. People scrap the web for various reasons. A web scraping tool helps them to collect the data they want conveniently for any further uses, such as data analysis and more. We insist on making a good web scraping experience for all of you. We are sorry if you feel Octoparse is not good enough or brings any inconvenience to you. We will continue to improve and thank you for your feedback. Here is our latest version if you'd like to see any updates. www.octoparse.com/download/windows

    • @transientaardvark6231
      @transientaardvark6231 2 года назад +2

      @@Octoparsewebscraping OMG I'm so sorry if you thought my comments were a criticism of your video. The video is informative and well constructed. My point was about how web sites exist to deliver information but then make it hard to automate access. I know why scraping is necessary, but web site designers should just make their data available without involving these difficulties.

    • @Octoparsewebscraping
      @Octoparsewebscraping  2 года назад +1

      @@transientaardvark6231 I got you😀. Some websites do have difficulties in scraping due to different reasons, such as they don't want their data to be scraped and so on. But we always keep solving those problems. Thanks for your reply and feedback. We really appreciate!

    • @sdwone
      @sdwone 2 года назад +4

      @@Octoparsewebscraping If some websites don't want their data to be scraped, then why scrape them?

    • @emilianodelia98
      @emilianodelia98 2 года назад

      @@sdwone because fuck them that's why

  • @aireisorentertainment3143
    @aireisorentertainment3143 3 месяца назад

    1:11

  • @hh3739
    @hh3739 4 года назад

    I think this application is designed for people who don't know how to coding with python

    • @cheeseIT1992
      @cheeseIT1992 3 года назад +1

      There's still some good tips.

  • @Octoparsewebscraping
    @Octoparsewebscraping  4 года назад +2

    And here's our latest XPath tutorial! helpcenter.octoparse.com/hc/en-us/articles/360041118892-Everything-you-should-know-about-XPath-when-using-Octoparse

  • @denizsevinc9334
    @denizsevinc9334 Год назад +1

    music is very annoying

  • @talba9596
    @talba9596 4 года назад

    nice music and infographics ..good speaker -- my guys use python and anaconda and I do too .. lol .. but your anti block solutions look great

  • @julianabbott5381
    @julianabbott5381 4 года назад +1

    Excellent

  • @MuhammadAhmad-bx2rw
    @MuhammadAhmad-bx2rw 4 года назад

    Amazing

  • @Octoparsewebscraping
    @Octoparsewebscraping  5 лет назад +4

    Check out an easy-to-use web scraping tool Octoparse to reduce the chances of being blocked! www.octoparse.com/download What other anti-blocking techniques do you use? Share with us in the comments :)

  • @joshhoek8082
    @joshhoek8082 4 года назад

    Smart

  • @Octoparsewebscraping
    @Octoparsewebscraping  2 года назад +1

    🎇What is data extraction?
    🎇Why do we need it?
    🎇Intro to data extraction tool
    Don’t miss this one with the basics of data extraction info: ​ruclips.net/video/E7oACf4a24Y/видео.html

  • @lotsofpixels
    @lotsofpixels 3 года назад +6

    Also make a video how to break into somebody"s house without getting caught! Thats almost the same!!! Why do you think website owners build anti scraping technics into ther websites? Because youre not welkom as a scraper! It"s their hard work you are stealing!

    • @ninjamaster7986
      @ninjamaster7986 3 года назад +1

      Have you ever maintained a large e-commerce website?

    • @Meleeman011
      @Meleeman011 2 года назад +3

      I mean you could just copy and paste their data too. I'm sorry dude copying isn't stealing especially when they are providing the data publicly

  • @Octoparsewebscraping
    @Octoparsewebscraping  2 года назад +1

    ✨ What are the 3 methods of web scraping?
    ✨What are the pros and cons of each web scraping way?
    ✨ Which approach is your cup of tea?
    This video got all the answers well covered: ruclips.net/video/AeA-neSgON8/видео.html

  • @Octoparsewebscraping
    @Octoparsewebscraping  2 года назад +1

    ✨What is a web crawler?
    ✨How does a web crawler work?
    ✨What are the differences between it and a web scraper?
    Get yourself refilled with all info related!
    ruclips.net/video/Vjayaft_1Pc/видео.html

  • @Octoparsewebscraping
    @Octoparsewebscraping  2 года назад

    ✨ Why do we need web scraping? What is web scraping? Is web scraping right for you?
    Check out now and more is coming: ruclips.net/video/Pm1P5hvsc-k/видео.html

  • @Octoparsewebscraping
    @Octoparsewebscraping  2 года назад

    💥 Check out Octoparse's Summer Sale 2022:
    www.octoparse.com/summer-sale-2022/?
    👏 Take an EXTRA 10% off everything on Jun.15th only!
    ✨ Take 30% OFF when Renew or Upgrade from Jun.16th to Jun.28th EST!

  • @Octoparsewebscraping
    @Octoparsewebscraping  3 года назад

    💥 Check out Octoparse's Black Friday Sale:
    www.octoparse.com/2021-black-friday-sale/?comment=
    👏 Save up to 40% on Nov.17th only!
    ✨ Take 30% OFF when Renew or Upgrade from Nov.18th to Dec.3rd EST!
    🤩 Get FREE custom crawlers & 1-on-1 training~

  • @Octoparsewebscraping
    @Octoparsewebscraping  2 года назад +1

    ✨ Is web scraping legal?
    ✨What kinds of data can be scraped?
    ✨ What are common applications of web scraping?
    Check out this video and find answers for all questions related to web scraping: ruclips.net/video/WOuzDxHdz6I/видео.html