Is this how pro's scrape HUGE amounts of data?

Поделиться
HTML-код
  • Опубликовано: 10 янв 2025

Комментарии •

  • @SweetInsanity
    @SweetInsanity 5 месяцев назад +1

    awesome stuff! it's hard to find best practices around data scraping so it's always great to see an actual helpful "advanced" video.
    Keep em coming, I'd love to see more of your workflow as well!

  • @dandyexplorer4252
    @dandyexplorer4252 5 месяцев назад +1

    Thank you for delivering such content. Keep it up

  • @jacqueskraemer7538
    @jacqueskraemer7538 5 месяцев назад +3

    Hello John, I have been learning web scraping from your videos for quite some time. I am currently struggling with web page analysis prior to deciding which scraping approach to follow between Requests, Playwright and Selenium or API. If you could make a video around this topic i would really appreciate it.
    PD: Greetings from Argentina.

  • @gentrithoxha7797
    @gentrithoxha7797 5 месяцев назад +3

    Very handy advices and techniques 👍

  • @geertdepuydt2683
    @geertdepuydt2683 5 месяцев назад +5

    Ah. I thought everybody did it like that. Ok.

  • @naradakandawala4278
    @naradakandawala4278 5 месяцев назад +1

    Good as always ❤

  • @muhammedjaved4322
    @muhammedjaved4322 5 месяцев назад +1

    Always lovely content

  • @markomarjanovic8348
    @markomarjanovic8348 5 месяцев назад +1

    Hi John,
    Do you have perhaps a Udemy or similar course that you are holding, or at least any learning material that you recommend for obtaining the exact knowledge that you have, ecomerce scraping and analysis. BTW you are amazing, and I honestly suggest investing your time in making a full on course. Id pay for it and Im sure many many more would too.

  • @ChristopherBruns-o7o
    @ChristopherBruns-o7o 5 месяцев назад

    Im a proxy noob and have no clue why async-ethics a proxy is essential. Rate limits?

  • @greendsnow
    @greendsnow 5 месяцев назад

    If the data has some structure, turn it into markdown and use your imagination.

  • @personofnote1571
    @personofnote1571 5 месяцев назад +1

    What happens after saving to mongo? That’s the part I was hoping to see 😂

    • @JohnWatsonRooney
      @JohnWatsonRooney  5 месяцев назад +1

      Haha sorry! Connect and pull the data out, parse what you need and move on! I’ll do a follow up full project with this method

    • @personofnote1571
      @personofnote1571 5 месяцев назад

      Awesome. Yeah conceptually I understand the separation of concerns. It would be great to see the second part where the data is consumed out of mongo. Looking forward to it.

    • @LionBlu2000
      @LionBlu2000 Месяц назад

      @@JohnWatsonRooney Makes no sense. Why not parse everything immediately and save money on storage?

  • @lahcenkhweb1912
    @lahcenkhweb1912 5 месяцев назад

    nice