What is Apache Spark?

Поделиться
HTML-код
  • Опубликовано: 27 июл 2024
  • Best Apache Spark Course: bit.ly/3Pi5VPB
    Thank you for watching the video! You can learn data science FASTER at mlnow.ai!
    Master Python at mlnow.ai/course-material/python/!
    Learn SQL & Relational Databases at mlnow.ai/course-material/sql/!
    Learn NumPy, Pandas, and Python for Data Science at mlnow.ai/course-material/data...!
    Become a Machine Learning Expert at mlnow.ai/course-material/ml/!
    Don't forget to subscribe if you enjoyed the video :D
    0:00 Introduction
    0:25 Surprise!
    2:06 Distributed Computation
    9:04 Intermission 1
    10:00 Introduction to PySpark and RDDs
    14:00 Filter (Narrow Operation)
    18:30 Map (Narrow Operation)
    21:00 GroupByKey (Wide Operation)
    31:00 ReduceByKey (Wide Operation)
    44:15 Intermission 2
    45:00 DataFrames (In-Memory Tables)
    53:24 Congrats on the Hard Work! Subscribe for More!
  • НаукаНаука

Комментарии • 26

  • @GregHogg
    @GregHogg  11 месяцев назад

    Take my courses at mlnow.ai/!

  • @arsheyajain7055
    @arsheyajain7055 3 года назад +3

    This pretty much sums up my BigData course at university! Another great video! Thank you.

  • @johnnyzou1070
    @johnnyzou1070 Год назад

    hey Hogg, Im a grad student trying learn spark by my own, thanks for the video. Clear and helpful video. The little clip really get me to subscribe, it reminds me of the day when i was back in undergrad, walking to classroom in snow.

  • @GhosT-op9eq
    @GhosT-op9eq Год назад

    Just recently found your channel and i gotta say that you're a big W. Amazing content, simplified, and very informative, thank you so much!

    • @GregHogg
      @GregHogg  Год назад +1

      Great to hear, thanks so much! :)

  • @noushinbehboudi5694
    @noushinbehboudi5694 3 года назад

    Such a great, useful material. Thank you, Greg!

    • @GregHogg
      @GregHogg  3 года назад

      Thanks so much! Very glad you enjoyed it :)

  • @ahmadsaad1888
    @ahmadsaad1888 3 года назад +2

    Beautiful explanation of Spark.

    • @GregHogg
      @GregHogg  3 года назад

      Thanks so much!!!

  • @mohitsonkamble5822
    @mohitsonkamble5822 4 месяца назад

    Hi Greg, Just recently discovered your channel. Amazing content. Thank you.

  • @salmanrasheed4366
    @salmanrasheed4366 Год назад

    Great 👍

  • @safakhan6348
    @safakhan6348 3 года назад +3

    So happy to officially declare myself as a data scientist! Thank you for the content!

  • @emresahindance
    @emresahindance 2 года назад +1

    You are an #amazing human being brother, wish you all the best :)

    • @GregHogg
      @GregHogg  2 года назад +1

      That's very sweet. Thank you very much Emre.

  • @hsoley
    @hsoley 2 года назад +1

    Greg, how are you? just want to thank you for sharing your khowlege and expertise with the world!

    • @GregHogg
      @GregHogg  2 года назад

      Great thank you how are you? You're very welcome.

    • @hsoley
      @hsoley 2 года назад

      @@GregHogg I am good, thanks for asking! your channel is fantastic, I hope you can keep creating content, All the best from NYC!

    • @GregHogg
      @GregHogg  2 года назад +1

      @@hsoley I'm very glad to hear that and really appreciate the kind words!

  • @AL3Ymir
    @AL3Ymir 3 года назад +1

    Professor, the lambda x+y has me stumped. what if there were three 1s at index 1? for example if the tuple were (2, [1,1,1])? What is the x and y that is being sent as a parameter to the lambda function?

    • @GregHogg
      @GregHogg  3 года назад

      x + y simply sums them.

    • @AL3Ymir
      @AL3Ymir 3 года назад

      @@GregHogg Hey professor, thanks for the reply

    • @GregHogg
      @GregHogg  3 года назад +1

      @@AL3Ymir The result is trivial, but no, it's actually kinda difficult to understand what's going on here. It's a loop. first time, x is the first 1 and y is the second 1. It adds them to get 2. Next time in the loop, x is 2 and y is the third 1. It sums them to get 3. More info here spark.apache.org/docs/latest/api/python/reference/api/pyspark.RDD.reduceByKey.html

  • @feroo44
    @feroo44 2 года назад

    Man, if you could just stop asking people to subscribe. The channel is great and people will subscribe if they think they will need more videos in the future.

    • @GregHogg
      @GregHogg  2 года назад

      Okay

    • @jaycobclavel7098
      @jaycobclavel7098 Год назад

      I agree. But i subscribed anyway.. great content

    • @amit4rou
      @amit4rou Год назад

      No, you need to tell people to subscribe because people forget sometimes, but it should not be in the face all throughout the video...