What Is RDD In Spark | Session 1 | RDD Basics | Resilient Distributed Dataset

Поделиться
HTML-код
  • Опубликовано: 17 окт 2024
  • This session will make you learn basics of RDD (Resilient Distributed Dataset) in spark. This covers the following topics-
    1) What is RDD?
    2) How to create RDD?
    3) Operations supported by RDD in spark
    4) Lazy evaluation concept in RDD
    Here is complete RDD tutorial playlist-
    • Playlist

Комментарии • 18

  • @cinemass916
    @cinemass916 7 лет назад +3

    Honestly, you are one of the best one that knows how to teach, because most of the video clips on RUclips have no idea of fundamental principles of teaching! Please do more of this kind of work especially in the area of massively confusing materials about Spark, we need people like you on RUclips. Thanks so much, and please let me know how to find more of the clips have you down so far. Unfortunately the link that you provided does not work and it is broken, would you please leave a connecting link to get in touch with you.
    Thanks

  • @vanivadde734
    @vanivadde734 5 лет назад

    nice , i listen to many videos i did not understand but the way explain the topic very clear and simple,thank you so much make me easy RDD

  • @clarencesail
    @clarencesail 3 года назад

    Really good explanation. Thank you for simplifying this concept for me.

  • @Raghav54321
    @Raghav54321 4 года назад

    Could you please help me in below query.
    .suppose I need to create a application where I need to load multiple files (CSV) using DF , if any of the file structure ids different from our defined structure(we defined in beginning) then we need to redirect those files into some error folder and load the file only with correct structure..how will we achieve this in spark.
    Ex . file columns should be id,name,roll but in some files id,name,city,subject.. i need to load the files which are having id,name,roll columns only

  • @bhavaniv1721
    @bhavaniv1721 3 года назад

    I really appreciate what you explained it's very easy to understand please share data frame Vs dataset 👍

  • @bhavaniv1721
    @bhavaniv1721 3 года назад

    Can please post data frame Vs dataset,ur explained Cristal clear

  • @smritidey2942
    @smritidey2942 7 лет назад +1

    Best lecture on spark

  • @saugatsaxena
    @saugatsaxena 6 лет назад

    Thanks for the explanation! Can a RDD be stored as a file or stored on a hard drive (and not just in a memory)? If yes, then a link towards an article describing it will be very helpful.

  • @ponrenjith
    @ponrenjith 6 лет назад

    What is RDD first ?
    You have explained how the data is stored in RDD but what is the architecture of RDD and why does we need that ?

    • @jayeshpatil2711
      @jayeshpatil2711 5 лет назад

      spark.apache.org/docs/latest/rdd-programming-guide.html#resilient-distributed-datasets-rdds

  • @sateesh92004
    @sateesh92004 6 лет назад

    very good explanation with example. Thank you bro

  • @MrManish389
    @MrManish389 5 лет назад

    Hi sir, please,, plz plz ... upload scala tutorials for the beginners

  • @vasavikakkat8040
    @vasavikakkat8040 6 лет назад +1

    Brilliant!

  • @JT-nq9vh
    @JT-nq9vh 5 лет назад

    Well done.

  • @vpreeti36
    @vpreeti36 5 лет назад

    nice

  • @kamaldeepverma4633
    @kamaldeepverma4633 5 лет назад +1

    You should first explain what exactly RDD is rather than explaining how it is made and other things.Please clear the basics of the viewer first.Thank you :)