2.1 What is RDD and Partitions | Spark Tutorial

Поделиться
HTML-код
  • Опубликовано: 17 окт 2024

Комментарии • 13

  • @moneymaker2328
    @moneymaker2328 5 лет назад +2

    Could you please arrange the spark interview question playlist ..videos are not in order very random arrangement

  • @manishjaiswalable
    @manishjaiswalable 6 лет назад +1

    Waiting for next video....

  • @KaramveerSikarwar
    @KaramveerSikarwar 3 года назад

    Hi sir please answer my question. Interviewer asked me this question
    Q : if rdd fails at stage 5 what will happen and how do we handle this situation recover data and fix the problem

    • @abhishek5643
      @abhishek5643 3 года назад +1

      there are two types of problem that could happen, recoverable and unrecoverable situation, in case of unrecoverable one you have to go through the logic and update it accordingly. while in case of recoverable issue you don't have to perform anything as it is resilient, spark will internally re-executes all the stages and finally create the RDD

    • @KaramveerSikarwar
      @KaramveerSikarwar 3 года назад

      @@abhishek5643 thanks

  • @vinodmani3900
    @vinodmani3900 5 лет назад

    Hi @Data Savvy , I
    I do have a doubt on partition concept .
    By default any file stored in HDFS will be distributed. Say 1 GB file with cluster split size as 256 MB , do have 4 different pieces internally. When I create an RDD from this file , would the partition you mentioned here is the same 4 pieces or Spark do have another mechanism to define its on partitions.
    In simple words , is HDFS split and spark RDD partitions are same ?

  • @SandeepKumar-op3xy
    @SandeepKumar-op3xy 5 лет назад

    Not all heroes wear capes

  • @bhargavhr1891
    @bhargavhr1891 6 лет назад +1

    Finished 27 videos,

    • @DataSavvy
      @DataSavvy  6 лет назад +1

      Awesome... Great dedication... M happy this was useful to you :)

    • @bhargavhr1891
      @bhargavhr1891 6 лет назад

      Very much useful and will now start following another series that you have started

    • @shyambollapalli4763
      @shyambollapalli4763 5 лет назад +1

      @@DataSavvy great knowledge...I appreciate you alot

    • @DataSavvy
      @DataSavvy  5 лет назад

      Thanks Shyam... Your words are very encouraging :)

    • @shyambollapalli4763
      @shyambollapalli4763 5 лет назад

      @@DataSavvy I am looking for some support with spark. Plz let me know if you are interested thank you.