Apache Spark Interview Questions and Answer | Real Time Question | Using PySpark

Поделиться
HTML-код
  • Опубликовано: 6 янв 2025

Комментарии • 16

  • @NiksTravelDiaries
    @NiksTravelDiaries 2 года назад +5

    Hi why are you using rdd2=rdd1.flatMap(lambda x:x.split(',') i dont see any comma in the input file

  • @thesadanand6599
    @thesadanand6599 Год назад +1

    superrrrrrrrr da superrrrrrrrrrrrrr 👍

  • @mohitkumar-co7xt
    @mohitkumar-co7xt 8 месяцев назад

    I have a doubt, how lambda works here does it pick whole df or it takes the value row by row

  • @praneethbhat6361
    @praneethbhat6361 3 года назад +1

    Hi bro. Your explanation was beautiful . I need a video on how to deploy spark job? Please make a video on this.

  • @DE-Py-Sq-Az-Db
    @DE-Py-Sq-Az-Db 2 года назад

    Nice One...

  • @divyadivya48
    @divyadivya48 6 месяцев назад

    Nice explanation

  • @amiyaghosh6869
    @amiyaghosh6869 9 месяцев назад

    You can directly read this file as csv in pipe delimited. Why to use rdd and make it complex.

    • @poojav2585
      @poojav2585 2 месяца назад

      Hey in the question set they mentioned to use it as RDD

  • @pavithrasrividhya7489
    @pavithrasrividhya7489 2 года назад

    Hi bro. I tired this code using case class schema defining. But getting error

  • @dhivakarsathya3918
    @dhivakarsathya3918 3 года назад +1

    Why do u use rdd which is mostly not used in industry and not optimised ?

    • @AzarudeenShahul
      @AzarudeenShahul  3 года назад +3

      This question was given to candidate in interview by one of the company.. not framed by me buddy..
      Hope ur are able to solve it..

  • @sanskarsuman589
    @sanskarsuman589 Год назад +1

    so many ads in between the videos.

    • @AzarudeenShahul
      @AzarudeenShahul  Год назад +1

      Ads are posted by RUclips bro. Anyways will chk and limit if possible. Thanks