Apache Spark Core Concepts 02 (Rdd /data frame/transformations/actions)

Поделиться
HTML-код
  • Опубликовано: 13 дек 2024

Комментарии • 17

  • @vishnuk-g1b
    @vishnuk-g1b Год назад

    Great work ! your explanation is clear and excellent . I feel you like your content is a hidden gem.

  • @mmp9371
    @mmp9371 Год назад

    very nice explanation, mam. Thank you.

  • @vikaskasaraneni6111
    @vikaskasaraneni6111 9 месяцев назад +2

    map()- Narrow
    mappartition() - Narrow
    groupbyKey()- WideSpread
    reduceByKey() - WideSpread
    Join()- Narrow
    distinct() - WideSpread
    intersect()- WideSpread
    flatMap() - Narrow
    filter() - Narrow
    Union() - Narrow
    Please correct me if I am wrong.

    • @codjawan
      @codjawan 7 месяцев назад

      Join is a Very Big Wide transformation in Spark, how come you mentioned it under Narrow

  • @vishalnasre1251
    @vishalnasre1251 Год назад +1

    Is this playlist focused on mainly on Scala ?

  • @kaushaldangi900
    @kaushaldangi900 2 года назад +2

    Hi Bhawna, very nice explanation, could you please share the notebook used during this exercise.

  • @raghavendrareddy4765
    @raghavendrareddy4765 2 года назад +2

    Nice content but bit confusion is there @Bhawna while explanation

  • @anithaanitha-g8b
    @anithaanitha-g8b 11 месяцев назад

    It is very understanding and great sessions , can you please provide the notebook for future reference purpose.

  • @TamizharasanL-sx9yn
    @TamizharasanL-sx9yn 7 месяцев назад

    Maam I have a question to you . When you say action has stages and tasks etc then, What happens really happening behind the transformation ? Is it just computing and storing it as a dataframe ?

  • @gurramvarunchowdary5735
    @gurramvarunchowdary5735 2 года назад +2

    I like your content and very informative. Thank you.
    Could you please share those ppt's if possible?

  • @jdisunil
    @jdisunil 2 года назад +1

    Great content and Great delivery:
    Question: if RDDs are immutable, and next RDD is created on basis of previous. What happens to the previous RDDs, how many such rdds are kept to until its freed? I know I should bother about the latest one. but still.

    • @venkatakrishnaprasadk1214
      @venkatakrishnaprasadk1214 2 года назад +5

      The previous RDDs are by default deleted after successful generation of new RDD- unless we use persist method, in which case the RDD we want will be persisted in cache

    • @codjawan
      @codjawan 7 месяцев назад

      Yaa that's true if it fail at any step it can go back to previous step to recalculate the step again after successful it will delete the previous Rdd's

  • @sunitabedi1230
    @sunitabedi1230 2 года назад +2

    👍👌

  • @caiyu538
    @caiyu538 Год назад

    👍

  • @cusukanya
    @cusukanya 2 года назад

    Ma'am do you provide the ppts for reference??