72. Databricks | Pyspark | Interview Question: Explain Plan

Поделиться
HTML-код
  • Опубликовано: 29 ноя 2024

Комментарии • 32

  • @omprakashreddy4230
    @omprakashreddy4230 2 года назад +5

    I requested this a few days ago and here you are with awesome explanation. This shows how much you care about the community :)

  • @premsaikarampudi3944
    @premsaikarampudi3944 Год назад +1

    Honestly, I was intimidated my the length of the video but then after watching it. It was very simple :) Thanks

  • @Aramakishore
    @Aramakishore 2 года назад +2

    very nice explanation raja..Thank you for your efforts to make this video..looking forward to more videos ...It is very useful

  • @amazhobner
    @amazhobner 11 месяцев назад

    Hands down best df.explain() video I've watched so far.

  • @sonuamrith9091
    @sonuamrith9091 2 года назад +2

    Wooow awesome super marvelous or if there are any other words in English which can praise in depth of concept you have explained.. Thank you very very much sir

  • @arpittapwal4651
    @arpittapwal4651 Год назад +1

    Great explanation sir, keep going😊

  • @umuttekakca6958
    @umuttekakca6958 Год назад +1

    Very clear explanation, thanks.

  • @sravankumar1767
    @sravankumar1767 2 года назад +1

    Nice explanation Raja 👌 👍 👏

  • @tanushreenagar3116
    @tanushreenagar3116 8 месяцев назад +1

    Awesome content sir

  • @pankajchikhalwale8769
    @pankajchikhalwale8769 8 месяцев назад

    I have a question - please tell me whether my question is right or wrong.
    .
    I have a pipeline in databricks and it runs 4 times in every 24 hours in production environment. 8am, 2pm, 8pm, and 2am.
    At those 4 times - there may be other different applications also running in the same databricks production cluster.
    Say - at 8am 10 other jobs are running, at 2pm no other job is running, at 8pm 5 other jobs are running, and at 2am 4 other jobs are running.
    If after all the logical and physical planning is done and if multiple physical plans are created and cost model is applied - then - will there be different physical plans, which will be executed at 8am, 2pm, 8pm, and 2am respectively ?
    In other words - will the choice of physical plan actually executed at 8am, 2pm, 8pm, and 2am depend on actual run-time work-load/volume of work at that time (i.e. 8am, 2pm, 8pm, and 2am) in production environment ?

  • @at-cv9ky
    @at-cv9ky 10 месяцев назад

    Why is exchange happening after sort-merge join ? I see both the tables are initially exchanged and then sorted which means same DEPT-ID records are on the same partition that enabled sort-merge join. So, I can't understand why exchange partitioning is done after sort-merge join.

  • @tinashechinyati6823
    @tinashechinyati6823 Год назад

    This is great content. Is it possible to version control data frame query execution plans?

  • @venkatasai4293
    @venkatasai4293 2 года назад +1

    Good one 👍

  • @ranjansrivastava9256
    @ranjansrivastava9256 10 месяцев назад +1

    Well Explained !!!!! 😄

  • @ajay_jangra
    @ajay_jangra 6 месяцев назад +1

    really nice, thankyou so much!!!

  • @shilashm5691
    @shilashm5691 Год назад +1

    Vera level thala!!(G*d level explanation)

  • @varun8952
    @varun8952 2 года назад +1

    Thank you!