Spark performance optimization Part1 | How to do performance optimization in spark

Поделиться
HTML-код
  • Опубликовано: 26 ноя 2024

Комментарии • 81

  • @sky-i8d
    @sky-i8d 16 дней назад +1

    Absolutely amazing, the way content is delivered!!

  • @darnasibujji
    @darnasibujji Год назад +1

    Earlier I watched some videos regarding this topic ,no one can explained in this way ,I am glad to see this video,now clearly understood spark optimization techniques

  • @durairaj4296
    @durairaj4296 2 месяца назад +1

    I am searching this kind of topics from the web in long time.. Thank you very much.. And i appreciate your stuff..

  • @harshmohan8419
    @harshmohan8419 2 года назад +1

    best video on internet for spark performace...

  • @sunnyd9878
    @sunnyd9878 6 месяцев назад +1

    This is excellent and valuable knowledge sharing... Easily one can make out these trainings are coming out of personal deep hands-on experience and not the mere theory ..Great work

  • @funnysatisfying426
    @funnysatisfying426 2 года назад +2

    Very informative 👏👍
    Keep such videos coming

  • @husnabanu4370
    @husnabanu4370 8 месяцев назад +1

    what a wonderfull explanation to the point... thank you

  • @gancan1654
    @gancan1654 Год назад +2

    perfectly went into my brain, what a clean explanation.
    can you please do videos on Pyspark from scratch.

  • @hemanthkumar9757
    @hemanthkumar9757 Год назад +1

    Very good explanation keep create more videos in spark

  • @SagarSingh-ie8tx
    @SagarSingh-ie8tx 2 года назад

    One of the best explanation on RUclips 😊

  • @thedarkknight579
    @thedarkknight579 3 года назад +2

    I only have one word for this video
    "Awesome!!"

  • @namratashinde9157
    @namratashinde9157 Год назад +1

    It's very very easy to understand whatever you explained, thank you so much

  • @sahilmittal7426
    @sahilmittal7426 2 года назад +1

    Very Good vedio, awesome work. To the point and one can understand easily

  • @rajnimehta5156
    @rajnimehta5156 3 года назад +2

    Great expectations Mam ... eagerly waiting for your upcoming video

  • @quadribrothers4396
    @quadribrothers4396 2 года назад +1

    Many thanks for making such informative video

  • @samk_jg
    @samk_jg 2 года назад +2

    All yours tutorials are too good!

  • @theanatomyofreliability2168
    @theanatomyofreliability2168 3 года назад +2

    Thanks for sharing. Very informative .

  • @ChethanKarur
    @ChethanKarur 2 года назад +1

    This is excellent maam. Looking forward to watching more videos from you

  • @kiranmudradi26
    @kiranmudradi26 3 года назад +1

    Another much needed video on Spark optimizations. point to point. Thank you very much for the video.

  • @gunasekaranr8029
    @gunasekaranr8029 3 года назад +2

    Neat and clean explanation. Looking forward to the videos on Spark Optimization.

  • @vibhad-cv4sf
    @vibhad-cv4sf Год назад +1

    Very well explained. Loving your videos!❤

  • @satviknaren9681
    @satviknaren9681 Год назад +1

    REALLY helped me get better at my work

  • @terrificmenace
    @terrificmenace Год назад +1

    Nice video it was really good 👍🏻 Thank you

  • @ramkumarananthapalli7151
    @ramkumarananthapalli7151 3 года назад +1

    Thanks a lot mam for making these videos. These are extremely useful. One of the best videos I have come across.

  • @ayushgour3984
    @ayushgour3984 3 года назад +1

    Amazing Job Shreya... Keep it Up..

  • @arjunkharat121
    @arjunkharat121 2 года назад +1

    Thanks for video ma'am., you made it very simple to understand.. waiting for more video's on this topic and spark

    • @BigDataThoughts
      @BigDataThoughts  2 года назад

      Thanks Arjun

    • @BigDataThoughts
      @BigDataThoughts  2 года назад

      ruclips.net/video/snPYj3TqM1g/видео.html you can chk this too. There are others videos on spark that i have posted

  • @abhisekhmishra4029
    @abhisekhmishra4029 3 года назад +1

    So nicely explained Shreya..

  • @sandeepchoudhary4900
    @sandeepchoudhary4900 2 года назад +3

    Awesome explanation of the optimisation techniques. If possible please create a video to cover the realtime challenges which you faced in your project and the solution you provided. That will be really helpful.

  • @khaderbasha7592
    @khaderbasha7592 3 года назад +3

    Awesome Shreya, if possible could you please upload realtime challenges which we faced in realtime environment

  • @chessforevery1
    @chessforevery1 Год назад

    where do I get practical session on this optimization technique of spark..?

  • @sathisha1702
    @sathisha1702 10 месяцев назад

    If count is not adviced, how can we count the number of rows in data frame?

  • @SpiritOfIndiaaa
    @SpiritOfIndiaaa 2 года назад

    thanks a lot , i have case where someother modules write parquet file , i need to process in my module by reading it, so how should i apply bucketing on that day ...can it be possible without writing ???

  • @Technology_of_world5
    @Technology_of_world5 Год назад +1

    Good explanation 😊

  • @puneetnaik8719
    @puneetnaik8719 3 года назад +1

    Great explaination !!

  • @Sharath_NK98
    @Sharath_NK98 11 месяцев назад +1

    Tnks Amigo
    It's very helpful

  • @harshalpatel555
    @harshalpatel555 3 года назад

    very well explained. but you are telling everything i know to exclude. ?? We need count, group,agg

  • @karthikvenkataram4790
    @karthikvenkataram4790 Год назад +1

    Ultimate 👏👏👏

  • @anirbanrc1
    @anirbanrc1 3 года назад +1

    Great explanation

  • @foodietraveller4591
    @foodietraveller4591 3 года назад +1

    Nice video mam

  • @mohitupadhayay1439
    @mohitupadhayay1439 Год назад

    Coalesce doesn't do Shuffle and that's why it's less expensive than repartition. I believe.

    • @BigDataThoughts
      @BigDataThoughts  Год назад +1

      It does but not as much as repartition. Repartition does entire data shuffle as it can reduce or increase no of partitions.

    • @mohitupadhayay1439
      @mohitupadhayay1439 Год назад

      @@BigDataThoughts thanks! Can you build an end to end project or some mini project where one can see how and where these properties arte getting implemented? Just watching these in silos only give half knowledge. Thanks.

  • @DASHTeknik
    @DASHTeknik Год назад

    Excellent.

  • @npl4295
    @npl4295 3 года назад +1

    good optimization tips

  • @tanushreenagar3116
    @tanushreenagar3116 2 года назад +1

    so nice it helped a lot

  • @shaileshc4994
    @shaileshc4994 3 года назад

    What are configurations file in spark ?
    Plz anyone can ans me please

  • @mdatasoft1525
    @mdatasoft1525 7 месяцев назад

  • @tolasebrisco6565
    @tolasebrisco6565 2 года назад

    Keep the good work #Prinetechs.
    I can clearly see all the good reviews about you man…I never believed my account can fixed after 7 months hahaha

  • @ahmedaly6999
    @ahmedaly6999 7 месяцев назад

    how i join small table with big table but i want to fetch all the data in small table like
    the small table is 100k record and large table is 1 milion record
    df = smalldf.join(largedf, smalldf.id==largedf.id , how = 'left_outerjoin')
    it makes out of memory and i cant do broadcast the small df idont know why what is best case here pls help

  • @tolasebrisco6565
    @tolasebrisco6565 2 года назад

    Keep the good work #Prinetechs.
    I can clearly see all the good reviews about you man…I never believed my account can fixed after 7 months hahaha

  • @tolasebrisco6565
    @tolasebrisco6565 2 года назад

    Keep the good work #Prinetechs.
    I can clearly see all the good reviews about you man…I never believed my account can fixed after 7 months hahaha