Great content... you are really filling the gap.. have some doubt and really want to get some good advice from you.. 1. I learned pyspark with azure databricks (no real time exp).. trying to build my portfolio project for interview.. like i got some source data in azure blob in file format and processeing those thru databricks pyspark.. is it must to put hive somewhere in my pipeline? If so how to do that, i mean how to integrt hive with azure.. If possible please create an end to end project video with azure databricks.. Thank u in advance....
Well explained Gowtham, clear and understandable. Request you to prepare more questions and possible answer to it. Please make more videos on use cases.
Thanks for the video. Can anyone seeing this comment help me with Exactly the diff between parquet and orc and why we prefer one over other in production
Thanks for the explanation, very nicely explained.
Great explanation !!
Thanks Bro .. Awesome Explanations on ACID & Default PartitionAlgo
Really a good one. Thanks
Thank you, these question are mind-blowing
Extremely useful, thanks!
Great content... you are really filling the gap.. have some doubt and really want to get some good advice from you..
1. I learned pyspark with azure databricks (no real time exp).. trying to build my portfolio project for interview.. like i got some source data in azure blob in file format and processeing those thru databricks pyspark.. is it must to put hive somewhere in my pipeline?
If so how to do that, i mean how to integrt hive with azure..
If possible please create an end to end project video with azure databricks..
Thank u in advance....
Book is awesome bro.Thanks for valuable share.
Please let me know when you will take interview, I’ll attend the interview 😎since I already knew the answers .
Could please provide some real time use cases for choosing file formats? When to choose orc and when parquet?
Thanks Gowtham for great explanation.
But repartition can also be used to decrease the partition isn't?
yes you are correct
Well explained Gowtham, clear and understandable.
Request you to prepare more questions and possible answer to it.
Please make more videos on use cases.
Thanks for posting videos ,its so helpful for us.
Explained Well 👌🏻👍🏻... Thank You
Bro you are amazing :) Thanks for the video !
Thank you :)
If the application has shuffling and default setting is 200 so 200 output part files will be written right
How input partions equal to output partitions
Thanks for the video.
Can anyone seeing this comment help me with
Exactly the diff between parquet and orc
and why we prefer one over other in production