Why the partition 0 is getting 5 keys , partition 1 - only 1 key and the other two partitions 2 & 3 got ZERO keys. What is the logic behind this or criteria for this kind of partition ?
That's my question too. I guess groupBykey still has a partitioner but it is a new one, there is some shuffling. But reduceBykey will keep the original partitions.
So many videos in other channel but this one after so many years still has best value content. Thank you !
This is such a brilliant series :) I can feel myself levelling up in spark while watching it :)
Awesome Explanation with every details in all aspects. Squared all corners nicely. Brilliant.
absolutely great .... please add some more videos on spark real time use cases ... thanks
superb! explanation covering all possible scenarios, hats off..
simply great
The best explanation I have ever seen.
Hey, that's pretty good!
This is incredibly explained
Well Explained !! Thanks
Excellent
so contentful!Thanks
excellent !!
Why the partition 0 is getting 5 keys , partition 1 - only 1 key and the other two partitions 2 & 3 got ZERO keys. What is the logic behind this or criteria for this kind of partition ?
It is Key%(number of partition). Example: key = 8 and partition = 4 , 8%4 = 0
I remember that in other video u said groupBykey causes shuffling .. then how come its has a paritioner ?
That's my question too. I guess groupBykey still has a partitioner but it is a new one, there is some shuffling. But reduceBykey will keep the original partitions.
great
the only sorry that i cant get english