Much waited video bro..waiting for more videos..please consider topics like Hive serde component,how to use Hive as OLTP system, Performance optimisation in Hive etc
Bro, in partition while reading it will directly skip unwanted records, goes to mentioned partition and returns required records. But bucket la how does it know in which bucket the record is in? In your example, 9876 phone number has hash value remainder 0, so it went to 0th bucket. so while querying select * ...where ph_num = 9876, how does it know the record is in 0th bucket? It will still scan all the buckets right?
Hi Bro, really a useful video on bucketing... I have a question, considering the same scenario in the video, I'm writing a query to select a record which is present in the 3rd bucket, will it directly hit the third bucket or it will search from 1 and 2 then comes to 3rd ? From your explanation I can say, yes it directly hits 3rd bucket, but bit confused after seeing sampling, since we are specifying the particular bucket to do with sampling... So just confused how the query hits the correct bucket for particular data we are looking for.. Thanks!!!
Yes your understanding is correct . But in sampling we are intend to choose the bucket without giving the value of the record , but in the 1st case we are giving the value and hive will check of the bucket it resides
Much waited video bro..waiting for more videos..please consider topics like Hive serde component,how to use Hive as OLTP system, Performance optimisation in Hive etc
Thala vera level explaination unexpected, keep rocking bruh I can see your experience in your explanation 🔥🤙
Hi Bro, Please put video about 1.Linage graph and DAG diffrence , 2. What is stage and task how to count these while running one word count pgm
Really helpfull do more videos regarding block chain too
Hi when we search by phone number , will hash value be apllied on it and only the respective bucket will be hit directly ... is tat corre
ct?
Bro, in partition while reading it will directly skip unwanted records, goes to mentioned partition and returns required records. But bucket la how does it know in which bucket the record is in?
In your example, 9876 phone number has hash value remainder 0, so it went to 0th bucket. so while querying select * ...where ph_num = 9876, how does it know the record is in 0th bucket?
It will still scan all the buckets right?
Really awesome explanation
Nice video bro..is there any option to learn directly from you..any courses you conducting?
Hi Bro, really a useful video on bucketing... I have a question, considering the same scenario in the video, I'm writing a query to select a record which is present in the 3rd bucket, will it directly hit the third bucket or it will search from 1 and 2 then comes to 3rd ?
From your explanation I can say, yes it directly hits 3rd bucket, but bit confused after seeing sampling, since we are specifying the particular bucket to do with sampling... So just confused how the query hits the correct bucket for particular data we are looking for.. Thanks!!!
Yes your understanding is correct .
But in sampling we are intend to choose the bucket without giving the value of the record , but in the 1st case we are giving the value and hive will check of the bucket it resides
@@dataengineeringvideos So you mean it directly hits the respective bucket without scanning each and every bucket until it finds?
Awesome bro.