How do indexes make databases read faster?

Поделиться
HTML-код
  • Опубликовано: 10 янв 2025

Комментарии • 209

  • @aayankhan5198
    @aayankhan5198 2 года назад +92

    People run behind influencers and left with this GoldMine! Loved ur content.. following you when you joined Unacademy. 1 of the Most knowledgeable and inspiring person in the Tech Field.

  • @harshitanand7349
    @harshitanand7349 Год назад +7

    Many people have superficial knowledge about indexing. All they know is that indexing makes querying faster. But they don’t know how. Glad I found this video through your twitter post. Will read about B+ trees soon.

  • @SR-we1vl
    @SR-we1vl 2 года назад +18

    Great content sir! 😄
    Just a summary:
    Whenever a record is read from the DB, the entire block is fetched which contains that particular record. Since block reads are expensive(takes more time to read) it is better to index the records based on the property you want to search. So now when you search the record it is first checked into the index table which is stored in quite a few blocks, so block read time will be very low. Since we know in which of the blocks, the exact record(which can be computed from index table using id attribute) is present, we can directly compute those blocks and return the record from them.

  • @nytlyf2085
    @nytlyf2085 2 года назад +3

    21:49 perfect example of a range query optimization.

  • @mridulmaurya8383
    @mridulmaurya8383 5 месяцев назад +1

    seriously blown out by how you explained such a complex thing in a simple manner. Thanks man!

  • @jspnser
    @jspnser Год назад

    I don't think even my professor in under grad or grad school has taught me like they way you're explaining the concepts so succinctly, thank you.

  • @kalpeshmali1476
    @kalpeshmali1476 Год назад +2

    Wow aaj pata chala 😂 literally Gold 🔥🔥🔥🔥 keep it up sir

  • @majid127x
    @majid127x 26 дней назад +1

    Last time I was watching his interview . Now I am learning from him. 🤗

  • @sheersendughosh
    @sheersendughosh 3 месяца назад

    Every college should mandate your videos as part of their curriculum, your content is just knowledge bomb! Wish had known this in college days. You have sparked my curiousness thanks a lot!

  • @jaskiratwalia
    @jaskiratwalia 11 месяцев назад +1

    Wonderful explanation of such an important topic. The way you explained it with practical examples made the concept quite clear to me. Thank you so much!

  • @singh.aadarsh
    @singh.aadarsh Год назад +1

    Bro You probably in google 1cr package. Because you have a huse knowledge abouts system. I watch your almost 20+ video and every video has nice and details information. So you do amazing work, to teach Deep concept about engineering side.
    👍👍👍

  • @ahirjoy
    @ahirjoy 2 года назад +1

    Waiting for your book on System Design one day. There is real need for such book in the market. Keep inspiring.

  • @ankitverma1790
    @ankitverma1790 8 месяцев назад +1

    Great content...what I understood before is that if you have index on certain column, db will make the record s sequential on the basis of that column and it will access the record like an element is accessed from an array in o(1). That made me doubt as well sometimes when I thought to add indexes on multiple columns.
    This video made the concept crystal for me

  • @kashyapsharma8721
    @kashyapsharma8721 2 года назад +1

    Indices explained in the most simplest of terms. Excellent video , thank you Arpit.

  • @maksadTohBhoolChuka
    @maksadTohBhoolChuka 5 дней назад

    Hey at 13:07 I didn't get the point of need to serialize index while storing in blocks. Can any1 please elaborate more ?

  • @agrittiwari5485
    @agrittiwari5485 7 месяцев назад

    Thanks for this video Arpit. I asked an EM about indexing, as it was only a buzzword to me at that point, couldn't wrap my head around jargons back then. I was able to relate this video as to how I used to search for words in my Oxford dictionary from the index written on the first few pages. Thanks for making super imaginable.

  • @codingwithkhurram
    @codingwithkhurram 4 месяца назад

    You did an excellent job with the explanation, and I really appreciate the clarity and detail you provided! Your effort in making the concepts clear and understandable is commendable.

  • @sarangbondre
    @sarangbondre 2 года назад +6

    Hello, Arpit.
    It would be fantastic if you could include the following two points in the video or answer them here.
    1. Can we index on the string (varchar/text), and if so, what happens?
    2. What are the drawbacks of indexing (will it take up more memory?) Where should we not do it?

  • @nimitkanani1691
    @nimitkanani1691 2 года назад +1

    Your videos are so simple to understand and the pace is also very easy to follow. Great work !!! Perfect for me, and you are also covering very relevant important topics. Keep up the work, man.

  • @sarthakvaish2520
    @sarthakvaish2520 Месяц назад

    The best explanation on Database indexes

  • @dhineshrathinam8756
    @dhineshrathinam8756 Год назад

    Key points to take:
    1. While reading the data it reads full blocks
    2. Making an indices will create the separate table like structure with sorted order along with its primary key mapped

  • @santosh_bhat
    @santosh_bhat Год назад +4

    Very good explanation on index, learnt a ton !! Thanks
    One doubt, what if the resultant IDs from the index were part of each of the 37 blocks ?
    Eg: IDs - 1,4,7,10,13,16,.....
    Then we would read all 37 blocks so totally
    37 sec + 2 sec( reading index ) > sequential read
    So when should we not use an index on a column ?

    • @venkateshnaga4550
      @venkateshnaga4550 11 месяцев назад

      can u pls help here @AsliEngineering

    • @victorsolomonmachinelearning
      @victorsolomonmachinelearning 3 месяца назад +1

      Exactly what I said to myself. Hahaha. By the way he explained it, the aim of indexing is defeated in this case

    • @maksadTohBhoolChuka
      @maksadTohBhoolChuka 5 дней назад

      Hey at 13:07 I didn't get the point of need to serialize index while storing in blocks. Can any1 please elaborate more ?

  • @chandruts.2579
    @chandruts.2579 5 месяцев назад

    Great video! The explanations were crystal clear and easy to follow. Thankyou Asprit Ji

  • @LeoLeo-nx5gi
    @LeoLeo-nx5gi 2 года назад +2

    Arpit bhaiya trust me this was really awesome (though I knew about indexes but still learnt many new things), Thanx ^_^

  • @bheemireddyvamsi7042
    @bheemireddyvamsi7042 2 года назад +1

    Best video ever on the indexing concept. Thanks, Arpit, expecting many more videos from you on system designing. By the way, your neat notes are enough to clear all doubts. It will be good if you could add a video on the time complexity of operations like updating and delete on millions of data. How to increase the performance of DB like partition, and segregation techniques. Yes, I read this on many blogs but feel like they are half explanatory so need to understand them in depth.

  • @Mr.Mishr4
    @Mr.Mishr4 5 месяцев назад

    Great Video ! But I just want to add one thing: when you mentioned adding age as an index and that it's sorted, this implies we can use binary search operations to find the required age's start and last values. This improves the time complexity from O(n) to O(log(n))for searching.

  • @shabnamhaque2003
    @shabnamhaque2003 4 месяца назад

    how wonderfully explained. sir you have no idea how helpful this video is.

  • @meeta890
    @meeta890 Год назад

    Thanks for a great video! It helped me gain understanding of how an index works on a table while querying. It would be great if you could also make a video on how indexes are stored eg. B+ tree and what happens behind the scenes when a query with indexed column is fired.

  • @israaezzat2353
    @israaezzat2353 11 месяцев назад

    you nailed it !! seriously its the best explanation I have ever seen for Index thank you 🙏

  • @SurajSingh-tz1wr
    @SurajSingh-tz1wr 6 месяцев назад

    A complex topic explained in such a simple words. Nice!

  • @RTXXONn
    @RTXXONn 11 месяцев назад

    wowww so simple yet so elegent.

  • @cur92lyone
    @cur92lyone 7 месяцев назад

    Arpit - you're truly talented, with such in-depth knowledge. No wonder, your career trajectory reflects that :)
    Keep rocking - onwards and upwards. ALso, Pleeeeease NEVER STOP MAKING SYSTEM DESIGNING VIDEOS! EXCELLENT! 😊

  • @adamyatripathi2743
    @adamyatripathi2743 2 года назад +1

    Maza aa gaya! Behtareen! 🔥

  • @shreyanshnanavati
    @shreyanshnanavati 2 года назад +1

    Brilliant Video Arpit :) , Thank you so much, clears alot of concepts around Indexes :)

  • @timmytesla9655
    @timmytesla9655 4 месяца назад

    Amazing breakdown of how index works.

  • @vamsireddy5051
    @vamsireddy5051 Год назад

    Really superb explanation, you not nailed normally but nailed it deeply 🙌

  • @saurabhsuman4960
    @saurabhsuman4960 2 месяца назад

    Thanks Arpit. These explanantions makes learning fun.

  • @kirankumargunturu8517
    @kirankumargunturu8517 Год назад

    How do we know or guarantee ( based on id column in this example) that our table is stored sequentially ( 17:37 ) which helped us to read first two blocks ? I am assuming id column marked as PK. You can correct me please

    • @Avinashkk360
      @Avinashkk360 Год назад

      I think the assumption is table stored in B+ tree structure.

  • @Sanjay-pu4sp
    @Sanjay-pu4sp Год назад

    Fantastic! You have a real flair for teaching mate!

  • @JardaniJovonovich192
    @JardaniJovonovich192 2 года назад +2

    Great Video, Arpit !!
    It'd be interesting if you could make a video on BTree Indexes and clustered BTree Indexes.

    • @AsliEngineering
      @AsliEngineering  2 года назад +5

      Soon :)

    • @kps4881
      @kps4881 Год назад

      Till Arpit makes one you can check out this video. ruclips.net/video/aZjYr87r1b8/видео.html

  • @kanikabhatia4171
    @kanikabhatia4171 7 месяцев назад +1

    Would love an answer on below query:
    If there are 34 records with age 23 in the entire db of 100 users and worst case spread across all 34 blocks; wouldn't indexing use more time - 2s for index + 34s for fetching 34 records from 34 block?

  • @vasiovasio
    @vasiovasio Год назад +2

    Great video. I just want to tell you that you have a typo on your thumbnail image - "indices" instead of indexes.

  • @97vaqasazeem
    @97vaqasazeem 2 года назад +1

    Thanks alot i learn alot from your content it is clear and to the point !

  • @shubhamkhatik2012
    @shubhamkhatik2012 9 дней назад

    this is called ... how to teach ... super....

  • @pratibhasharma4006
    @pratibhasharma4006 Год назад +1

    Inspiring content. Thanks a lot sir

  • @abhishekvishwakarma9045
    @abhishekvishwakarma9045 Год назад

    Really liked how you explain things so easily with that much detail, totally awesome arpit sir 😁

  • @Shivam-sl4sp
    @Shivam-sl4sp Год назад

    This is just too good! Thanks man.

  • @DemystifyFrontend
    @DemystifyFrontend Год назад

    I am lucky that I stumbled upon your channel it's really asli engineering in the true sense
    I really appreciate such content which touches the foundation concepts
    Thank you again, Asli me asli engineering 🙏🙏🙏

  • @varunkoranne6872
    @varunkoranne6872 Год назад

    The content and explanation is just too awesome

  • @mahimanbissa8308
    @mahimanbissa8308 Год назад +2

    Hi Arpit,
    I didn't understand the part after we get the IDs after applying indexes. How will it search the IDs (buffered from the indexing) in the non-indexed database?
    in the example you provided while calculating query time after indexing, the two IDs you searched for were 7 and 2. And in your example, 7th ID occured in the first block while the 2nd ID was in the second block. What if the 2nd ID was in the 4th block? Would the database search in every memory block i.e. 2nd and 3rd and then 4th until it gets the 2nd ID?

    • @AsliEngineering
      @AsliEngineering  Год назад

      Read about B+ trees and how they work. That will help you understand how rows are fetched.

  • @parambharti7095
    @parambharti7095 Год назад

    Very informative video. Thanks @Arpit Sir.

  • @raj_kundalia
    @raj_kundalia Год назад

    Thank you, Arpit!

  • @biswaMastAadmi
    @biswaMastAadmi Год назад

    Beautiful explanation brother!

  • @muskan_bagrecha
    @muskan_bagrecha 4 месяца назад

    This just piqued my interest. Guess I am going to dive further into these topics. Thanks Arpit.

  • @abhishek3835
    @abhishek3835 2 года назад +4

    Hi Arpit. When we used the age index table, we got row ids = {1,4}. Why did we only require 2 blocks search to find records with ids = 2 and 4? How did it know only those 2 blocks will contain rows with id 2 and 4, shouldn't all blocks be searched for this?

    • @AsliEngineering
      @AsliEngineering  2 года назад +7

      Using the index we narrowed down the row ids we are interested in. When we go through the table to look for the rowwith those IDs we can leverage the storage structure used to store the table. This is where the magic of B+ trees come in for the relational databases.
      By default all the tables/databases are structured to make the lookup by primary key very efficient and this is something that powered by the core storage structure (B+ trees for relational databases). So when we are looking for the relevant rows with particular IDs the database engine would know which all block to reads and how to read them [typically done by managing and maintaining the disk offsets].
      This makes are entire process super-efficient and not needing us to read entire table to find rows with specific IDs.
      Hope this helps.

    • @abhishek3835
      @abhishek3835 2 года назад

      @@AsliEngineering Thanks for the reply. What you said makes absolute sense but I had another doubt after reading this. What if a table doesn't have a primary key, can indexing still work in this case? If yes, what will the index table point towards now as there is no primary key to point towards?

    • @suraj-upadhyay
      @suraj-upadhyay 2 года назад

      Thanks for the explanation bro

    • @manchukondamanoj
      @manchukondamanoj 2 года назад

      Thanks for asking this, I had the same doubt.

  • @abde2849
    @abde2849 Год назад +1

    Hey Arpit - When db engine have to read multiple blocks, is that operation sequential or multi-threaded? Sequential can be too slow right? This is regarding example for reading 34 blocks taking 34 seconds...

  • @gulabsinghsikarwar7447
    @gulabsinghsikarwar7447 4 месяца назад

    Great in depth Explanation ! .

  • @myironlung9651
    @myironlung9651 3 месяца назад

    Thanks! You're a nice teacher!

  • @soodankit1993
    @soodankit1993 Год назад

    @AsliEngineering I have couple of questions here: 1. You mentioned that age index will be sorted in some order. Does that mean during insertion it'll try to move records to identify the correct place for the inserted record to maintain the sorting order. Will it have negative impact on performance. 2. In your example, ids with age 23 was exhausted by block 2 but what if there was one more record which is part of last block. In that case the response time will go much higher. Please help me with the answers in order to clarify all these doubts.

    • @AsliEngineering
      @AsliEngineering  Год назад

      that's why you have B+ trees. I have a video about it on my channel

  • @udayakumarsakthivel6285
    @udayakumarsakthivel6285 Год назад

    You made index content as so simple 🤪. Great explanation 🎉

  • @chandrachurmukherjeejucse5816
    @chandrachurmukherjeejucse5816 10 месяцев назад

    I don't know why channels uploading 100 videos on z-index/ basic js are more popular than this goldmine. As always great content sir

    • @AsliEngineering
      @AsliEngineering  10 месяцев назад

      Thank you for resonating. As you found the stuff I put out helpful, it would be great if you could spread a word across socials. It would help people become more curious and overall better engineers.
      Thanks again!

  • @SabyasachiMohanty-x3q
    @SabyasachiMohanty-x3q Год назад

    Thanks bro!! Loved the content... great one!!

  • @prashantgupta387
    @prashantgupta387 2 года назад +1

    12:51 great content 😁

  • @adwaithssundar1620
    @adwaithssundar1620 Год назад

    Superb explanation!!

  • @deepakkothari2850
    @deepakkothari2850 Год назад

    This was wonderfully explained 🙏

  • @praveenchouhan6388
    @praveenchouhan6388 Месяц назад

    amazing explaination, keep it up!!!!!!!

  • @prachiagrawalcipher
    @prachiagrawalcipher 2 года назад +4

    If id corresponding to age==23 were present in all the disk blocks,34 blocks then read query will become slow in this case. It would have taken 36 seconds. How to optimize that?

    • @AsliEngineering
      @AsliEngineering  2 года назад +5

      Excellent question.
      A database engine can optimize this by not iterating over index when it finds the cardinality of resultset is almost equal to the number of table rows. This is why every database has the query optimization phase.

    • @prachiagrawalcipher
      @prachiagrawalcipher 2 года назад

      @@AsliEngineering will spend some time reading about this query optimization phase

    • @sahil0094
      @sahil0094 2 года назад

      @@AsliEngineering but even to find cardinality of result set on index search, it would still can process all blocks, right? So anyhow time is wasted

    • @rohansharma8618
      @rohansharma8618 2 года назад

      @@prachiagrawalcipher But what about time consumed in maintaining index in ordered form

  • @rahulkumarmishra1638
    @rahulkumarmishra1638 2 года назад +3

    Hi Arpit, Great Content Thanks, I Have This Doubt What If Each Block Has A Entry Where Age==23, So Our result buffer will hold all the IDs and each id will be stored in a different block, so in that case, our total will be 34 sec for reading all blocks and 2 sec for index scan so total time is 36 sec but in previous approach without an index it was 34 sec, so how to deal with such situations.

    • @AsliEngineering
      @AsliEngineering  2 года назад +4

      During query evaluation, database runs an optimization and checks the best approach. It does this by maintaining the cardinality of each indexed attribute.
      Here as it would see a skew, it would not use index for evaluation.

    • @rahulkumarmishra1638
      @rahulkumarmishra1638 2 года назад

      @@AsliEngineering
      Got it Arpit Thank You 😊

  • @ayushjindal4981
    @ayushjindal4981 3 месяца назад

    Very nice explanation !!

  • @420lomo
    @420lomo 2 года назад

    Recently discovered your channel, very engaging content! Thanks

  • @PrdpPtl-j4b
    @PrdpPtl-j4b 7 месяцев назад

    I think it's bit a wrong
    1) when we find the age in physical table that time process not read all table like name, age, bio and totalblogs, read only things age. This means read by default 2 block mean 2 second.
    2) In above case they created non clustered index without primary key ( clustered index ) this means, in disk data stored in inserted order and create by default RID.
    And when create non clustered index on age that time it's map sorting age and RID of the record,
    Now when we use search age==23 than process go through B-tree (binary tree - root and index node make decisions and go to leaf node and give the RID - leaf node contain group of RID or clustered index ) that create on age (non clustered index) and process go through it's like binary search and find equivalent RID. Now we have group of RID actual pointer address now we go through physical address and retire data.

  • @sanjeevsingh-zz2hg
    @sanjeevsingh-zz2hg Год назад

    5:07 why it so? why full block size is read instead of the one which we are interested in?

    • @AsliEngineering
      @AsliEngineering  Год назад

      Because that is how disk IO works. When you read anything from the disk, your file system reads the entire block (that contains the intended region).

  • @roshnigupta9296
    @roshnigupta9296 2 года назад +2

    Hey Arpit, amazing explanation !!! I had a doubt that is it safe to assume that no matter what the Query is(considering only select queries with where conditions and not joins ), the time taken to execute the query will always be constant for a given table, in this case, 34 sec, without the indexes, or are there other parameters which can make the execution time to vary with significant differences, when queried different columns.

    • @AsliEngineering
      @AsliEngineering  2 года назад +6

      The query execution times depends on a lot of other factors but we can certainly say that the pointed queries will require the same number of Disk IOs.
      The query execution time will depend on the number of rows being fetched, current load on the DB, IO tuning of the DB, Cache settings, etc.
      One thing that would impact the lookup time the most is caching policy and buffer size which would mean a large part of index can be kept in memory leading to quicker execution.

  • @godbhaigod
    @godbhaigod 5 месяцев назад

    If Index is stored in separate block, then in MS SQL serve, Why Rows are sorted inside a table when we define clustered index and Rows remain unsorted when we define Nonclustered index?

  • @harshildoshi3299
    @harshildoshi3299 Год назад

    Loved the way you simplified the concept. Would you recommend any book(s) for the concepts includes Indexing, Cursor, Query Optimization etc?

  • @navinmittal4809
    @navinmittal4809 2 года назад

    Hi Arpit, I got little confused when you said index is on id column instead of age column at time 13:26. Later it got clarified that it's age column.

    • @AsliEngineering
      @AsliEngineering  2 года назад

      My bad. Flow flow me ho gaya. Will be extra careful going forward.

  • @aruprakshit7218
    @aruprakshit7218 5 месяцев назад

    Why don't you have DSA course? Your teaching is top-notch.

  • @RutwijGhadge
    @RutwijGhadge Год назад

    Nice and clear explanation 👍

  • @harshadommeti4936
    @harshadommeti4936 5 месяцев назад

    19:04 , how does it know which I’d Is stored in which block?

    • @AsliEngineering
      @AsliEngineering  5 месяцев назад +1

      Data is ordered by PK and the non lead nodes hold the range. A classic B+ tree implementation.

    • @harshadommeti4936
      @harshadommeti4936 5 месяцев назад

      @@AsliEngineering thanks for clarifying😊

  • @jpnr8
    @jpnr8 2 года назад

    Thank you for lesson.. its awesome..

  • @HowTo-bk2jn
    @HowTo-bk2jn 2 года назад

    Great explanation!

  • @navinmittal4809
    @navinmittal4809 2 года назад +1

    Do you have any suggestions on how to optimize writes/updates if there's an index column? You can assume you aren't updating index column value, but there are millions of rows to update.

    • @AsliEngineering
      @AsliEngineering  2 года назад +3

      If the updates does not change the values on the index column there will not be any write happening on the index. This evaluation is taken care by the query optimization stage.
      In general when a massive update is run, there is a possibility of table (if not rows) are locked in order to protect and give a consistent view of the data to transaction happening at that moment.
      One way to ensure your DB performance is not hampered much because of massive update is by breaking it into micro transaction if possible. This way the db will be locking a few rows at a time.
      Let me know if you have any other special use-case which I did not cover. Happy to answer.

    • @santoshbhatnagar2155
      @santoshbhatnagar2155 Год назад

      Can you share an example for the question being asked please.

  • @yusufmirkar6508
    @yusufmirkar6508 Год назад

    we used index for integer column so index fit in 2 blocks, if we have used string as index column (60 or 128 bytes), index would have been much larger, and hence take many blocks , right ? (60 + 4) * 100 = 6400 = 11 blocks

  • @gauravraj2604
    @gauravraj2604 2 года назад +1

    Hi Arpit,
    Amazing explanation. Thanks.
    I wa wondering how this block size (600B in your case) is decided?

    • @AsliEngineering
      @AsliEngineering  2 года назад

      Typical size if 4kb. I took 600 as an example. Not too small not too large is the only criteria

  • @its__aakashg_
    @its__aakashg_ 4 месяца назад

    Thankyou .. it was really informative

  • @abhijitmitra4665
    @abhijitmitra4665 Год назад

    Hi Arpit,
    Really thanks for creating such wonderful content and including how disk space getting managed for database.
    Curious to know what happens the blocks, when we add new column or change datatypes of column. Because in that case overall row size will change, and if data is stored in disk sequential order, then how databases internally manage that size change. If we consider a row which was earlier of size 200Byte and then later converted to 900Byte or so, I assume majority of the block reference which clustered index was holding will also change. That looks like huge write operation. Curious to know how that gets managed internally??

  • @ShubhamGupta3110
    @ShubhamGupta3110 Год назад

    for alter table scripts when we add columns..does block allocation gets redone ?

  • @HimanshuKumar-xz5tk
    @HimanshuKumar-xz5tk 2 года назад +1

    Lets say you got the ids from index - [1, 4]. You still need to iterate through all the blocks to check whether 1 or 4 is present in that row or not. Explanation seems incomplete to me. How do we get block from ids?

  • @gauravraj2604
    @gauravraj2604 2 года назад +1

    Maybe I missed that but I didn't get how system is deciding that ids (1,4) are available in block 1 and block 2? Are we checking 1 by 1 in a linear-search fashion?

    • @gauravgupta5530
      @gauravgupta5530 2 года назад

      I have also this same question, how the processor will decide which block it needs to read to get the data it founds from indexes. @Asli Engineering by Arpit Bhayani.

    • @kps4881
      @kps4881 Год назад

      If you look at B+ Trees you will get the idea. The B+Tree stores the location of the disk block(shown here as the id for easy illustration). So instead of 1 and 4 the address/offset to the disk is stored and then those blocks are read by the engine.

    • @gauravraj2604
      @gauravraj2604 Год назад

      @@kps4881 thank you for reply. But can you please explain a bit more in terms of address/offset and block 1 / block 2?

    • @kps4881
      @kps4881 Год назад

      @@gauravraj2604 The leaf node of the B-Tree or B+Tree is a pointer to the offset of the disk block which contains that actual data. So once you traverse the tree and reach the leaf, you get the pointer to the disk block that is used to read the data from the disk directly.

  • @karanbangia14
    @karanbangia14 Год назад

    Can you tell how you choose indexes optimally?
    If i have 3 columns
    Id, fistname, lastname and i want to query by firstname , bylastname,byfirstand lastname how many indexes and what index should i make?

  • @KHUSHI-zd2hr
    @KHUSHI-zd2hr 8 месяцев назад

    does this mean that dbms creates an index for the column when a query is run on that column? i understood how indexing works. i just dont understand at what point an index table for each of the columns in a db is created.

    • @AsliEngineering
      @AsliEngineering  8 месяцев назад

      You are required to create indexes on columns that you will be querying. You typically do this while creating the table, or after that by firing CREATE INDEX query.

  • @subasm2160
    @subasm2160 20 дней назад

    Hi sir, I am new to system design I have a doubt.
    Without indexing we have to search all the rows linearly to get the data which is not so efficient, but even with indexing the example u have given, to get the list of ids, we need to read 2 blocks and if the ids are like this 1,4,7,12 means if all the ids are present in different block then it is same as before , because now we are reading all 34 blocks and 2 blocks extra to get the correct ids, which seems worse. Please explain this. Thanks ❤

  • @soni.himansh
    @soni.himansh Год назад

    Brilliant Video :)

  • @shrutimath3330
    @shrutimath3330 Год назад

    Amazing content 🙌

  • @theoptimaldeveloper
    @theoptimaldeveloper 2 года назад

    one thing that is clear from this is indexing comes with the cost of making writes slower. But Can it make read slower also? Like In cases where we have to return almost entire table?

    • @Avinashkk360
      @Avinashkk360 Год назад

      In case where we have to return almost entire table, query optimizer may choose not to use indexes

  • @KK-kl7se
    @KK-kl7se 2 года назад

    Good explaination , btw what tool are you using in your explanation @Arpit

  • @niteshrawat576
    @niteshrawat576 9 месяцев назад

    Question: What if one of the fetched index is 1000, now fetch the data block where this row is present a sequencial scan till that data block happens or is there some other way?

    • @niteshrawat576
      @niteshrawat576 9 месяцев назад

      Ahh, with a little researching I got to know that index also holds the "rowId" which is not the primary key but the internal id to identify which diskstore the row data might be present. This make sense to me but what confirmation if that is indeed the case.

  • @abhishekpathania7093
    @abhishekpathania7093 2 месяца назад

    1. Are indexes sorting again & again on inserting new data.
    2. then does data also move from 1 block to other block after index resorting.

    • @AsliEngineering
      @AsliEngineering  2 месяца назад +1

      1. They use data structures that keep data ordered. No need to sort again and again. Check b plus trees
      2. Yes. And that's rebalancing phase of b plus trees

  • @rhul0017
    @rhul0017 4 месяца назад

    but how do you find that id 4 is in 2nd block 18:31??

    • @AsliEngineering
      @AsliEngineering  4 месяца назад

      Intermediate nodes hold the range and where in the child nodes they can find it.

    • @rhul0017
      @rhul0017 4 месяца назад

      @@AsliEngineering didint understood the point

  • @akashagarwal6390
    @akashagarwal6390 Год назад

    after getting row id, how does the DB engine know which row to fetch exactly (assuming its not a PK or even if its a PK, in both cases, if it makes a diff) & from which block on disk?