The Secret Sauce Behind NoSQL: LSM Tree

Поделиться
HTML-код
  • Опубликовано: 23 ноя 2024

Комментарии • 197

  • @Kxneki2433
    @Kxneki2433 9 месяцев назад +60

    IMPORTANT: Don't forget the Memtable is stored in memory, so if the system crashes, that data will be lost. To avoid losing data, we can maintain a separate log file on disk. Every time we write to the Memtable, we'll also append that write to the log file (no need to sort it as we just use it to restore after a crash). Then once the Memtable contents get written out to a SSTable file, we can erase the log file. That way the log helps us avoid losing writes stuck in memory when a crash happens.

    • @jerichaux9219
      @jerichaux9219 5 месяцев назад +1

      This seems like a fairly important detail.

    • @sujithmenonp
      @sujithmenonp 4 месяца назад

      Kafka is used as the logging system by default in most scenarios

    • @xijinping5064
      @xijinping5064 Месяц назад +2

      a write-ahead log.

  • @JoelBrubaker
    @JoelBrubaker 2 года назад +184

    This is the perfect amount of depth and overview I’m looking for. Great videos and visuals!!

  • @CommandantNOVA
    @CommandantNOVA 2 года назад +54

    LSM trees are actually used a lot in modern SQL databases as well. the idea is to represent a relational table as a series of packed records - [k: primary key v: field 1, field 2]. Indices can be created on other fields by creating additional k:v pairs - [k: field 1 v: primary key] so an index scan just becomes two NoSQL lookups for each result.

    • @romannasuti25
      @romannasuti25 2 года назад +11

      Spot on. One thing to keep in mind is that all databases, SQL or noSQL, are based on the principles of key value operations and the transaction consistency algorithms that allow for data integrity. The core calculus and proofs behind these systems are all done in the perspective of only two types of operation: read and write, each to some concept of a key.
      These algorithms vary wildly and come with varied pros and cons, some with very weird limitations that actually DO prevent SQL semantics (the FaunaDB and FoundationDB distributed algorithms require the W/R set to be known before starting the transaction, and SQL requires conditional and unbounded W/R sets to fully abide by the spec).

    • @greyowl3787
      @greyowl3787 2 года назад +2

      @@romannasuti25 what do you mean by “W/R set” ?

    • @romannasuti25
      @romannasuti25 2 года назад +1

      @@greyowl3787 write/read. Basically, Fauna and Foundation use a distributed transaction system that can only provide full ACID guarantees with exact read and write keys known before the transaction even starts. This means dependent querying is usually not possible, like "SELECT name IN users WHERE id = (SELECT friend IN users WHERE name = x);" simply because the result of the inner query changes the write set of the outer query based on its result. In some cases a good query engine can narrow down to a fixed "maybe W/R" set if the results are well bounded (either single element or known countable range), but generally they don't bother with SQL support for this reason.

    • @akash-kumar737
      @akash-kumar737 2 года назад +3

    • @ismailmo4
      @ismailmo4 Год назад +1

      @@greyowl3787 write/read i assume

  • @sealuke2724
    @sealuke2724 2 года назад +96

    Bruh, this is just awesome... keep going

  • @phanphong3533
    @phanphong3533 2 года назад +22

    This guy is better than my teamlead in term of explaining a concept on NoSQL, thank you , make my day

  • @mr_nature
    @mr_nature 2 года назад +25

    I appreciate your efforts. Thanks for making system design more palatable than ever.

  • @maxil122
    @maxil122 2 года назад +15

    That's the best system design content I have ever seen on youtube ! This channel is absolutely amazing. It must be tough to squeeze all that valuable knowledge into less than a 10 minutes video. Keep up the excellent work!

  • @bazoo513
    @bazoo513 2 года назад

    This is the most information-dense video on what's so special about NoSQL databases I ever saw. Not a word was superfluous, but the key concepts were clearly transmitted.

  • @nishantgoel769
    @nishantgoel769 9 месяцев назад

    One of the concise video to understand how elastic/lucene is using these things to fast write and read.
    Great work man!

  • @danielgospodinow
    @danielgospodinow 9 месяцев назад

    I can't recall the last time I stumbled upon such great material! Fascinating work!

  • @rafaelacioly3252
    @rafaelacioly3252 2 года назад +1

    This channel is by far the best channel that I've found on yt about tech

  • @BRBearUSA
    @BRBearUSA Год назад

    VERY informative without going into too much complexity. THANKS and congrats for a great video. I'm an MS SQL Server DBA, and the high level explanation you provided was awesome. Thanks again. Best, R.

  • @richsadowsky8580
    @richsadowsky8580 2 года назад

    Really awesome overview of how SQL and NoSQL differ. Agree with Joel below me just the right level of detail to provide value.

  • @Andrew-rc3vh
    @Andrew-rc3vh 2 года назад +11

    That's a cool trick. So what it is essentially doing is spreading out the computer's resources across time, where with a traditional database you will get lots of spikes in processing on the timeline when doing reads and writes. Mind you the attraction of SQL was that you could have multiple indexes and create custom views on the data in a highly relational way, but granted that was very expensive on resources. I think traditional databases were mostly optimised for the deficiencies of the mechanical hard disk drive. I think it may end up as redundant in future as we store our data more on chips.
    Thanks for the video. I like videos that don't waste your time with long BS intros.

    • @BRBearUSA
      @BRBearUSA Год назад +1

      You read my mind when it comes to "spreading the computer's resources across time"... Which for some use cases makes perfect sense, but not all use cases.
      Not sure I agree with the "deficiencies of mechanical HDDs" part of your comment though. But great comment overall.

  • @arthursoares610
    @arthursoares610 2 года назад +12

    The dark mode was awesome. I think it could be the default from now on

    • @guptaanmol184
      @guptaanmol184 2 года назад

      I came here to say this! +1 for dark more, finally!

  • @AminAramoon
    @AminAramoon 2 года назад +12

    These videos are superb man, keep up the good work

  • @RajinderYadav
    @RajinderYadav 10 месяцев назад

    I love these byte sized video, helps introduce new concepts.

  • @arno.claude
    @arno.claude Год назад

    This channel is such a gold mine!

  • @susingh2
    @susingh2 2 года назад

    I never experienced such a communicative video with such a simple and easy explanation.. Thanks Alex ..Please keep it up and upload more such videos. I have anyway bought both volume of System Design Book. Thank you so much !!!

  • @frederickbarbarossa2746
    @frederickbarbarossa2746 Год назад

    besides bloom filter, a sparse index is to help find a key quickly so we only look into a small number of sstables

  • @bardhan.abhirup
    @bardhan.abhirup 2 года назад +8

    These videos are incredible! Very well paced and presented!

  • @DK-ox7ze
    @DK-ox7ze Год назад +1

    Great explanation. Resolved all my doubts on how NoSql DBs work. However, I wanted to understand
    1) Whether the balanced tree and keys in sorted set is only the object key with pointer to data value or it also contains the actual data?
    2) Can a NoSql DB index multiple keys?
    3) Why can't SQL DB also implement flushing mechanism in order to speed up writes? I know that they are highly consistent so they need to persist data to disk, but they can simply append the entry in a log file just like NoSql DBs do, and in case of a network partition, first check the log file to sync data in actual database?

    • @bojandolinar1535
      @bojandolinar1535 Год назад

      Re 3 afaik that's what they already do. But sooner or later it has to write them to b-tree, which I guess is the real bottleneck.

  • @CppExpedition
    @CppExpedition 2 года назад +2

    LOVE THE DARK BACKGROUND! :D:D:D:D (Also the video of course!!)

  • @lechprotean
    @lechprotean 2 года назад +20

    great, you make it sound so simple, I'm writing my own nosql db this weekend ;)

  • @bobdinitto
    @bobdinitto 2 года назад

    I've often wondered how NOSQL databases can achieve higher write throughput than relational DB. Thank you for sharing the techniques involved. Your explanation is clear and the graphics are excellent!

  • @shoobidyboop8634
    @shoobidyboop8634 2 года назад

    Stuff like this is the future of many forms of education.

  • @anilkaliya3375
    @anilkaliya3375 11 месяцев назад

    Nicely explained. I read all this information in designing data intensive application book. But few topics like memorable and sstables were still bit unclear to me. Got the whole idea now. Great stuff. Keep going

  • @Konservator69
    @Konservator69 2 года назад +3

    Good video. For the further topic development it'd be interesting to see a LSM tree vs Redis RDM/AOF persistence schema comparison.

  • @AndyThomasStaff
    @AndyThomasStaff Год назад +1

    Great explanation, thank you, this helped reinforce my learnings about LSM-trees from reading. The graphics were especially helpful

  • @balaclava351
    @balaclava351 2 года назад

    Great video. I'm a junior dev that has to implement a chat feature in the next few weeks. This really helped me understand NoSQL. Thanks.

  • @pashachechehov3483
    @pashachechehov3483 Год назад

    Great visualization of "Designing Data-Intensive Applications" book

  • @LeoFuso
    @LeoFuso 2 года назад +1

    Great work! I wish I had watched this video before trying to learn the architecture behind RocksDB by only looking their documentation, hahaha! Awesome work!

  • @shakedrosenblat1925
    @shakedrosenblat1925 11 месяцев назад

    Thank you. great video, as always.
    I'd like if you guys could go into more detail

  • @modolief
    @modolief 2 года назад +1

    Sublime, superb, excellent ... again.

  • @alamelu85
    @alamelu85 2 года назад

    Alex - Your lectures and contents are great assets to software engineering community.

  • @GyroCannon
    @GyroCannon 2 года назад

    Not at all a question I had, but glad I watched because I extensively use Mongo for my own app

  • @arunkutube
    @arunkutube 8 месяцев назад

    great explanation for beginners

  • @tomtomsiesie5436
    @tomtomsiesie5436 2 года назад +1

    Another amazing video! This format is so vaiuable.

  • @AllOfMyWat
    @AllOfMyWat 2 года назад

    I can't wait for my next systems design interview!

  • @ibgib
    @ibgib 2 года назад

    Like many have said, this is a great dense video at a good beginner depth for people like me. I hope there will be a relational db video that complements this if at all possible, since the quality level is so high 🤞

  • @carolinegr
    @carolinegr 2 года назад +4

    One additional thing to mention (around ruclips.net/video/I6jB0nM9SKU/видео.html perhaps) is that the writes are written to memory AND a transaction log to ensure durability. Otherwise whatever was in the MemTable will not persist after a crash. The transaction log can be replayed to rebuild the MemTable.

    • @ByteByteGo
      @ByteByteGo  2 года назад +5

      Thank you for the feedback.
      We did have that initially, but decided to take it out to focus on the LSM tree itself.
      We knew someone would bring it up, but didn't think it would take this long. 😂
      We are glad you did.

    • @ibgib
      @ibgib 2 года назад +1

      I was actually wondering if this were a difference with what I understand of relational dbs. Thanks for pointing it out.

  • @obedgennius1401
    @obedgennius1401 2 года назад +1

    I really appreciated these videos !! Thanks you very much
    I would to know what software are you using to produce such presentation 🙏

  • @eliyahubasa9401
    @eliyahubasa9401 2 года назад +1

    Great Content, thank you very much.
    I'm already waiting for Tuesday for a new video, as much as I wait for a new One Piece episode.

  • @TheOnlyRealBreadIntheWorld
    @TheOnlyRealBreadIntheWorld 7 месяцев назад

    amazing content, thank you for all the hard work you do!

  • @quirkyquerty
    @quirkyquerty 6 месяцев назад

    in their book, bloom filters are stored on disk, but here they're shown to be in memory. Hopefully we'll get some eventual consistency

  • @doxologist
    @doxologist 2 года назад

    Perhaps the best educational systems content on the whole of RUclips right now. Great stuff

  • @sahilchadha9621
    @sahilchadha9621 3 месяца назад

    Please create a video on size tiered and level compaction strategy as well

  • @roct07
    @roct07 Год назад

    This is so high quality. Thank you :)

  • @BiggRanger
    @BiggRanger 2 года назад

    Excellent presentation!

  • @sampleshawn5380
    @sampleshawn5380 2 года назад +1

    Man this video is awesome, so much information loved it!

  • @plussin2760
    @plussin2760 8 месяцев назад

    LSM Tree에 대한 이해와 작동 방식에 대한 개요를 알 수 있었습니다. ㄳ 합니다

  • @faris_id_music
    @faris_id_music 2 года назад

    one of the best videos so far

  • @jiamingliu192
    @jiamingliu192 2 года назад

    Absolutely amazing. Thank you for making the video!

  • @singhsaubhik
    @singhsaubhik 2 года назад +2

    This is an awesome overview of LSM tree. If someone wants dig deeper read "Design Data intensive applications".

  • @paramvirsingh5640
    @paramvirsingh5640 2 года назад

    Looking at how beautifuly my tired brain understood this, this video deserves a noble prize.

  • @ThangNguyen-je8kv
    @ThangNguyen-je8kv 2 года назад

    The next step after watching this video is to … watch it again 😂. Awesome as always.

  • @jrabelo_
    @jrabelo_ 2 года назад

    perfect explanation, thanks

  • @thelonearchitect
    @thelonearchitect 2 года назад +4

    Thanks for the video. Your explanation rises a concern to me : since the memtable is in memory, what happens if the server crashes before flushing ?
    Is that memtable distributed or replicated ?

    • @lwfeagan
      @lwfeagan 2 года назад +2

      Cassandra, for example, still has a write ahead log.

    • @nitinagrawalbst
      @nitinagrawalbst 2 года назад +1

      Generally for the memory table write ahead log is maintained. Once the memory table is moved to create SSDTable write ahead log is deleted. In case of crashes write ahead log can be used to restore the memory table.

  • @javisartdesign
    @javisartdesign 2 года назад

    very well explained. Thanks

  • @sophiiisticated
    @sophiiisticated 2 года назад

    I think the movie is recommended to left 30 seconds instead of 10s for the ending credit because the next suggestion video covers (in the left bottom side) in the summary time

  • @aus10d
    @aus10d Год назад

    Very interesting! Loved this video

  • @jiankuang9890
    @jiankuang9890 3 месяца назад

    Question regarding the books: volume 1 and volume 2 are considered as two separate editions for the same content or the same edition for different contents? Another way to ask: Should I but both volumes or just volume 2?

  • @Marcus-yc3ib
    @Marcus-yc3ib Месяц назад

    Thank you very much.

  • @peepeepoopoo2243
    @peepeepoopoo2243 2 года назад +1

    Great video!

  • @galeop
    @galeop Год назад

    Amazing video
    1:15 what is that "object key" ? The row key that is being edited/added to the keyspace ?

  • @mnchester
    @mnchester 2 года назад +1

    amazing video!

  • @paddyd7642
    @paddyd7642 2 года назад +1

    Thank you! When you say sorted, is it by some object id or time?

    • @big0bad0brad
      @big0bad0brad 2 года назад +2

      Object ID, unless you design the system to use a high precision timestamp as the id, which could maybe be an interesting idea. If the Object ID is a timestamp, then object creations are already sorted which could further boost write performance, though there is probably no advantage in the case of updates or deletes.

  • @adamaiken00
    @adamaiken00 Год назад

    This is a great video. If I want ti go further is there any good reference for nosql?

  • @DarknessGu1deMe
    @DarknessGu1deMe 2 года назад

    What would be a good example of an application benefiting from "fast write slow read" property of NoSQL DB? Based on what's presented, I'd say most user-facing application, like a typical service a startup would build (e.g. personal calendar organization, etc) doesn't sound like a good fit given reads are pretty important in user-facing traffic.

  • @santozard
    @santozard 2 года назад +1

    Freaking intuitive talk.

  • @MostafaZeinali
    @MostafaZeinali 2 года назад

    Thank you for the great video. Keep up the good work. ❤

  • @ziakhan-tk7rk
    @ziakhan-tk7rk 2 года назад +4

    What software do you use for animation in your videos?
    I am very curious

    • @wrondonparticual5113
      @wrondonparticual5113 2 года назад

      I want to know as well. It is perfect!!!

    • @biwer-r
      @biwer-r Год назад

      @@wrondonparticual5113 Maybe this is something :-) ruclips.net/video/H5GETOP7ivs/видео.html

  • @aaronprindle385
    @aaronprindle385 2 года назад

    Awesome work, thanks for this!

  • @StephenGillie
    @StephenGillie 2 года назад

    Very cool. Could have
    - one process just taking DB writes and putting them in memory
    - another writes too-big variables to files on disk
    - next would go through files and flatten them (like continuous truncate/shrink on a transaction log)
    - last would take DB reads and go through the memory, bloom filters, and file structure to find and return the requested data.

  • @abhijit-sarkar
    @abhijit-sarkar Год назад

    How is a key-value actually stored and retrieved from disk? Although a SSTable is sorted within itself, since the keys in different SSTables are not related, how does a read request find a key? Surely not by trying all SSTables one by one, that'd be too slow.

  • @axa993
    @axa993 2 года назад

    Awesome overview.

  • @RandomNullpointer
    @RandomNullpointer 6 месяцев назад

    Any idea how Domino databases work in this regard? Do they follow the same concepts? Is the compaction process "Tiering" or "Leveling"?

  • @noahgsolomon
    @noahgsolomon Год назад

    awesome explanation

  • @chandragie
    @chandragie 2 года назад

    I don't really understand them, but it's cool!

  • @willl0014
    @willl0014 Год назад

    So much knowledge!!!

  • @rbelatamas
    @rbelatamas Год назад

    great explanation ❤

  • @seattle_bach
    @seattle_bach 2 года назад

    great explanation!

  • @wave9303
    @wave9303 Год назад

    hi , can you do a video on how Splunk is use for devops and how it storing its data ?

  • @VirgoBoy2991
    @VirgoBoy2991 2 года назад

    Hi, i just wonder why the second volume has not been available in kindle yet :( i'm from Vietnam and shipping is expensive

  • @wingforce8530
    @wingforce8530 Год назад

    Actually, many modern SQL are also LSM Tree based, it's not limited to no-sql

  • @adatalearner8683
    @adatalearner8683 2 года назад

    Is the concept of SS tables specific to a specific type of NOSQL database(Cassandra), or generic ?

  • @thomasrobin
    @thomasrobin 2 года назад

    Amazing Content! Does anyone know how these animations are made?

  • @allisonmachado
    @allisonmachado 2 года назад

    awesome video indeed! thank you

  • @chaoluncai4300
    @chaoluncai4300 2 года назад

    this is brilliant! I'm also wondering would this amount/level of knowledge for an advanced DS is enough for tech/system design interview? Obviously I think the interviewer won't ask for implementation so... ig im trying to know how deeper do we need to go than e.g. this channel's few minutes videos?

  • @swaroopas5207
    @swaroopas5207 2 года назад +1

    When searching why does it need to search at all levels? It needs to search at the latest level right? Can someone help me understand this.

    • @karolinaa9478
      @karolinaa9478 2 года назад

      The keys, when compacted, are deleted from their original level, and added to the next level.

  • @ethanmye-rs
    @ethanmye-rs 2 года назад

    Thanks! One of the things I find difficult to find good information on is structuring data. Given I want these x properties, how do so arrange the information to get them, and what technologies are required to do it.

  • @thisisnotok2100
    @thisisnotok2100 2 года назад +1

    yeah I freakin love this channel

  • @Chauhannitin
    @Chauhannitin 2 года назад

    Very good animation

  • @caseyspaulding
    @caseyspaulding Год назад

    Thank you!

  • @stackunderflow5951
    @stackunderflow5951 2 года назад

    The explanation of why SStables have to be sorted is not sufficient. In my opinion, the reason is that allowing the SSTables to be sorted allows faster merging and allows the index for the table to be sparse. It doesn't really allow faster table retrieval since running a binary search on disk is not so efficient.

  • @KostiantynKostin
    @KostiantynKostin 2 года назад +2

    6:56 correction regarding Bloom Filter
    If the Key doesn't exist, bloom filter will NOT return False Negative
    If the Key doesn't exist, bloom filter MAY return False Positive
    In other words.
    If bloom filter says there's NO key in page, there's definitely no key in page.
    If it says that there's the key in page, there MIGHT be the key in page with some probability.
    This probability data structure helps to reduce unnecessary reads.

    • @Winnetou17
      @Winnetou17 4 месяца назад

      That's exactly what is said in the video

  • @mohawkgwai
    @mohawkgwai 2 года назад

    Cassandra also has Leveled Compaction Strategy so that slide comparing it to RocksDB is a little misleading

    • @raphaelcarvalho4288
      @raphaelcarvalho4288 2 года назад

      Cassandra initially had size tiered only and later borrowed leveled from RocksDB to solve the space amplification problem, so it's not completely misleading.

  • @raviv5109
    @raviv5109 2 года назад

    Thank you so much!

  • @junkahoolik
    @junkahoolik 2 года назад

    relational databases don't use btrees, they use b+ trees. the only db i know of that uses btrees is actually mongodb.

  • @wrondonparticual5113
    @wrondonparticual5113 2 года назад

    What is the animation software used? It is beautiful