Choosing a Database for Systems Design: All you need to know in one video

Поделиться
HTML-код
  • Опубликовано: 20 ноя 2024

Комментарии •

  • @pieter5466
    @pieter5466 11 месяцев назад +34

    Two things make this video stand out for system architecture interviews:
    1) general knowledge of the available options, with arguments for and against
    2) enough in depth knowledge to go deep and impress

  • @NikitaUtkin-r7x
    @NikitaUtkin-r7x 9 месяцев назад +13

    One of the best videos of its kind.
    small inaccuracy: Hbase being wide-column store actually store column families together, not individual columns.

  • @ВалентинТ-х6ц
    @ВалентинТ-х6ц 2 месяца назад +3

    Dude, today I've passed an interview from a first try! Your videos are extremely helpful. I was just putting on a board all you talked about. I'd fail if I hadn't watched your videos. Thanks!

    • @jordanhasnolife5163
      @jordanhasnolife5163  2 месяца назад

      Congratulations!! Glad to hear the hard work paid off for you!

  • @cloudx1057
    @cloudx1057 19 дней назад +2

    finished this series, im proud of myself. you are so funny btw

  • @JeT2686
    @JeT2686 Год назад +7

    I gotta say, this summary video is great!
    As much as you dread redundancy here, I at least got a ton of value of out of it. The material is fantastic for reviews
    Kudos, and great stuff!

  • @ritwik121
    @ritwik121 2 года назад +12

    Glad you are back with system design videos😭😭

    • @jordanhasnolife5163
      @jordanhasnolife5163  2 года назад +3

      We'll see about that one buddy, these have been covered mostly

  • @mnchester
    @mnchester 2 года назад +7

    Thanks for this video man! While I agree with you that it'd be better to watch your more in depth videos, this compilation video works great for a quick recap right before going into your System Design interviews

  • @BRBallin1
    @BRBallin1 5 месяцев назад +3

    Since both SQL and NoSQL DBs are ACID compliant, the key reason to ever choose SQL over noSQL is if you want to join multiple tables with data and if your queries are more aggregate based (ex: SUM, COUNT, AVG) and records are used in a combined manner rather than to store rows of data for unrelated records.

    • @jordanhasnolife5163
      @jordanhasnolife5163  5 месяцев назад +1

      While I think you make some valid points here, I think by default everyone should want to use SQL. Not all NoSQL DBs are ACID compliant, especially in a distributed setting.
      I agree that tables that store unrelated records generally play nicely with NoSQL, but that doesn't even necessarily warrant using it unless the specific database that you choose gives you some performance improvement that you couldn't have had otherwise.

  • @luiscarlosricoalmada4296
    @luiscarlosricoalmada4296 2 года назад +4

    I finished the whole series :) , wish me luck on my System Design interview

  • @rajrsa
    @rajrsa Год назад +1

    I've gone through all your concepts and interview video and this video did a great job of summarizing everything!
    Thanks for everything, giga chad! :P
    All the best, y'all! Let's get this bread! 🚀

  • @_launch_it_
    @_launch_it_ 5 месяцев назад +5

    2.5x speed. Interview in 19 hours. Let's go

  • @prafullakh
    @prafullakh 6 месяцев назад +1

    Absolutely great work. Someday you should talk about the interview questions that you asked candidates and any interesting approaches they took and also about some interview questions that zapped you.
    PS: Towards the start of this video you asked us to get lotion and paper. What gives?

    • @jordanhasnolife5163
      @jordanhasnolife5163  6 месяцев назад +1

      1) I've never interviewed anybody, I'm a sham :)
      2) you need the paper to take notes and the lotion to keep the pencil from sticking to your otherwise sweaty hands

  • @FranckPachot
    @FranckPachot 6 месяцев назад +2

    You are sharing awesome content. Great to link to for short and acurate explanations.
    Would be great to see more on Distributed SQL (you did Spanner but there's also YugabyteDB, CockroachDB, TiDB, YDB). And on PostgreSQL compatible databases (you did Aurora but there's also AlloyDB, Neon, YugabyteDB)

  • @thktomska
    @thktomska Год назад +1

    This is what i'm look for! Great quality - thank you very much!

  • @lvmrjb
    @lvmrjb Месяц назад +1

    watched your video about why you left Google and you mentioned you're a new grad.. extremely impressive you know all of this already! any good books/resources you used? thanks for the videos!

    • @jordanhasnolife5163
      @jordanhasnolife5163  Месяц назад +1

      Thanks! I'd just recommend really reading and understanding designing data intensive application, and that should give you all the background that you need to go do more of your own research!

  • @aiman_yt
    @aiman_yt 3 месяца назад +1

    B-tree writes can go to memory too. It's called buffer pools.

  • @whirr9755
    @whirr9755 2 года назад +11

    huh, i subbed for day in the life vids 😒

  • @chits006
    @chits006 Год назад +2

    Great Video, One question, where can we learn about db schema design? Some basics and exercises would be good, any online course you recommend?

    • @jordanhasnolife5163
      @jordanhasnolife5163  Год назад +2

      I'd just look at database docs and existing engineering blogs from reputable companies!

  • @arshadhammad
    @arshadhammad 11 месяцев назад +1

    Thanks for the nice series. I really liked your videos

  • @pl5778
    @pl5778 Год назад +1

    this is an awesome video! thanks for such a great summary

  • @akhilkhubchandani2632
    @akhilkhubchandani2632 Год назад +1

    Great work and amazing video! Could you also make more low level design videos?

  • @roywastaken
    @roywastaken 2 года назад +4

    Thank you senpai 🙏🏽

  • @420_gunna
    @420_gunna Год назад +2

    16:30, I haven't heard of column compression being used for image data in the way that you describe here, any pointers on what you were talking about when you mentioned this?

    • @jordanhasnolife5163
      @jordanhasnolife5163  Год назад

      Hey so I don't actually mean to compress the images with column compression:
      I just mean having a column containing multiple images means that you only have to fetch the images themselves as opposed to potentially a lot of metadata that may come with them (if you were to fetch a row at a time)

    • @BenLernerOfficial
      @BenLernerOfficial Год назад +2

      ​@@jordanhasnolife5163 I paused the video at this point in confusion as well, because I'm afraid the example doesn't make much sense. In the query you described, you only want to get the thumbnails associated with a specific video, so you would either implement that with a relational table (full_video_id | thumbnail_id, where one full_video id is associated with one or more thumbnail_ids) or you'd store a list of the thumbnail_ids (pointing to the actual image data in, say, s3) on a document representing the full video. The only situation in which you would possibly want to store images in a column is if you'd want to somehow query ALL thumbnails across ALL videos, but that is not the situation you described - you described getting the thumbnails of a SINGLE video. That would be OLTP/row-based, not OLAP/column-based. Also, columns typically contain primitives (so you could, for example, perform an average across a column of floats)

    • @jordanhasnolife5163
      @jordanhasnolife5163  Год назад

      @@BenLernerOfficial Yes sorry, this is assuming that one video might have many thumbnails (e.g. to create one of those gifs that you see on RUclips now). Sorry this wasn't clear, everything that you've said is accurate.

    • @jordanhasnolife5163
      @jordanhasnolife5163  Год назад

      Another common use case is to load all thumbnails for a user's channel, such as if you were to click my channel page.

  • @tamarapensiero8048
    @tamarapensiero8048 Год назад +2

    Hey Jordan, just started watching every video you've created. I love them. I'm wondering how I could get in contact with you as soon as possible. Id like a couple minutes of your time if possible. Thanks x

  • @raj_kundalia
    @raj_kundalia 11 месяцев назад +1

    Thank you, Jordan!

  • @sohansingh2022
    @sohansingh2022 11 месяцев назад +2

    Thank you buddy!

  • @luli829
    @luli829 10 месяцев назад +2

    how do you gain some much knowledge in system design? really amazing!

    • @jordanhasnolife5163
      @jordanhasnolife5163  10 месяцев назад +1

      I have no life!
      No but actually, I just have optimized my knowledge specifically for the interview haha - I'm sure you all are better software engineers than me

    • @luli829
      @luli829 10 месяцев назад +2

      @jordanhasnolife5163 lol no. I'm trying to learn from you and get better :)

  • @amospan14
    @amospan14 2 года назад +1

    Really good one! Thank you Jordan! =)

  • @ameygoel1
    @ameygoel1 Год назад +2

    bro I watched your earlier videos in 1.25x speed and now your normal voice feels weird and slow. Nevertheless great and orderly content. Cheers! Would recommend others too :)

  • @andreybraslavskiy522
    @andreybraslavskiy522 7 месяцев назад +1

    Thank you for the great content

  • @zuowang5185
    @zuowang5185 10 месяцев назад +1

    why redis instead of just using the hashmap in your program? for cross process communication?

    • @jordanhasnolife5163
      @jordanhasnolife5163  10 месяцев назад +2

      Well sometimes you want many servers, sometimes you want replication, sometimes you want a writeahead log, sometimes you want database partitioning

  • @sarfrazz34
    @sarfrazz34 2 года назад +1

    Thanks for the video man! it was informative
    could you please create a video if possible on scenario-based database usage I am really confused about where to properly use sql db and nosql db
    I am little clear that if we need ACID properties then best is sql.
    but I am not completely aware of different other scenarios on where to perfectly use sql and nosql dbs. if you also have any resources please share I am not able to find a good one

    • @jordanhasnolife5163
      @jordanhasnolife5163  2 года назад

      I think you basically just expressed it yourself - "if you need acid properties use sql" - if data integrity is the most important part of your application, SQL is the way to go. Otherwise, NoSQL can offer greater speed while sacrificing some of these requirements.

    • @sarfrazz34
      @sarfrazz34 2 года назад

      ​@@jordanhasnolife5163 Thanks Jordan
      I am thinking of a scenario in case of storing product related things I see nosql is best suited as different product could have different properties, but how about managing the inventory for the product?
      in this case since it requires acid props to manage the inventory count properly, should we maintain the inventory count details alone in sql DB?

  • @hdrkn5247
    @hdrkn5247 6 месяцев назад +1

    which database is of choice when you need SQL database but the dataset is too large and you need to shard the data or the database needs to be distributed?

    • @jordanhasnolife5163
      @jordanhasnolife5163  6 месяцев назад

      A SQL database lol. You can still shard your data here, just be smart about how you do it.

  • @ravindranaths513
    @ravindranaths513 7 месяцев назад +1

    Could you please make a video on Wide column vs column family vs columnar vs column oriented DBs with some examples

    • @jordanhasnolife5163
      @jordanhasnolife5163  7 месяцев назад

      Hey! I think I probably mentioned this more in the 1.0 series but not sure that it deserves a full video, just look up images of the formats :)

    • @ravindranathsirisala6408
      @ravindranathsirisala6408 7 месяцев назад

      @@jordanhasnolife5163 , please give me link of that video

  • @franklinyao7597
    @franklinyao7597 Год назад +1

    What if you need a NoSQL store with strong consistency? You need Hbase or MongoDB. And if you need a db optimized for heavy reads, you may need MongoDb since it uses B tree.

    • @jordanhasnolife5163
      @jordanhasnolife5163  Год назад

      Mongo might be better for reading sure, but I caution you from saying it and HBase are strongly consistent. Hadoop has some weird writing thing that kinda makes it strongly consistent, and maybe you can configure mongo to do so, but Hadoop writes aren't like actually achieving consensus (and afaik mongo isn't either), so it's kinda just not great for that haha

    • @franklinyao7597
      @franklinyao7597 Год назад +1

      @@jordanhasnolife5163 what is that weird writing thing?

    • @jordanhasnolife5163
      @jordanhasnolife5163  Год назад +1

      @@franklinyao7597 You like write to multiple nodes at once and only get a success message if it's hit a certain amount of them, but the write still goes through on some of the nodes even if you don't meet the success threshold if I remember correctly

  • @mmfStudent
    @mmfStudent 5 месяцев назад +2

    B-Trees are not binary trees. The video itself is still quite good.

  • @jporritt
    @jporritt 4 месяца назад +1

    Whenever I mention Cassandra in a Systems Design interview, the interviewer always seems to have some horror story concerning it (often its performance!)

    • @jordanhasnolife5163
      @jordanhasnolife5163  4 месяца назад

      Interesting, I'd be curious if you pushed back on them a bit to ask them what the workload was and why the performance was poor what they'd say!

    • @jporritt
      @jporritt 4 месяца назад +1

      @@jordanhasnolife5163 It’s often that “We tried it and it was slower”. I’m guessing they were approaching it as some sort of vertical solution (so a faster RDBMS) than a horizontal solution and retaining an application model that was optimized for single-leader.

  • @ashwint959
    @ashwint959 9 месяцев назад +1

    What about distributed sql databases like spanner/cockrorachdb?

    • @jordanhasnolife5163
      @jordanhasnolife5163  9 месяцев назад

      I think these are probably worth knowing about from a software engineering perspective but probably not worth using in a design for an interview. Spanner (can't speak for cockroach) is great, but I think it may be too niche to be fair game here (since it doesn't exactly have a "dedicated" use case).

  • @chits006
    @chits006 Год назад +1

    Why no honorable mention of Dynamo & BigTable ?😀

    • @jordanhasnolife5163
      @jordanhasnolife5163  Год назад +1

      Mainly because bigTable = hbase and dynamo = Cassandra (it actually may not assuming you're talking about dynamodb but theres no docs on internal implementation afaik)

  • @danielvega-myhre4201
    @danielvega-myhre4201 Год назад +1

    Are your slides available to view/download somewhere?

  • @effy1219
    @effy1219 Год назад +1

    hahahah i just like how he call us , you lazy f**s and do it

  • @pavliv
    @pavliv Месяц назад +1

    Nice

  • @rydmerlin
    @rydmerlin Год назад +1

    Are trees with more than two children for a given parent still considered binary trees?

  • @Stella-se1lg
    @Stella-se1lg 6 месяцев назад +1

    Salute😊

  • @sachin_yt
    @sachin_yt 2 года назад +2

    Finalyyyyyyyyyyy

  • @prathamsinghal5261
    @prathamsinghal5261 7 месяцев назад +1

    Scylla DB ??

  • @piercef7343
    @piercef7343 Год назад

    Kudos!

  • @nishanthprince
    @nishanthprince Месяц назад +1

    Did the Goat just say he’s insecure ?

    • @jordanhasnolife5163
      @jordanhasnolife5163  Месяц назад

      You think a secure person would spend multiple years of their life lifting weights and studying systems design?? 😭

  • @ladyv890
    @ladyv890 2 года назад +5

    Yay for Women!

    • @jordanhasnolife5163
      @jordanhasnolife5163  2 года назад +7

      Just defended women against a mysognist on Xbox live the other day

    • @ladyv890
      @ladyv890 2 года назад

      @@jordanhasnolife5163 Yay Jordan! 🤗 lol

  • @mvp4gman
    @mvp4gman 8 месяцев назад +1

    No S3 🥲

    • @jordanhasnolife5163
      @jordanhasnolife5163  8 месяцев назад

      Not a database - though technically some cloud native data warehouses are being built using s3 as the storage layer and parquet files

  • @BlunderMunchkin
    @BlunderMunchkin 3 месяца назад +5

    Talking too fast.

  • @SreekantShenoy
    @SreekantShenoy 8 месяцев назад +1

    This guy stores! 🫣