Why are vector databases so FAST?

Поделиться
HTML-код
  • Опубликовано: 2 фев 2025

Комментарии • 75

  • @MrTulufan
    @MrTulufan 6 месяцев назад +6

    The actual discussion about vector database starts at 14:45. Before that, it is a just a review of embeddings and RAG framework

  • @JHBG1971
    @JHBG1971 8 месяцев назад +12

    How do you only have 40k followers? Amazing content. Been looking for this for over a year. Thank you!

  • @ah89971
    @ah89971 8 месяцев назад +11

    You are great. People who following you are the ones who care about understanding the root concepts which is rare to find nowadays because everyone copying and pasting without understanding

    • @deroace
      @deroace 2 месяца назад

      so true

  • @farrael004
    @farrael004 7 месяцев назад +17

    Underfitted? More like Underrated.

  • @dorins3787
    @dorins3787 7 месяцев назад

    Thanks!

  • @tee_iam78
    @tee_iam78 7 месяцев назад

    He is absolutely right. Unless you take course in vector database, it is not easy to find material on 'how vector database works at low level'. Thank you for your content.

  • @PuerinTheHunter
    @PuerinTheHunter 7 месяцев назад +7

    Hey Santiago, keep going with your choice of shirts!

  • @oseteg
    @oseteg 7 месяцев назад

    Thanks a lot, Santiago! You are one of two authors I follow in RUclips and mainly in LinkedIn. The content is just a gold.
    My question is about that serverless thing. You provide the cloud and region but don't provide your aws credentials. Does it mean that it is free? As far as I understood, the cloud provider in this case is used to store the data. What is we don't delete the database at the end? Will have the bills for storing the db?

  • @vinj98
    @vinj98 7 месяцев назад

    This video, from its content to your performance, is fantastic.

  • @emrahe468
    @emrahe468 8 месяцев назад

    @22:22 this really helps on understanding the efficiency of the vector search algorithms. and the drawing reminds me the SVM borders/boundaries.
    by the way, great shirt! :)

  • @LiebsterFeind
    @LiebsterFeind 7 месяцев назад +1

    Wonderful video. Any chance of a video comparing HNSW vs Faiss vs Annoy?

  • @toddroloff93
    @toddroloff93 8 месяцев назад

    Thanks for the lesson. Always good to understand how things are getting done in the background. Great Explanation!!

  • @justindressler5992
    @justindressler5992 8 месяцев назад

    Thanks you for explaining this, I had the intuition that this is how the indexing worked via clustering but you helped crystallise my thoughts on this. One thing I think might have been missed is the trigonometric functions used like cosine take into account the direction of the vector towards the next cluster. So the cosine function uses the vectors like a compass. When grouping the vectors your quantizing or approximately all related vectors to the centroid. So obviously reducing accuracy because your not pointing to the exact point in the cluster but to the centre. How are the results selected is there an attempt to research the selected related records using the original vector or is it simply random selection.

  • @mahmoudelaskare4982
    @mahmoudelaskare4982 5 месяцев назад

    great video as usual , love the energy 👏🏻❤

  • @nachoeigu
    @nachoeigu 7 месяцев назад +2

    Thank you very much for this amazing content!! It is so educative :)

  • @uwegenosdude
    @uwegenosdude 7 месяцев назад

    Thanks for the cool video to make me better understand this topic. If I do not want to put my data into a cloud, what other vector db could you recommend? ChromaDB?

    • @vedant_stone
      @vedant_stone 7 месяцев назад

      There's weaviate which is also open source I believe

    • @vedant_stone
      @vedant_stone 7 месяцев назад

      There's convex as well

  • @ns_the_one
    @ns_the_one 21 день назад

    subscribed. Awesome explanation

  • @DarkRaviForDeath
    @DarkRaviForDeath 8 месяцев назад +1

    top tier content as always

  • @KumR
    @KumR 7 месяцев назад

    Nice ...Can u do one on Graph Database too?

  • @alextiger548
    @alextiger548 7 месяцев назад +1

    fantastic stuff. thank you so much

  • @collinvelarde7473
    @collinvelarde7473 5 месяцев назад

    This was awesome. Thanks big guy.,

  • @riemannderakhshan1037
    @riemannderakhshan1037 7 месяцев назад

    Have nothing to tell, than You are fantastic!

  • @michaelduffy5309
    @michaelduffy5309 7 месяцев назад

    Beautifully done.

  • @kpm25
    @kpm25 5 месяцев назад

    Thanks a lot, subscribed

  • @ernestuz
    @ernestuz 8 месяцев назад +1

    In many ways, when you calculate the embeddings, and you reduce a fragment of data to a single vector, you are calculating a kind of hash.

    • @carterthaxton
      @carterthaxton 7 месяцев назад

      Yes, like a hash, it’s a compression and normalization of the data into a short common form. But better than a hash, because it’s comparable in multiple dimensions.

  • @lokeshsharma4177
    @lokeshsharma4177 7 месяцев назад

    Awesome as always. I live in Florida as well what are my chances to meet you in person AND how did you automate your responses to all comments you get as ♥ !!!!!! Please write something as well 🙂

    • @underfitted
      @underfitted  7 месяцев назад +1

      No automation. The RUclips Studio app on my phone gives me the option to ❤️ replies. 😃

  • @hasnainahmed7605
    @hasnainahmed7605 2 месяца назад

    Ahan!! How lucky these 48k, subscribers are... :)
    BTW, you look nice in this shirt!

  • @delvoneu
    @delvoneu 6 месяцев назад

    Love the shirt, where did you buy it?

    • @underfitted
      @underfitted  6 месяцев назад +1

      Can’t remember. Probably Dillard’s

  • @johnmarshall4_
    @johnmarshall4_ 7 месяцев назад

    Thank you for this

  • @DataPains
    @DataPains 7 месяцев назад

    Awesome!

  • @nope9310
    @nope9310 7 месяцев назад

    "ok so I'm going to execute this" "BOOM it's just that fast!"
    really?... really?.... You add a cut between those two sentences? I'm hoping this was unintentional. (thankfully the next search didn't have a cut)
    Great video otherwise. I'd love to see you dive into the actual indexing though so we can actually see how it works. This was quite high level.

    • @underfitted
      @underfitted  7 месяцев назад

      Sorry, the cut was unintentional. My goal is to show to to build things, not how fast the tech is because that won’t be relevant in your own hardware.

  • @domineia
    @domineia 8 месяцев назад

    Amazing content

  • @deroace
    @deroace 2 месяца назад

    Wonderfull video Im trying to make an AI personality with vector databases lets hope I will get in my head an idea how to make it useing the information form the video 😅

  • @rally_furymoments5294
    @rally_furymoments5294 7 месяцев назад

    This guy is creating amazing content and subscriber is 40k??

  • @Vivek2062
    @Vivek2062 4 месяца назад

    I didn't know Adam Sandlers is a VectorDB nerd!

  • @damonguzman
    @damonguzman 7 месяцев назад +1

    You didn’t explain the answer.

  • @Drackomass
    @Drackomass 8 месяцев назад +3

    I like the shirt.

  • @sorin202
    @sorin202 Месяц назад

    You've correctly pointed out that you don't understand how OpenAI works. You're also questioning whether it's definitely powered by a quantum chip."

  • @shahjahanmirza1616
    @shahjahanmirza1616 8 месяцев назад

    Im sad that you dont have any paid course. I'd buy any of your AI course.

  • @AtomicPixels
    @AtomicPixels 8 месяцев назад

    Vector dbs have ML indexing built in ha

  • @jtmuzix
    @jtmuzix 8 месяцев назад

    linear algebra. Orientation vs magnitude.

  • @johnini
    @johnini 7 месяцев назад

    The shirt is okay, and the content overall is good. However, the video could have been shorter, 15 minutes.
    It felt too redundant, with not-so-useful examples. There was no need to include an example of Voronoi diagrams for cities.
    Maybe I am not the target viewer of your content. For now I will follow :)

    • @underfitted
      @underfitted  7 месяцев назад +2

      Be honest: The shirt is awesome!

    • @johnini
      @johnini 7 месяцев назад

      ​@@underfitted I love your carisma! The shirt is awesome!
      Still following and looking your new video!

  • @raunaquepatra3966
    @raunaquepatra3966 7 месяцев назад +1

    you can compress the video into 10 mins video, would be a lot better

    • @underfitted
      @underfitted  7 месяцев назад +2

      Yup. Still learning how to do that.

    • @dorins3787
      @dorins3787 7 месяцев назад +1

      Not for a 5 years old to understand. The content is very good because it takes you from zero and grows the technical level. It is one of tge best i have found.

    • @ricardomahfoud
      @ricardomahfoud 7 месяцев назад

      I disagree. To make the video 10 minutes, alot of the information will have to be either redacted or simplified. I like having longer videos that I can watch at 1.5/2x to get the best of both worlds. What makes this channelvaluable to me is the fact that it is not just a 10 minute surface explanation, but an in-depth technical explanation.

  • @blindConjecture
    @blindConjecture 7 месяцев назад

    This was WAY too much background context.
    You have to think who your audience is here. If we're interested in knowing the inner working of fast vector database lookups it's because we already know the basics like "what is a vector" and "how do you load a csv file in Python".
    I gave up watching after 15min because the video still hadn't even begun explaining anything about vector databases.

    • @underfitted
      @underfitted  7 месяцев назад +1

      Thanks for the feedback!

    • @nope9310
      @nope9310 7 месяцев назад

      There are other videos that explain that.

  • @SonidosEnArmonia_1992
    @SonidosEnArmonia_1992 8 месяцев назад

    Hi

  • @cherniaktamir612
    @cherniaktamir612 8 месяцев назад +6

    why are you angry?

    • @teaman7v
      @teaman7v 7 месяцев назад +5

      He's foreign, not angry. Common mistake.

    • @underfitted
      @underfitted  7 месяцев назад +7

      I’m actually a very happy person.

    • @cosmicaug
      @cosmicaug 7 месяцев назад

      @@teaman7v, you wouldn't like him when he's angry.

    • @Hlfe0
      @Hlfe0 7 месяцев назад +1

      I like his style, he is passionate about what he shares 😁♥️

    • @subusrable
      @subusrable 6 месяцев назад

      is he? isn't it just his way of passionately explaining?

  • @timwake5830
    @timwake5830 8 месяцев назад

    Two minutes no info. Done w you

  • @mrsesh7364
    @mrsesh7364 7 месяцев назад +1

    @jamesbriggs has great videos