Machine Learning design: Search engine for Q&A

Поделиться
HTML-код
  • Опубликовано: 1 дек 2024

Комментарии • 20

  • @cskvarma6
    @cskvarma6 4 года назад +25

    We are working on doing a couple of public live sessions on the code-walkthrough of this system(ElasticSearch+BERT) using publicly available datasets. Stay tuned! We have just made an announcement for these public live sessions here: ruclips.net/video/6q_ez_Cfcik/видео.html

    • @prateeksingh812
      @prateeksingh812 4 года назад

      eagerly waiting sir..:)

    • @Aalekhrock
      @Aalekhrock 4 года назад

      Please make it publicly available... sir !!

    • @AppliedAICourse
      @AppliedAICourse  4 года назад +3

      As mentioned above, these will be public sessions.

    • @varunsaproo4120
      @varunsaproo4120 4 года назад

      Awesome Sir! It will help us a lot. Thank You!

    • @TheGoodboy26
      @TheGoodboy26 4 года назад

      @@AppliedAICourse Thank you

  • @chayanitbhateja7964
    @chayanitbhateja7964 3 года назад

    Amazing video! Looking forward to see more of these experience-driven videos!!!!!🔥🔥

  • @jeetendrakumar5964
    @jeetendrakumar5964 4 года назад +1

    Thanks AppliedAi for these informative sessions. Would be really helpful if you can come up with some good Cost structure & Curated course for your alumni Students who ended up doing all things till self driving cars. I mean special course that include all newly added material after self driving cars with Tf sessions.

    • @AppliedAICourse
      @AppliedAICourse  4 года назад +5

      Thank you Jeetendra. Thank you for the suggestion. We are brainstorming a few ideas on the same lines. We will let you know as soon as we have something in place.

  • @yoorooda9743
    @yoorooda9743 3 года назад

    Why not use KNN which runs with log(n) complexity? To further optimize, you can use a funnel approach with a simple linear model which focuses on recall and then apply BERT on the smaller output set of the funnel.

    • @AppliedAICourse
      @AppliedAICourse  3 года назад

      The Kd-tree based approach for k-NN which has a log(n) time complexity does not work well when you have high dimensional data. When applying the first linear model, what feature would you employ? How do you get all similar questions using a linear model which is often used for classification and regression.

  • @amitunemelody5612
    @amitunemelody5612 4 года назад

    sir i am thinking to build a project name - "Automatic sketch of a suspect or criminal using genetic algorithm " based on input given by the user such as eye color,face structure,etc. But don't know from where to start. Sir Please give me some suggestion like....which topics should i study more for this ?.....or which approach should i take to build this ?.....how can i build the optimization function ?.........i will be really thankful to you if you show some light on this.
    Thank you in advance sir.

  • @varungondu7053
    @varungondu7053 4 года назад

    Hi, sir I tried using elastic search + Bert but sometimes elastic search fails and when the elastic search fails bert is not able to extract the correct answer so is there any other idea?

    • @AppliedAICourse
      @AppliedAICourse  4 года назад

      Could you be more specific on what you means my ES "fails"? Does that mean ES faila to find relevant questions even when BERT vectors are similar? Are the BERT vectors themselves not similar? You will need debug and understand the source of the problem. Note that BERT is not always perfect. Like all ML models it will make errors. We just have to fine-tune these models to make them work better on our problem at hand.

  • @bharadwajchivukula2945
    @bharadwajchivukula2945 4 года назад

    Sir is whoosh better than elastic search

    • @AppliedAICourse
      @AppliedAICourse  4 года назад +3

      Whoosh is a Python package for search. We have not used it extensively. Hence, can't comment in depth about it. But, elastic search is much more mature and is used extensively in large scale production environments at scale at many larger internet companies.

  • @AppliedAICourse
    @AppliedAICourse  4 года назад +2

    This is a high-level system design of how to build a simple semantic Search engine for Q&A data
    References:
    1. ElasticSearch+ BERT: www.elastic.co/blog/text-similarity-search-with-vectors-in-elasticsearch
    2. FAISS: engineering.fb.com/data-infrastructure/faiss-a-library-for-efficient-similarity-search/
    3. DiskANN: www.microsoft.com/en-us/research/publication/diskann-fast-accurate-billion-point-nearest-neighbor-search-on-a-single-node/

    • @JaGWiREE
      @JaGWiREE 4 года назад

      Very nice. Been following your series for a while during quarantine from Toronto while I work on some complex and stochastic analysis, these videos are VERY good and some of the best applied videos I've seen yet on youtube - some theorists like myself certainly really appreciate this content, thank you