Build Semantic-Search with Elastic search and BERT vector embeddings. ( From scratch )

Поделиться
HTML-код
  • Опубликовано: 22 авг 2024

Комментарии • 68

  • @AbidSaudagar
    @AbidSaudagar  8 месяцев назад +1

    You can checkout New Advanced Semantic Search Video by using GPT 3.5 Turbo and ElasticSearch: ruclips.net/video/frnKiugyFRI/видео.html

  • @ganeshkumarmr1
    @ganeshkumarmr1 27 дней назад

    I appreciate your will and dedication to rerecord the video second time

  • @margipatel7768
    @margipatel7768 10 месяцев назад +1

    Thank you for sharing! Very clear, thorough, well-paced and learner centered. What an amazing educator!

    • @AbidSaudagar
      @AbidSaudagar  10 месяцев назад

      Thank you!! Glad you found it valuable!

  • @sulaiman3975
    @sulaiman3975 11 месяцев назад +3

    I think you should consistently upload videos .
    More video like this very helpful ✅
    ❤❤❤🙌🏻

  • @ephermalinsights
    @ephermalinsights Месяц назад

    That's very nice .. glad I searched and got your video

  • @judepops8015
    @judepops8015 3 месяца назад

    Incredible video, you are amazing! Thank you for the concise outline of the video and explanations of all the key concepts. You have a new sub :)

  • @bigm8872
    @bigm8872 10 месяцев назад

    Wow! This is amazing work you did here. Your work is very clear, the process well well explained 👏. I was looking into setting up vector embeding search with Elastic and you've clarified the whole process and fundamental understanding of such a solution.

    • @AbidSaudagar
      @AbidSaudagar  10 месяцев назад

      Thank You. I'm glad it was helpful!

  • @godfreyogbeide2340
    @godfreyogbeide2340 5 месяцев назад +1

    Very good tutorial

    • @godfreyogbeide2340
      @godfreyogbeide2340 5 месяцев назад

      Like a semantics search ,as chatting with your Json data privately?. I love ur tutorials

  • @dkobia
    @dkobia 8 месяцев назад

    Nicely done. Thanks for the walkthrough

  • @afederici75
    @afederici75 9 месяцев назад +1

    This was a great video! Thank you so much. I enjoyed the format (very much to the point, with a nice clarification at the beginning) and you presented it really well. I would love to see a video in the future that maybe explains how picking a custom model could help being charged a lot by ES's ML nodes. In other words, how do you think we could use custom models to be able to use smaller ES ML nodes (e.g. 1GB or 2GB, vs the 4GB ESRE) and be charged less? Thanks again

    • @AbidSaudagar
      @AbidSaudagar  9 месяцев назад +2

      Glad it was helpful.
      I'll need to look into the smaller node usgae stuff. Never used it before. Will try to make video about it future for sure.
      Thanks for such detailed comment. Really Appreciate it!

  • @syedikramuddin7218
    @syedikramuddin7218 11 месяцев назад

    GREAT ❤ WELCOME BACK ... BURHANIS...

  • @snehancoghosh7322
    @snehancoghosh7322 2 месяца назад

    Hey man great video. it would be amazing if you could teach us how to do the same thing using Opensearch and docker

  • @mehul4mak
    @mehul4mak 8 месяцев назад +1

    How to run this as different services for es and streamlit via docker compose? Also if I need to use any other vectordb is there any way to do it?

  • @jasleenkalra9009
    @jasleenkalra9009 11 месяцев назад

    Great Video 👏Nice to see you back

  • @souvikdas4941
    @souvikdas4941 2 месяца назад

    I have watched the video, need advice from you on a similar use case. So I have API log data and for that I want to create a similar search application. Is it possible for the search application to answer questions like the ones below? -
    "List all failed transactions between Merchant_B and any bank.",
    "Find transactions between Merchant_A and Bank_X." etc.

  • @ankurtyagi1440
    @ankurtyagi1440 7 месяцев назад

    Superb 😊

  • @waseem901
    @waseem901 11 месяцев назад

    Great video brother👍👌👌

  • @clouds187
    @clouds187 8 месяцев назад

    Any chance you would show us how to implement this but using cosine similarity? :), Amazing video, helped me learn a lot!!

    • @AbidSaudagar
      @AbidSaudagar  8 месяцев назад +1

      We just need to change the similarity parameter in indexMapping file.
      I have used l2_norm here. Just change it to "cosine".
      Here is reference link: www.elastic.co/guide/en/elasticsearch/reference/current/dense-vector.html#dense-vector-similarity:~:text=dimensions%20per%20vector.-,cosine,in%20this%0Acase.,-max_inner_product

  • @syednomaan877
    @syednomaan877 11 месяцев назад

    Great content

  • @eduardovarela1177
    @eduardovarela1177 8 месяцев назад +1

    At final part of code i have this error BadRequestError(400, 'illegal_argument_exception', 'Invalid type: expecting [_doc] but got [_knn_search]')

    • @MuhammadAdnan-tq3fx
      @MuhammadAdnan-tq3fx 7 месяцев назад

      same error in my side even i am using 7.17.17 version of es which support knn_search

  • @parvejkhanpathan7792
    @parvejkhanpathan7792 11 месяцев назад

    Nice to see you sir

  • @valentind.5398
    @valentind.5398 8 месяцев назад

    Thank you for sharing, it really was a pleasure to follow that tutorial. I just had a video recommendation about the latest improvements of ElasticSearch and now I am wondering if that video is still relevant as of today considering the improvements that have been made (ELSER) ?

    • @AbidSaudagar
      @AbidSaudagar  8 месяцев назад

      Glad you find the video helpful Valentin.
      Thanks for the suggestion. I'll surely look at ELSER.
      Today I'll be posting a video where I have used OpenAI Embedding in Elasticsearch 8.11 as they have updated the vector dimension limit to 4096.

    • @valentind.5398
      @valentind.5398 8 месяцев назад +1

      @@AbidSaudagar Hey Abid, thanks for coming back to me. Regarding ELSER, it requires a ml node so one has to have a licence that most likely costs thousands of dollars so I will stick to vector embedding.
      Bell activated, waiting for the video!

  • @thelocalguide4429
    @thelocalguide4429 3 месяца назад

    Hello abid Is it possible to convert a specific field in elasticsearch that already contains data into vector and then use semantic search in it. Great Content

  • @IIGrudge
    @IIGrudge 10 месяцев назад

    Phenomenal tutorial! Can you up the audio? It's low.

    • @AbidSaudagar
      @AbidSaudagar  10 месяцев назад

      Thanks for kind words and also feedback regarding audio. I have increased audio levels in later videos.

  • @geethumaria6055
    @geethumaria6055 7 месяцев назад

    great video

    • @AbidSaudagar
      @AbidSaudagar  7 месяцев назад

      Glad you found it helpful.

  • @leamon9024
    @leamon9024 6 месяцев назад

    Hi, thanks for the tutorial. Could you also make one to compare elasticSearch with other vector database solution in terms of semantic/vector search?

    • @AbidSaudagar
      @AbidSaudagar  6 месяцев назад

      Great suggestion! I'll surely put it in my next videos list.

  • @arpitsingh9198
    @arpitsingh9198 9 месяцев назад

    great tutorial, wanted to ask if elastic search is the best option for basic l2 or cosine vector similarity insted of some online or offline vector db based methods as it uses approximate nearest neighbours ( ANN ), also this approach cannot be used in cases where false positives are not acceptable such as cache implementation or medical data semantics, can you suggest something that resolves these problem statements 😢

  • @mm-rr2ei
    @mm-rr2ei 3 месяца назад

    awesome

  • @user-dm9vr6ln3b
    @user-dm9vr6ln3b 9 месяцев назад

    great work amigo
    i just had one question please :
    what should i do in order to search by all the fileds (price , brand , gender ...ect ) ???

    • @AbidSaudagar
      @AbidSaudagar  8 месяцев назад

      You can make all field you wanna search for as dense_vector field, then search on all of them separately and then combine the results manually. This is the simplest way.

  • @satyashah3045
    @satyashah3045 4 месяца назад

    ObjectApiResponse({'acknowledged': True, 'shards_acknowledged': False, 'index': 'series120'}) I am getting this error
    How can I solve this?

  • @devanshgoel3433
    @devanshgoel3433 6 месяцев назад

    thanks bro!

  • @AustinMark
    @AustinMark 8 месяцев назад

    great video. I was also wondering what vs code plugin you used that color coded your python indentations?

    • @AbidSaudagar
      @AbidSaudagar  8 месяцев назад +1

      Thank You Austin.
      Plugin name is indent-rainbow.

  • @AbidSaudagar
    @AbidSaudagar  11 месяцев назад

    Coding a privateGPT using LANGCHAIN, HuggingFace Embeddings and FREE LLM
    ruclips.net/video/kUxfr2i2zn8/видео.html

  • @seriesophile6577
    @seriesophile6577 11 месяцев назад

    you didnot provide the repository link to work with, or the dataset path . please share the repository in git. it would be quite helpful coz i am currently working on it

    • @AbidSaudagar
      @AbidSaudagar  11 месяцев назад

      Sorry I forgot to add the link. Here is the link.
      github.com/abidsaudagar/semantic-search-elastic-search-and-BERT-vector-embedding

  • @varunkalyan3135
    @varunkalyan3135 7 месяцев назад

    Totally loved it! Code access link for referral please?

    • @AbidSaudagar
      @AbidSaudagar  6 месяцев назад +1

      github.com/abidsaudagar/semantic-search-elastic-search-and-BERT-vector-embedding

  • @rajeevmishra7486
    @rajeevmishra7486 10 месяцев назад

    hi @abid, thanks for the video, can you help me in combining multiple column data for vector search instead of just one as somtimes the description does not contain everything and the customer may look for things from other column. i hope, you got it. please help.

    • @AbidSaudagar
      @AbidSaudagar  10 месяцев назад

      This can be done with multiple approaches.
      1. We can aggregate the text data from multiple field into one and then create the vector.
      2. If for some reason you do not want to aggregate the data in one field. We can use LTR plugin: chat.openai.com/share/01ef9614-af6e-47e4-be1d-c0697eee6bd7

  • @hemanthkumar4580
    @hemanthkumar4580 11 месяцев назад

    👏

  • @sameergaikwad222
    @sameergaikwad222 8 месяцев назад

    where is the Kaggle file link in description ??

    • @AbidSaudagar
      @AbidSaudagar  8 месяцев назад

      github.com/abidsaudagar/semantic-search-elastic-search-and-BERT-vector-embedding
      You can download from my Github.

  • @chinmayjena2600
    @chinmayjena2600 4 месяца назад

    Sir how to run multiple nodes

    • @AbidSaudagar
      @AbidSaudagar  4 месяца назад +1

      You can use docker to easily run multiple containers of ElasticSearch.
      Without docker is also possible.
      I'm gonna make video about it soon.

    • @chinmayjena2600
      @chinmayjena2600 4 месяца назад

      @@AbidSaudagar thank you sir ....plz upload that video

  • @ibrahimabolel1543
    @ibrahimabolel1543 9 месяцев назад

    Appreciate your effort Abid, but when i run the search query code at 28:42 as shown in the video it shows this error :
    "BadRequestError(400, 'search_phase_execution_exception', 'failed to create query: [knn] queries are only supported on [dense_vector] fields')"
    I'm using the same dataset you're using with the same mappings,
    Could you please assist, thank you in advance!

    • @AbidSaudagar
      @AbidSaudagar  9 месяцев назад +1

      Can you confirm that the field you are searching on, has field type as dense vector.

    • @ibrahimabolel1543
      @ibrahimabolel1543 9 месяцев назад

      @@AbidSaudagar Appreciate your response Abid, i managed to solve the snippet error and prepared the index properly but however the code runs without any trouble in the vs code and the Ai model is working but when i launch it using streamlit it pops up the same error regarding the [dense_vector fields] and i've made sure the field of the query is a type of dense vector,.