Locality Sensitive Hashing (LSH) for Search with Shingling + MinHashing (Python)

Product Quantization for Vector Similarity Search (+ Python)

Index 2024 Talk: Vector Search and the FAISS Library

Surprising Son with Dream Car on 16th Birthday

MAKING BURR BASKETS FOR EACHOTHER!! ft: EVELYN ORTIZ

Rory McIlroy, Scottie Scheffler vs Bryson DeChambeau, Brooks Koepka | Crypto.com Showdown Highlights

Choosing Indexes for Similarity Search (Faiss in Python)

James Briggs

Просмотров 23 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 25 янв 2025

Комментарии • 19

@narayansharma8797 3 года назад ⁺¹
Thanks a bunch for this, James! Would be really great to see a couple of them get explored in depth. Also, if you could benchmark FAISS against ScaNN, it will help a few of us noobs a hell lot.
Great content! Lovely command over your content. Really need more of this.
@jamesbriggs 3 года назад ⁺²
Hey Narayan, there is a video released already covering the 'traditional' version of LSH, and two more videos that will be released at 1200 ET today on the random projection version of LSH (used in Faiss) - and there are plenty more of these on the way ;)
I love the FAISS vs ScaNN idea too, will be working on it soon!
@narayansharma8797 3 года назад
@@jamesbriggs Sold!
@Nick-vs1zp 2 года назад
Great explanations, especially for IVF - it's probably the best explanation for how it works that I've seen.
@jamesbriggs 2 года назад
thanks Nick!
@harshitjaitly6850 3 года назад ⁺¹
Super Informative Content!
Thank you so much for this.
@mohammadyahya78 Год назад
Does the IVF algorithm works with high dimensional data please like 100?
@katehan9623 3 года назад
Thank you for your video. Most Valuable Channel. Do you use GPU for indexing in this projects?
@haneulkim4902 Год назад
Thanks for amazing video! Do you know why simple K-means are not used for these MIPS problems?
@nareshsandrugu6057 2 года назад
Can share the video assume I have binary data of train and test, so need to calculate the haming distance, I didn't found any videos using faiss ,if share the video that may more helpful
@grayrigel7091 2 года назад ⁺¹
Hi James.
Thanks for such a wonderful tutorial. Really useful. A quick question, For a new query vector, is it possible to return the IVF cell/partition that it belongs to, instead of returning the neighbors? I think I can measure the distances with centroids and return the closest centroid. However, I was thinking if there is built-in way.
@caiolp4 2 месяца назад
I have the exactly same problem. How did you solve it?
@mohammadyahya78 Год назад
what is nbits please at 10:21?
@basedscienxe6632 3 месяца назад
This is the amount of bits for the precision of each component in the vector I believe
@viorelteodorescu Год назад
What does IP stand for?
@itheenigma 3 года назад
Super useful! Thanks for this video James. For IVF, can we retrieve the clusters that each datapoint belongs to after training (also cluster centroids)?
@jamesbriggs 3 года назад ⁺²
Yes you can, there is info on it here gist.github.com/mdouze/904e0b538ef7767c9e83a45ac1b57d1b
The code you need to write (after training and adding your data to 'index') is:
invlists = index.invlists
all_ids = []
for l in range(ind.nlist):
ls = invlists.list_size(l)
if ls == 0:
continue
all_ids.append(
faiss.rev_swig_ptr(invlists.get_ids(l), ls).copy()
)
@itheenigma 3 года назад
@@jamesbriggs legend. Will give it go. Ta!
@ChrisZuo 6 месяцев назад
Thank you! The drawings are cute!

Следующие

Автовоспроизведение

Locality Sensitive Hashing (LSH) for Search with Shingling + MinHashing (Python)

Locality Sensitive Hashing (LSH) for Search with Shingling + MinHashing (Python)

Product Quantization for Vector Similarity Search (+ Python)

Product Quantization for Vector Similarity Search (+ Python)

Index 2024 Talk: Vector Search and the FAISS Library

Index 2024 Talk: Vector Search and the FAISS Library

Surprising Son with Dream Car on 16th Birthday

Surprising Son with Dream Car on 16th Birthday

MAKING BURR BASKETS FOR EACHOTHER!! ft: EVELYN ORTIZ

MAKING BURR BASKETS FOR EACHOTHER!! ft: EVELYN ORTIZ

Rory McIlroy, Scottie Scheffler vs Bryson DeChambeau, Brooks Koepka | Crypto.com Showdown Highlights

Rory McIlroy, Scottie Scheffler vs Bryson DeChambeau, Brooks Koepka | Crypto.com Showdown Highlights

where i have been.

where i have been.

Faiss - Introduction to Similarity Search

Faiss - Introduction to Similarity Search

3 Vector-based Methods for Similarity Search (TF-IDF, BM25, SBERT)

3 Vector-based Methods for Similarity Search (TF-IDF, BM25, SBERT)

349 - Understanding FAISS for efficient similarity search of dense vectors

349 - Understanding FAISS for efficient similarity search of dense vectors

HNSW for Vector Search Explained and Implemented with Faiss (Python)

HNSW for Vector Search Explained and Implemented with Faiss (Python)

I Made a FAST Search Engine

I Made a FAST Search Engine

Speculations on Test-Time Scaling (o1)

Speculations on Test-Time Scaling (o1)

Vector Database Search - Hierarchical Navigable Small Worlds (HNSW) Explained

Vector Database Search - Hierarchical Navigable Small Worlds (HNSW) Explained

How LSH Random Projection works in search (+Python)

How LSH Random Projection works in search (+Python)

Faiss - Vector Compression with PQ and IVFPQ (in Python)

Faiss - Vector Compression with PQ and IVFPQ (in Python)

Юля Паршута - Маленький Принц Новый выпуск «По каверочку». Смотрите эксклюзивно в VK 🤍

Юля Паршута - Маленький Принц Новый выпуск «По каверочку». Смотрите эксклюзивно в VK 🤍

Первые впечатления от Galaxy S25 Ultra… и S25 Edge!

Первые впечатления от Galaxy S25 Ultra… и S25 Edge!

Если Крым не рай на земле то что это? 😂

Если Крым не рай на земле то что это? 😂

Противный КЛИЕНТ (смешное видео, юмор, приколы, поржать, прикол)

Противный КЛИЕНТ (смешное видео, юмор, приколы, поржать, прикол)

КОТЁНОК МНОГО ПОЁТ #cat

КОТЁНОК МНОГО ПОЁТ #cat

Создал Пилу в Dota 2 Победитель Получает 100.000 Рублей !

Создал Пилу в Dota 2 Победитель Получает 100.000 Рублей !

КОТЁНОК ОЧЕНЬ МНОГО ПОЁТ #cat

КОТЁНОК ОЧЕНЬ МНОГО ПОЁТ #cat

The Fastest Way To Make A Salad!

The Fastest Way To Make A Salad!