Training State-of-the-Art Text Embedding & Neural Search Models

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Multiple Negatives Ranking Loss for Sentence Embeddings

"It's time for him to leave" | Jamie Carragher says Marcus Rashford should leave Man Utd

The Greatest Comeback Of All Time?

Gas Fruit Is The MOST OVERPOWERED Fruit.. (Blox Fruits)

Training State-of-the-Art Sentence Embedding Models

Nils Reimers

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 10 янв 2025

Комментарии • 12

@shambledeggs Год назад
Great explanation!
Regarding the "Multiple negative ranking loss"; you say that a_i and p_j should be far away from each other.
Don't we risk to create a lot of clusters of pairs? I mean how do we get that a_i and p_j are closer than a_i and p_k if p_k and p_j are rather similar just like your example at 07:00
We would assume that (a_1, p_1) and (a_1, p_2) is closer than (a_1, p_3) .
How do we make sure, that hose 3 pairs just not just end up in each of their own corner?
@lonebadatel 3 года назад ⁺¹
Thanks @nils super helpful content as usual
@narutoishan1 5 месяцев назад
I am doing a sentiment analysis task. I want to train a model on multiple negatives ranking loss. I am dealing positive and negative pairs. (x, (ai,bi),(ak,bk)).
x -> query
ai, bi -> positive pair
ak, bk - > negative pair(I have multiple negative pairs for every (query, positive pair) combination.
How can I use multiple negatives ranking loss in this case?
@RafGuitar Год назад
Hi Nils, awesome explanation. What happens to MNR if you have batches where there are two queries with the same answer? The CE matrix will be broken because there are extra ones. What’s the best approach in this case?
@clashgamers4072 Год назад
Hi Nils , question: if the embedding model is trained on let's say cosine similarly during inference other similarity fuctions also generate decent results why?
@frankslade Год назад
Thank for video. i ve a question. when I are trying to extract contextualized word embedding by Bert, always I get out of memory issue on collab. I have a twitter dataset 50000 rows. I couldn't find a solution for it. changing batch size or any other solution really doesn't work at all.
@uniquefine 3 года назад ⁺¹
Thanks for the great talk. One Question: What do you mean when you say "for dot-product, longer documents can result in vectors with higher magnitudes"?
If I understand correctly you are using mean pooling. Why would the mean of many embeddings (necessarily or probably) have higher magnitude than the mean of few?
@NilsReimersTalks 3 года назад ⁺¹
Note that BERT produces contextualized word embeddings. So the output of each words depends on all other words in a paragraph.
Just because we take the mean, we cannot conclude that the length of the paragraph has no impact on the magnitude of the embedding.
The model can simply learn: Long document => word vectors with high magnitude => high magnitude sentence embedding.
@박지호-z6k Год назад
hello sir, slid url is broken :)
@maitrangnguyen643 2 года назад
Dear Nils, thank you very much for such interesting and rich information video. By the way, I have 2 questions about Sbert:
1 - The best input for Sbert is 2 sentences or we can give more than that? (As I see the output vector have 512 dimensions)
2 - The best comparison for the output vector is scaled-cosine-similarity and not cosine-similarity?
@NilsReimersTalks 2 года назад ⁺¹
You can input longer texts, up to the sequence length of the model. Some models work will with text up to 512 word pieces.
The scaling is just relevant for training, later it doesn't play a role anymore.
@flreview212 Год назад
Hello sir, thank you for providing an interesting presentation, I am a final semester student and very, very confused cause I new in NLP, my final project is text summarization, and have not gotten any results, do you have any advice on how sbert is used for text summarization or how can I do fine-tuning with my own dataset to get embedding generated by sbert? And all text using my own language not English :).
Really appreciate it if you give me an answer, I'm really stressed right now, thanks in advance!

Следующие

Автовоспроизведение

Training State-of-the-Art Text Embedding & Neural Search Models

Training State-of-the-Art Text Embedding & Neural Search Models

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Multiple Negatives Ranking Loss for Sentence Embeddings

Multiple Negatives Ranking Loss for Sentence Embeddings

"It's time for him to leave" | Jamie Carragher says Marcus Rashford should leave Man Utd

"It's time for him to leave" | Jamie Carragher says Marcus Rashford should leave Man Utd

The Greatest Comeback Of All Time?

The Greatest Comeback Of All Time?

Gas Fruit Is The MOST OVERPOWERED Fruit.. (Blox Fruits)

Gas Fruit Is The MOST OVERPOWERED Fruit.. (Blox Fruits)

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

How to train a model to generate image embeddings from scratch

How to train a model to generate image embeddings from scratch

Sentence Transformers - EXPLAINED!

Sentence Transformers - EXPLAINED!

Sentence Embeddings - EXPLAINED!

Sentence Embeddings - EXPLAINED!

Introduction - Recent Developments in Neural Search

Introduction - Recent Developments in Neural Search

Embeddings: What they are and why they matter

Embeddings: What they are and why they matter

Fine tuning Embeddings Model

Fine tuning Embeddings Model

Graph Embeddings

Graph Embeddings

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Learn Machine Learning Like a GENIUS and Not Waste Time

Learn Machine Learning Like a GENIUS and Not Waste Time

БИТВА БЛОГЕРОВ 2025 - НАША КОМАНДА

БИТВА БЛОГЕРОВ 2025 – НАША КОМАНДА

오징어게임2 5인 6각 실제로 해봄

오징어게임2 5인 6각 실제로 해봄

Из за чего супермаркеты в Японии УДОБНЕЕ и лучше остальных 🤯 #япония #shorts #путешествия #токио

Из за чего супермаркеты в Японии УДОБНЕЕ и лучше остальных 🤯 #япония #shorts #путешествия #токио

БАЛДЁЖНЫЙ ПОДКАСТ - ЖИЗНЬ ПОСЛЕ 30

БАЛДЁЖНЫЙ ПОДКАСТ - ЖИЗНЬ ПОСЛЕ 30

Арестович: Мир пришел в движение. Где место Украины? @A.Shelest

Арестович: Мир пришел в движение. Где место Украины? @A.Shelest

He Knew A Shortcut 👀

He Knew A Shortcut 👀

【大郎哥哥】媳妇真的太过分了，连老妈的鸡蛋也抢，还好我留了一手

【大郎哥哥】媳妇真的太过分了，连老妈的鸡蛋也抢，还好我留了一手