Parrot: Paraphrase based utterance augmentation framework | Python #NLP

How to use BERTopic - Machine Learning Assisted Topic Modeling in Python

BERTopic Explained

The Most DISRESPECTFUL Way To End a Game I've Seen

Off Grid Cabin Disaster !

NEW DRAGON HUNTER NPC FULL GUIDE | DRAGON HEART QUEST? | Blox Fruits...

BERTopic : Topic Modelling with Transformer Embeddings , arxiv dataset python demo

Rithesh Sreenivasan

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 22 дек 2024

Комментарии •

@prabhacar 2 года назад
Great demo! Very clear explanation and trouble-shooting tips! welldone!
@RitheshSreenivasan 2 года назад
Thank You!
@parthrangarajan3241 7 месяцев назад
Hi, great video!
Is it possible to map each topic to their respective documents?
@mmishrafaculty Год назад
Very well explained. Can you suggest a source for text preprocessing before BERTopic.
@RitheshSreenivasan Год назад
Refer to this :github.com/MaartenGr/BERTopic/issues/40
@adityay525125 2 года назад
Very detailed explanation thank you sir 🙏
@RitheshSreenivasan 2 года назад
Thank You!
@seemarani7314 2 года назад ⁺¹
1. It is mentioned that the existing topic modeling methods such as LDA/NMF methods have too many parameters to be tuned, and this seems to be the major motivation for BERTopic approach. How is this challenge solved in this approach? What is the difference in the number of parameters in topic modeling methods LDA/NMF vs. BERTopic method?
2.Why UMAP has been used for dimensionality reduction? Why is it the most effective clustering algorithm? What is the best way to balance the loss of information with low dimension reduction and poor clustering? How the parameters are tuned for this? Why HDB SCAN has been used?
@RitheshSreenivasan 2 года назад
Please have a look at the BERTopic paper
@seemarani7314 2 года назад
I have sir... please let help me understand this...
@seemarani7314 2 года назад
Can you please find answer for these questions for me? I will be highly obliged to you.... 🙏
@RitheshSreenivasan 2 года назад ⁺¹
For you second question from the paper "‘Moreover,
(Allaoui et al., 2020) demonstrated that reducing
high dimensional embeddings with UMAP can improve the performance of well-known clustering algorithms, such as k-Means and HDBSCAN, both
in terms of clustering accuracy and time"
For your first question again from the paper:
"Conventional models,
such as Latent Dirichlet Allocation (LDA) (Blei
et al., 2003) and Non-Negative Matrix Factorization (NMF) (Févotte and Idier, 2011), describe a
document as a bag-of-words and model each document as a mixture of latent topics.
One limitation of these models is that through
bag-of-words representations, they disregard semantic relationships among words. As these representations do not account for the context of words
in a sentence, the bag-of-words input may fail to
accurately represent documents"
You have to do your own research regarding the number of parameters in topic modelling LDA/NMF vs BERTopic
Alos have a look at the discussion part of the paper where the author has explained the strength and weakness
@seemarani7314 2 года назад ⁺¹
Thanku so much sir

Следующие

Автовоспроизведение

Parrot: Paraphrase based utterance augmentation framework | Python #NLP

Parrot: Paraphrase based utterance augmentation framework | Python #NLP

How to use BERTopic - Machine Learning Assisted Topic Modeling in Python

How to use BERTopic - Machine Learning Assisted Topic Modeling in Python

BERTopic Explained

BERTopic Explained

The Most DISRESPECTFUL Way To End a Game I've Seen

The Most DISRESPECTFUL Way To End a Game I've Seen

Off Grid Cabin Disaster !

Off Grid Cabin Disaster !

NEW DRAGON HUNTER NPC FULL GUIDE | DRAGON HEART QUEST? | Blox Fruits...

NEW DRAGON HUNTER NPC FULL GUIDE | DRAGON HEART QUEST? | Blox Fruits...

I 3D Printed a $1,500 Chair

I 3D Printed a $1,500 Chair

Python Sentiment Analysis Project with NLTK and 🤗 Transformers. Classify Amazon Reviews!!

Python Sentiment Analysis Project with NLTK and 🤗 Transformers. Classify Amazon Reviews!!

An Introduction to Topic Modeling

An Introduction to Topic Modeling

BERTopic for Topic Modeling - Maarten Grootendorst - Talking Language AI Ep#1

BERTopic for Topic Modeling - Maarten Grootendorst - Talking Language AI Ep#1

BERT for Topic Modeling - EXPLAINED!

BERT for Topic Modeling - EXPLAINED!

LDA Topic Modelling Explained with implementation using gensim in Python #nlp #tutorial

LDA Topic Modelling Explained with implementation using gensim in Python #nlp #tutorial

The Best Way to do Topic Modeling in Python - Top2Vec Introduction and Tutorial

The Best Way to do Topic Modeling in Python - Top2Vec Introduction and Tutorial

Topic Modeling with BERT

Topic Modeling with BERT

2,000 People Fight For $5,000,000

2,000 People Fight For $5,000,000

Самые важные участки фронта сейчас

Самые важные участки фронта сейчас

Как Форчан разоблачил вербовщиков с помощью ШУТОК ПРО МАТЬ 🍀

Как Форчан разоблачил вербовщиков с помощью ШУТОК ПРО МАТЬ 🍀

ДПС ОСТАНОВИЛИ РОСГВАРДИЮ и ВОТ ЧТО ВЫШЛО.. в ГТА 5 РП (GTA 5 RMRP / Криминальная Москва)

ДПС ОСТАНОВИЛИ РОСГВАРДИЮ и ВОТ ЧТО ВЫШЛО.. в ГТА 5 РП (GTA 5 RMRP / Криминальная Москва)

Oleksandr Usyk vs. Tyson Fury 2 HIGHLIGHTS | ESPN Ringside

Oleksandr Usyk vs. Tyson Fury 2 HIGHLIGHTS | ESPN Ringside

10 МЕСЯЦЕВ РЕМОНТА И ФРОНТЕРА ГОТОВА !!!

10 МЕСЯЦЕВ РЕМОНТА И ФРОНТЕРА ГОТОВА !!!

Пара слов про ЛАЗЕРНУЮ сварку

Пара слов про ЛАЗЕРНУЮ сварку

1 класс vs 11 класс (письмо деду морозу) *подписывайся на мой тг♥️ - ссылка в шапке профиля*

1 класс vs 11 класс (письмо деду морозу) *подписывайся на мой тг♥️ - ссылка в шапке профиля*

Let you know sleep all day long. My son is sorry. He is your father and will not do anything to you.

Let you know sleep all day long. My son is sorry. He is your father and will not do anything to you.

Сумасшедшая история близнецов. Детство и юность, которых никогда не было

Сумасшедшая история близнецов. Детство и юность, которых никогда не было