Lecture 9 : N-Gram Language Models

Lecture 12: Language Modeling: Advanced Smoothing Models

Lecture 15: Introduction to POS Tagging

We Turned our Home into a 5 STAR HOTEL! Celebrity Guest Rate It*

I Went UNDERCOVER In A Marvel Tournament!

I Coded Your Terrible Escape Room Ideas into Minecraft

Lecture 10: Evaluation of Language Models, Basic Smoothing

Natural Language Processing

Просмотров 19 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 25 авг 2024

Комментарии • 16

@pawanchoure1289 2 года назад ⁺¹
One solution to probability density estimation is referred to as Maximum Likelihood Estimation or MLE for short. First, it involves defining a parameter called theta that defines both the choice of the probability density function and the parameters of that distribution.
@pawanchoure1289 2 года назад ⁺¹
Traditionally, language model performance is measured by perplexity, cross-entropy, and bits-per-character (BPC). As language models are increasingly being used as pre-trained models for other NLP tasks, they are often also evaluated based on how well they perform on downstream tasks.
@louerleseigneur4532 4 года назад ⁺²
Thanks sir
@pawanchoure1289 2 года назад
perplexity is a measurement of how well a probability model predicts a sample. In the context of Natural Language Processing, perplexity is one way to evaluate language models.
@pawanchoure1289 2 года назад
A 2-gram (or bigram) is a two-word sequence of words, like “I love”, “love reading”, or “Analytics Vidhya”. And a 3-gram (or trigram) is a three-word sequence of words like “I love reading”, “about data science” or “on Analytics Vidhya”.
@pawanchoure1289 2 года назад
In information theory, perplexity is a measurement of how well a probability distribution or probability model predicts a sample. It may be used to compare probability models. A low perplexity indicates the probability distribution is good at predicting the sample.
@pawanchoure1289 2 года назад
The Shannon Visualization Method
1. Choose a random bigram (, w) according to its probability.
2. Now choose a random bigram (w, x) according to its probability.
3. And so on until we choose
4. Then string the words together.
@pawanchoure1289 2 года назад
The term smoothing refers to the adjustment of the maximum likelihood estimator of a language model so that it will be more accurate. ... When estimating a language model based on a limited amount of text, such as a single document, smoothing of the maximum likelihood model is extremely important.
@pawanchoure1289 2 года назад
Perplexity is the inverse probability of the test set, normalized by the number of words. In the case of unigrams: Now you say you have already constructed the unigram model, meaning, for each word you have the relevant probability.
@pawanchoure1289 2 года назад
What is extrinsic and intrinsic evaluation?
In an intrinsic evaluation, quality of NLP systems outputs is evaluated against pre-determined ground truth (reference text) whereas an extrinsic evaluation is aimed at evaluating systems outputs based on their impact on the performance of other NLP systems
@pawanchoure1289 2 года назад
unigram prior smoothing
@divyanshukumar2605 3 года назад ⁺⁵
Never goes in depth of any concepts, just says a bunch of technical words without explaining explicitly, even the explanations are word to word copied from the lecture of Dan Jurafsky.
@sumonchakrabarty6805 Год назад ⁺³
Worst teacher ever seen in my life. He don't even know English properly. His vocabulary is worse. These kind of professors should be fired out immediately of IITs. They are polluting the teaching process....
@divyanshukumar2605 3 года назад ⁺¹
A third grade teacher, he should be teaching a 5th grader.

Следующие

Автовоспроизведение

Lecture 9 : N-Gram Language Models

Lecture 9 : N-Gram Language Models

Lecture 12: Language Modeling: Advanced Smoothing Models

Lecture 12: Language Modeling: Advanced Smoothing Models

Lecture 15: Introduction to POS Tagging

Lecture 15: Introduction to POS Tagging

We Turned our Home into a 5 STAR HOTEL! Celebrity Guest Rate It*

We Turned our Home into a 5 STAR HOTEL! Celebrity Guest Rate It*

I Went UNDERCOVER In A Marvel Tournament!

I Went UNDERCOVER In A Marvel Tournament!

I Coded Your Terrible Escape Room Ideas into Minecraft

I Coded Your Terrible Escape Room Ideas into Minecraft

Vyse Official Gameplay Reveal // VALORANT

Vyse Official Gameplay Reveal // VALORANT

Lecture 22: Syntax - Introduction

Lecture 22: Syntax - Introduction

Lecture 23: Syntax - Parsing I

Lecture 23: Syntax - Parsing I

Lecture 16: Hidden Markov Models for POS Tagging

Lecture 16: Hidden Markov Models for POS Tagging

Lecture 42 : Topic Models : Introduction

Lecture 42 : Topic Models : Introduction

Nlp - 2.3 - Evaluation and Perplexity

Nlp - 2.3 - Evaluation and Perplexity

#46 || Language Model Evaluation || Language Modeling || NLP || #nlp

#46 || Language Model Evaluation || Language Modeling || NLP || #nlp

Lecture 20 - Statistical Language Models | UIUC

Lecture 20 — Statistical Language Models | UIUC

ВОТ ЧТО СЛУЧИЛОСЬ С НАШИМ ПРУДОМ СПУСТЯ ГОД!

ВОТ ЧТО СЛУЧИЛОСЬ С НАШИМ ПРУДОМ СПУСТЯ ГОД!

天使和小丑看到了什么东西？#天使#小丑#家庭#搞笑

天使和小丑看到了什么东西？#天使#小丑#家庭#搞笑

СТРОЙКА ЗАВЕРШЕНА! Что внутри секретных домов!? ТРОФИМОВ БЕРЕГ

СТРОЙКА ЗАВЕРШЕНА! Что внутри секретных домов!? ТРОФИМОВ БЕРЕГ

ЛИШЕ ПОСЛУХАЙТЕ ЦЮ РОСІЯНКУ. "Украинские военные настоящие полковники"

ЛИШЕ ПОСЛУХАЙТЕ ЦЮ РОСІЯНКУ. "Украинские военные настоящие полковники"

Может быть ты знаешь что надо делать?🤯

Может быть ты знаешь что надо делать?🤯

Guys help!, She won't stop🥹😂 @isabellaafro Credit: @JasminandJames 🫡 #dance #trending

Guys help!, She won't stop🥹😂 @isabellaafro Credit: @JasminandJames 🫡 #dance #trending

O'ZBEK VA TOJIKLAR ROSSIYADA TERAKT UYUSHTIRISHDI!

O'ZBEK VA TOJIKLAR ROSSIYADA TERAKT UYUSHTIRISHDI!

Where are my pants? 🤣🤗🫣 #comedy #abdurozik #uae #pants #fyp #india #ukraina #funny

Where are my pants? 🤣🤗🫣 #comedy #abdurozik #uae #pants #fyp #india #ukraina #funny