Session 8: Fine-Tuning Embedding Models for RAG Systems

How to evaluate an LLM-powered RAG application automatically.

AI Agent Evaluation with RAGAS

We Entered the Transfer Portal! Road to Glory Ep. #2

2024 Home Run Derby Full Highlights

Top 50 Amazon Prime Day 2024 Deals 🤑 (Updated Hourly!!)

Session 7: RAG Evaluation with RAGAS and How to Improve Retrieval

AI Makerspace

Просмотров 14 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 3 дек 2023
What you'll learn this session:
- How and why to evaluate RAG systems using best-practice open-source tooling
- RAG Assessment with RAGAS, including Context Precision, Context Recall, Answer Relevancy, and Faithfulness
- How to improve RAG system outputs using advanced retrieval
Speakers:
Dr. Greg Loughnane, Founder & CEO AI Makerspace.
/ greglough. .
Chris Alexiuk, CTO AI Makerspace.
/ csalexiuk
Apply for one of our AI Engineering Courses today!
www.aimakerspace.io/cohorts
Наука

Комментарии • 29

@enceladus96 2 месяца назад ⁺¹
incredibly informative, not like clickbait or anything like other channels. real 37mins worth of knowledge. Thank you 🙌
@bdoriandasilva 4 дня назад
great video, thanks a lot!
@lespaceman 4 месяца назад
Great presentation guys, full of valuable knowledge 🎉
@mansoorbaig9232 2 месяца назад ⁺¹
Great job guys. 👏
@AI-Makerspace 2 месяца назад
Thanks Mansoor!
@micbab-vg2mu 7 месяцев назад ⁺¹
thank you:)
@AI-Makerspace 7 месяцев назад ⁺⁸
Colab Notebook: colab.research.google.com/drive/1TZo2sgf1YFzI4_U-tGppg_ylHAR3MXF_?usp=sharing
Slides: canva.com/design/DAF13fk63Ps/oKNCJf_Oez21fkf0KRW9eA/edit?DAF13fk63Ps&
@someshfengade9623 3 месяца назад ⁺¹
The slides link is not valid ?
@AI-Makerspace 3 месяца назад
@@someshfengade9623 it looks like the permissions were set to "anyone can edit" and someone went ahead and did that! We've restored the previous version and it should work now!
@RaviPrakash-dz9fm Месяц назад ⁺¹
Can anyone tell me how ragas actually calculates these numbers. Like manually I get it, but what do the algorithms or functions look like? Like how does it measure faithfulness?
@AI-Makerspace Месяц назад ⁺¹
Hey Ravi great question! We go a bit deeper into this in our more recent event with the creators! ruclips.net/user/liveAnr1br0lLz8?si=UG6vRnSY9oVtAuAT
We'd recommend reading through the docs and digging into the source to go EVEN deeper! e.g., docs.ragas.io/en/stable/concepts/metrics/faithfulness.html
@andybrown8438 6 месяцев назад
Thanks for the great video. When did context relevance get broken out into context precision and context recall? The RAGAs paper of 26 September 2023 still refers only to relevance and I'd find it useful to have a source to explain why it was broken into two components. Intuitively it makes sense though.
@AI-Makerspace 6 месяцев назад ⁺¹
Hey @andybrown8438 we're planning another event soon on RAG eval, and are in contact with the RAGAS creators - we'll ask them!
@user-wf4mo5kc1y Месяц назад ⁺¹
This is really great explanation. I have one query, lets say I want to improve the performance by focusing on Faithfulness or Answer Relevance, so which RAG optimization techniques I should follow to increase Faithfulness or which techniques can improve Relevance or Precision etc.
@AI-Makerspace Месяц назад
The answer is, unfortunately, it depends! The whole system needs to work together (from data quality, to retrieval quality, to model performance, to prompting), and it needs to work for your use case. What is the best metric to use for your use case? That also depends. It all comes down to metrics-driven development: docs.ragas.io/en/stable/concepts/metrics_driven.html , but you need to decide which direction to drive!
There are some simple things to do after you set up RAG like reranking, but for any given use case the details really matter with regards to what steps you should take.
@cynogriffin6678 5 месяцев назад
Hi Chris, Very informative video, Can you please tell how can I generate test set using Azure in RAGAs.
@AI-Makerspace 4 месяца назад
You'd want to use a LangChain apadter for Azure - so we can use that to create the test set.
@supergaulig 20 дней назад ⁺¹
Good video but one question: Why did you choose to create the testset step-by-step yourself and not use the provided TestSetGenerator from Ragas? Was is not available back then?
@AI-Makerspace 20 дней назад ⁺¹
That's right! They had just rolled it out it when we had them on for this more recent event: ruclips.net/user/liveAnr1br0lLz8?si=_wIYqsL4vcVM5QDq
@kamalyadav4259 4 месяца назад
Hi chris
I have a use case for text-to-SQL with RAG using LangChain. Is there any example or guide to evaluate the SQL result? Is the metric the same as regular text RAG? Thanks in advance
@AI-Makerspace 4 месяца назад
The E2E metrics would likely be the same - and you could crearte a dataset that let you compare the intermediate results as well, the same as you saw here.
@HosselBossel 6 месяцев назад ⁺¹
Chris I love your explanations and notebooks! But you shouldn't be singing while Greg is talking at 16:49
@AI-Makerspace 6 месяцев назад
😆
@AdamPippert 3 месяца назад
Why did nobody laugh at Greg’s durag joke?
@AI-Makerspace 3 месяца назад
😆🤣
@nirash8018 3 месяца назад
Dude you're over 30 years old. Take the cap off if you want to be taken seriously
@AI-Makerspace 3 месяца назад
Thanks for the tip @nirash! The h/t, that is. Cheers!
@nirash8018 3 месяца назад
@@AI-Makerspace You're welcome bro. Carry that bald head with pride
@AI-Makerspace 3 месяца назад
@@nirash8018 ✊

Следующие

Автовоспроизведение

Session 8: Fine-Tuning Embedding Models for RAG Systems

Session 8: Fine-Tuning Embedding Models for RAG Systems

How to evaluate an LLM-powered RAG application automatically.

How to evaluate an LLM-powered RAG application automatically.

AI Agent Evaluation with RAGAS

AI Agent Evaluation with RAGAS

We Entered the Transfer Portal! Road to Glory Ep. #2

We Entered the Transfer Portal! Road to Glory Ep. #2

2024 Home Run Derby Full Highlights

2024 Home Run Derby Full Highlights

Top 50 Amazon Prime Day 2024 Deals 🤑 (Updated Hourly!!)

Top 50 Amazon Prime Day 2024 Deals 🤑 (Updated Hourly!!)

We Need to Rethink Exercise - The Workout Paradox

We Need to Rethink Exercise – The Workout Paradox

Lessons Learned on LLM RAG Solutions

Lessons Learned on LLM RAG Solutions

Why you should build an LLM benchmark [English]

Why you should build an LLM benchmark [English]

"Make Agent 10x cheaper, faster & better?" - LLM System Evaluation 101

"Make Agent 10x cheaper, faster & better?" - LLM System Evaluation 101

RAGAS - Evaluate your LangChain RAG Pipelines

RAGAS - Evaluate your LangChain RAG Pipelines

Agentic AI: The Future is here?

Agentic AI: The Future is here?

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

What is RAG? (Retrieval Augmented Generation)

What is RAG? (Retrieval Augmented Generation)

Testing Framework Giskard for LLM and RAG Evaluation (Bias, Hallucination, and More)

Testing Framework Giskard for LLM and RAG Evaluation (Bias, Hallucination, and More)

Multi-Agent Systems for Everyone

Multi-Agent Systems for Everyone

Какой ПК можно собрать за 110.000 ₽? Советы от CompShop

Какой ПК можно собрать за 110.000 ₽? Советы от CompShop

Windows 7. 15 лет спустя. Что она ЕЩЁ может?

Windows 7. 15 лет спустя. Что она ЕЩЁ может?

Собери ПК и Получи 10,000₽

Собери ПК и Получи 10,000₽

⚡️Супер БЫСТРАЯ Зарядка | Проверка

⚡️Супер БЫСТРАЯ Зарядка | Проверка

😮Новый ДИРЕКТОР Apple🍏

😮Новый ДИРЕКТОР Apple🍏

Полная версия на @brother-live Запустил серверный комп который нашёл на радиоэлектронной свалке))

Полная версия на @brother-live Запустил серверный комп который нашёл на радиоэлектронной свалке))

Colorful Vulcan w rtx 4070ti Super

Colorful Vulcan w rtx 4070ti Super

Микрофоны | Дорогие VS Дешевые #shorts

Микрофоны | Дорогие VS Дешевые #shorts