Building Resilient RAG Systems for Large-Scale Data | Office Hours

Top Minds in AI Explain What’s Coming After GPT-4o | EP #130

Text Chunking in RAG: Essential Guide with Anton from ChromaDB

The Outer Banks Cast Says Goodbye to JJ | Netflix

Mike Tyson Slaps Jake Paul at the Weigh In | Jake Paul vs. Mike Tyson | Netflix

Quando Rondo - Grow Up [Official Music Video]

Eugene Yan on Using LLMs as Judges: Insights, Challenges, and Best Practices

Jason Liu

Просмотров 671

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 15 ноя 2024

Комментарии • 2

@MatijaGrcic 2 месяца назад
Great discussion, thanks.
@calmcode-io 2 месяца назад
Interesting. From my experience with annotators, I found it was less about "firing people" who performed bad and perhaps more about "rewriting the guidelines". Sometimes an annotator takes the guidelines literally (actually not a bad thing) and as a result generates annotations that the guideline designer did not have in mind. This is also partially why it makes a tonne of sense for folks who write guidelines to also annotate on the task.
It can also help to have an annotation interface where folks are able to flag a task/example as confusion so that it's easy to reflect.
I have not tried it with LLMs, but my gut says that allowing the LLM to flag an example/task combo as confusing can also really help in designing a few solid prompts.

Следующие

Автовоспроизведение

Building Resilient RAG Systems for Large-Scale Data | Office Hours

Building Resilient RAG Systems for Large-Scale Data | Office Hours

Top Minds in AI Explain What’s Coming After GPT-4o | EP #130

Top Minds in AI Explain What’s Coming After GPT-4o | EP #130

Text Chunking in RAG: Essential Guide with Anton from ChromaDB

Text Chunking in RAG: Essential Guide with Anton from ChromaDB

The Outer Banks Cast Says Goodbye to JJ | Netflix

The Outer Banks Cast Says Goodbye to JJ | Netflix

Mike Tyson Slaps Jake Paul at the Weigh In | Jake Paul vs. Mike Tyson | Netflix

Mike Tyson Slaps Jake Paul at the Weigh In | Jake Paul vs. Mike Tyson | Netflix

Quando Rondo - Grow Up [Official Music Video]

Quando Rondo - Grow Up [Official Music Video]

We Have To Talk About Weed

We Have To Talk About Weed

Office Hours: LLM as a Judge & Thoughts on AI Consulting

Office Hours: LLM as a Judge & Thoughts on AI Consulting

Jeff Huber of Chroma: Building the open-source toolkit for AI Engineering

Jeff Huber of Chroma: Building the open-source toolkit for AI Engineering

Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)

Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)

Ex Google CEO: AI Is Creating Deadly Viruses! If We See This, We Must Turn Off AI!

Ex Google CEO: AI Is Creating Deadly Viruses! If We See This, We Must Turn Off AI!

The Realities of AI Consulting: Lessons Learned and Future Directions w/ Jason and Hamel

The Realities of AI Consulting: Lessons Learned and Future Directions w/ Jason and Hamel

Evaluating LLM-based Applications

Evaluating LLM-based Applications

The challenges in using LLM-as-a-Judge - Sourabh Agrawal | Vector Space Talk #013

The challenges in using LLM-as-a-Judge - Sourabh Agrawal | Vector Space Talk #013

Physics of Language Models: Part 3.1 + 3.2, Knowledge Storage, Extraction and Manipulation

Physics of Language Models: Part 3.1 + 3.2, Knowledge Storage, Extraction and Manipulation

Building Blocks for LLM Systems & Products: Eugene Yan

Building Blocks for LLM Systems & Products: Eugene Yan

САМОЕ ДОРОГОЕ BMW. ROLLS-ROYCE CULLINAN.

САМОЕ ДОРОГОЕ BMW. ROLLS-ROYCE CULLINAN.

ДЕЛО О ТРУСАХ В КУКУРУЗНЫХ ПАЛОЧКАХ

ДЕЛО О ТРУСАХ В КУКУРУЗНЫХ ПАЛОЧКАХ

География уральских пельменей - Япония | Уральские пельмени 2024

География уральских пельменей - Япония | Уральские пельмени 2024

ЦАРЬ ГОРЫ! ПРОЙДИ ИСПЫТАНИЕ ради ЭТОЙ ТАЧКИ...

ЦАРЬ ГОРЫ! ПРОЙДИ ИСПЫТАНИЕ ради ЭТОЙ ТАЧКИ...

БОРК МНЕ ПОДАРИЛИ СТАЙЛЕР ЗА 80.000Р😳

БОРК МНЕ ПОДАРИЛИ СТАЙЛЕР ЗА 80.000Р😳

ИЗБИЛ ЧЕМПИОНА! Полный Бой Майк Тайсон vs Джейк Пол / СМОТРЕТЬ БОКС JAKE PAUL - MIKE TYSON

ИЗБИЛ ЧЕМПИОНА! Полный Бой Майк Тайсон vs Джейк Пол / СМОТРЕТЬ БОКС JAKE PAUL - MIKE TYSON

"Не так должно было быть!" Зузанна МАКСИМЮК и Камиль НЕДВЕДЬ

"Не так должно было быть!" Зузанна МАКСИМЮК и Камиль НЕДВЕДЬ

Арестович: Гос.Система Украины распадается в прямом эфире. @A.Shelest

Арестович: Гос.Система Украины распадается в прямом эфире. @A.Shelest