The AI Scientist (Paper Explainer)

Harvard Presents NEW Knowledge-Graph AGENT (MedAI)

Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)

WAVY (OFFICIAL VIDEO) KARAN AUJLA | LATEST PUNJABI SONGS 2024

Using SPRUNKI to FOOL My Friend in Minecraft

GloRilla - I LUV HER (feat. T-Pain) (Official Music Video)

Structured generation hurts LLM reasoning performance (Paper Explainer)

Elvis Saravia

Просмотров 2 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 17 ноя 2024

Комментарии • 7

@ThePuddu2 3 месяца назад ⁺²
I would be curious to see tested a variation of the NL-to-Format in a single generation instead of two subsequent ones. Meaning: reply to this (thinking step by step, and all other instructions). Then, include a JSON version of the response in the following format: { ... }.
From what I'm seeing, it seems to improve the overall reasoning quality while keeping JSON for parsing in industrial applications. It would be nice to have it formally tested to benchmark it properly
@bastabey2652 3 месяца назад ⁺⁴
my understanding is that gemini 1.5 pro and gpt-4o/4 were specially trained on constrained structured output (cso).. furthermore, gemini flash doesn't support json mode.. when I tested openai structured output with chatgpt 3.5, it didn't work... i haven't tested claude json support enough to comment.. so the paper results don't apply in the latest state of the art cso like gpt-4o and gemini 1.5 pro.. I agree with the conclusion of the paper in the case of the low end models... it's a result anyone who did a bit of ai application using these models have witnessed
thank you for the informative video
@elvissaravia 3 месяца назад
Thanks for sharing your experience. Definitely need to be constantly monitoring performance for this specifically. The benchmarks are also not representative of all real world tasks and they mention that in the discussion section.
@sfilkin 3 месяца назад ⁺³
You choose papers well.
@elvissaravia 3 месяца назад
I try based on the audience interest. Hard to choose sometimes with so many papers coming out every day.
@gr8tbigtreehugger 3 месяца назад ⁺¹
Many thanks!
@k0b0yash1 Месяц назад
of course the json restricted prompt performed worse as it removed chain-of-thought

Следующие

Автовоспроизведение

The AI Scientist (Paper Explainer)

The AI Scientist (Paper Explainer)

Harvard Presents NEW Knowledge-Graph AGENT (MedAI)

Harvard Presents NEW Knowledge-Graph AGENT (MedAI)

Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)

Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)

WAVY (OFFICIAL VIDEO) KARAN AUJLA | LATEST PUNJABI SONGS 2024

WAVY (OFFICIAL VIDEO) KARAN AUJLA | LATEST PUNJABI SONGS 2024

Using SPRUNKI to FOOL My Friend in Minecraft

Using SPRUNKI to FOOL My Friend in Minecraft

GloRilla - I LUV HER (feat. T-Pain) (Official Music Video)

GloRilla - I LUV HER (feat. T-Pain) (Official Music Video)

I Bought a Mountain and an Excavator!

I Bought a Mountain and an Excavator!

Claude PDF Analyzer | Ridiculously GOOD! (Tested)

Claude PDF Analyzer | Ridiculously GOOD! (Tested)

Scrapegraphai Usecase

Scrapegraphai Usecase

How to Make Learning as Addictive as Social Media | Duolingo's Luis Von Ahn | TED

How to Make Learning as Addictive as Social Media | Duolingo's Luis Von Ahn | TED

Google ships custom audio overviews! (NotebookLM Updates)

Google ships custom audio overviews! (NotebookLM Updates)

Introducing Forge Reasoning APIs | The future of building with agents?

Introducing Forge Reasoning APIs | The future of building with agents?

RAG vs. Fine Tuning

RAG vs. Fine Tuning

Local LightRAG: A GraphRAG Alternative but Fully Local with Ollama

Local LightRAG: A GraphRAG Alternative but Fully Local with Ollama

How I'd Learn AI (If I Had to Start Over)

How I'd Learn AI (If I Had to Start Over)

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

топ 3 способа не застрять в туалете самолета 😄 мой тг «хей! это марьяна!»

топ 3 способа не застрять в туалете самолета 😄 мой тг «хей! это марьяна!»

ЧТО ОБЩЕГО У ОЛЕСИ И ДЖИГАНА? #натальнаякарта

ЧТО ОБЩЕГО У ОЛЕСИ И ДЖИГАНА? #натальнаякарта

Can You Find Hulk's True Love? Real vs Fake Girlfriend Challenge | Roblox 3D

Can You Find Hulk's True Love? Real vs Fake Girlfriend Challenge | Roblox 3D

Злой Учитель от 0 до 100 лет за 24 часа !

Злой Учитель от 0 до 100 лет за 24 часа !

ПИСЬМА ДЕДУ МОРОЗУ

ПИСЬМА ДЕДУ МОРОЗУ

4 ДНЯ В ТАЁЖНОЙ ИЗБЕ. СТАВЛЮ КАПКАНЫ. ПЕРВАЯ ДОБЫЧА.

4 ДНЯ В ТАЁЖНОЙ ИЗБЕ. СТАВЛЮ КАПКАНЫ. ПЕРВАЯ ДОБЫЧА.

ПЕРВЫЙ ЧАС ПОСЛЕ ЗАРПЛАТЫ #стиль #бизнес #красота #продюсер #спб

ПЕРВЫЙ ЧАС ПОСЛЕ ЗАРПЛАТЫ #стиль #бизнес #красота #продюсер #спб

ВС РФ Зашли В Черниговскую Область🎖 Началось Запорожское Наступление⚔️ Военные Сводки За 15.11.2024

ВС РФ Зашли В Черниговскую Область🎖 Началось Запорожское Наступление⚔️ Военные Сводки За 15.11.2024