NEW CriticGPT by OpenAI: RLHF + FSBS

GraphRAG or SpeculativeRAG ?

The moment we stopped understanding AI [AlexNet]

Grocery Store Full of Undiscovered Priceless All Original Cars | Barn Find Hunter

백현 (BAEKHYUN) 'Pineapple Slice' MV

BROOKE IS BACK ON THE CANCELLED PODCAST - Ep. 95

Improve AGENTIC AI (Princeton)

code_your_own_AI

Просмотров 3,5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 6 сен 2024

Комментарии • 13

@Kriss-studios Месяц назад ⁺⁴
StoneAge, IronAge, ModernAge, AgenticAge❤
@davidwynter6856 Месяц назад
Thank you, I read a lot of papers working full time on GenAI projects, but missed the ground changing paper you presented. A comment on the economics of GenAI, it is clear to me that new models like Jamba, with linear complexity, with their equivalent performance to Transformer based LLMs, with quadratic complexity, will come to the fore. I have experience using Ray Tune, so that will be my optimizer :)
@user-vs3tt8xc6j Месяц назад
Regarding the comparison of complex agents and retry.
Did the agents provide 1 answer or a choice from the top 10?
It is incorrect to compare the top 1 with the top 10.
I would like to see a comparison of the top 1. After all, in practical tasks, I most often need one specific correct answer, not a bunch of answers among which there is a correct one.
Also, the agent explains its actions. They are divided into stages. It's easier to find errors in its reasoning. All else being equal, this can be an extremely important criterion for solving the task.
@code4AI Месяц назад
Some commercial agents can be black boxes. And it is not uncommon, that agents perform internal majority voting to present the "correct" answer to you, an answer with the highest probability score. As with the example of SWE, I can't follow several hundred of thousand tokens for a $4 run.
@user-vs3tt8xc6j Месяц назад
@@code4AI It seems that the agent-based approach does not improve the reasoning capabilities of networks, BUT:
It allows for the decomposition of reasoning into stages, the correctness of which can be verified by instrumental means (checking the validity of the logical construction, code compilation, passing tests, etc.).
It allows for an increase in the length of the correct reasoning chain, i.e., to improve the perplexity of the response in a long context. For example, to write a coherent, logically, and stylistically correct book.
And the complexity of real tasks lies precisely in their multi-stage nature. This involves a long context of reasoning and actions, the correctness of which needs to be maintained. Are agent systems evaluated by the right benchmarks?
However, I do have questions about the feasibility of agent systems. Won't they be eventually overtaken by LLMs that can maintain a very long context and independently generate requests for various actions?
Are there any fundamental reasons to consider the agent-based approach as something unique and irreplaceable in the near future?
@user-vu4or4ih8p Месяц назад
Thanks
@user-de9hv2gu9z Месяц назад
very insightful! thanks
@code4AI Месяц назад
Thank you.
@ProgressRobotics Месяц назад
Can I do optimization on langgraph agents?
@code4AI Месяц назад
You can run an optimization on almost any system ...
@christopherc168 Месяц назад
Get out of my bubble
@code4AI Месяц назад
See you.

Следующие

Автовоспроизведение

NEW CriticGPT by OpenAI: RLHF + FSBS

NEW CriticGPT by OpenAI: RLHF + FSBS

GraphRAG or SpeculativeRAG ?

GraphRAG or SpeculativeRAG ?

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Grocery Store Full of Undiscovered Priceless All Original Cars | Barn Find Hunter

Grocery Store Full of Undiscovered Priceless All Original Cars | Barn Find Hunter

백현 (BAEKHYUN) 'Pineapple Slice' MV

백현 (BAEKHYUN) 'Pineapple Slice' MV

BROOKE IS BACK ON THE CANCELLED PODCAST - Ep. 95

BROOKE IS BACK ON THE CANCELLED PODCAST - Ep. 95

The Hidden Engineering of Landfills

The Hidden Engineering of Landfills

AI Game Theory explained for Multi-Agents

AI Game Theory explained for Multi-Agents

🔴 This Agentic AI Workflow Will Take Over 🤯 Algorithm + Papers Explained

🔴 This Agentic AI Workflow Will Take Over 🤯 Algorithm + Papers Explained

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"

AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"

What are AI Agents?

What are AI Agents?

AI Deception: How Tech Companies Are Fooling Us

AI Deception: How Tech Companies Are Fooling Us

What's next for AI agentic workflows ft. Andrew Ng of AI Fund

What's next for AI agentic workflows ft. Andrew Ng of AI Fund

Andrew Ng On AI Agentic Workflows And Their Potential For Driving AI Progress

Andrew Ng On AI Agentic Workflows And Their Potential For Driving AI Progress

#science #факты #animal #commonraven #воронгоша #birds

#science #факты #animal #commonraven #воронгоша #birds

Каким женщинам мужчины дарят подарки?

Каким женщинам мужчины дарят подарки?

Прохожу маску ЭМОЦИИ🙀 #юмор

Прохожу маску ЭМОЦИИ🙀 #юмор

Fixing Plastic with Staples

Fixing Plastic with Staples

Самый лучший Обзор Space Marine 2 / Булджать

Самый лучший Обзор Space Marine 2 / Булджать

Популярные сквиши 🐾😱 #виола #шортс

Популярные сквиши 🐾😱 #виола #шортс

ОХОТА НА БОРОВУЮ ДИЧЬ У ИЗБЫ ГЛУХАРИНОЙ. РАЗВЕДКА БОЛОТНОГО ОЗЕРА.

ОХОТА НА БОРОВУЮ ДИЧЬ У ИЗБЫ ГЛУХАРИНОЙ. РАЗВЕДКА БОЛОТНОГО ОЗЕРА.