I never understood why too many neutrons cause instability - until now!

Gödel's Theorem to Gödel AI: The Blueprint for Self-Learning Machines

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

I Spent $2000 on 5 ABANDONED Storage Units and Found Some INSANE Stuff

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

Dallas Cowboys vs. San Francisco 49ers Game Highlights | NFL 2024 Season Week 8

SOLVED: Perfect Reasoning for every AI AGENT (ReasonAgain)

Discover AI

Просмотров 715

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 окт 2024

Комментарии • 9

@Tinbabymantra Час назад ⁺²
Do you have another channel that you publish your experiments on? Like for your work with robots?
I would love to see videos like that either on this channel or a similar channel that you run.
@soulacrity7498 Час назад ⁺¹
Very interesting! I can't wait to see this in practice. Can you make a video of its practical applications and how to implement them?
@IdPreferNot1 Час назад ⁺¹
Thats why i like swarm framework as the better the model gets, it can just use those powers for better picking of the best agents and then the best tools/functions, not doing the extended work. The tools get everything done and the agents just help with integrating the fuzzy communications between processes.
@solyarisoftware 41 минуту назад
I tend to agree. SWARM demonstrates how this "fuzzy communication" can be realized recursively through function calling. I wrote about this in an article titled "SWARMing Conversational AI: Integrating No-Code and Code in Agent-Based Workflows," which you can find on LI.
@undefined6512 46 минут назад
Alice isn't wozz, but Scott most definitely is.
@cyberpunkdarren 54 минуты назад ⁺⁴
AI models don't actually "understand" reasoning? News flash. Neither do humans. Its all about efficacy. If it walks like a duck and quacks like a duck. Guess what?
@stunspot Час назад ⁺²
Oh MAN! I love your stuff but you REALLY dropped the ball here. Ok, all they've done is say "Hey! The model is bad at math! Give it a calculator!" There is no reasoning here. This EXCLUSIVELY applies to consistent mathematical reasoning suitable for programming. Why are they even using a model here? All it is doing is coding. No one is testing the model's logic AT. ALL.
Look, if you can parameterize and regularize it, USE A COMPUTER! LLMs are good at stuff Turing machines aren't. You want to impress? Do it WITHOUT an external tool. Now, I like your logic test. THAT'S good. Until you run it on a non-model environment.
No, they ask "How do we evaluate and improve the model's reasoning?" and their answer is "Outsource the reasoning to something that isn't a model."
LAME.
I mean, even their basic test that came up with the 20% error - that didn't show a failure of _logic_, that could much more easily be explained by basic innumeracy. It can have the logic cold, but if 2 + 2 = 5 because it had a brainfart, it will fail.
@debn_bey-jj9lq Час назад
bootiful

Следующие

Автовоспроизведение

I never understood why too many neutrons cause instability - until now!

I never understood why too many neutrons cause instability - until now!

Gödel's Theorem to Gödel AI: The Blueprint for Self-Learning Machines

Gödel's Theorem to Gödel AI: The Blueprint for Self-Learning Machines

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

I Spent $2000 on 5 ABANDONED Storage Units and Found Some INSANE Stuff

I Spent $2000 on 5 ABANDONED Storage Units and Found Some INSANE Stuff

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

Dallas Cowboys vs. San Francisco 49ers Game Highlights | NFL 2024 Season Week 8

Dallas Cowboys vs. San Francisco 49ers Game Highlights | NFL 2024 Season Week 8

Lil Uzi Vert - Eternal Atake 2 TV Show

Lil Uzi Vert - Eternal Atake 2 TV Show

Neuro-Symbolic GRAPH REASONING #ai

Neuro-Symbolic GRAPH REASONING #ai

Session 05: Understanding Sorting Algorithms | Bubble Sort & Merge Sort Explained

Session 05: Understanding Sorting Algorithms | Bubble Sort & Merge Sort Explained

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

The Value of Source Code

The Value of Source Code

Seven AI Agents and a Knowledge Graph: AGENTiGraph

Seven AI Agents and a Knowledge Graph: AGENTiGraph

How are holograms possible? | Optics puzzles 5

How are holograms possible? | Optics puzzles 5

Harvard Presents NEW Knowledge-Graph AGENT (MedAI)

Harvard Presents NEW Knowledge-Graph AGENT (MedAI)

What does '__init__.py' do in Python?

What does '__init__.py' do in Python?

MCTS Enhanced AI AGENTS: SELA (Stanford, UC Berkeley)

MCTS Enhanced AI AGENTS: SELA (Stanford, UC Berkeley)

Кто меня ЛУЧШЕ ЗНАЕТ? ПАРЕНЬ или ДРУГ // Милана Некрасова, Лизогуб

Кто меня ЛУЧШЕ ЗНАЕТ? ПАРЕНЬ или ДРУГ // Милана Некрасова, Лизогуб

Уникальный вездеход АРКТИКА. СОБИРАЕМ ЗАНОВО ХОДОВУЮ!

Уникальный вездеход АРКТИКА. СОБИРАЕМ ЗАНОВО ХОДОВУЮ!

ДЖИМЕН ВСЕХ СПАСЁТ ! | Сюжет skibidi toilet 77 (part 4)

ДЖИМЕН ВСЕХ СПАСЁТ ! | Сюжет skibidi toilet 77 (part 4)

REAL MADRID 0 vs 4 FC BARCELONA | EL CLÁSICO | LALIGA 2024/25 MD11

REAL MADRID 0 vs 4 FC BARCELONA | EL CLÁSICO | LALIGA 2024/25 MD11

Теперь мы знаем, к чем стремится Динара #huga #хетагхугаев #детекторлжи

Теперь мы знаем, к чем стремится Динара #huga #хетагхугаев #детекторлжи

SOUTH KOREA AND NORTH KOREA HOLD THEIR BREATH 🇰🇷 🇰🇵 #countryhumans

SOUTH KOREA AND NORTH KOREA HOLD THEIR BREATH 🇰🇷 🇰🇵 #countryhumans

новое испытание

новое испытание

Подземелья Чикен Карри #31 Культ Кракена (Гоген Солнцев, Бортич, Копейкин, Гудков, BRB)

Подземелья Чикен Карри #31 Культ Кракена (Гоген Солнцев, Бортич, Копейкин, Гудков, BRB)