What are AI Agents?

AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"

AI Agents Every Business Needs to Skyrocket Efficiency and Cut Costs

Pacific Championships 2024 | Fiji Bati v PNG Kumuls | Extended Highlights

Stuck In A Car With Jiji Wonder (first kiss)

I Bought A Haunted House!

Evals for AI Agents, the right way!!!

1littlecoder

Просмотров 1,5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 28 окт 2024
Наука

Комментарии • 10

@MindForeverVoyaging 2 месяца назад ⁺²
I have a similar agent setup that takes a similar approach and have found that Anthropic's claude-3-5-sonnet-20240620 model (not shown in the table) seems much better than OpenAI's GPT-4o model at determining what function tool to use in a given context. The approach I took was not to provide information in the main agent's system prompt about the functions that it has available to it, but instead, the agent should be able to 'associate' which function to call from the OpenAPI definitions which are part of each available function in the agent's tools.
This is all subjective, but in my conversations with the main agent I found that when I asked for something to be done, the 3.5 sonnet model would use the correct function and arguments the majority of the time but the Gpt4-o model would quite often have to be reminded that it had the function available to it, having been reminded, the agent would then make the correct call. As the paper pointed out about the open source models, their context to function 'association' is much lower and as a result, cannot be relied upon and are therefore mostly useless for this type of approach.(I was using llama3.1 thru groq)
@1littlecoder 2 месяца назад
This is a great validation of the paper
@GNARGNARHEAD 2 месяца назад ⁺¹
oh heck yeah, love some paper reviews 👍
@1littlecoder 2 месяца назад ⁺¹
@@GNARGNARHEAD glad there's some interest still 😁🙏🏾
@MichealScott24 2 месяца назад
❤
@AI-Wire 2 месяца назад ⁺¹
When do you think we will have computer-using agents?
@1littlecoder 2 месяца назад
Open interpreter kind of tried. People have tried the same with GPT-4o but nothing is quite yet there
@dhruvmehta2377 2 месяца назад ⁺²
💯❤️‍🔥
@1littlecoder 2 месяца назад ⁺¹
🔥

Следующие

Автовоспроизведение

What are AI Agents?

What are AI Agents?

AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"

AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"

AI Agents Every Business Needs to Skyrocket Efficiency and Cut Costs

AI Agents Every Business Needs to Skyrocket Efficiency and Cut Costs

Pacific Championships 2024 | Fiji Bati v PNG Kumuls | Extended Highlights

Pacific Championships 2024 | Fiji Bati v PNG Kumuls | Extended Highlights

Stuck In A Car With Jiji Wonder (first kiss)

Stuck In A Car With Jiji Wonder (first kiss)

I Bought A Haunted House!

I Bought A Haunted House!

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

Google's secret AI Agent....more AI news!!!

Google's secret AI Agent....more AI news!!!

How to Improve Blender's UI - Andrew Price - Blender Conference 2024

How to Improve Blender's UI - Andrew Price — Blender Conference 2024

SpaceX Secrets Leaked By Diablo Player - Deep Space Updates October 28th

SpaceX Secrets Leaked By Diablo Player - Deep Space Updates October 28th

This intense AI anger is exactly what experts warned of, w Elon Musk.

This intense AI anger is exactly what experts warned of, w Elon Musk.

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Google Releases AI AGENT BUILDER! 🤖 Worth The Wait?

Google Releases AI AGENT BUILDER! 🤖 Worth The Wait?

How AI 'Understands' Images (CLIP) - Computerphile

How AI 'Understands' Images (CLIP) - Computerphile

The Future Of AI Agents With Dharmesh Shah | INBOUND 2024

The Future Of AI Agents With Dharmesh Shah | INBOUND 2024

BabyAGI is back!!! 💥Self-Building Agents Framework💥

BabyAGI is back!!! 💥Self-Building Agents Framework💥

ЧТО? И это ЛУЧШИЙ РЕДМИК в 2025-м? Redmi Note 14 Pro Plus - сравнил с Realme 13 Pro+

ЧТО? И это ЛУЧШИЙ РЕДМИК в 2025-м? Redmi Note 14 Pro Plus – сравнил с Realme 13 Pro+

XIAOMI 14T - Наконец-то Народный ФЛАГМАН 2024 Года Без Переплат? ЧЕСТНЫЙ ОТЗЫВ

XIAOMI 14T – Наконец-то Народный ФЛАГМАН 2024 Года Без Переплат? ЧЕСТНЫЙ ОТЗЫВ

Месяц собирал ПК. И все ещё не собрал…

Месяц собирал ПК. И все ещё не собрал…

Я купил Android - а он точно лучше моего iPhone?

Я купил Android — а он точно лучше моего iPhone?

Creepy Samsung Alarm 008 tutorial #shorts

Creepy Samsung Alarm 008 tutorial #shorts

Первые тесты Core Ultra 9 285K, 7 265K, 5 245K. Теперь официально. "Фуфыкс с патанцевалом" от Intel?

Первые тесты Core Ultra 9 285K, 7 265K, 5 245K. Теперь официально. "Фуфыкс с патанцевалом" от Intel?

Many people do not know the secret of the safety pin.

Many people do not know the secret of the safety pin.

Самый быстрый интернет #связь #казахстан #5g

Самый быстрый интернет #связь #казахстан #5g