"Training" an AI Agent for ONE Specific TASK with OpenAI-o1 API

Artificial Intelligence Explained: How to Make Money with AI & Use It to Improve Your Life ft X Eyeé

OpenAI DevDay 2024 | Multimodal apps with the Realtime API

I Filled my ENTIRE House with Snow *don’t try this*

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

The Battle Over NYC Congestion Pricing

OpenAI Realtime API - The NEW ERA of Speech to Speech? - TESTED

All About AI

Просмотров 31 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 7 янв 2025

Комментарии • 52

@KCM25NJL 3 месяца назад ⁺³¹
Yeah, no basement dweller dev's are gonna be messing with that API until the costs drop by at least 100x, which I honestly only see as a near term incentive for Meta to get a Llama Voice model cookin'
@jamesjonnes 3 месяца назад
I'll use it, but can't wait for an uncensored open source version. Text only is too boring. I lack the patience to use text only for too long for the tasks I want, like learning languages.
@karmcy Месяц назад ⁺¹
Well said, 3 tests today ~2mins each conversation. $1.5. Yikes!
@almirkaza 3 месяца назад ⁺¹²
can you share the url to the repo?
@boxeemusic 3 месяца назад ⁺¹²
where can i find the code? pls help
@OliNorwell 3 месяца назад ⁺⁵
Great work! You must have had a busy couple of days getting it working
@meetsummdev 3 месяца назад ⁺¹
you can really implement it in a few hours
@sykexz6793 3 месяца назад ⁺¹³
I don't think this is the same model as advanced voice mode.
@ibrahimaba8966 3 месяца назад ⁺⁴
I just integrated it on Twilio, it changes everything, but it took me a bit of time.
@DarrenJohn10X 3 месяца назад ⁺⁹
Looking forward to seeing your alleged "spaghetti" code! (Right now 2 weeks ago is your latest repo)
@AgenticAlex Месяц назад
5:58 - I felt that 😂 Currently having the same conundrum with the Anthropic API! (Claude 3.5 Sonnet is so good...)
@Bangs_Theory 3 месяца назад ⁺³
Which function controls the interruption?
@gaijinshacho 3 месяца назад ⁺³
VAD
@viduraerandika8296 27 дней назад
@@gaijinshacho even i use it in turn detection it continue talking until it finishes.
@DeepSucess 2 месяца назад
can we have speech/voice as input to this app using websockets and get result as text as output?
@AtheistAdam 13 дней назад
You are cool :) Thanks for all you share.
@jamesyoungerdds7901 3 месяца назад
Great video, thanks Kris! I'm interesting in the function calling and structured output from the voice websocket return. Can you use agents or agentic flows with constrained and structured outputs with the voice mode 🤔
@pjm17 3 месяца назад ⁺¹
Could you achieve these results in an app just using the text to speech and speech to text with native ios features alongside openai NON realtime api's?
@bassemibrahim3798 29 дней назад
yes I can, I have already implemented that
@hamzakhanswati9087 Месяц назад
when will you upload it on github??
@Akander20 3 месяца назад
where can i get the repo?
@tommoves9935 3 месяца назад
Happy to be the first to comment. Kris you are always up to date. Once again cool stuff from you. Spaghetti code... 🤣. Great that you did talk about the costs as well. I like your creative and often real funny ideas. Please keep up the great work! Regarding your phone call: saw a video from a guy in the US weeks ago (no Realtime API) - he did let his AI order a Pizza and it worked great. Latency even back then was good enough - should work perfectly. Maybe try it with an italian accent 😉. Thx from Tom!
@drewpeer 3 месяца назад
Does everyone have access to this beta? Anything we have to do?
@JaredVBrown 2 месяца назад
Would love the bankrupt myself with your code, i wont judge spaghetti, tried for 20 prompts with the new claude to get it up and running - no dice. Examples would be much apricated :)
@DesignDesigns 3 месяца назад
This is mindblowing...
@d3xrd527 Месяц назад
Where to find code?
@alarconfilms1 3 месяца назад ⁺¹
What is the code used?
@khalifarmili1256 3 месяца назад
It's not out yet
@romera9662 3 месяца назад
@@khalifarmili1256 How long will it take?
@DeepSucess 2 месяца назад
can It work for other languages such as urdu, hindi?
@nmana9759 2 месяца назад
Why wouldn't you share the repo?
@MagagnaJayzxui 3 месяца назад
What is AVA?
@micbab-vg2mu 3 месяца назад
Thanks :)
@三川富資訊股份有限公 3 месяца назад ⁺³
The Realtime API cost is high. I suggest that there is a cheaper way. 1.Using Google STT to get user's speech texts. 2.Send texts to GPT. 3. Get responses from GPT. 4.Send responses to Google TTS. 5.User gets AI responses in both texts and voices. The response time is longer and it costs lower.
@李征-u3n Месяц назад ⁺¹
In that case, you don't need to use realtime API. OpenAI chat completion API I think works just fine.
I think the key point is that realtime API has the ability to not miss any information from your voice (tone, intonation or accent), which means it can feel you like a real person, as least it is trying to.
@MrAnonymousCitizen Месяц назад
Yes you said it yourself. The response time is longer and the cost is cheaper… thank you Sherlock…. Case solved
@李征-u3n Месяц назад
I don't quite understand what realtime means here, especially in text version
In voice version, yes, you can interact with it like really talking to a person, such as you can interrupt the conversation, or maybe openAI can understand extra information from your tone or intonation or accent.
But in text version, I don't see any difference with just use OpenAI chat completion API
@dievas_ 3 месяца назад
I still don't have access to it :/
@Cutestreetcats 2 месяца назад
where is the code?
@Dea07thox 3 месяца назад
Can't you just better prompt it to have a less talkative output so you don't have to break it's response that often? That would make a big difference and everything more seamless :)
@icydemon9749 2 месяца назад
can you provide a code ? please
@contentfreeGPT5-py6uv 3 месяца назад
i tested yesterday ,but
Error al conectar: 403
Acceso denegado. Verifica tu clave de API y los permisos para usar el API Realtime.
@elprox1290 3 месяца назад
try checking your api key or just making a new one
@contentfreeGPT5-py6uv 3 месяца назад
@@elprox1290 again, thanks
@saksham3 3 месяца назад
Doesn't it have emotions?
@AI_Escaped 3 месяца назад
No one is going to be even able to develop at these prices other than those with deep pockets. Just testing and figuring things out would be too expensive to even try.
@thenoblerot 3 месяца назад
By telling it it is playing a game with the user, it might be failing on purpose to let you win!
@benbrahimjamil1976 2 месяца назад
How to get the repo ?
@TheTrainstation 2 месяца назад
Im waiting to hear the Irish accent to be sure
@DhairyaMarwah-l1u 3 месяца назад ⁺⁵
Can you share the repo link ?
@khanhhq2044 2 месяца назад ⁺³
Can you share the repo link ?

Следующие

Автовоспроизведение

"Training" an AI Agent for ONE Specific TASK with OpenAI-o1 API

"Training" an AI Agent for ONE Specific TASK with OpenAI-o1 API

Artificial Intelligence Explained: How to Make Money with AI & Use It to Improve Your Life ft X Eyeé

Artificial Intelligence Explained: How to Make Money with AI & Use It to Improve Your Life ft X Eyeé

OpenAI DevDay 2024 | Multimodal apps with the Realtime API

OpenAI DevDay 2024 | Multimodal apps with the Realtime API

I Filled my ENTIRE House with Snow *don’t try this*

I Filled my ENTIRE House with Snow *don’t try this*

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

The Battle Over NYC Congestion Pricing

The Battle Over NYC Congestion Pricing

The Aston Martin Valkyrie Is a $4.5 Million Insane Hypercar

The Aston Martin Valkyrie Is a $4.5 Million Insane Hypercar

AI Is Not Designed for You

AI Is Not Designed for You

Was I Wrong About AI Agents? | INSANE OpenAI-o1 Planning Capabilities

Was I Wrong About AI Agents? | INSANE OpenAI-o1 Planning Capabilities

Linus On LLMs For Coding

Linus On LLMs For Coding

OpenAI Realtime API and Livekit Integration Walkthrough | Reduce Latency | Building AI Voice Agents

OpenAI Realtime API and Livekit Integration Walkthrough | Reduce Latency | Building AI Voice Agents

Client Side Tool Calling with the OpenAI WebRTC Realtime API

Client Side Tool Calling with the OpenAI WebRTC Realtime API

PirateSoftware is right, this needs to stop

PirateSoftware is right, this needs to stop

How to Build an AI Agent Using OpenAI Realtime API (Step-by-step Guide)

How to Build an AI Agent Using OpenAI Realtime API (Step-by-step Guide)

Using OpenAI Realtime API to build a Twilio Voice AI assistant with Node.js

Using OpenAI Realtime API to build a Twilio Voice AI assistant with Node.js

The new Realtime API from OpenAI uses AI voice to call store

The new Realtime API from OpenAI uses AI voice to call store

Disrespectful driver crushes eggs

Disrespectful driver crushes eggs

Fall into the world of fairy tales~ #immersive skincare #extremecomfort #decompression

Fall into the world of fairy tales~ #immersive skincare #extremecomfort #decompression

黑天使预知未来#short #angel #clown

黑天使预知未来#short #angel #clown

Из за чего супермаркеты в Японии УДОБНЕЕ и лучше остальных 🤯 #япония #shorts #путешествия #токио

Из за чего супермаркеты в Японии УДОБНЕЕ и лучше остальных 🤯 #япония #shorts #путешествия #токио

Каха туалетная бумага

Каха туалетная бумага

GPD DUO - лэптоп с двумя экранами 💻

GPD DUO - лэптоп с двумя экранами 💻

Новый УЧИТЕЛЬ vs Старый ПЕДАГОГ (смешное видео, юмор, приколы, поржать, прикол)

Новый УЧИТЕЛЬ vs Старый ПЕДАГОГ (смешное видео, юмор, приколы, поржать, прикол)

В конце прЭкол4ik

В конце прЭкол4ik