MOST Important AGENTIC Application - Speech to Text to AI Agents (TTS, STT, LLM Router)

GPT-4o mini Prompt Chain: Legit TRICK for DIRT CHEAP AI with SOTA Accuracy

Creative Lighting for $1

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Death Of A Unicorn | Official Trailer HD | A24

BLACK BAG - Official Trailer [HD] - Only in Theaters March 14

CONTROL your Personal AI Assistant with GPT-4o mini & ElevenLabs (AI TTS & STT)

IndyDevDan

Просмотров 7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 23 дек 2024

Комментарии • 38

@jorgeconsulting 4 месяца назад ⁺⁵
Your content is top notch man. It’s really easy to digest.
@lakergreat1 4 месяца назад ⁺⁶
the #1 video I look forward to each week, sent you an email as well
@WeeklyTubeShow2 4 месяца назад ⁺⁸
That first one may as well have called you senpai. 😂
@thetagang6854 4 месяца назад ⁺²
Amazing, so much value in a PA. Can't wait for speech-to-speech models to come to the market, super natural convos.
@christyson4245 4 месяца назад ⁺¹
As we say in the UK - this is the dogs bollox! Fantastic work as always Dan.
@ew3995 4 месяца назад ⁺⁶
would it be possible to further reduce latencies by streaming transcription and streaming response
@jorgeconsulting 4 месяца назад
That’s what I was thinking too
@ronisaroniemi8501 4 месяца назад
Amazing content - keep up this combo of practical + high level videos 💪
@brianmorin5547 4 месяца назад ⁺²
Same thing I’ve been playing with these week. I want to carve out some time to take chunks from the streaming to create a loop of sending and receiving from the TTS API for super low latency
I think the biggest problem is maintaining continuity of the voice so perhaps render the first sentence locally and while it plays assemble the next audio file or two server-side then send back?????
@6lack5ushi 4 месяца назад ⁺¹⁰
you can get faster calls using Groq new LARGE whisper, and Llama 3.1 70bn if you stream the audio ASAP and have tiny chunk sizes you can down to sub 1 second responses
@zipaJopa 4 месяца назад
Do you have any repos on hand that showcases this setup?
Many thanks!
@6lack5ushi 4 месяца назад
@@zipaJopa I can dig one up and put it on git gimme a few hours, I have an iOS version but it’s faffy. Will reduce it to python and write a simple readme
@6lack5ushi 4 месяца назад ⁺⁶
@@zipaJopa Just rewrote the script will post it to GIT in a few mins and share the link here. (y) might make a video on it actually... Thanks for asking
@zipaJopa 4 месяца назад
@@6lack5ushi my hero! 💕
@pajarobobo4467 4 месяца назад
@@6lack5ushiwheres the link?
@ytubeanon 4 месяца назад
that's neat, I've been trying to get TTS to work for open-interpreter which I use a lot with gpt-4o-mini
@seventhapex 4 месяца назад ⁺²
do you know what the word "own" means?
@ModernCentrist 4 месяца назад
What would be the main difference between the custom voice assistant and the OpenAI voice mode?
@TimothyJoh 4 месяца назад
This was so great. I left you a PR on the repo, waiting for your feedback. Would love to demo it for you.
@YorkyPoo_UAV 4 месяца назад
Is there a way to run this on your mobile device and have it in a call/open conversation function?
@yuniorgonzalez4638 4 месяца назад
Amazing job !!!
@ariramkilowan8051 4 месяца назад
Love the content but I think it would be helpful to at least mention relative costs. You've said in the past that the API costs are worth the investment. You are likely correct but probably still worth mentioning. Thanks again for the content.
@andrewandreas5795 4 месяца назад
Does anybody know a STT model that could be run locally for live transcript?
@mrd6869 4 месяца назад ⁺¹
The top area for this will be Cybersecurity hands down.
In years time there will be entire cyberwars being waged and run by AI systems.
@Mosen_xd 3 месяца назад
perfect
@EternalKernel 4 месяца назад ⁺³
Eleven labs pricing structure is prohibitive. Very prohibitive.
@BaldyMacbeard 4 месяца назад ⁺³
"Personal AI is TOO valuable to leave in the hands of Big Tech"... proceeds to build his AI assistant with APIs hosted by "big tech". Kinda expected to see whisper, llama, mycroft or sth.
@littledovecitydust 4 месяца назад
this is just a voiceflow type chatbot, it's not your own assistant.
@orthodox_gentleman 3 месяца назад
Good point. I don’t really understand the point of this.
@itskittyme 4 месяца назад
cringe lol

Следующие

Автовоспроизведение

MOST Important AGENTIC Application - Speech to Text to AI Agents (TTS, STT, LLM Router)

MOST Important AGENTIC Application - Speech to Text to AI Agents (TTS, STT, LLM Router)

GPT-4o mini Prompt Chain: Legit TRICK for DIRT CHEAP AI with SOTA Accuracy

GPT-4o mini Prompt Chain: Legit TRICK for DIRT CHEAP AI with SOTA Accuracy

Creative Lighting for $1

Creative Lighting for $1

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Death Of A Unicorn | Official Trailer HD | A24

Death Of A Unicorn | Official Trailer HD | A24

BLACK BAG - Official Trailer [HD] - Only in Theaters March 14

BLACK BAG - Official Trailer [HD] - Only in Theaters March 14

Tornado touches down in Santa Cruz County, several injured

Tornado touches down in Santa Cruz County, several injured

AI Engineering 2025 PLAN: Max out AI COMPUTE for o1 Preview, Realtime API, and AI Assistants

AI Engineering 2025 PLAN: Max out AI COMPUTE for o1 Preview, Realtime API, and AI Assistants

7 Prompt Chains for Decision Making, Self Correcting, Reliable AI Agents

7 Prompt Chains for Decision Making, Self Correcting, Reliable AI Agents

AI-Powered Personal Assistants: The Best of 2024

AI-Powered Personal Assistants: The Best of 2024

host ALL your AI locally

host ALL your AI locally

Realtime API with Tool Chaining. ADA is BACK. o1 assistant FILE AI Agents

Realtime API with Tool Chaining. ADA is BACK. o1 assistant FILE AI Agents

Two-Way Prompts: SIMPLIFY your AI Agents, Agentic Workflows, Personal AI Assistant

Two-Way Prompts: SIMPLIFY your AI Agents, Agentic Workflows, Personal AI Assistant

Radical Simplicity

Radical Simplicity

No, ChatGPT SKY is NOT an AI Assistant: How to LEVERAGE GPT-4o, GenAI, and Gemini

No, ChatGPT SKY is NOT an AI Assistant: How to LEVERAGE GPT-4o, GenAI, and Gemini

I Built a $1M AI App [No Code]

I Built a $1M AI App [No Code]

Денис и Елена Кукояки - о секретах счастливой семейной жизни, дочке Василисе и итогах года

Денис и Елена Кукояки – о секретах счастливой семейной жизни, дочке Василисе и итогах года

Экстренный перенос выборов в Беларуси / Авария на БелАЭС

Экстренный перенос выборов в Беларуси / Авария на БелАЭС

НОВЫЙ СЕЗОН - игры со временем!😱 • что же нас ждет?

НОВЫЙ СЕЗОН - игры со временем!😱 • что же нас ждет?

Другая семья. Рассказ

Другая семья. Рассказ

ДЕТЕНЫШИ СТЕПЫ ПОПАЛИ В ПОДЗЕМНЫЙ ХОМЯКАРИУМ

ДЕТЕНЫШИ СТЕПЫ ПОПАЛИ В ПОДЗЕМНЫЙ ХОМЯКАРИУМ

Богатая Норвегия. Почему? @posle_zavtra

Богатая Норвегия. Почему? @posle_zavtra

Жизнь без российского газа

Жизнь без российского газа