Gemini Flash is SURPRISINGLY Good for Agents and Function Calling

TUTORIAL on How to Make a Roblox Game | Seribical

What is Retrieval-Augmented Generation (RAG)?

IDK HOW (MUSIC VIDEO) KARAN AUJLA | FOUR ME EP | LATEST PUNJABI SONGS 2024

CAN OFFLINETV CUT THE PERFECT HALF?

Remble - Colors (ft. Mozzy & Stoneda5th) [Official Music Video]

Creating J.A.R.V.I.S.

Prompt Engineering

Просмотров 3,8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 15 май 2024
A sneak peek of voice-to-voice chat assistant.
🦾 Discord: / discord
☕ Buy me a Coffee: ko-fi.com/promptengineering
|🔴 Patreon: / promptengineering
💼Consulting: calendly.com/engineerprompt/c...
📧 Business Contact: engineerprompt@gmail.com
Become Member: tinyurl.com/y5h28s6h
💻 Pre-configured localGPT VM: bit.ly/localGPT (use Code: PromptEngineering for 50% off).
Signup for Advanced RAG:
tally.so/r/3y9bb0
All Interesting Videos:
Everything LangChain: • LangChain
Everything LLM: • Large Language Models
Everything Midjourney: • MidJourney Tutorials
AI Image Generation: • AI Image Generation Tu...
Наука

Комментарии • 50

@MeinDeutschkurs Месяц назад ⁺²
Wooohooo!! Yeah, can‘t wait for it! ⭐️
@barackobama4552 Месяц назад ⁺²
Impressive, thanks!
@comfyuiadrian Месяц назад
Wahooo..really looking forward to your new project!
@engineerprompt Месяц назад
thank you!
@Techonsapevole Месяц назад ⁺¹
it's fast which TTS and STT did you use ?
@engineerprompt Месяц назад
All openai
@Thorin632 Месяц назад
Please make beginner friendly tutorial, step by step guide on how to integrate this with localgpt 🙏🙏
@brianpereira7757 Месяц назад ⁺²
That doesnt sound like Jarvis, I want the real Jarvis voice!!!
@engineerprompt Месяц назад ⁺¹
Good point, I think elevanlabs have that. Will try to integrate that :)
@sayantandas7544 Месяц назад
@@engineerprompt How about you add a little UI also? And maybe add a button to take continuous screenshot with a regular interval as well. In that way, you will be releasing the OpenAI's demo app before OpenAI.
@aa-xn5hc Месяц назад
Great looking forward
@engineerprompt Месяц назад
thanks
@RickySupriyadi Месяц назад
yes please is it going open source?
@user-jq1gc8lt7s Месяц назад
I LIKE IT GREAT JOB
@engineerprompt Месяц назад
thank you :)
@joepropertykey3612 Месяц назад
Right on Bro, RIGHT ON. ......... but we need the voice of Cortana for this, for when we are sitting around in our Mark V Armor and coding...:)
@engineerprompt Месяц назад
:)
@3choff Месяц назад
Very interesting project! Do you use any VAD to detect the end of the request?
@engineerprompt Месяц назад
At the moment no.
@GetzAI Месяц назад
EXCITED!
@engineerprompt Месяц назад
:)
@GroqSummarizer Месяц назад
Nice!
@RickySupriyadi Месяц назад
also i request a video about this vs gpt-4o
@themax2go Месяц назад
should edit title to add "using openai"
@im-notai Месяц назад
Idk know, why there is a folder on my desktop named Jarvis-v6 since 5 months and surprisingly that's also doing the same job 😮
@engineerprompt Месяц назад
Would love to see what's in the folder :D I am v0 now
@im-notai Месяц назад
@@engineerprompt it's gonna become interesting. I thought I was the one who was able to crack speech while streaming to reduce the latency.
@KiyotokaAyanakoji-ss1gn Месяц назад ⁺²
What TTS are you using and is it running locally
@engineerprompt Месяц назад ⁺³
Whisper but via the api. Nothing is running locally in this video. Local version will be coming soon.
@KiyotokaAyanakoji-ss1gn Месяц назад
@@engineerprompt loved it 👍
@Gun_ForFun Месяц назад ⁺¹
@@engineerprompt but Whisper is ASR, not TTS??
@snapman218 Месяц назад
Gross.
@themax2go Месяц назад
someone already made a fully local version and works w/ little latency and with voice training. there already exist projects on github for continuous speech using a keyword to trigger recording, and a version with a ptt implementation instead of keyword
@borisrusev9474 Месяц назад
I don't get it, how's that different from GPT-4o?
@engineerprompt Месяц назад ⁺¹
You are right, very similar in functionality. In fact, this version is using GPT-4o for text generation. But the voice functionality is not available in GPT-4o yet.
@smoofwah3552 Месяц назад
Is there a way to speed it up?
@engineerprompt Месяц назад
Yes, Groq has whisper support now. Going with that but the issue is the rate limit!
@alx8439 Месяц назад
To use rhasspy3 as a base. It streams audio directly to asr model
@Soniboy84 Месяц назад
how it's different than gpt4o voice?
@engineerprompt Месяц назад
that is not available yet :)
@danieldjinishiandebriquez1858 Месяц назад
What apis are being used?
@engineerprompt Месяц назад
currently everything is openai. Just got access to whisper from Groq, will update it and hope will be much faster!
@danieldjinishiandebriquez1858 Месяц назад
@@engineerprompt great! Looking forward the tutorial or git repo. Literally yesterday I was searching about Jarvis haha
@temp911Luke Месяц назад
Nice but would be great without that annoying 2-3 sec delay.
@engineerprompt Месяц назад
I agree, I just got access to Groq Whisper. Will be interesting to see how that works.
@fontende Месяц назад
@@engineerpromptGeorge Hotz on stream called groq a scam...
@themax2go Месяц назад ⁺²
not local. not the jarvis voice. misleading title. disappointed
@javiergimenezmoya86 Месяц назад
Why do you think that is not local? The only bad thing is that he do not use voice streaming for make it faster (I did it so)

Следующие

Автовоспроизведение

Gemini Flash is SURPRISINGLY Good for Agents and Function Calling

Gemini Flash is SURPRISINGLY Good for Agents and Function Calling

TUTORIAL on How to Make a Roblox Game | Seribical

TUTORIAL on How to Make a Roblox Game | Seribical

What is Retrieval-Augmented Generation (RAG)?

What is Retrieval-Augmented Generation (RAG)?

IDK HOW (MUSIC VIDEO) KARAN AUJLA | FOUR ME EP | LATEST PUNJABI SONGS 2024

IDK HOW (MUSIC VIDEO) KARAN AUJLA | FOUR ME EP | LATEST PUNJABI SONGS 2024

CAN OFFLINETV CUT THE PERFECT HALF?

CAN OFFLINETV CUT THE PERFECT HALF?

Remble - Colors (ft. Mozzy & Stoneda5th) [Official Music Video]

Remble - Colors (ft. Mozzy & Stoneda5th) [Official Music Video]

LISA - ROCKSTAR (Official Music Video)

LISA - ROCKSTAR (Official Music Video)

Marker: This Open-Source Tool will make your PDFs LLM Ready

Marker: This Open-Source Tool will make your PDFs LLM Ready

3 Levels of Vim Refactoring

3 Levels of Vim Refactoring

New LLaMA 3 Fine-Tuned - Smaug 70b Dominates Benchmarks

New LLaMA 3 Fine-Tuned - Smaug 70b Dominates Benchmarks

Creating JARVIS - Your Voice Assistant with Memory

Creating JARVIS - Your Voice Assistant with Memory

Nvidia Nim: Deploy Open Source LLMs with 1 click

Nvidia Nim: Deploy Open Source LLMs with 1 click

Why I Switched from Python to Rust for AI Deployment

Why I Switched from Python to Rust for AI Deployment

Two GPT-4os interacting and singing

Two GPT-4os interacting and singing

GPT-4o Deep Dive & Hidden Abilities you should know about

GPT-4o Deep Dive & Hidden Abilities you should know about

Better Searches With Local AI

Better Searches With Local AI

Какие телефоны запрещены в разных странах мира ?(Часть 2) 📱

Какие телефоны запрещены в разных странах мира ?(Часть 2) 📱

Что взять до $400 (до 40000 РУБЛЕЙ)? | ТОП-10 смартфонов в 2024

Что взять до $400 (до 40000 РУБЛЕЙ)? | ТОП-10 смартфонов в 2024

Лучше купить PSP || Anbernic RG405m и Retroid Pocket 2

Лучше купить PSP || Anbernic RG405m и Retroid Pocket 2

Охлаждаю Мини ПК БАШЕННЫМ КУЛЕРОМ...

Охлаждаю Мини ПК БАШЕННЫМ КУЛЕРОМ...

Gizli Apple Watch Özelliği😱

Gizli Apple Watch Özelliği😱

Самый СТРАННЫЙ смартфон!

Самый СТРАННЫЙ смартфон!

478 СОКЕТ НА СТЕРОИДАХ / ЧТО СМОЖЕТ В 2024 ГОДУ?

478 СОКЕТ НА СТЕРОИДАХ / ЧТО СМОЖЕТ В 2024 ГОДУ?

Из одного выключателя ДВА! КАК сделать? #выключатель #светильник #ремонтквартир #diy

Из одного выключателя ДВА! КАК сделать? #выключатель #светильник #ремонтквартир #diy