INSANELY Fast AI Cold Call Agent- built w/ Groq

AI Jason

Просмотров 226 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 19 дек 2024

Комментарии • 509

@AIJasonZ 9 месяцев назад ⁺⁴⁵
What are the use cases you want to see me building with Groq?
@CultKosmosa 9 месяцев назад ⁺⁷
Are you working for groq now?
@EdemirLandau 9 месяцев назад ⁺¹⁷
I need a ai girlfriend
@MrErick1160 9 месяцев назад ⁺³
Personal agent that ‘sees’ zpwhat you do on your computer / phone and helps with it. (By sending a screenshot to it)
@tonykaze 9 месяцев назад ⁺⁶
Doing literally anything requiring intelligence beyond a basic best-case simple script.
@klfrost7 9 месяцев назад ⁺³
I have worked with a non-profit that helps with Fair housing problems. I think a good use case would be receiving calls for a business and helping the customers understand if they are having a real fair housing problem.
Would Groq be able to receive phone calls?
@robeaston4063 9 месяцев назад ⁺⁴⁷⁸
My first thought is how can we use this for scam baiting? We just need an elderly person's voice option to make the call and then prompt the AI to waste the scammers time talking about gift card activation codes.
@venim1103 9 месяцев назад ⁺²⁵
Until the AI conjures up real credit card information from within its data and then some unfortunate persons life savings are gone 😢
@speedoflink 9 месяцев назад
@@venim1103 yeah nah mate.
@Jim-ey3ry 9 месяцев назад ⁺²
i like this idea!
@danielchoritz1903 9 месяцев назад
i suppose it is the other way around^^ natural speaking "people" will now scam old persons
@igorp3360 9 месяцев назад ⁺²
It still costs tokens tho
@HarpaAI 9 месяцев назад ⁺²²
🎯 Key Takeaways for quick navigation:
00:32 *🧠 Introduction to Groq's LPU (Large Language Model Processing Unit)*
- Introduction to Groq's LPU architecture designed specifically for AI inference.
- Explanation of the need for LPU in large language model inference.
- Comparison between LPU and other processing units like CPU and GPU.
05:37 *🔍 Comparison between CPU and GPU*
- Description of CPU as the central processing unit and its limitations in parallel computing.
- Explanation of GPU architecture, parallel computing power, and its expansion beyond gaming.
- Illustration of the difference between CPU and GPU through a painting demonstration.
06:05 *🔄 Limitations of GPU in Large Language Model Inference*
- Discussion on the limitations of GPU in handling large language model inference.
- Explanation of the complexities in achieving sequential execution on GPU.
- Overview of the latency issues and the need for complex control mechanisms.
09:47 *🚀 Groq's LPU Architecture and Performance Benefits*
- Introduction to Groq's LPU architecture designed for sequential tasks and low latency.
- Explanation of the simplified architecture and shared memory advantages.
- Discussion on the predictability and performance gains achieved with Groq's LPU.
11:37 *🗣️ Applications of Fast Inference Speeds*
- Exploration of potential applications such as real-time voice AI for natural conversations.
- Discussion on the reduction of latency enabling smoother interactions.
- Demonstration of real-time voice AI and its impact on user experience.
13:17 *🖼️ Utilization in Image and Video Processing*
- Highlighting the effectiveness of Groq for real-time image and video processing.
- Demonstration of image processing capabilities for various applications.
- Discussion on unlocking consumer-facing use cases with fast inference speeds.
14:40 *🤖 Building Real-time Voice AI with Groq*
- Discussion on building outbound sales agents using real-time voice AI.
- Introduction to platforms like Vee for integrating voice AI into applications.
- Demonstration of setting up a real-time voice AI assistant using Groq's model.
00:00 *📞 Setting Up Real-time Voice AI Cold Call Agent*
- Setting up a real-time voice AI cold call agent using Groq technology.
- Integration of voice AI capabilities into existing agent systems.
- Configuring API calls and server URLs for seamless communication between systems.
19:18 *🛠️ Integrating Real-time Voice AI with Existing Agent Systems*
- Demonstrates how to integrate real-time voice AI with existing agent systems.
- Setting up agent tools for making phone calls and receiving transcriptions.
- Configuring metadata and webhooks for seamless communication between platforms.
20:41 *📞 Configuring Call Functionality and AI Assistant*
- Configuring call functionality within agent systems for real-time voice AI interaction.
- Setting up dynamic message generation and personalized interactions.
- Defining schemas, URLs, and metadata for effective communication between systems.
Made with HARPA AI
@ozfish17 9 месяцев назад ⁺⁶
Thanks, Jason for the great work!
@AIJasonZ 9 месяцев назад ⁺²
Thanks a lot mate!
@UniversalManifest 7 месяцев назад
@@AIJasonZ use the ai to order pizza
@raghuoffl-fd6cu 9 месяцев назад ⁺⁴¹
This is one true gem of a video that focusses more on the use case. Thank you for breaking down the concepts really well and showing us demo of it's capabilities
@palfers1 9 месяцев назад ⁺¹⁵⁹
Yes because we all want more cold calls from sales bots.
@nathanfranck5822 9 месяцев назад ⁺⁶
came here to also say this. Yech... Leave the calling to the humans, everything automated should have been an email.
@hiandrewfisher 9 месяцев назад ⁺⁷
Sure but what about more cold calls from better sales bots?
@nikolaizaicev9297 9 месяцев назад ⁺¹⁸
@@hiandrewfisher
Sales bot or human, what ever company still thinks in our time that cold calling is the way to go, is beyond the point of saving, and it should go bankrupt, for its own stupidity. The bots will just speed up that process.
@drowsy4400 9 месяцев назад
@@nikolaizaicev9297i make 100k a year off of coldcalls
@HuxleyCrimson 9 месяцев назад
@@nikolaizaicev9297amen to that
@MarkS-23 9 месяцев назад ⁺⁶
Creating a UI questionnaire for non coder types to build applications to solve problems. Mostly business applications that might otherwise require a developer or consultant.
@denisburkatsky2436 18 дней назад
great video, Jason! thank you for the insigths on how to build these flows
@chrsl3 9 месяцев назад ⁺¹¹
Its easy to see this will replace all callcenters very soon. I assume they originally developed this chip for the new Tesla Autopilot software, that is mainly AI/video based.
@Robbyrool 8 месяцев назад ⁺²
They even added vocal fry to the woman’s voice for realism. * slow clap *
@armadasinterceptor2955 9 месяцев назад ⁺¹⁸
2:55 "In every frame 2 million pixels have to be generated"
This guy broke down graphics in a way that made sense, for the first time in 20 years.
@Macatho 9 месяцев назад
Good for you ✌
@danielchoritz1903 9 месяцев назад
isnt true dough, it just needs to get the pixels who are changing. And you dont render every pixel alone, but in object for object.
@Macatho 9 месяцев назад ⁺¹
@@danielchoritz1903 In graphics you are rendering every pixel. You're talking about video codecs, whole different ballgame.
@MunirJojoVerge 9 месяцев назад ⁺¹²
These are amazing use cases!! Lowering the barriers of entry to do high quality business associated with big companies!!
Thanks Jason
@larion2336 9 месяцев назад ⁺²³
The phone number thing is interesting... makes me fantasize about being able to have this as a replacement for the "leave a message after the beep" answering machines for your mobile if you don't get a call. A lot of people find leaving a message without having a conversation really awkward, so if you could instead connect to an AI assistant like this that actually talks to you, you could leave better messages, and the AI can summarize the conversation and leave you a txt message of the contents, or just leave their own summarized voice message.
@PazLeBon 9 месяцев назад ⁺⁵
nobody listens to answerfone messages, not since abourt 2007 id say haha
@abandonedmuse 9 месяцев назад ⁺⁴
That’s a super amazing idea. Build it! You will become rich lol
@TruNGenius 9 месяцев назад ⁺⁴
You just described an AI secretary and yes this would be an amazing tool. Build it !!
@thxp4478 9 месяцев назад
With all this current technology it is possible to create a really cool AI girlfriend. And highly customizable.
@jonahdaian9085 9 месяцев назад ⁺²
@@abandonedmuse Launched it today and I'm still not rich lol
@JRealMe82 9 месяцев назад ⁺¹⁴
This is really interesting. Thanks for the sharing Jason.
@Joe-bp5mo 9 месяцев назад ⁺¹⁶
I can't trust anything anymore! The demo in the end is very impressive
This is so powerful but also scary, what the world will look like in 12 month, when all the communication are driven by AI?
@hqcart1 9 месяцев назад
you would be busy scratching your balls, while AI does everything else.
@HamsterFlex 9 месяцев назад ⁺⁵
You're incredible. Thanks for this Demo, Jason Sensei.
@christinbremer6449 8 месяцев назад
How wonderful, this is bound to improve trust among people and all of our lives. This is the best thing that science has wrought since industrialized warfare. Thank you, technology.
@laalbujhakkar 8 месяцев назад
Many thanks for never bothering to define what LPU is an actual acronym for.
@musumo1908 9 месяцев назад ⁺⁸
hey great video - can you do a full walkthrough of relevanceai and how you set that agent up as its not possible to follow from your video as looks like you had some pre defined steps in there thanks or drop and drop a link to the code you used to build this? thanks
@vickmackey24 9 месяцев назад
17:17 That is so fast and seamless. Super cool.
@mariusorani 9 месяцев назад ⁺⁶
The Sales Agencies after watching this video: „Ah f*** this sh*t, let‘s learn some new skills“
@JamilaJibril-e8h 9 месяцев назад ⁺¹
😂😂😂😂😂😂😂😂😂
@MyWatermelonz 9 месяцев назад ⁺⁶
I'll have to try this. I managed to get very fast, close to realtime speech with the chatgpt api using a few queues and a local text to speech. The slowest part was the actual speech to text processing i believe. I was using whisper before they added all the new upgrades to the gpt api (this was when gpt3.5 just came out basically).
It just processed two sentences to speech and put out the audio while it provessed the next sentences. The issue was that twilio made it very difficult to work with this since I needed to make it a stream and that required some realtime communication protocol that worked over phone, so i just stopped and had my own little chat assistant. Im a weeb. It was an anime girl ai assistant.
@ultimape 9 месяцев назад
We did this too, some of the audio engines even give an output that tells you the realtime factor -> if it's less than one, it means you can generate the sentences faster than they can be spoken! Basically we used a queue and pipe-lining to reduce the mean time to first output.
I don't think you need these LPU things unless you're trying to use an online service that just bulk process a bunch of sentences.
@amrdeabes6338 9 месяцев назад
super @@ultimape
@vobose 9 месяцев назад ⁺¹
the thumbnail of this video is really cool, the text looks like it sticks out.
@Robbyrool 8 месяцев назад ⁺³
If we can get ai bots to do this, surely we can get them filling up the comments sections of yt videos too with sensible yet gratuitous, meaningless, insincere comments. Get after it guys!
@mattcalhoun2971 7 месяцев назад
They have been filling it up, along with all Social media, especially Twitter, oh and dating sites for a long time. Twitter is probably the worst. But Sorry fellas, you’re paying for a dating membership because of the AI girls they gave you. Happy for all the good relationships found from dating sites.
@CasimiroBukayo 9 месяцев назад ⁺⁴⁵
I wonder how many "Nigerian Prince" this thing could run in parallel? 🤔🤭
@JamilaJibril-e8h 9 месяцев назад ⁺³
😂😂😂😂😂😂
@GengoSenmon 9 месяцев назад ⁺⁴
1:31 "I haven't do exercise at all for the past 3...or 6 months..." 😂
@BuddhaMedam 9 месяцев назад ⁺⁵
Another awesome video with great presentation and overview ,i give your video's example to many to make them understand how to educate viewer abour particular thing and tell about what,why,how and then implement things in easy way possible.
Keep feeding us quality content buddy :-))
@Warung-AI-Channel 7 месяцев назад
Thank you for covering this, we are building AI Applications using groq. Fast, cheap, and reliable.
@sai_404_1 9 месяцев назад ⁺¹
Please check your view with the name " "okay, but I want GPT to perform 10x for my specific use case" - Here is how ".
A lot of people, including me, can't work with the code. There are mistakes in it.
Just choose a T4 GPU and you'll figure it out for yourself.
Thanks for your attention
@aimattant 6 месяцев назад
One effective usage for voice agents, for now is for incoming calls for leads coming in from ads, or website/social media. They have a real interest in the product/service beforehand. This is a viable use. As for cold calls - I do not think they are ready yet.
@onetechgirlllc 7 месяцев назад
Loved this Jason!!! Thank you
@yahyarochdi7510 9 месяцев назад ⁺²
Hi Jason , great content , I just have one remark concerning the demo , the video is being cut it would be really nice if it was left intact just to have an idea of the latency , otherwise nice video
@konstantino_vichi 9 месяцев назад ⁺²
Hesu, Jason
The best channel
You grew so much
Since the first video
I love this moment
Where I am like,
Opening the feed,
Oh okay, Jason released a new video,
"Well, it's probably _Good As Always_".
...
Proceed to watch
...
ABSOLUTE PERFECTION
HANDS DOWN
MAJESTIC
INFORMATION BOILED DOWN LIKE A
BOOSTED MONKEY ANIMAL YOU ARE
NEVER HAVE I SEEN
THINGS PUT IN THAT MANNER TOGETHER
MUCH HARMONY
STRONG BALANCE
RESONANCE LEVEL?
DEEeeeeee
eeeeeeeeeee
eeeeeP.
From the Bottom of my heart,
With Love & Respect
Ivan
@BadrLaajali 9 месяцев назад ⁺²
Can't wait to try this on some use cases I have in mind :D Great video as usual ;)
@NobleCaveman 9 месяцев назад
So not quite there yet or reliable enough but getting closer. Thanks for these insights!
@NatGreenOnline 9 месяцев назад ⁺²
This is awesome. I've seen a bunch of Voice AIs and all of them have terrible latency issues as well as obvious AI voices. Using Groq to get the latency way down and custom voices with PlayHT solves both issues. Thanks for sharing!
@IanADolan 9 месяцев назад
Thanks for this awesome content, first time on your page but this is great and simple to follow and understand!
@antman7673 9 месяцев назад ⁺³
As far as I know from the All in Podcast, “Groq” isn’t particularly made to be the LPU or language processing unit. It was build as a very parallel processor and had little use case until it was a perfect fit for LLMs.
The brown skinned dude from the podcast owning a stake in the “Groq” company,
also explained, that they didn’t have a compiler as in Nvidias Cuda, thus they build one in the last year.
As the company was working on the idea for a while. It is more like the use case fits the product.
LLMs definitely don’t exist long enough, that it was specifically made for it.
So even as the LPU might be an adequate description right now:
It rather looks like the chip picked up that profession, when growing up/maturing.
Perfect timing interval for success:
-Later and we would see another chip taking the spotlight, even if a little later.
-earlier and the company might have bankrupted, if no use case were to be found
@markdin2988 9 месяцев назад
The company wasn't built for LLM's , mostly for providing processors specifically for Machine Learning use cases. The LLM wave was just something they were uniquely in a strong position to pursue, so they made a small natural pivot.
@theshoreys4741 8 месяцев назад ⁺¹
It will be something when ai can interrupt into a conversation correctly.
@supersal3478 7 месяцев назад
wow man this is incredible... holy molly!
@ozfish17 9 месяцев назад
Thanks Jason for the good work.
@ericvoncannon 9 месяцев назад ⁺³
Great video, but just to clarify: GPU is Graphics Processing Unit not General Purpose Unit
@ScorgeRudess 9 месяцев назад
Even if you are misleading with the idle cut times on the demo its impressive.
@thesilentcitadel 9 месяцев назад ⁺⁶
I'd love to know how much it cost for the demo you created in the video. There were a lot of parts there.
@OurSpaceshipEarth 9 месяцев назад
twilio is free to setup signup even give you test $. buck a month fotr a number. vapi is probaby free to get going very cheap to keep up. whatsapp looked local gpt->api powered maybe.
@ThomasConover 9 месяцев назад
God my manifestation skills went through the roof this time. Only 7 minutes from process start until this video magically materialized.
@brianWreaves 9 месяцев назад ⁺¹
As a non-dev, I am _so_ looking forward to tools like these.
@aggiechase37 9 месяцев назад ⁺²
Looked like there was some cuts between when you finished speaking and when the bot starts speaking. Can we see the actual unedited version? I've had issues with groq getting to the first token.
@MiloslawSmyk-q4k 9 месяцев назад ⁺²
Why are there cuts every time before the agent answers in the final demo? Was she perhaps taking more time to respond than video shows?
@slavakurilyak 8 месяцев назад
The highly-anticipated tool use (aka function calling) feature for Groq API was released last week!
@toCatchAnAI 9 месяцев назад ⁺²
It's not a new concept, LPU has been existent since GPU.. the thing with Groq is they have dedicated chips for LPU. You train with GPU, and execute LLMs with LPUs.
@faizanjaved1443 9 месяцев назад
I'm excited to share my thoughts about Sora Pika Labs' Runway ML and other amazing tools like Synthesia, Speechify, Suno plug-in, Copilot, Grok-1, Claude Opus, Gemini Ultra, ChatGPT, and more. Stay updated!
@JariVasell 9 месяцев назад ⁺¹³
WOW! Amazing tutorial. Top 3 I've watched ever! Keep up the great work! 🎉
@imitry 9 месяцев назад
what other two?
@kubanaid5960 9 месяцев назад
Tell us what other two asap ! Why are you treatening us like that.
@JariVasell 9 месяцев назад
I would say Trelis Research has good content youtube.com/@TrelisResearch?si=oM1o4NaE30h2nI4y and learning wise all of Lex Fridman youtube.com/@lexfridman?si=yHJb1O-mzDYqS6c1
@JariVasell 9 месяцев назад
Seems like my replies to the questions were deleted by RUclips 😑
@A_I_World 7 месяцев назад
Really great synopsis
@Beanskiiii 9 месяцев назад ⁺²
This makes what Vedal achieved with Neuro-sama even more impressive. He did all of that with pure code without any LLM or LPU
@amrdeabes6338 9 месяцев назад
what do u mean ?
@Beanskiiii 9 месяцев назад
@@amrdeabes6338 search up Neuro-sama it’s an AI vTuber that finish 2023 as the most popular, female streamer, even though she’s an AI the way, she talks and responds is insane and her creator Vetle put in a lot of work until the code. And I’m almost certain he didn’t use a large language model.
@Beanskiiii 9 месяцев назад
@@amrdeabes6338 search Neuro-sama, she’s an AI vTuber
@onekycarscanners6002 9 месяцев назад
It will not be intelligent just logical there is a difference. Read up Ai version 101
@mikeg3810 9 месяцев назад ⁺⁵
I need one acting as my office assistant answering my phone calls.
@admiralhyperspace0015 9 месяцев назад
I can build it for you for some money. Would you like that?
@bestlifeever1211 9 месяцев назад ⁺²
Great share. Seriously grateful for creators like you!
@JamesPratt-wg6ss 9 месяцев назад
I think it'd be very cool to use this for on-demand mini language lessons. Imagine before you go into any situation where you will be able to use your target language you can set up a quick call with the AI and have it role-play a conversation with you. And you could iteratively improve your language skills per situation. And have transcripts to further work on with your flesh and blood language teacher.
@DRDG-Design 7 месяцев назад
Good stuff! Keep it up
@SeidSuleman-g3p 9 месяцев назад ⁺¹
As much as exciting Groq is in voice ai.LPU may also help in video generating ai models like sora. since sora uses the same structure as llm. i believe it will help create videos faster and longer.
@StephanBuchin 9 месяцев назад ⁺¹⁶
12:30 The ai voice sounds amazingly natural. All the lonely people will soon have a friend to talk to.
@PazLeBon 9 месяцев назад ⁺¹
it doesnt tho , i dont agree, they all sound fake, every one of them..even tho we use them in various forms
@RhumpleOriginal 9 месяцев назад ⁺³
@@PazLeBon they sound fake now. They wont in a year or less.
@PazLeBon 9 месяцев назад ⁺¹
@@RhumpleOriginal oh for sure but 2 years of that numbnuts sound? oh dear :)
@RhumpleOriginal 9 месяцев назад ⁺¹
@@PazLeBon I don't mind waiting. Been wanting to make 3 games for 20 years. I can wait a bit longer for AI to get to that point. Will be fun choosing voices for my characters. Hell, Google has Genie now. Can't be much longer.
@PazLeBon 9 месяцев назад
@@RhumpleOriginal will be 56 billlion games out. so gl
@reidmoto 9 месяцев назад
Awesome tutorial! The output seems to be conversation-aware. How can I train the voicebot so it will handle questions, and scripted answers the way I want it to? Would this be done in Groq? Your fitness caller did a great job and asked relevant questions to qualify you and give her an idea of where to go with the conversation...and the focus was on helping you and sales. Keep up the great work! I'm going to watch your video on how you built AI Agents for Research.
@Buddydaneable 9 месяцев назад ⁺¹
FYI AI cold calling is illegal in the US per the FTC. You WILL get fined into oblivion if you use any automation or AI to make AI generated calling.
@donb6514 9 месяцев назад ⁺¹
I can’t wait for this technology to get better. I need AI agents to for sales 😊
@jreamer0 9 месяцев назад ⁺¹
It's good enough now, why wait.
@officialandyrong 9 месяцев назад ⁺²
This is great but I just saw you cut the latency between your voice and the AI voice
@ЕвгенийКанамицин 9 месяцев назад ⁺²
Why are you edited your final demo to make responces appear faster than they actually are?
@cristopherserrato9259 7 месяцев назад
Great video, would be awesome if you could make one video of building a wrapper like this from scratch 😀
@guji7351 9 месяцев назад ⁺¹
Excellent video! Keep up the good work.
@zbigniewgorawski7070 9 месяцев назад ⁺²
Thank you for detailed, informative content 10/10
@ThWind81 9 месяцев назад ⁺¹
If we are utilizing AI for anything related to cold callers, it should be working on how to eradicate them.
@COMATRON. 9 месяцев назад ⁺¹
can you also use AI to not have double audio when playing videos? would help. but right, what the world now needs is more cold robocalls to sell sh1t :D nah, that sucks.
@BangaloreYoutube 7 месяцев назад
that intro was gold
@pajeetkumar1645 7 месяцев назад
*This is god send for the indian 🇮🇳 economy, now we willvbe able to 200x our call centres and the call will sound a lot more professional.*
@NLPprompter 8 месяцев назад ⁺¹
Okay, let's break this down step-by-step:
Given information:
- Usage: 240 hours
- Transcription provider: Deepgram ($0.01/min or $0.60/hr)
- Voice provider: ElevenLabs ($0.04/min or $2.40/hr)
- Model provider: GPT-3.5 ($0.02/min or $1.20/hr)
Step 1: Calculate the total minutes of usage.
Total minutes = 240 hours x 60 minutes/hour = 14,400 minutes
Step 2: Calculate the cost for transcription.
Transcription cost = $0.01/min x 14,400 minutes = $144
Step 3: Calculate the cost for voice.
Voice cost = $0.04/min x 14,400 minutes = $576
Step 4: Calculate the cost for the model.
Model cost = $0.02/min x 14,400 minutes = $288
Step 5: Calculate the total cost.
Total cost = Transcription cost + Voice cost + Model cost
Total cost = $144 + $576 + $288 = $1,008
Therefore, the total cost for 240 hours of usage with Deepgram for transcription, ElevenLabs for voice, and GPT-3.5 for the model will be $1,008.
so... 1008usd a month for 8 hours per day, is it cheaper than hiring someone?
@dariovetskyy1462 8 месяцев назад
thx man
@dariovetskyy1462 8 месяцев назад ⁺¹
fu.....king yes by the way..
@N7Tonik Месяц назад
of course its cheaper wtf
@NLPprompter Месяц назад
@@N7Tonik well... that was previous way of doing things... you don't know how cheaper today, (october 2024) with streaming API we don't have to use other models with almost 100 ms latency too...
cheaper, faster, grounded to knowledge base, able do function calls...
@keithTRADING 9 месяцев назад ⁺¹
Do another application that shows real-time speed to do something that no body expects. Maybe like super fast poker bot/trading analysis/ etc something that is massive but done in 1 sec
@shanewallis69 8 месяцев назад
amazing stuff 💯
@jasonfinance 9 месяцев назад ⁺³
grok grok groq, groq groq groq. how did you build the whatsapp integration on relevanceAI? i dont see the option
@PazLeBon 9 месяцев назад
groque :)
9 месяцев назад
I loved your video!
@krishcshah 9 месяцев назад
Eleven labs conversational voice is so good for this. You should do it.
@badger25 7 месяцев назад
The jump cut at the end make me question how responsive this is. After every question, time was trimmed to not show the lagged AI response.
@magic-4-ai 9 месяцев назад
Well done. It could be helpfull for custommer support actions
@unrealminigolf4015 9 месяцев назад ⁺²
Amazing!
@UserNoCinco 8 месяцев назад
Awesome Video
@kenchang3456 9 месяцев назад ⁺¹
Great explanation and example. Thank you very much.
@RealEstateRiddles916 9 месяцев назад
Interesting ai, gonna give it a whirl on monday with my turbo api keys
@salmonnella4930 9 месяцев назад
This workflow is insane for CRM.
@EmeraldView 9 месяцев назад ⁺⁸
That sounded JUST like you were talking to a real person! 😮
@PazLeBon 9 месяцев назад ⁺²
did it f
@allanshpeley4284 9 месяцев назад ⁺³
lol no it didn't
@fredfred2363 9 месяцев назад ⁺¹
No
@EmeraldView 9 месяцев назад
@@fredfred2363😋
@RasmusSchultz 9 месяцев назад
25:42 watch the clock on the phone - several seconds of delay was edited out.
it's an interesting setup, but this is still far from the experience that's currently possible with an LLM.
I don't think LLMs are useful for this use-case, and probably won't be in the near future.
if you want something that's going to be natural to talk to, it needs to be trained and optimized specifically for real time conversations - Google had some tech some years back and briefly released some videos demonstrating real time conversations with an AI that was actually built for it, it was very convincing, would even interject "uhm" and "hmm" like a person. It was never released, so either they realized this was too likely to get abused, or they faked the videos.
I think it could be done - but not with an LLM. It's not as simple as just making them faster - people interrupt each other in real conversations, they make sounds of acknowledgement just to let you know they're listening, lots of behaviors that would throw off an algorithm... LLMs were just not designed to work this way.
@Taskade 9 месяцев назад
Love this Jason, keep'em coming !!
@guirohden 9 месяцев назад
I loved the Crysis reference hahaha
@artfol2 9 месяцев назад ⁺²
At 25:17 you can the video is trimmed, that means the Ai is kinda slow that you cut some frames from the video, You betrayed me jason
@laden6675 9 месяцев назад ⁺²
Please don't skip the part where you wait for a response on the call...
@OurSpaceshipEarth 9 месяцев назад
good demo
@mirkostanic92 9 месяцев назад ⁺²
Wait until someone hooks the “Nigerian Prince” AI model. That's going to revolutionize scamming.
@Shaunmcdonogh-shaunsurfing 9 месяцев назад ⁺²
For functions, you can just write your own “AI function” similar to Marvin AI like we did in the Rust Auto GPT udemy course. So even though it’s not supported yet, we should be able to take a “hacky” approach
@fa8ster 9 месяцев назад ⁺¹
do you know any AI service that can fully interact with a browser, meaning using doing amazon reserach for me
@CompletedReview 9 месяцев назад ⁺⁶
Still does not quite sound human. needs more variable pacing, volume, and emotion.
@criticalnodecapital 8 месяцев назад
Have you spoken to some call centre people, eleven labs api..
@KevinLopezfr 9 месяцев назад ⁺¹
Hey Jason thank you for sharing ! Any ressources on connecting relevance ai to WhatsApp Business?
@kyoungd 9 месяцев назад
The problem with Groq is that none of the LLMs it supports can handle real-life situations. They are inconsistent and generate mixed results, especially if you ask for the output in JSON. Even if you break down the problem into multiple steps, getting consistent results on each step is difficult.
If anyone has a suggestion, let me know.
@u2b83 9 месяцев назад ⁺¹
Nobody picks up unknown cold calls anymore lol. The real use-case is in fast-food orders, where ppl prefer AI over Migranteese. E.g. Carls Jr.
@boltvanderhuge8711 9 месяцев назад
The demo at 13:29 is a bit silly; there's nearly 30 seconds between when the image is uploaded and the outputs being shown, during which the server could be preemptively generating them. Not saying it's faked but it's the type of thing a company might do to make their method seem more impressive.

Следующие

Автовоспроизведение

"I want Llama3.1 to perform 10x with my private knowledge" - Self learning Local Llama3.1 405B