Part 1: How to Build an AI Voice Agent using OpenAI Realtime API

Поделиться
HTML-код
  • Опубликовано: 9 фев 2025
  • WATCH PART 2: • Part 2: How to Build a...
    WATCH PART 3: • Part 3: How to Build a...
    In this video, I will show you how to build and deploy an AI Voice Agent using OpenAI's new Realtime API (takes 10 min!). This agent will take bookings and send data to Make.com where you can then run any of your other automations. I give you the full code in from my Github Repo. I also show you step-by-step how to set up Replit and how to deploy on Replit so it's always live. I also show you how to plug in Twilio so you can have a phone number that calls your AI agent. I also show you how to connect Make.com. This is a beginner friendly tutorial.
    🚀 Sign up to Replit using my link: replit.com/ref...
    📺 Watch the ENTIRE series: • OpenAI Realtime API Vo...
    📺 AI SMS Assistant: • How to Build an Advanc...
    📋 Take This Quick Survey: forms.gle/otAr...
    🛠️ Need this built? Contact: bart@supportlaunchpad.com
    🗂️ Github repo: github.com/Bar...
    👉 LinkedIn: / bartlomiejslodyczka
    Learn AI & Coding:
    Try Scrimba's AI Engineer course (20% off Pro plan with my link):
    v2.scrimba.com...
    Other related videos: • Exploring OpenAI's New...
    #openai #realtimeapi #maketutorial #replit
    Note: Affiliate links support this channel through commissions.

Комментарии • 184

  • @BartSlodyczka
    @BartSlodyczka  4 месяца назад +3

    📺 Watch Part 2: ruclips.net/video/ffDm4HVGuTM/видео.htmlsi=W1nfLYgj3zsQ0RWW
    📺 Watch Part 3: ruclips.net/video/oQtBwhRLrT4/видео.htmlsi=o56i5609Zp8Ko3eG
    🗂 Github repo: github.com/Barty-Bart/openai-realtime-api-voice-assistant
    5x NEW VOICES just released: ruclips.net/video/PTCpw1Y9HOQ/видео.htmlsi=roHjjllMKNHNzLGu
    📺 AI SMS Assistant: ruclips.net/video/HYPw8TfL2Pg/видео.htmlsi=CVAzhuQzsXH5T2Wa
    📋 Take This Quick Survey: forms.gle/otAr1xUamgyYZE5y7

  • @PlayQuest
    @PlayQuest 4 месяца назад +1

    Yes Please! Looking forward to the next episode of your AI Voice Agent Build! Thanks for your effort in making this Vid, Bart!

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      thank you :) appreciate that! next vid will be out by end of this week :)

  • @WillCousin
    @WillCousin 4 месяца назад +3

    This is a good demo - looking forward to part 2.

  • @fernandomendes1177
    @fernandomendes1177 4 месяца назад +1

    Thank you for sharing, Bart! Amazing! I'm already waiting for part2! Keep going.

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад +2

      Thank you my man 🙏 Will make part 2 soon!

  • @Educationsupport
    @Educationsupport 16 дней назад

    Greay video. I was doing a similar thing here with VAPI. VAPI is more complicated, but it sounds way more realistic. This one sounds very robotic. It was eye opening for me that you created an assistant in a completely different way.

    • @BartSlodyczka
      @BartSlodyczka  9 дней назад

      So many possibilities out there I'm also often surprised :)

  • @DJSemme
    @DJSemme 4 месяца назад

    Really good, looking forward to PT2, hopefully we get it soon :)

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      thanks! coming out end of week :)

  • @victorvanvas
    @victorvanvas 3 месяца назад

    Superb demo Bart, you're one of the best in the game right now

  • @gt6148
    @gt6148 4 месяца назад

    This works great and the set up was a breeze, Thank You!

  • @minasmarioskontid
    @minasmarioskontid 4 месяца назад

    Thank you so much! learning to code, and got it hard with intergrating twilio. You're video created my day!

  • @aymane.bencheikh
    @aymane.bencheikh 4 месяца назад

    Can't wait for part 2 !

  • @SP-js4gf
    @SP-js4gf 4 месяца назад +1

    yes we would love to see more of these kind of videos

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      you got it! will work on more :)

  • @lakergreat1
    @lakergreat1 4 месяца назад

    So good, thank you for sharing. Subbed and looking forward to rag and function call future videos!

  • @lingfeizhang9411
    @lingfeizhang9411 4 месяца назад

    This is so extremely useful. Thank you!

  • @didacfergir
    @didacfergir 4 месяца назад

    Legend. Can wait to see more about it!

  • @stevebim000
    @stevebim000 4 месяца назад +1

    Amazing content, man! Please do another one with RAG and function calling

  • @ilanelhayani
    @ilanelhayani 4 месяца назад +1

    amazing stuff. waiting for v2

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      thanks! will be out by end of week :)

    • @taiwoakeem2602
      @taiwoakeem2602 3 месяца назад

      I have openAI accounts with credits i can sell you at 50% off the credit value.

  • @tuaitituaiti1565
    @tuaitituaiti1565 4 месяца назад +1

    As instructed, I liked this content, turned me into a new Sub. Standing by for a part 2-100 🙂Thank you, sir, for the education and value bombs you are dropping.💪🗿🔥🦅👊

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      my good man, thank you for the support 👊

  • @bartunma
    @bartunma 4 месяца назад

    Great project. Thank you for that! It would be great to see a part 2 with bidirectional connection to any calendar. I'm also waiting for better version of real API since this version cannot be used at least for czech language (making a lot of mistakes).

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      Díky! Interesting about czech not being so good yet, but yeah I bet it will improve soon. Keep at it legend :)

  • @montaex.893
    @montaex.893 3 месяца назад

    Thank you. Great video. Super helpful.

  • @DuryabAziz
    @DuryabAziz 3 месяца назад

    Thank you very much, good stuff and very helpful!

  • @jeanchindeko5477
    @jeanchindeko5477 4 месяца назад +1

    This is quite interesting to see OpenAI releasing in 2024, a technology Google demonstrated in Google IO 2017, and it was called Duplex, where an AI was at that time able to pass a phone call and was sounding so real. Google never released that API to the masses and is again late to the show in 2024.

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      wow I didn't even know this. Lucky there are other companies bringing out cool stuff and releasing to the public 💪

  • @AnilKumar-g2t1p
    @AnilKumar-g2t1p Месяц назад

    Nicely explained. Thank you

  • @bilalghumman
    @bilalghumman 3 месяца назад

    Excellent video. Looking forward to enhancements

    • @BartSlodyczka
      @BartSlodyczka  2 месяца назад

      thank you very much :) I have Part 2 and Part 3 out on my channel that you can watch 💪

  • @calixtera
    @calixtera 2 месяца назад

    Awesome man! Thank you!

  • @paullopez_ai
    @paullopez_ai 3 месяца назад

    Great video!

  • @Noirteclabs
    @Noirteclabs 4 месяца назад

    Thank you bro, great content! subscribed ✅

  • @ETMGroup
    @ETMGroup 3 месяца назад

    Thanks, its super helpful.

    • @BartSlodyczka
      @BartSlodyczka  3 месяца назад

      my pleasure :) you should watch the part 2!

  • @shikharsinghal5171
    @shikharsinghal5171 4 месяца назад

    Thanks! this is a good demo of capabilities. When is part 2 coming out?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      thanks! Part 2 coming out later today :)

  • @ValchyGaming
    @ValchyGaming 2 месяца назад

    Would be awesome to see RAG and function calls! Please do this!

    • @BartSlodyczka
      @BartSlodyczka  2 месяца назад

      I've got a series on this that you should check out: ruclips.net/p/PLi7jtY2ZZqRYE8Lvw4MuLHTZPYTA4jZHQ

  • @enthogenesis
    @enthogenesis 4 месяца назад

    onya mate, node, webhooks, whisper transcripts, logging, right URLs, deploying, live! boom! we're already in your debt... sweet as! I think most RAG implementations are in python may not need if less tha n 250 pages of text just need a large context window for an outfit like Bert's automotive! I did RAG: Beyond Basics from Prompt Engineer I strongly recommend it!

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад +1

      thank you legend! excellent recommendations, hooroo 💪

  • @hickam16
    @hickam16 4 месяца назад

    thank you! I want to see more!

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад +1

      wicked - will whip something up soon 💪

  • @saedsaify9944
    @saedsaify9944 Месяц назад

    Nice work, thanks. What do you suggest to change for a different input / output language communication?

    • @BartSlodyczka
      @BartSlodyczka  29 дней назад

      I would edit the main prompt. If you watch the later videos in this series (go to the playlist in video description) you'll see a video where you can add a custom first message into the agent upon starting a new call. You can also set language here if you like

  • @fernandofortini4743
    @fernandofortini4743 23 дня назад

    What about the privacy matters on an integration like this? Do you have any details at all? Like for instance, this service is a paid service, that means my data will be held or not?

    • @BartSlodyczka
      @BartSlodyczka  22 дня назад

      Always safe to assume it will be held!

  • @XxX-mb2tg
    @XxX-mb2tg 4 месяца назад

    Great video! Is it possible to also use the OpenAI voice assistant for the initial greetings message? I don't like the switch between the twilio tts voice and the openai realtime voice.

    • @XxX-mb2tg
      @XxX-mb2tg 4 месяца назад +1

      Found it:
      - remove the `` tag from the twilio stream connection
      - change openAi ws open listener:
      ```
      openAiWs.on("open", () => {
      console.log("Connected to the OpenAI Realtime API");
      setTimeout(sendSessionUpdate, 250);
      setTimeout(() => {
      openAiWs.send(JSON.stringify({
      type: "conversation.item.create",
      item: {
      type: "message",
      role: "user",
      content: [
      {
      type: "input_text",
      text: "Hello!",
      },
      ],
      },
      }));
      openAiWs.send(JSON.stringify({ type: "response.create" }));
      }, 500);
      });
      ```

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      Yeah it is, figured out a golden nugget for this, video 2 coming end of this week!

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      ah you got it anyway!!! nice

  • @MontyChicola
    @MontyChicola 4 месяца назад

    Unbelievable great code

  • @AIPartners-q1j
    @AIPartners-q1j 3 месяца назад

    Thanks for such a wonderful tutorial ❤️.
    I am wonder if it is possible to check the availability on google calendar before booking. Is this going to replave vapi or we can use real-time api within vapi or other similar platforms❤

    • @BartSlodyczka
      @BartSlodyczka  3 месяца назад

      thank you legend :) In the Part 2 and Part 3 videos I explain how to connect the AI caller to make.com. And from within make.com you can connect to google calendar modules. If you watch those vids I also give you the make.com blueprint to get you started 💪 I think Open AI maybe wont replace platforms like Vapi, but it will be a great alternative options. Good to learn it :) Keep it up man!

  • @rockstarcomputerhelp
    @rockstarcomputerhelp 4 месяца назад

    Great video. Thanks very much! I'm super curious how it would be possible to add RAG support and how well it would work with getting high quality output and low enough latency.

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      thanks! check out part 2 with RAG here: ruclips.net/video/ffDm4HVGuTM/видео.htmlsi=zyOJMMPYuiY2rdSZ

  • @fredericherrera
    @fredericherrera 4 месяца назад

    This is very impressive

  • @barisbesorak
    @barisbesorak 4 месяца назад

    thanks man highly appreciated

  • @DUBOURGIA
    @DUBOURGIA 4 месяца назад +1

    Hey man thanks for the video, I would like to know if we can use a platform other than twilio to do this, because Twilio does not support many countries?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад +1

      great question, I think so but I haven't looked into it yet. What other platforms do you know that support more countries?

  • @IdkJustCookingDude
    @IdkJustCookingDude 4 месяца назад

    Super cool brother, i am making such cool things with chat gpts text api, I can't wait you try this! I don't even know how to code and i can do this!

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      thank you my man, this comment makes me so happy 💪

  • @MohamedKhalil-tr4pi
    @MohamedKhalil-tr4pi 4 месяца назад

    Great video, the arrow pointer for demonstration is very cool, how do you do that?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад +1

      thanks man! It's a mac app called "DemoPro - Screen Annotation"

  • @xSneakybeast
    @xSneakybeast 4 месяца назад

    thanks for sharing. i was wondering instead of it being a phonecall, how can the realtime api be accesed by pushing a button on a app like thats made with react native? That way it also can serve other usecases and the audio is better.

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      very interesting idea 🤔

    • @xSneakybeast
      @xSneakybeast 4 месяца назад

      @@BartSlodyczka yeah, i found swift and kotlin implementations of realtime api but im still searching for react native implementation. do you know how to do that? the component that needs to be changed is the websocket to make it compatible with mobile

  • @wongr643
    @wongr643 4 месяца назад

    Great content.

  • @mikew2883
    @mikew2883 4 месяца назад

    Very cool! Quick question. Were you able to get the barge in to work in your version. The Twilio version I tried I was unable to and the Twilio author stated it was a know issue and they are looking into it.

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      thanks my man! I haven't tried to do barge in yet, but if twilio said its a known issue then maybe it's not possible just yet? but I imagine they'd fix it quickly considering they are the main partner for voice integration into the realtime api. I'll probably make another video with more features in the coming days and I'll suss out the barge stuff too 💪

    • @mikew2883
      @mikew2883 4 месяца назад +1

      @@BartSlodyczka I'm hoping so. Looking forward to your functions and rag videos! 👍

  • @AlfredNutile
    @AlfredNutile 4 месяца назад

    Nice work! I have been wondering how to have a phone number be used for stuff like this! Thanks

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      this is awesome to hear :) thank you!

  • @cybersec9345
    @cybersec9345 3 месяца назад

    Hi Bart,
    How are you doing?
    this project is amazing and I badly trying to connect that with my Yeastar S20 sip device.
    But as now no luck (
    Do you have any suggestion?

    • @BartSlodyczka
      @BartSlodyczka  2 месяца назад +1

      Hey my man, sorry I don't have any suggestions, I don't know what a Yeastar S20 sip device is and I don't use one 🙏

  • @DIY4Profit
    @DIY4Profit 2 месяца назад

    Many thanks!
    I have bought a local(Israeli) phone from twillo, i try to connect it to Vapi but i wont come through...had you encountered in some issues like this?

    • @BartSlodyczka
      @BartSlodyczka  2 месяца назад +1

      I haven't experienced anything like this, but also I don't use Vapi, sorry man!

  • @njjax2005
    @njjax2005 4 месяца назад

    Would love to see function calls - trying to call a fine tuned model

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      Done! Part 2 with function calls coming out today :)

  • @muhammadibrahimabdullahi3840
    @muhammadibrahimabdullahi3840 4 месяца назад

    Hello, everyone it has been a long time, and I have experienced an AI voice conversation, which is very good for children in learning, and they are young onces.

  • @MohammedAffanAhmed-l2s
    @MohammedAffanAhmed-l2s 2 месяца назад

    Ive got one doubt Ive followed all steps to install the application IVR but its not executed. And i don’t have premium subscription for Replit And Open AI is that giving the problem? Can you please help me out

    • @BartSlodyczka
      @BartSlodyczka  2 месяца назад +1

      I don't think you need a premium subscription to Replit, but you might need one for OpenAI. Go to your OpenAI account and check in your settings which models you have available, look for realtime api. The next problem will be if you are Tier 0 or Tier 1, you might have usage restrictions which will stop this from working too. Hope this helps :)

    • @MohammedAffanAhmed-l2s
      @MohammedAffanAhmed-l2s Месяц назад

      I am trying to test the IVR it shows that error on chat gpt and chat gpt is not responding is it states that i need premium version

  • @felipesuaya5646
    @felipesuaya5646 4 месяца назад

    Hi Bart! Excellent tutorial. I have a question about Replit. How does the pricing work? I'm currently working with the Assistants API. Thank you!

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      thank you :) So you pay $25 a month (month to month plan) and you $10 in credits each month. If you're just starting out with Replit, I don't think you'l go over this limit. I've been using Replit for like a year now and have deployed lots of things, lots of testing, and have not yet gone over. I think if you get lots and lots of users then you'll use those credits up quickly. Hope this helps legend!

    • @santosh0011
      @santosh0011 4 месяца назад

      @@BartSlodyczka Any alternatives to Replit?

  • @radoslav07
    @radoslav07 4 месяца назад

    I would like to use local microphone or iphone app to talk to the local PC server so that way we can skip calling/using Twilio? Any recommendation how?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      haven't played around with local mic yet, but i have seen other tutorials where they might be doing this. let me know how you go?

  • @neelkanani7230
    @neelkanani7230 2 месяца назад

    Is there a way to add a custom voice ?

    • @BartSlodyczka
      @BartSlodyczka  2 месяца назад

      I don't think so at this stage

    • @neelkanani7230
      @neelkanani7230 2 месяца назад

      @ is there any other way by using any library? I have an saas app idea in mind

  • @Itay-Zerem
    @Itay-Zerem 3 месяца назад

    Can I use AWA Lambda instead of replic?😊

    • @BartSlodyczka
      @BartSlodyczka  3 месяца назад +1

      Yes 100% you can! I haven't used AWS before but I'm sure you can relatively easily convert to AWS

    • @Itay-Zerem
      @Itay-Zerem 3 месяца назад

      @@BartSlodyczka thanks bro I’ll try it there!

  • @muhammadazfar6361
    @muhammadazfar6361 4 месяца назад

    Hy Bart . What About Outbound Calls , Can We Also Handle Outbound Calls Using RealTime API ?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад +1

      I haven't tried yet but I feel like yes, I'll look into it and make a follow up vid if i figure it out :)

  • @thompsonlaw1
    @thompsonlaw1 Месяц назад

    Awesom!

  • @claudioagmfilho
    @claudioagmfilho 4 месяца назад +1

    🇧🇷🇧🇷🇧🇷🇧🇷👏🏻, Great video!

  • @beatrizpintosoares3650
    @beatrizpintosoares3650 3 месяца назад

    How do you manage to end the phone call? And close the websocket connection.

    • @BartSlodyczka
      @BartSlodyczka  3 месяца назад

      when you hang up the call the websocket will close :)

  • @Itay-Zerem
    @Itay-Zerem 3 месяца назад

    Great video!!
    Is this can work on more languages than english?

    • @BartSlodyczka
      @BartSlodyczka  3 месяца назад

      thanks! yes, just make the prompt in your chosen language and speak in your chosen language -- your responses will be in that language too :)

  • @lingfeizhang9411
    @lingfeizhang9411 4 месяца назад

    Would you please consider making a video like this but in python as well?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      I'll keep this in mind, thanks for the recommendation 🙏

  • @nhtna4706
    @nhtna4706 4 месяца назад

    What would be the avg api cost? With an assumption that the calls could be in 100000 of min in a day/ month?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      I haven't done any cost tests yet, but openai says roughly 30c per minute

    • @nhtna4706
      @nhtna4706 4 месяца назад

      @@BartSlodyczka smart pricing , very expensive , not affordable by small biz..

  • @РусланНагимов-д7д
    @РусланНагимов-д7д 4 месяца назад

    thank you! this is my first JS code and it is working. Tried to rework it in russian) works pretty well but first message read wtih heavy accent) how can i change system message? i guess it takes it from my accont - default message

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад +1

      nice work man! i haven't looked intot he default message yet, but the system prompt does the trick for me atm. In the next vid I will upgrade my system prompt. keep up the good work man!

    • @РусланНагимов-д7д
      @РусланНагимов-д7д 3 месяца назад

      @@BartSlodyczka will it work with other services webhook?

  • @alanzou7677
    @alanzou7677 4 месяца назад

    can you add function call to the bot?

  • @angeloh-u1q
    @angeloh-u1q 4 месяца назад

    Does the AI agent have the ability to remember returning callers?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      this is a 10/10 suggestion holy shmoly. Will look into this for the next vid. WOW

  • @vengeshop
    @vengeshop 4 месяца назад

    This is great! Any ideas how to use phone numbers for other countries? I have an online store located in Ukraine. It would be great to receive incoming calls when no one is in the office.

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      Thanks! Great question, I'll suss it out and see if I can have some solutions for my next vid :)

  • @alanzou7677
    @alanzou7677 4 месяца назад

    can you make it let OpenAI bot to talk first, twilio's greeting sound is different openai sound

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      great idea! will look into this :)

  • @micbab-vg2mu
    @micbab-vg2mu 4 месяца назад

    thanks :)

  • @gurindersingh1713
    @gurindersingh1713 4 месяца назад

    how much does it cost per minute on average?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      On average it costs around $0.06 per minute for audio input and $0.24 per minute for audio output, so $0.30 per minute if you're using both audio input and output

    • @gurindersingh1713
      @gurindersingh1713 4 месяца назад

      @@BartSlodyczka i know the openai website says that. But i was asking how much did it cost you in your demos. 0.30/minute doesn't seem realistic as you will not have 2 person speaking at same time. I mean at any given time the ai will be either listening or speaking. Not doing both. What do you say

  • @chrisburgdorff588
    @chrisburgdorff588 3 месяца назад

    I copied this step by step but my assistant just does the welcome message and then hangs up. No error messages. Anyone else?

    • @BartSlodyczka
      @BartSlodyczka  3 месяца назад

      Were you able to sort this out? If you copied step by step then the code should be all good. I would check (1) do you have funds in your twilio account and does your twilio number allow calls (2) did you pass the correct replit URL into the Twilio webhook configuration? IE if you deploy your replit code, it is a different URL to when you test the replit code in development mode. LMK how you go!

  • @trygverefvem6158
    @trygverefvem6158 4 месяца назад

    Great tutorial! I am trying to get it to work better with interruptions (I want to be able to cut into the reply or correct something that was wrong), but it does not seem to respond to that? What am I missing? Found this in a forum : "To whoever is reading this in the future, I found a solution. Here is how I implemented it into my code: if response["type"] == "input_audio_buffer.speech_started": print('Speech Start:', response['type']) # Clear Twilio buffer …"

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      hey legend, someone posted a comment in my new Part 2 video and it had this code:
      "Here is a simple update i made, to make the ai stop talking when the user is talking you need to add this:
      if (response.type === "input_audio_buffer.speech_started")
      {
      console.log("Speech Start:", response.type);
      // Clear any ongoing speech on Twilio side
      connection.send(
      JSON.stringify({
      streamSid: streamSid,
      event: "clear",
      })
      );
      console.log("Cancelling AI speech from the server");
      // Send interrupt message to OpenAI to cancel ongoing response
      const interruptMessage = {
      type: "response.cancel",
      };
      openAiWs.send(JSON.stringify(interruptMessage));
      }"
      You can prob throw the full code into chatGPT, then give it the above snippet, and ask it to insert it. Hope this helps :)

  • @jalengonel
    @jalengonel 4 месяца назад

    Idk why it does this but why, no matter how much I try to prompt/tweak parameters, does the API voice sound so monotone and bad at taking speech directions compared to the ChatGPT voices?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад +1

      Yeah I agree, right now it's not the best sounding, but I'm sure in time it will get better. when it does, we will be ready 💪

    • @jalengonel
      @jalengonel 4 месяца назад +1

      @@BartSlodyczka fr. In reality this will likely birth an entirely new protocol/ web framework. Feels like an early days of the internet era where things are being bootstrap established for the first time ever

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      @@jalengonel such an exciting time man, such an exciting time

  • @yazanrisheh5127
    @yazanrisheh5127 4 месяца назад

    Can you please do in python as well

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      interesting! I might do this in the coming weeks :)

  • @elpablitorodriguezharrera
    @elpablitorodriguezharrera 4 месяца назад +2

    If I may ask, so for the openai API, it costs $3 / 10-minute of call?
    Imagine a business handling on average 10-minute inbound call with 1,000 of people🤦‍♂️

    • @thomasjamesbailey1209
      @thomasjamesbailey1209 4 месяца назад

      Today, tomorrow it will be cheaper, and the day after cheaper than a person.

    • @elpablitorodriguezharrera
      @elpablitorodriguezharrera 4 месяца назад

      @@thomasjamesbailey1209 Thank you for your answer my grandma knows. I was just clarifying the pricing "in the moment", not tomorrow, the day after, or hundred of years later.

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад +1

      I think $3 per 10-minute call is still cheap, considering all the costs and operations that go into hiring someone. Costs: salary + medical/ salary taxes + subscription costs (ie the business uses SAAS products and each person needs a seat) + sick days + etc. Operations: hiring + training + need a team manager + etc. From a cost and operations POV - I think business owners would be happy to pay considering how easy it is and how little overhead they have. Hope this kind of context helps :)

    • @elpablitorodriguezharrera
      @elpablitorodriguezharrera 4 месяца назад

      @@BartSlodyczka $3K will pay 1000 customer service in my country Indonesia for 10 hours. And Indonesia is even #16 in gdp with income per capita around $5K.
      You can literally can pay $3 for 100 human customer service for talking for 10-minute in some poor country.
      When I say expensive, it means globally. Not in the US with #1 GDP.

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      @@elpablitorodriguezharrera Very good points. At $3 per 100 human * 10 min this is 16.67 hours. Or 18c per hour. Now I see your point. I guess it then comes down to the business and where the employees are located. Either way, appreciate the time taken to explain your point, I learned something new 🤝

  • @localloop
    @localloop 4 месяца назад

    More pls

  • @Shubham-rf2bs
    @Shubham-rf2bs 3 месяца назад

    🤗

  • @DeEU13
    @DeEU13 3 месяца назад

    Hey Bart! Great Video, but unfortunately it is not working for me. I am using replit free plan. My OpenAI Account allow Realtime API requests and i already bought a number on twilio. Every time when i try to call it, it is busy. Can you help me out?

    • @rafaychaudry320
      @rafaychaudry320 2 месяца назад

      Same here, I have a free openai account and got the api key from there, do tell me if you found a way to tackle this. Thanks

    • @BartSlodyczka
      @BartSlodyczka  2 месяца назад

      Hmm, I think I would look at your usage limits for the realtime api model in your account. I recently worked with a client who had a Tier 0 or Tier 1 account and their usage was so low that the caller wouldn't work. Only after they went to Tier 2 or Tier 3 did it work. So give that a go, upgrade your account to allow more usage and that should be it. Hope this helps 🙏

  • @Numi2003
    @Numi2003 4 месяца назад

    gj

  • @isaiahgomez1215
    @isaiahgomez1215 29 дней назад

    Got it working great, and modified the code to create a HAZMAT advisor for my fire department. Has anyone hooked this up to MS teams or a zoom number?

    • @BartSlodyczka
      @BartSlodyczka  29 дней назад

      So awesome! I haven't hooked up to MS teams or Zoom so will be interested to see if others have :)

  • @joseduarte1240
    @joseduarte1240 4 месяца назад

    You have discord? if i want to learn a little more?

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      yes but email is better - bart@supportlaunchpad.com

  • @JohnDoe-rk7ex
    @JohnDoe-rk7ex 4 месяца назад

    Thats a similar tutorial that twilio posted a few days ago but its going to cost some money to be run in production

    • @BartSlodyczka
      @BartSlodyczka  4 месяца назад

      Yeah Twilio had a great tutorial and this is very similar :)

  • @willwill-io4rf
    @willwill-io4rf 15 дней назад

    Hello Bart. This is very good. I'm getting the bot to answer and only says the greeting . I'm getting this error Starting transcript processing for session session_1737809170453...
    Starting ChatGPT API call...
    Disconnected from the OpenAI Realtime API
    ChatGPT API response status: 404
    Full ChatGPT API response: {
    "error": {
    "message": "The model `gpt-4o-2024-08-06` does not exist or you do not have access to it.",
    "type": "invalid_request_error",
    "param": null,
    "code": "model_not_found"
    }
    }
    Raw result from ChatGPT: {
    "error": {
    "message": "The model `gpt-4o-2024-08-06` does not exist or you do not have access to it.",
    "type": "invalid_request_error",
    "param": null,
    "code": "model_not_found"
    }
    }
    Unexpected response structure from ChatGPT API

    • @BartSlodyczka
      @BartSlodyczka  11 дней назад

      Looks like the error is saying the model you are using doesn't exist or you don't have access to it. ("message": "The model `gpt-4o-2024-08-06` does not exist or you do not have access to it."). Make sure on your openai account you have access to this model. Or, if this is outdated, find the model that supports realtime api. And then update the code to use that model :)