OpenAI Realtime API: The future of Voice AI?

Поделиться
HTML-код
  • Опубликовано: 28 дек 2024

Комментарии • 53

  • @patrickzupanc1795
    @patrickzupanc1795 2 месяца назад +1

    Great video, thank you, Jannis!

  • @mikearmstrong-ai
    @mikearmstrong-ai 2 месяца назад

    Very informative, will start jumping in, thanks for the free resources.

  • @LucasMarquesAI
    @LucasMarquesAI 2 месяца назад +1

    Great video as always Jannis, let's go 🔥

  • @clairedubiel1
    @clairedubiel1 2 месяца назад

    Thanks for the helpful video Jannis!

  • @BrockMesarich
    @BrockMesarich 2 месяца назад

    Was waiting for you to release this!

  • @HenrykAutomation
    @HenrykAutomation 2 месяца назад

    Love its speed, unmatched by anything else out there right now!

  • @mohammedzihan7382
    @mohammedzihan7382 2 месяца назад +3

    For Developers, feel voice providers like VAPI wouldn't be required in near future. Directly integrate the OpenAI API, and have components like WebRTC, real time streaming, client server connection mapping, DB connections & data mapping implemented. For handling workflow management, state management, could integrate certain frameworks on top like Langraph.

    • @jannismoore
      @jannismoore  2 месяца назад +2

      Those platforms are already not required anymore, but I believe the realtime API will be the reason they become even more popular. Will share more on that soon.

  • @7_Tom
    @7_Tom 2 месяца назад +4

    Great video as always! Since you are probably in contact with the Vapi team... Can you estimate how long it will take until this is implemented? Thanks.

    • @jannismoore
      @jannismoore  2 месяца назад +2

      I’m not quite sure, but I assume we should see something being released soon.

  • @radoslav07
    @radoslav07 2 месяца назад

    Can you share your replit link? Thanks

    • @jannismoore
      @jannismoore  2 месяца назад

      I did! It’s in my resource hub which you’ll find in the description

  • @_arav_patel_
    @_arav_patel_ 2 месяца назад +1

    Great video. I wonder what the future will be like with Voice AI becoming this realistic. How long do you think it will take for Vapi to implement this? (few days, weeks, months?)

    • @jannismoore
      @jannismoore  2 месяца назад +1

      I expect weeks max. :)

  • @naryanzaninja7367
    @naryanzaninja7367 2 месяца назад

    What are your plans Jannis? Run the agency long term, or switch completely to saas, or voice ai education, or something else?

    • @jannismoore
      @jannismoore  2 месяца назад

      I haven’t even started with voice AI education.
      Honestly, for now I’m happy helping others build out extremely powerful systems, but the educational route might certainly be interesting one I see the need for it

  • @pjm17
    @pjm17 2 месяца назад

    SO could I build a conversational chat app. Basically give someone a person to talk to as they walk around and chat with? are prices too limiting right now??

    • @jannismoore
      @jannismoore  2 месяца назад +1

      You can do that, but yes, prices are still limiting as of now.
      I do believe that those will come down quite rapidly.

  • @greendsnow
    @greendsnow 2 месяца назад +9

    İt's just way too expensive. Some people payed $3 for 5 minutes, even though the pricing catalogue says it's around 30 cents a minute... Simply unacceptable

    • @alexxandermedeiros
      @alexxandermedeiros 2 месяца назад +4

      Cost will go down soon just like other API costs

    • @jannismoore
      @jannismoore  2 месяца назад +5

      You can achieve the same with Vapi by dropping 50k tokens into your master prompt :)
      Anyways, API costs will definitely come down, so that isn’t a concern in my opinion

    • @dazdazfzf
      @dazdazfzf 2 месяца назад

      ⁠@@jannismooreexactly. Just a way to raise the bar of the value of their product because they cannot already scale.

  • @lakergreat1
    @lakergreat1 2 месяца назад

    could it work with Microsoft Teams Phone? I would like to use it in an IVR setup

    • @jannismoore
      @jannismoore  2 месяца назад

      We haven't tried that yet, but if you have a number, you can most likely make calls to it through a provider like Twilio. There are also other approaches that you might be able to leverage long term, such as daily.co

  • @moatazelkersh6129
    @moatazelkersh6129 2 месяца назад

    What a great video! Thanks so much for doing the work and providing us with the template for free. If you don’t mind me asking, how can I reduce my costs with Twilio and set up an open-source phone system to act as the call gateway? Another thing I was planning to implement WebRTC as it has the functionality to reduce Eco and noise reduction in case someone will call in a loud environment!

    • @jannismoore
      @jannismoore  2 месяца назад

      I think OpenAI handles the noise reduction part by themselves. If you're referring to SIP trunking, you most likely need to see how you can do the connection. Not every platform allows you to add a SIP URL to it, sometimes it's the other way around.
      If you want to try it, use something like Zoiper

  • @8888-u6n
    @8888-u6n 2 месяца назад

    How do we get acces to the code you made? 👍

    • @jannismoore
      @jannismoore  2 месяца назад

      Via my resource hub - the links for that are in the description :)

  • @angeloh-u1q
    @angeloh-u1q 2 месяца назад +2

    I'm surprised that vapi isn't on top of this already.

  • @pauledam2174
    @pauledam2174 2 месяца назад

    Can anyone suggest how this could be used for real-time translation? Actually it doesn't need to be voice to voice just voice to text

    • @jannismoore
      @jannismoore  2 месяца назад +1

      In that case you might just want to look at Deepgram

  • @thereviewer5562
    @thereviewer5562 2 месяца назад

    You are as always authentic in your opinion. It is exciting thing for someone who is beng introduced to this voice stuff with ai for the first time. What do you thinkis the basic thing a beginner can learn in low code development ? What is the skill that moves the needle?

    • @jannismoore
      @jannismoore  2 месяца назад +1

      Understanding the concept and foundations.
      I think that’s the most important thing.
      Try some of my examples so you have a working solution, and then try to understand how it’s done.
      That’s a great point to start. 👍🏻

    • @thereviewer5562
      @thereviewer5562 2 месяца назад

      @@jannismoore that is good to hear.

  • @tuaitituaiti1565
    @tuaitituaiti1565 2 месяца назад

    Hey there. Thank you for tge value bombs you are dropping...Heads up the link to the resource seem to be broken...thanks again

    • @jannismoore
      @jannismoore  2 месяца назад

      Appreciate it! Both of the links work when opening them. What do you see once you click on them?

  • @jamesballantyne9214
    @jamesballantyne9214 2 месяца назад

    This seems as slow as vapi. What advantages does this, will this have, if it’s the same speed without and of the features of vapi?

    • @jannismoore
      @jannismoore  2 месяца назад

      Are you sure you watch your videos on normal playback speed? :D
      I've mentioned some of the benefits in the video. If that's not enough, I'll drop a more detailed one soon.

    • @rarf2142
      @rarf2142 2 месяца назад

      Bro this is not slow at all… You do realise it should sound human and not respond in 0.005 milliseconds? The delay makes it sound human smh

  • @Kevinsmithns
    @Kevinsmithns 2 месяца назад +1

    How can we use it for ai call bots?

    • @jannismoore
      @jannismoore  2 месяца назад

      You can use the custom example I showed for Twilio, or you can give it another couple of days and Vapi will most likely have something available too

  • @jeelanshahtlyr6076
    @jeelanshahtlyr6076 2 месяца назад

    Jannis is the ONLY way to go when it comes to AI Voice and Automations.

  • @shanes.6227
    @shanes.6227 2 месяца назад

    can't wait til this kills customer service phone jobs. calling my wireless carrier for something is often a big trouble, taking hours!

  • @NeuralDev
    @NeuralDev 2 месяца назад

    The cost is way too high, we need open source models

    • @jannismoore
      @jannismoore  2 месяца назад

      I don't think the price will be that high for long

  • @gslvqz8812
    @gslvqz8812 2 месяца назад

    You need to change your thumbnail. It looks evil

    • @jannismoore
      @jannismoore  2 месяца назад

      Seems like you clicked on it nevertheless

  • @SzamBacsi
    @SzamBacsi 2 месяца назад

    Laughable. It works in English or German, with simple Indo-European languages. But it dies with Hungarian. instantly.

    • @jannismoore
      @jannismoore  2 месяца назад +2

      I can see what causes your disappointment.
      You'll always see major languages being implemented at a faster pace. Honestly, I'm already impressed it properly handles multilingual conversations as smooth as now, as this was already incredibly hard with the orchestration layers we've seen so far.
      We should be happy about those advancements and help them with enough input to make it even better, which on the other hand will also increase your chances of having better results for other languages.

    • @rarf2142
      @rarf2142 2 месяца назад

      @@jannismooreI hope Dutch works already, I really need a Dutch agent. VAPI starts hallucinating on Dutch and speaking half German after a while lol

    • @SzamBacsi
      @SzamBacsi 2 месяца назад

      @@jannismoore Indeed, I am disappointed, as I have experience applying language models in IVR systems since the 2000s, and I understand that implementing a new model in 2024 should not pose a problem. The underlying issue seems to be a lack of concern for anything outside a specific "cultural" circle. In summary, they simply don't care.
      But I do hope I am mistaken.
      I truly appreciate your videos; they bring a refreshing perspective to this emerging area .