OpenAI Realtime Voice API: A 7-Minute Getting Started Guide

Поделиться
HTML-код
  • Опубликовано: 26 дек 2024

Комментарии • 31

  • @vcarrascoring
    @vcarrascoring 2 месяца назад +1

    Love it, I am still waiting for the production like video :)

  • @MaliRasko
    @MaliRasko 2 месяца назад +6

    Talked with it for 5 min in the playground today. The cost was $2.35. Not too shabby.

    • @i2Sekc4U
      @i2Sekc4U 2 месяца назад +4

      That’s pretty expensive. Especially if you wanted to build something with this for consumers, think about how pricy it would get. Monthly subscriptions would have to be like $50

    • @ChrizzeeB
      @ChrizzeeB 2 месяца назад +2

      That's a 1990s sexline...
      What service would work at that price?

    • @yurijmikhassiak7342
      @yurijmikhassiak7342 2 месяца назад

      The price is $20/hour. Like a junior sales rep.

    • @danacarvey
      @danacarvey 2 месяца назад

      What I want to know is if you can interrupt it?

    • @yurijmikhassiak7342
      @yurijmikhassiak7342 2 месяца назад

      @MaliRasko yes you can interrupt it, and it has automatic voice detection. So you pay only for the time you speak, not for silence. Still $20 for an hour of conversation requires a solid use case.

  • @BrianDevJourney
    @BrianDevJourney 2 месяца назад +3

    Great tool, if this was cheaper I would develop with it. Also, just emailed you about a sponsor opportunity. Cheers!

    • @DevelopersDigest
      @DevelopersDigest  2 месяца назад +1

      Cheers - I’ll have a look. Agree, I think as the price comes down it will be much more viable for more apps

    • @BrianDevJourney
      @BrianDevJourney 2 месяца назад

      @@DevelopersDigest Hey developer digest, following up here. Did you see my email? Thanks!

  • @adityakale55
    @adityakale55 Месяц назад

    how to end call , how do we know if last audion has been played

  • @nhtna4706
    @nhtna4706 2 месяца назад

    What would be the cost of the api usage given a scenario where there calls volume goes between 200000 min in a given month?? On an avg. cos it involves calls that goes on for hours n 10000 of calls,.

  • @manoharants
    @manoharants Месяц назад

    When i give phonenumber as voice input, numbers gets mixed up. Could you help me?

    • @SirHelios
      @SirHelios Месяц назад

      I have the same issue, also difficulties understanding the last name. Twilio was more accurate

  • @kelvindimson
    @kelvindimson 2 месяца назад +1

    This is crazy!!

  • @seecmellikew
    @seecmellikew Месяц назад +1

    Any luck deploying?

    • @DevelopersDigest
      @DevelopersDigest  Месяц назад

      I haven’t had a chance to circle back to this yet! I did see cloudflare had a really nice looking relay for this though that I have been meaning to try!

  • @micbab-vg2mu
    @micbab-vg2mu 2 месяца назад +1

    thanks :)

  • @jaysonp9426
    @jaysonp9426 2 месяца назад

    What was the latency? Also is there a way to have it await the function call return via the websocket? Def a non starter if we just have to deal with it coming back in pieces

  • @ibrahimaba8966
    @ibrahimaba8966 2 месяца назад +5

    This API is too expensive; I think we should avoid sending all chunks. We need a local VAD (Voice Activity Detection) to send only the chunks that contain voice; otherwise, it could become costly.

  • @nastastic
    @nastastic 2 месяца назад

    can you make a cartoon character voice with it?

  • @ivan3584
    @ivan3584 Месяц назад +1

    Always on 429 *to many req

    • @DevelopersDigest
      @DevelopersDigest  Месяц назад

      Oh interesting - I hadn’t thought about the rate limit for this offering. I haven’t run into any issues yet

  • @johnnylarue3933
    @johnnylarue3933 2 месяца назад +3

    Crazy expensive @3.6 min cost $12.

    • @AI_Escaped
      @AI_Escaped 2 месяца назад

      Yup, just tinkering around to figure out how things work will drain your account. I don't see many people using this unless they have huge funding. Guess most of us will have to wait for open source or when openai drops the price later. Horrible pricing OpenAI.

    • @AI_Escaped
      @AI_Escaped 2 месяца назад

      And the voice sounds like crap

    • @johnnylarue3933
      @johnnylarue3933 2 месяца назад

      @AI_Escaped I'm sure it's going to drop in price in a year from now... but I was hoping to start using this today for many usecases... like many others, I cobbled together a version of this using VAD, STT and TTS to/from GPT Chat Completions which wasn't overly fast to initial response (3-6 seconds), but otherwise a decent two-way conversation. I am going to try handling VAD and STT (send as text is 1/10th the cost) to see if this balances the tradeoff of converting to text to lower cost to use.

    • @ibrahimaba8966
      @ibrahimaba8966 2 месяца назад

      It’s normal; this system sends everything to the model, even if you’re not saying anything. It keeps filling the buffer, so we need to add a local VAD.

    • @TéonMèhta
      @TéonMèhta 2 месяца назад

      @@johnnylarue3933 This is the way.