OpenAI GPT-4o API Explained | Tests and Predictions

Поделиться
HTML-код
  • Опубликовано: 19 июн 2024
  • OpenAI GPT-4o API Explained | Tests and Predictions
    👊 Become a member and get access to GitHub and Code:
    / allaboutai
    🤖 Great AI Engineer Course:
    scrimba.com/learn/aiengineer?...
    🔥 Open GitHub Repos:
    github.com/AllAboutAI-YT/easy...
    📧 Join the newsletter:
    www.allabtai.com/newsletter/
    🌐 My website:
    www.allabtai.com
    Explaining the OpenAI GPT-4o API. My predictions and some tests of what I think we can expect from GPT-4o API and the multimodal model.
    00:00 OpenAI GPT-4o API Intro
    03:23 OpenAI GPT-4o Explained
    06:47 OpenAI GPT-4o Exploration
  • НаукаНаука

Комментарии • 36

  • @elliotanderson1585
    @elliotanderson1585 Месяц назад +9

    It's going to be a game changer if OpenAI can actually deliver all the functions they demonstrated.

  • @Ginto_O
    @Ginto_O Месяц назад +22

    I just realized that new openai model killed your voice assistant projects, just as they did last time with GPTs

    • @acllhes
      @acllhes Месяц назад +4

      It’s a pattern I noticed and expect since gpt4 dropped.

    • @josephtilly258
      @josephtilly258 Месяц назад +1

      or it can make it easier to build an with low latency, in the gpt ap you don't really have long term memory, or function calling, their is still a lot of room to make a great, personnalize local assistant imo + this open the way on omnicient llm, mabye in the futur local llm will have voice and vision natively

    • @alexanderrosulek159
      @alexanderrosulek159 Месяц назад

      @@josephtilly258maybe but doubt this year, I can’t even run the 7b text modes and to add vision and audio understanding would need to be bigger

    • @AllAboutAI
      @AllAboutAI  Месяц назад +5

      yes haha, but hey, thats technology and why we love it

  • @Ms.Robot.
    @Ms.Robot. 19 дней назад

    This was very educational. Your instructions were clear and concise. ❤🎉

  • @YunusDogan-yc4lx
    @YunusDogan-yc4lx Месяц назад +2

    Hi Kris can you do it vision version also with camera ? Some things or help usecases

  • @Ou8y2k2
    @Ou8y2k2 Месяц назад +1

    4:59 It's not horribly wrong, but I'd combine the and Voice IN in one graphic in the GPT-4o Voice API Now section and Voice OUT under the LLM RESPONSE. GPT-4o is going to be a game changer for education.

  • @elliotnyberg9332
    @elliotnyberg9332 Месяц назад +1

    I wounder how they manage to handle interuptions during the voice output like in their demo for the api

  • @alirezasheikh8797
    @alirezasheikh8797 Месяц назад

    I think you're right. Maybe minutes after OpenAI presentation was done, I posted on their developer forum if voice in / voice out will be available to developers soon. They said only to a small group of "trusted" partners. So yea, I'm not sure when we gonna get access to this. You gotta be in that special circle. 😅

  • @mwkoti
    @mwkoti Месяц назад +1

    What we need ASAP is an open source alternative to GPT-4o realtime speech-to-speech (as in demos). I'm pro open-source and I want full control of the application flow, preferably offline. Has anyone tried to use XTTS streaming capabilities succesfully, for example by extending AllAboutAI-examples?

  • @francycharuto
    @francycharuto Месяц назад +1

    Great content!

  • @ionutownprint4198
    @ionutownprint4198 29 дней назад

    Does the streaming audio function help with the latency?

  • @brianpoillucci1805
    @brianpoillucci1805 Месяц назад +2

    You’re the best man.

  • @OliNorwell
    @OliNorwell Месяц назад +1

    That "Performance Scores" table - nice! If that is all correct then that's pretty impressive.
    Though I did a screenshot test myself and it mistook a 3 for an 8 so it might not be flawless.

    • @AllAboutAI
      @AllAboutAI  Месяц назад +1

      yeah, i expect it to just improve over time until its perfect tho

  • @zhenfu4556
    @zhenfu4556 13 дней назад

    How can I get this code?

  • @RICHARDSON143
    @RICHARDSON143 Месяц назад +1

    ❤❤❤

  • @ziadnahdi4343
    @ziadnahdi4343 Месяц назад

    if you could give it IQ tests visually ! that would be great since this is difficult for it like how many triangles or what is the next humber in a series..

  • @athemis1180
    @athemis1180 Месяц назад

    Hi, great video! Iam courios how much do those apis cost. On their website I found text pricing in tokens and it is pretty cheap and understandable. However the image or “vision” function seems to be so expensive. I calculated it and with full hd on low settings it is going to cost about 5$ for a minute on 15fps. That’s crazy, not even mentioning it their TTS, that costs 15$ per 1M tokens, which is pretty hilarious

  • @RaysAiPixelClips
    @RaysAiPixelClips Месяц назад +1

    I made a script just like yours and now with GPT-4o it kinda defeats the purpose...😅at least we can still use it with local models.

    • @gulludiscord
      @gulludiscord Месяц назад

      Hey Bro Is This Model Free To use, I Men Can We Explore This?

  • @squiddymute
    @squiddymute Месяц назад +2

    multimodal or multimodel ? does really anyone believe it’s a single model ?

  • @learnwithyan
    @learnwithyan Месяц назад

    My thoughts that we already have a good chance to be so so devs and to write perfect code 🎉😅

  • @Gamez4eveR
    @Gamez4eveR Месяц назад

    GPT5 may finally be able to tell how many r's are in the word "strawberry", but 4o will suffice with its ability to write a bigint from scratch in C in just a couple minutes of telling it to try again

    • @mirek190
      @mirek190 Месяц назад

      Really ?
      llma 3 8b even answer it ...
      How many r's are in the word "strawberry"?
      The word "strawberry" has 3 R's.

    • @Gamez4eveR
      @Gamez4eveR Месяц назад

      @@mirek190 ask it where they are. Also which Llama 3 8b model? Mine failed on first attempt, both meta and groq

    • @Gamez4eveR
      @Gamez4eveR Месяц назад

      @@mirek190 here's llama 3 70b failing: How many r's in strawberry
      There are 3 R's in the word "strawberry".
      Where are they?
      I apologize for the mistake! There are actually 2 R's in the word "strawberry". They are consecutive, appearing as "rr" in the middle of the word.

    • @Gamez4eveR
      @Gamez4eveR Месяц назад

      @@mirek190 llama 3 70b groq also failed this

    • @Gamez4eveR
      @Gamez4eveR Месяц назад

      @@mirek190 and here's claude 3 opus:
      The three r's in "strawberry" are located as follows:
      1. st*r*awberry (after the first "t")
      2. stra*w*berry (after the "w")
      3. strawbe*r*ry (near the end, before the final "y")
      You be the judge lmao

  • @rodrigov.9252
    @rodrigov.9252 Месяц назад +3

    god the number of men who will fall in love with that voice hahaha

    • @Ou8y2k2
      @Ou8y2k2 Месяц назад

      It will make the movie _Her_ seem like an underestimate.