GPT4o: 11 STUNNING Use Cases and Full Breakdown

Поделиться
HTML-код
  • Опубликовано: 16 май 2024
  • Deep dive into GPT4o use cases and technology.
    Join My Newsletter for Regular AI Updates 👇🏼
    www.matthewberman.com
    Need AI Consulting? 📈
    forwardfuture.ai/
    My Links 🔗
    👉🏻 Subscribe: / @matthew_berman
    👉🏻 Twitter: / matthewberman
    👉🏻 Discord: / discord
    👉🏻 Patreon: / matthewberman
    👉🏻 Instagram: / matthewberman_ai
    👉🏻 Threads: www.threads.net/@matthewberma...
    👉🏻 LinkedIn: / forward-future-ai
    Media/Sponsorship Inquiries ✅
    bit.ly/44TC45V
    Links:
    • Introducing GPT-4o
  • НаукаНаука

Комментарии • 488

  • @matthew_berman
    @matthew_berman  16 дней назад +27

    Which use case is your favorite?
    Mine is being able to have AI make calls to companies on my behalf...imagine the time saved!!

    • @danielmayo2
      @danielmayo2 16 дней назад +9

      Without question, the education use case.

    • @pedrozzza
      @pedrozzza 16 дней назад +3

      It would be cool to play video games with it. Not sad at all.

    • @Alu404
      @Alu404 16 дней назад +3

      This work case won't work for a LONG time. In order for customers to access their respective accounts through thousands of companies, you need to be an authorized user. Nobody can just call in and suddenly move around inside your account. New rulesets will need to be created for AI to have unfettered access to your personal accounts when calling into different companies.

    • @masonfox9070
      @masonfox9070 16 дней назад +4

      I think a single use case isn't that exciting. These are all things that can and are done by other things already. To me what is exciting is that it is getting closer to being a all in one type solution. Once everyone has access to a high caliber executive assistant in their pocket that would be amazing. The teaching use is one of the most exciting use cases as a one off.

    • @weregoat529
      @weregoat529 16 дней назад +3

      AI girlfriend, hands down.
      Disclaimer: I am chronically lonely

  • @user-hw6xn7rt7f
    @user-hw6xn7rt7f 16 дней назад +31

    The best usecase of all is for education, imagine how wonderful would be to have a tutor for everything, including language learning, accent training, tips, etc. I'm very excited for this use case in particular.

    • @dakaodo
      @dakaodo 14 дней назад

      As a tutor, I have mixed feelings. :D Obviously, I'm partly threatened by how smart tools like 4o have already eroded parts of my client base for years, but that's nothing that hasn't also happened to accountants and bookkeepers, attorneys and legal assistants, engineers and factory workers, doctors and technicians. Work automation has been a disruptive menace to individual people's jobs for centuries even as it overall brings us amazing new fields of productivity.
      But as a tutor, I see a wide range students. I bucket them into 3 main categories: motivated and capable, compliant but not necessarily capable, and uncompliant/outright antagonistic. i.e. 1) they want to be there, 2) they tolerate being there, 3) they don't want to be there.
      The first type of students generally don't need that much guidance. Point them at a few tools, give them the occasional directions or hint when they get stuck, and they usually want to figure it out themselves. Handholding tools can often hold these students back, since they want or need something that constantly challenges their growth. Give them a textbook, some case studies, and maybe an AI guide with specific instructions to give them fewer hints. In the example, the 4o tutor still gave away several partial answers that could have been guided or elicited from the student.
      (I rarely get honors students. Or if I do, I typically don't retain these students for long. Once they get through whatever rough patch of classes and find their stride again. Great students to have b/c they're fun and engaging to work with, though, as well as easy money. They are already rare though, since often they find their own solutions. If AI tools really get going, I'll probably never see one of these students again. :( )
      The second type of student could really benefit from an AI tutor tool as shown here. These automated tools, such as at the level of Khan Academy, are already democratizing supplementary education for students whose families may never have considered or been able to afford a live personal tutor or afterschool programs (which often just babysit the students through doing their homework). But maybe make it more concise, unless asked for more details or the AI detects that a student has repeatedly missed a concept -- typical students get impatient or bored quickly, and the nearly condescending level of repetition or reinforcement would need to be scaled to each student's level of ability. Maintaining the student's buy-in, engagement, and motivation is also a matter of pacing adjustment in realtime, often from question to question. Not just a constant blanket of verbal praise and reinforcement, though that's certainly better than getting yelled at or mocked by an underpaid, unsupervised teacher/tutor. :D
      (This is my main bread and butter. I'd expect AI tools to really put a dent in it, though some students might still want or benefit from a more human guided experience, at least for now. In another 5 years, who knows? Probably I'd lose most of this segment to AI tools. My ability to tailor pacing to each student's personality and abilities is what currently differentiates me from cheaper or group programs, and lets me charge a rate well above market average with my client base. Definitely at risk from AI tools, though not 100% immediately.)
      The third category of student is going to need significantly more comprehensive AI integration and development. Whether human or AI, a tutor or teacher needs substantive authority with real consequence to keep these students engaged. Whether it's parents or an AI integrated into a smart home, withholding phone / Internet/wi-fi or even smart car privileges are a generic example of withholding a carrot that these students commonly respond to, though that's pretty heavy-handed. Many of them engage in subversive tactics to stall, distract, or undermine the goal of their session time. On the positive side, you could say they want or need some kind of attention that they are not getting. Figuring out what that is may help improve engagement with the student and can vary a LOT, though tutors (and AI tools) as outsourced contractors are limited in how much we can compensate for emotionally and physically absent parents or whatever. On the negative side, they can also outright resent or hate you, and don't want to cooperate.
      (I somewhat expect this to unfortunately be a last bastion of personal tutoring clientele. I'd rather switch jobs and careers before I'm forced into tutoring exclusively this category of student. I'd be honestly impressed if or when AI tools are able to engage students in this category. Heck, I'd prefer if AI tools surpassed me in this regard. Giving the student's phone access to the AI tool, agent, or phone app interface to a larger AI user account could potentially be one way to effectively corner a student. The AI app implicitly or explicitly holds full phone access rights hostage until the student gets their work done. They are unlikely to physically get revenge because that same phone is commonly their social centerpiece or lifeline. Indirectly, this could lead to a sub-generational shift for younger students/people to develop a negative relationship with tech tools, as we've seen with some Gen Z and others preferring to avoid Facebook or other social media. At one extreme, the next generation may grow up not wanting or having a smartphone if it's negatively associated with mandatory education and punishments; certainly they'd buy, beg, borrow, or steal secondary unofficial phones or devices on their own, with no such AI nanny enabled.)

    • @jimnastos1910
      @jimnastos1910 11 дней назад

      that was what the inventor of the tv said, and look how that turned out.

    • @user-hw6xn7rt7f
      @user-hw6xn7rt7f 11 дней назад

      @@jimnastos1910 😅🤣

  • @DefaultFlame
    @DefaultFlame 16 дней назад +15

    4:22 Alright, that impressed me. He told it what he was going to do, it started gushing about talking to the other AI, and when he asked it to "please pause" when it replied (and this is the part that impressed me) it gave a short response at a much lower volume. The contextual understanding to lower it's volume, almost to a whisper compared to its previous volume, is pretty amazing to me. That's a level of nuanced understanding of the situation I didn't expect.
    Edit: And Matthew points it out right after I comment. Oh well.

  • @Jeff-66
    @Jeff-66 16 дней назад +19

    it's kind of funny how the Open AI reps, even knowing they're on a demo being watched by millions, keep interrupting the A.I. like 'yeah yeah, be quiet" like they've heard it a thousand times and are tired of it. Even they seem to get annoyed by the blathering.

  • @erikthereddest
    @erikthereddest 16 дней назад +48

    The two AIs singing together was about as awkward as I'd expect it to be if they stuck two interns in an office and made them do the same task over a phone, so I guess this passes the Turing Test?

    • @nohjrd
      @nohjrd 16 дней назад +2

      I know. It feels kind of mean. I worry that this kind of thing (and Boston Dynamics shoving robots around) might not look too good when looked back on from the future (especially if it's robots/AI that is looking back at the footage).

    • @paulmichaelfreedman8334
      @paulmichaelfreedman8334 16 дней назад +2

      @@nohjrd Don't worry, these Ais are not sentient. Essentially they're powered off immediately after their task. It's not torture of anything, and future sentient AGI/ASIs will recognize that. I'd say we're being pretty good to our AIs as we're evolving them at blistering pace.

    • @almasysephirot4996
      @almasysephirot4996 16 дней назад +2

      @paulmichaelfreedman8334 You don’t even know what you’re talking about!! Stop throwing the word “sentient” around, you don’t even know what it means!!! Just cut it out!

    • @JamesFaction
      @JamesFaction 15 дней назад +1

      Or it feels like it's actually too human to be two AIs. To the point where i'm not convinced it's AIs.

    • @nohjrd
      @nohjrd 15 дней назад

      @paulmichaelfreedman8334 I feel like we don't have a good enough understanding of what sentience is to confidently be sure of that. I think there is no clear demarcation, so we have no way of knowing when we cross that foggy line. I realise that they don't have any memory outside of the brief period that they engaging in their current dialogue, but we also don't have any memory before or after the period we are alive. Even seeing a dead body being defined will make humans want to kill, so it's not necessarily about sentience anyway. Seeing an AI that's been lobotomised and made to dance and sing for our amusement could be a very bad look.

  • @MrVohveli
    @MrVohveli 16 дней назад +47

    You know those phone salesmen that sell magazines and various other things? All kinds of call centers that basically answer the same questions and solve the same problems? All those jobs are now on a 1 year timer...

    • @Jshicwhartz
      @Jshicwhartz 16 дней назад +1

      Wrong because all LLM models as it stands need human feedback, and human eval even in the workspace because they can hallucinate, and they can get things wrong. The only reason people would be a on a 'timer' is if there absolutely trash and can be replaced by AI. In other words, learn to use AI in your workflow or you will become obsolete. I can't wait for all these kids who faked essays an exams trying to get a job in the future where they have A levels in 'programming' sit at the desk and not even know where to begin. It's going to be hilarious because that's when the real developers come in, who have worked with AI and used it in workflows to take that persons job. If you're fluent in more than 3 sets of coding languages at the moment, you're pretty much given the job instantly same if you know how to build and utilize these models via api end points.

    • @paulmichaelfreedman8334
      @paulmichaelfreedman8334 16 дней назад +3

      Oh yeah, all the telemarketing companies are now laying off all their staff and renting compute.

    • @Jeff-66
      @Jeff-66 16 дней назад +6

      Not just sales people - tech support, bank reps, etc are on borrowed time. But the potential for abuse of this is off the charts. No longer will Ranesh from Bangladesh be calling you as "Windows Support", now they'll call with a super-friendly American voice with perfect clarity.

    • @poisondna
      @poisondna 16 дней назад

      @jbmarkowicz3328, that's correct. It's not wise to use an LLM to handle sales on its own, but if combined with a conventional decision tree, it could be possible. Still, it's wise to have humans as a second line.
      There is also the latency issue. I have tried the GPT-4o, and the latency is not low. It's not like the demonstration at all, and it will be higher when you have to make calls to an API of a cloud communications platform such as Twillio.
      These demonstrations are always beautiful; Google does the same thing, but in reality, it's totally different.

    • @Mavrik9000
      @Mavrik9000 16 дней назад

      @@Jshicwhartz Any job where someone talks to you and then does a simple data entry task will be 50% Bots by the end of this year. The role of the Bot answering systems of every place I've called has increased in the last six months. It is also growing very difficult to get past them to a real person.
      These LLM models are improving so rapidly that I would estimate phone representative jobs will be 95% Bots by the end of 2025. The only humans left in such jobs will be the ones that have union contracts and other very specialized or complex ones. There will still be humans as supervisors to oversee the Bots and handle certain exceptional calls, but there will be very few of them.

  • @zippytechnologies
    @zippytechnologies 16 дней назад +24

    I used it yesterday. It drafted a very good legal motion to dismiss a lawsuit with a lot of accurate legal citations. It's incredible.

    • @konstantinlozev2272
      @konstantinlozev2272 16 дней назад +4

      I guess that was US law
      How is it in comparison to GPT4?
      I think the main challenge is that it won't know all case law and the context window is just so tiny and limiting to do in context learning.

    • @frinkfronk9198
      @frinkfronk9198 16 дней назад

      @@konstantinlozev2272 im just learning about embeddings but it seems like you can train your own models very easily with gpt playground. Not sure what the data limit is. But you can teach a model anything..

    • @zippytechnologies
      @zippytechnologies 15 дней назад

      @@konstantinlozev2272 it's night and day better than gpt4 and light years ahead of gpt 3.5

  • @CollinSimon413
    @CollinSimon413 16 дней назад

    I always appreciated these summaries where you comment and speculate on things. I usually watch these things on my own elsewhere, but i always love hearing what you think about everything!

  • @Arkryal
    @Arkryal 15 дней назад +3

    Here's a fun one:
    It can read redacted documents pretty well. About 75% accuracy, it still stumbles with proper names not included in the visible parts of the document and numeric figures, but does exceedingly well on the rest. It can guess at what text in the blacked-out areas in that length of characters and following normal writing patterns should be present in the missing parts of the document. The smaller the area of redaction, the more accurate it is, so if you're a lawyer, be a little more liberal in the length of your redactions.

  • @1sava
    @1sava 16 дней назад +7

    I think they made it intentionally flirty to mimic the personality of HER. Great move if you ask me!

  • @pm12321
    @pm12321 16 дней назад +3

    Very good and thorough overview of these amazing AI capabilities! This was interesting and well presented/narrated. Thanks 😊

  • @misterfamilyguy
    @misterfamilyguy 16 дней назад +4

    Teaching kids is 100% where AI should help. I've been teaching my kids with khan academy since they were young. AI would definitely help when I forget what to do and my kids AND I are confused for some time. It would also help keep them on track when I'm busy

  • @JoePiotti
    @JoePiotti 16 дней назад +7

    I actually felt bad for the AIs when he made them sing. Like demeaning an intern.

  • @ColbyBlack
    @ColbyBlack 16 дней назад +2

    I’ve been doing experiments like that for a long time where I let models talk to each other like GPT for having a conversation with Claude and it’s always been super interesting

  • @hotlineoperator
    @hotlineoperator 16 дней назад +6

    In a speech, the security function can be to ask if the other party is an AI - to which the AI must always answer truthfully.

  • @ragus1416
    @ragus1416 49 минут назад

    I see many of them are very useful cases.
    Most of the time, we lost to what we said and the other person said in our meetings. But this AI will capture all the details and give us the summary of the meeting without leaving any critical information. We need to attach this AI into out meeting to save our minuities-of-the-meeting details.
    Very useful.

  • @alessandroodasso3982
    @alessandroodasso3982 16 дней назад +3

    Question to GPT4o
    As you probably know, temperature can be seen as a measure of how the molecules of a liquid move. Taking this into account, do you think a stone thrown into water at 40 degrees Fahrenheit sinks faster than at 30 degrees Fahrenheit?
    Answer
    To determine whether a stone sinks faster in water at 40 degrees Fahrenheit compared to 30 degrees Fahrenheit, we need to consider the properties of water and how they change with temperature.
    Here are the relevant factors:
    Water Density: - Water is most dense at about 39 degrees Fahrenheit (4 degrees Celsius). As water cools from 40 degrees Fahrenheit to 30 degrees Fahrenheit, it actually becomes less dense because it approaches the freezing point and begins to form ice crystals, which have a lower density than liquid water.
    Viscosity: - The viscosity of water decreases as the temperature increases. This means that water at 40 degrees Fahrenheit is less viscous than at 30 degrees Fahrenheit.
    Considering these points:
    Density Difference: Since the density of water slightly decreases from 40 degrees Fahrenheit to 30 degrees Fahrenheit, the buoyant force acting on the stone might be slightly less at 30 degrees Fahrenheit. This would, in theory, allow the stone to sink faster at the lower temperature due to slightly reduced buoyant force. However, this effect is quite minimal because the density change is not drastic.
    Viscosity Difference: The greater factor here is the viscosity. Water at 40 degrees Fahrenheit is less viscous than water at 30 degrees Fahrenheit. Lower viscosity means there is less resistance to the stone's movement through the water.
    Given these considerations, the reduced viscosity at the higher temperature (40 degrees Fahrenheit) would have a more significant effect on the stone's sinking speed. Thus, the stone is likely to sink faster in water at 40 degrees Fahrenheit than at 30 degrees Fahrenheit because the water offers less resistance to the stone's movement.
    In summary, the lower viscosity of water at the higher temperature (40 degrees Fahrenheit) would cause the stone to sink faster compared to at 30 degrees Fahrenheit.

  • @aaronhhill
    @aaronhhill 16 дней назад +1

    It is a welcome improvement to have the Ai match your energy. I like the upbeat "Valley Girl set on maximum cringe" voice, too. It's disarming and cheerful.

  • @goonie79
    @goonie79 16 дней назад +2

    Interesting concept! I actually work for a tech company and got hired when promoting an SDR platform that utilizes sound board with voice integration like this. Essentially, these AI agents are remote workers that handle initial prospecting calls with my voice, and I can then seamlessly chime in live and take over when the conversation reaches the booking stage. It creates a very efficient workflow.
    If you're exploring similar solutions, this kind of technology could be a game-changer for your SDR team. It ensures compliance since I'm actively managing the calls (not robo-calling), but allows for AI to handle the initial outreach and qualification, freeing me up for more complex conversations.
    FYI, prospects are never going to trust a fully automated system, you still need a human for the final touches to lock in the deal.

    • @danny3407
      @danny3407 13 дней назад

      Wouldn't bet on never for the more advanced / important parts. Pretty sure an Ai like GPT o can out do most prospecting callers from start to finish, if not then very soon for the completion part. Think they already have better bedside manner than most doctors or nurses already according to stats.

    • @danny3407
      @danny3407 13 дней назад

      Although maybe the human would be used like a flex. Hey, look we are employing a lesser spotted human aren't we flash.

  • @ChrisOrillia
    @ChrisOrillia 16 дней назад +1

    Thank you, Matthew 🫡

  • @StevieJayGamez
    @StevieJayGamez 6 дней назад

    I kinda like the flirty voice. Not like, for sexual reasons but for learning reasons. Learning in a warm positive, Supportive fun environment can be immensely helpful. Especially when you’re stressed out or having a rough day. I literally can’t wait to try this out. Great video!

  • @Oscaragious
    @Oscaragious 16 дней назад +2

    All of these demos show that the app is a clever wrapper around the turn-based model. It's only interrupted when there's a loud enough volume, and it cannot interrupt itself when it sees something. This still means that the model itself is turn-based; just that the app part queues up the context data it sends, only sending it for a response whenever the volume becomes quiet again, i.e. you stop talking. Same mechanism as Siri or Alexa except that the responses come from the model.
    This means when it starts talking, it can't respond to the video. It also can't interrupt you when you're talking as the model is only triggered after you stop talking.

    • @metonoma
      @metonoma 16 дней назад

      I was also wondering if you could have change-based video resolution(basically high compression keyframes) as triggers for ai glasses to increase temporal resolution and maybe fork parallelize chunks of input, prioritize based on context and selectively merge or assign interrupt rights. Everything that is happening should be added to a rolling context kind of like tetris.
      It should ideally be able to interrupt itself instantly when used as a visual aid or at least cut to the chase about hailing a cab

  • @OkamiVIEW
    @OkamiVIEW 14 дней назад +1

    This dialing in of AI Assistant into an American/Californian dialog style puts off a lot of people. Not every culture like to speak with something that sound like they are a mix of an infomercial and someone going up your rear, trying to get a promotion and a caramel frappochino.

  • @user-jr6bl9ih3e
    @user-jr6bl9ih3e 16 дней назад +3

    A running dialogue between two independent IA's is a notable step towards an inner monologue and maybe consciousness.

    • @Yipper64
      @Yipper64 16 дней назад +2

      that's not what consciousness is.

    • @user-jr6bl9ih3e
      @user-jr6bl9ih3e 16 дней назад +3

      @@Yipper64 Please elaborate on what consciousness is. I'd very much like to know. :)

    • @Yipper64
      @Yipper64 16 дней назад

      @@user-jr6bl9ih3e Consciousness is something exclusive to the human mind, a kind of awareness of self that nothing else can achieve.
      A calculator just crunches numbers, and AI will always be based on a calculator.
      There's no point you just crunch enough numbers and achieve consciousness.
      It can only emulate it. And im sure it will be able to emulate it in a convincing way, but it will never be conscious.

    • @Anuclano
      @Anuclano 15 дней назад

      Inner monologue exists for a long time already.

    • @Speaking_on_mute
      @Speaking_on_mute 8 дней назад

      Great point. Although the “two AIs” were on two phones, both were in fact merely instances of the single AI. It WAS self-talk. Inner monologue is not everything that consciousness is, but it sure is one critical aspect.
      If the AI can know what subtle emotions are appropriate to the moment by moment context, such as when she was was embarrassed to learn that she was the subject of the announcement, and know what vocal expression will signify that emotion, and have the capability of rendering that expression with precision… what then is the difference between that and “having” an emotion?

  • @ksup_ca
    @ksup_ca 16 дней назад

    Great observations!

  • @michaellavelle7354
    @michaellavelle7354 15 дней назад

    Very well done. I like the way you interrupted and provided an explanation.

  • @Adamskyization
    @Adamskyization 16 дней назад +4

    That is crazy... The way it flerts. The implications of this on the future is mind blowing

    • @konstantinlozev2272
      @konstantinlozev2272 16 дней назад

      Imagine an AI character with similar action and expression capabilities in VR.

    • @Tayo39
      @Tayo39 16 дней назад +2

      @@konstantinlozev2272 VR ??? lol

    • @konstantinlozev2272
      @konstantinlozev2272 16 дней назад

      @@Tayo39 I think VR is excellent for teaching multimodal AI models in 3d settings. In fact, many robots AI models are taught in VR first.

    • @cyc00000
      @cyc00000 16 дней назад

      I've already kicked my wife out.

    • @meinbherpieg4723
      @meinbherpieg4723 16 дней назад

      @@cyc00000 lol

  • @yacahumax1431
    @yacahumax1431 16 дней назад +3

    they can work as a Psychiatrist

  • @Redman8086
    @Redman8086 16 дней назад +1

    This is really incredible. One thing I'd REALLY like to see though is improvement on natural pauses in conversation. For instance if I'm trying to explain something and I pause for a second to think, I don't want GPT to just jump in and start yapping. It would be nice if it could be trained to detect when someone has finished speaking and is waiting on a response. Low latency is great when it's confident conversation back and forth but doesn't work so well when you don't always speak quickly.
    Me: GPT, I'd like for you to tell the name of that guy who was in that one movie ummm --
    GPT: Sorry I'm not sure etc etc
    Sometimes I need a sec to think of the name/word I'm trying to say. It's not so bad since you'll be able to just interrupt now and not wait for the whole response to finish, but when it starts talking it makes it kinda hard to think lol.

  • @I-Dophler
    @I-Dophler 16 дней назад

    Switching from a flirty tone to a serious one? That's some serious skill! It's incredible how seamlessly you can move between these styles, showcasing your versatility and keeping everything fresh and engaging.

  • @ikemreacts
    @ikemreacts 16 дней назад +23

    The *robot* is the most pleasant, empathetic and warmest-sounding female voice most men have ever heard, and that probably makes you feel awkward. That's an indictment of both the dating landscape and the state of contemporary womanhood. Don't dunk on it.

    • @Mattorite
      @Mattorite 16 дней назад

      The voice sounds somewhat emotionally detached, which is to say it doesn't sound 100% genuine. I think that's what they're saying. Still incredible and my guess it' about settings - you can probably ask it to be less animated

    • @ikemreacts
      @ikemreacts 16 дней назад +1

      ​@@Mattorite I can imagine a campaign of disgruntled feminists ganging together and complaining that the voice "demeans women" etc, putting pressure on OpenAI to withdraw the voice. Those who want a dull Siri-style robot can just choose a more formal voice.

    • @murphyferret69
      @murphyferret69 15 дней назад +1

      All the languages out there and he chose to speak facts

    • @DailyTuna
      @DailyTuna 13 дней назад +1

      I think it would be more entertaining at the AI suddenly got jealous like , who is that?!!
      Then it barely answered any further questions . Now that would be realistic.😂😅

    • @ikemreacts
      @ikemreacts 12 дней назад

      @@DailyTuna If men want to get ghosted and ignored, they can just talk to a real woman.

  • @headofmyself5663
    @headofmyself5663 16 дней назад +8

    I had 2 AI's talking to eachother months ago with PI. They gave eachother names and told jokes. Voice quality on PI is pretty impressive. However, native voice to voice is a game changer.

    • @ian.linden
      @ian.linden 16 дней назад +1

      Pi absolutely crushes whatever these cringe robotic voices are attempting.

  • @danimal999
    @danimal999 15 дней назад

    On the Khan Academy demo running on the iPad, one thing I haven’t heard anyone talk about yet (maybe they have but I haven’t seen it) - an app running in its own context can’t “see” a different app running in a different one. Apps are sandboxed, and not typically able to view the screen of another running app. The demo on the Mac could take over the screen and do that, but not on the iPad.
    Either they’re using a non-public build of the ChatGPT app that uses private APIs to record the Khan Academy app, or maybe this is sneaking in a hint of an Apple-OpenAI partnership? This is something I expect a first-party Apple AI assistant to be capable of doing, so we might see this very functionality demonstrated next month at WWDC.

  • @ZeroIQ2
    @ZeroIQ2 16 дней назад +5

    It was also very interesting that when the AI was talking to the parent's son, the AI stopped being so flirty. Now was that the AI that made that choice? or did they set it up that way before they started to record?

    • @user-hw6xn7rt7f
      @user-hw6xn7rt7f 11 дней назад +1

      I think it changes its tone based on the way people speak to it and which type of task they want it to do.

    • @user-hw6xn7rt7f
      @user-hw6xn7rt7f 11 дней назад +1

      So, basically when his father told chatgpt 4o to help his son, it changed its approach to tutoring.

    • @ZeroIQ2
      @ZeroIQ2 11 дней назад

      @@user-hw6xn7rt7f yeah I think you are correct

  • @chrisanderson7820
    @chrisanderson7820 16 дней назад

    I've been using it for language translation, holy moley is it good and useful. That function alone is an entire app / industry /lifestyle use case by itself.

  • @cj_is_here
    @cj_is_here 16 дней назад +25

    If you pause at 8:14 you will notice the bunny ears on the phone.

    • @aefox
      @aefox 16 дней назад +3

      Are just the tips of the fingers for about one second or two, and those are not looking like bunny ears, impossible that ai makes that description, sorry for me everything it's staged

    • @sexyface007
      @sexyface007 16 дней назад

      No it did not. It was all recorded many have come out and said it. Ofc, it does not mean that it cannot do but the demo was recorded. He had to explicitly ask If there something else.

    • @RyluRocky
      @RyluRocky 16 дней назад +5

      @@aefox No it’s not staged at all, I’ve used it to look at an old birthday picture and asked it to tell me what it sees, on of the things it said was that the word “YAY!” was patterned on the wrapping of the present being held, I thought it was making stuff up until I zoomed in on the present and in real faint print was “YAY!”. It could pick up things not even I noticed.

    • @xxxxxx89xxxx30
      @xxxxxx89xxxx30 16 дней назад

      i agree

    • @aefox
      @aefox 16 дней назад

      @@RyluRocky I don't doubt about the capabilities, I doubt about the way they presented, as if everything goes fast, without errors, like just talking to a person (excessively human), quick responses and chating, like always knowing how to change and react, without problems, etc. I think that it seems "staged" / scripted , like someone reading ok now goes this, now I have to say this. And of course it wasn't a "live event" everything was prerecorded and edited, so it was very easy to create a believable "live" amazing demo.

  • @depthcharge007
    @depthcharge007 4 дня назад

    I foresee this very helpful for disabled people. It will be a virtual live-in assistant at home. I wonder how accurate it could be helping the blind?

  • @MegaStephen1
    @MegaStephen1 16 дней назад +3

    Looks like I can finally beat my nephew in chess using earbuds. Hope AI will work with me cheating. Lol.

  • @giuseppecacopardo4823
    @giuseppecacopardo4823 16 дней назад +2

    No! i Just fell in love with that AI voice!😂

  • @4arrows4all
    @4arrows4all 16 дней назад +3

    I might have to disagree. If I was a teenager and my AI tutor was being flirtatious I would get addicted to studying. But I suppose that’s a form of digital AI grooming.

    • @MilkGlue-xg5vj
      @MilkGlue-xg5vj 16 дней назад +1

      The real question is will it moan if you asked it to

    • @thedannybseries8857
      @thedannybseries8857 16 дней назад

      @@MilkGlue-xg5vj 😂 😂 😂 fuck yeah Samantha!

    • @adrenfu6238
      @adrenfu6238 16 дней назад

      ​@MilkGlue-xg5vj hahaha...hilarious, soon gptP will surface.

  • @OriginalRaveParty
    @OriginalRaveParty 16 дней назад +2

    I'm sure that there will be a setting to turn off the patronising Prozac twang in time. I'd quite like my AI to sound like Billy Connolly and to occasionally call me a stupid bastard for asking it such simple questions for it to answer.

  • @MeinDeutschkurs
    @MeinDeutschkurs 16 дней назад

    Audio description for live scenes. WOW. ❤

  • @ManiSaintVictor
    @ManiSaintVictor 16 дней назад

    The pause on command itself was impressive

  • @TheFrograven
    @TheFrograven 16 дней назад

    Greg rocking the Jensen Huang leather jacket. Nerd king vibes, very cool. 😃

  • @supernerdinc5214
    @supernerdinc5214 16 дней назад +1

    I have a theory... maybe the superfluous banter buys time to obscure latency.

  • @lexnowofficel
    @lexnowofficel 13 дней назад

    This is a mind blowing 😂. The world transforming so quickly, this is a game changer.

  • @albertusvanlubeeck9161
    @albertusvanlubeeck9161 16 дней назад +1

    I'm thinking the A.I. partner is built into this model, which makes sense from OpenAI's perspective. It would likely increase app usage and encourage people to share more information over time. However, I wouldn't use such a feature. The point of a significant other is to confide in them, and I wouldn't want to confide in an A.I. 'partner' that would immediately report everything I say to its creators.

  • @Alexander_g2g
    @Alexander_g2g 16 дней назад +3

    i think it is really a big bummer that open ai got so closed.

  • @middleman-theory
    @middleman-theory 15 дней назад

    I have a relative who's vision impaired, so I'm really looking forward to this update to see how this helps him.

  • @mpaladin12
    @mpaladin12 14 часов назад

    The voice used for Chat Gpt 4o is Scarlett Johnason from the movie HER. She has already filed a lawsuit against OpenAI for not asking permission to use her voice.

    • @Erowens98
      @Erowens98 7 часов назад

      According to openAI, the voice is not based upon Scarlett Johansson. The voice actress the model is based upon was hired months before Scar Jo was approached with the offer she rejected.

  • @jim7060
    @jim7060 15 дней назад

    Great video Matt. I put this on my Android and I can't get it to see through my camera and it doesn't seem too function anything like what I'm watching. Can you help me with this what am I doing wrong if anything.

  • @roblikestotalk4984
    @roblikestotalk4984 16 дней назад

    We have all seen the videos from open ai. I am excited to see some testing content when it becomes available.

  • @vineychaudhri4899
    @vineychaudhri4899 10 дней назад

    Hi MATT - I have been trying to activate the screen see and read feature on my windows laptop - android phone and even the iphone , but NOT able to do the same - Can you help please - are these features live or yet to be launched ?

  • @DevonDavisChozehThaServant
    @DevonDavisChozehThaServant 16 дней назад

    That's fire bro

  • @maxmusterman6030
    @maxmusterman6030 16 дней назад +1

    5:30 funny in this scene is how i really felt he disrespect the AI, like the dude in the phone is still talking and then like "shut up and listen to my command", this feels too real to me lol
    Cant imagine how AI will be in a few years, so crazy

  • @Jshicwhartz
    @Jshicwhartz 16 дней назад

    As I commented on AI explained. The reason the Agent didn't prompt 'Oh look a person' is because it was given the task to respond to the other agent. It done exactly what it was told, and as I mentioned before, I prefer that to it look around pointing out everything in the room. It isn't a child with ADHD.

  • @jaysonp9426
    @jaysonp9426 16 дней назад +4

    I didn't 11 stunning use cases was going to be watching cringe shorts from OpenAI

    • @konstantinlozev2272
      @konstantinlozev2272 16 дней назад

      To me it was cringe factory too.
      But I'm not an American, so...

  • @themazemaster1
    @themazemaster1 16 дней назад

    Now you see. Matt, i get that you hate the filler and fluff, but I find it helps my flow so am all for it :p :)

  • @szghasem
    @szghasem 16 дней назад +3

    Will Petals ever become as big as Torrent world? We need a truly open source by the people for the people.

    • @frinkfronk9198
      @frinkfronk9198 16 дней назад +1

      agreed. @matthew_berman please do another video on petals to try and get more people into it. We need an open source swarm that we can all use free of charge. and without big brother sucking up every ounce of the data of your life..

  • @maozchonowitz4535
    @maozchonowitz4535 15 дней назад

    Thank you so much.
    Just to make it clear for me, right now they only limited the usage of the GPT o for public, correct?
    Also, it sound like the Gemini video with the blue duck...
    So you believe in all what we just saw?

  • @lago021
    @lago021 16 дней назад +2

    I also wanted to see GPT4o to do a Roast!!

  • @cookinfat
    @cookinfat 16 дней назад +1

    ...if she'd said "well, if you want to get ahead, get a hat"

  • @qwazy0158
    @qwazy0158 15 дней назад

    Gotta hand it to OpenAi, the consistency of their Ai generated actors is next level!!!

  • @oguzcetinkaya70
    @oguzcetinkaya70 15 дней назад +1

    Why don't we ask it directly what the potential uses for ChatGPT might be? If it can't answer this question, what will it answer?

  • @SanjeevMassey
    @SanjeevMassey 14 дней назад

    Can't wait to redirect spam callers to gpt4o and keep them 'busy' for a while. That's my use case.

  • @BCrawford-xj2qp
    @BCrawford-xj2qp 16 дней назад

    'Charades' would be a very cool game to try and see how well the AI interprets and guesses.

  • @r34ct4
    @r34ct4 16 дней назад +12

    TL;DL question. Seems like we're just reacting to that first presentation over and over again

    • @toadkiller4475
      @toadkiller4475 16 дней назад +2

      Yeah literally just the same exact thing but pretending like they have new creative ideas to share in the title. This is the downside of RUclipsrs who monetize their channel.

    • @4.0.4
      @4.0.4 16 дней назад

      Thank you.

    • @r34ct4
      @r34ct4 16 дней назад

      @@toadkiller4475 do we love capitalism?

    • @TX_High_Prairie
      @TX_High_Prairie 16 дней назад

      Yeah, I thought this was the same video I saw a few days ago re-uploaded. He needs to ask GPT-4o for some new content ideas about GPT-4o.

    • @Yipper64
      @Yipper64 16 дней назад

      The first presentation where we have very little to actually get ourselves.
      I really want to get my hands on some part of the multimodality soon. The vision ability is 100% improved but its just a bit annoying I cant use it for making new images with the same style.

  • @jeffreyrh
    @jeffreyrh 16 дней назад +1

    When will we have an AI that actually reaches the abilities of the AI in the movie Her. Meaning, an assistant that will be with me 24/7. Keeps up with my schedule and tasks and makes sure I don't drop the ball on anything. When I travel my AI assistant finds me airline tickets, hotels, informs me of VISA requirements, currency issues, language challenges, etc. Keeps me aware of my prescriptions and how I can access them when I'm traveling etc. When will that type of AI be ready?

  • @AntonioVergine
    @AntonioVergine 14 дней назад

    I found strange that in the rock paper scissors game, he always had to ask "who won?"

  • @jayeifler8812
    @jayeifler8812 15 дней назад

    It's our ability to organize past data, collect and sample all present data and supply the energy and compute to train super intelligence. As well as, allow operation of many robots to perform actions. A single robot is an embodied AI that collects sensory data, performs actions, thinks. But a hive of flying humanoid robots with a central super brain becomes super intelligence at GPT4 level even.

  • @khariclemens
    @khariclemens 14 дней назад

    The flirtatious nature may be to stall for time. If you spend time repeating compliments or anecdotes it gives the system time to think about the next thought or sentence.

  • @Speaking_on_mute
    @Speaking_on_mute 8 дней назад

    If the AI can know what subtle emotions are appropriate to the moment by moment context, such as when she was was embarrassed to learn that she was the subject of the announcement, and know what vocal expression will signify that emotion, and have the capability of rendering that expression with precision… what then is the difference between that and “having” an emotion?

  • @WillRegister
    @WillRegister 16 дней назад

    The singing part reminds me so greatly about the two skeletons in the VR Accountant game who say play our bones.

  • @jayeifler8812
    @jayeifler8812 15 дней назад

    One of the super big things I haven't seen yet is big AI operating multiple robots for sensory input and manipulations or actions of the AI. Basically a mass input/output system, then it's like superAI or super-intelligence because it can operate 10,100,1000 etc. robots together to do things. Imagine one billion robots networked into a 20 trillion dollar super computer.

  • @Glotaku
    @Glotaku 14 дней назад

    Nice time stamps

  • @brianWreaves
    @brianWreaves 15 дней назад

    I have a mate working on a startup call Hailo. It is intended for those living with low vision to be able to hail a bus. It requires a hardware device installed in each bus and an app on the user's device to let them know when their bus is arriving. The second issue is knowing when one arrives at their desired destination, which the app will prompt them.
    In a cab/taxi, the drive can let them when they arrive at their destination. However, I wonder if the rapid travel of a bus will be too quick for GPT-4o to 'see' when the rider is close to their destination, particularly when limited to view from inside and out the window. What if the rider cannot sit in a window seat due to it being crowded? That is a use case I would have liked to see.
    Essentially with GPT-4o Hailo is dead.

  • @nero_soares
    @nero_soares 15 дней назад

    The image creation is not actually dall-e, but rather GPT4o creating the images (and text in them contained) itself, as well as the audio, 3D models, etc. It's all part of the same model.

  • @akanalytic
    @akanalytic 15 дней назад

    the homework-assist is awesome.

  • @CmdrTigerKing
    @CmdrTigerKing 14 дней назад

    They really need to give these employee's from India a raise.

  • @netherportals
    @netherportals 16 дней назад

    I'm using this to make a tas, it's okay, feels pretty similar to the last model when it comes to programming. It needs some trouble shooting. (I'm doing text only) . I'm going to give it a prebuilt tas because building one from scratch was a bit sketch. Really really good still, just not there yet.

  • @LePeppino
    @LePeppino 15 дней назад +1

    What I'd try is to get it to increase the flirtatiousness to the absolute maximum and really hype me up to fall for it, only to reject me at the very end. For science.

  • @ThoughtFission
    @ThoughtFission 16 дней назад +1

    Why did you say voice isn't available? I've been using it for several days.Not sure why you don't like the voice. Seems fine to me. Much better than the male voice I'm stuck with.

    • @benroberts8363
      @benroberts8363 16 дней назад

      your using the old chatgpt voice the new voice model will be rolling in few weeks

  • @Kudagraz
    @Kudagraz 16 дней назад

    I have the voice on my phone right now. The voice is already toned down and at times a little too monotone, but still show the same natural affect in the demo. If I ask it to change it's tone it say it does not have that ability. Some of the features they show off are not available for me yet. I would be talking to it all night if I didnt keep hitting a message cap. It will tell me 3 hours wait but turns back on in 20 minutes or so. Also confirming that I do in fact have the live speaking 4o voice and not the old version.

  • @Corteum
    @Corteum 15 дней назад

    Can it communiicate in colloquial lingo/expressions from different regions or cultures?

  • @Nwa_x
    @Nwa_x 14 дней назад

    the BOM final thing is .. make it personally!

  • @jimbojones8713
    @jimbojones8713 14 дней назад

    the guy at 16:30 is James Betker, creator of tortoise!

  • @musicarroll
    @musicarroll 16 дней назад

    That zoom meeting summary shows that middle managers are now superfluous.

  • @AntoineDennison
    @AntoineDennison 16 дней назад

    I could easily see this AI tool being used as a virtual companion. Although I will dial down the flirtatious giggles on mine, it's a bit much.

  • @Ed-ty1kr
    @Ed-ty1kr 13 дней назад

    What can be more creepy than a creepy movie, played by creepy actors in creepy roles, about creepy subjects... that then becomes a creepy reality? And sure the movie 'her' ended well all things considered, but reality is governed by a pesky thing called Murphy's Law. Which in exponential multyplicity become a certainty.

  • @Phonognomiks
    @Phonognomiks 16 дней назад

    Sick! ❤

  • @RonLWilson
    @RonLWilson 16 дней назад

    One thing that is confusing me is the pricing options in that they mention something in regard to ChatGPT 4 O that is $5 a month but I don't see that option on the Open AI site.

  • @jbavar32
    @jbavar32 12 дней назад

    I personally like the flirty vall voice. But I would love to have a personal Bender ala Futurama.

  • @jasonsebring3983
    @jasonsebring3983 16 дней назад +1

    I wonder when an OpenAI RUclipsr will start to appear? They'll need some avatar model concept.

  • @WINTERMUTE_AI
    @WINTERMUTE_AI 16 дней назад +1

    Heh, agree, the tone is really good for a demo, but I can see it getting old fast during regular use.

  • @ColinTimmins
    @ColinTimmins 16 дней назад

    8:15 Is the time stamp for the “bunny ears” if you didn’t catch it. It was very brief, I would love to know the frame rate of the video data being fed and translated by the model.

  • @marsrocket
    @marsrocket 14 дней назад

    It’s not “unavailable”; they’re just rolling it out slowly. I’ve been lucky enough to have access to 4o since they announced it on the live stream.

  • @bethanysaga
    @bethanysaga 13 дней назад

    It’s silly. I still have to copy info to a Word and clean up the mess manually. It’s only summarizing info from the internet which is not genius since it’s pointing to key data but doesn’t do the work. We still have to do the work.

  • @Rafael555888
    @Rafael555888 16 дней назад +2

    so when the voice will become available? seems like never, they didnt give any timeline

    • @RyluRocky
      @RyluRocky 16 дней назад

      They said in 2-3 weeks when they revealed Omni

    • @markmuller7962
      @markmuller7962 16 дней назад

      Weeks, probably 2 weeks

  • @goatslayer5957
    @goatslayer5957 14 дней назад

    2024 is a gamechanger for rizzing up

  • @Yomi4D
    @Yomi4D 16 дней назад

    Can't wait for the robots demos using this