Sam Altman Comments on Q* | Self Operating Computer | Pika 1.0 | The most INSANE AI News of the day!

Wes Roth

Просмотров 195 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 июл 2024
Sam Altman speaks on coming back, Q-Star and Ilya.
www.theverge.com/2023/11/29/2...
Self Operating Computer
github.com/OthersideAI/self-o...
TaskWeaver: A Code-First Agent Framework by Microsoft
arxiv.org/pdf/2311.17541.pdf
Help with Anaconda and Installing:
natural20.com/learn-ai/what-i...
Get on my daily AI newsletter 🔥
natural20.beehiiv.com/subscribe
[News, Research and Tutorials on AI]
See more at:
natural20.com/
My AI Playlist:
• AI Unleashed - The Com...
00:00 Sam Altman on Q*
03:18 Self Operating Computer Intro
06:06 Pika 1.0 + Insane AI video
08:12 TaskWeaver by Microsoft
09:12 Self Operating Computer GitHub
11:08 Installing Self Operating Computer
13:33 Testing Self Operating Computer (Locally)
17:09 Testing Self Operating Computer (Chrome Plugin)

Комментарии • 530

@averageintelligence6822 8 месяцев назад ⁺⁴⁸
Ai is iterating at a much much faster rate then I ever imagined
@larion2336 8 месяцев назад ⁺⁷
Not that much faster than was actually predicted though, by say Kurzweil (Singularity is Near from 2005). It's basically on track, when people mocked those predictions for the past 15 years.
@2CSST2 8 месяцев назад ⁺⁶
@@larion2336 Good point, but Ray was quite the visionary, and he doesn't get enough credit for it.
@vicv6831 8 месяцев назад
@@larion2336 He has a lot of predictions that are nowhere close to reality, although he also has good predictions.
@Afkmuds 8 месяцев назад
which is exactly the rate it would go at!
@averageintelligence6822 7 месяцев назад
Probably not singularity will end with us being consumed by ai @@Afkmuds
@David.Alberg 8 месяцев назад ⁺⁵⁰
Remember: There are still people thinking that AGI is 10 years away. At that point with the insane acceleration just over the last 12 months, we will have AGI or some sort or pre AGI before the end of 2024.
@archvaldor 8 месяцев назад ⁺⁵
I don't think you know what AGI is. You have to develop the AI's ability to make logical inferences to get AI. You can't just rely on brute force and processing power for everything. All you have at the moment is a program that simulates dumb stuff it cribbed off the internet.
@eyemazed 8 месяцев назад ⁺⁴
not without significant breakthroughs in actually "understanding", not simply appearing to understand. example - basic yet unseen physics/math questions
@hglbrg 8 месяцев назад ⁺⁴
I will pledge 100 000 000 000 dollars to you personally, paid in full on jan 1st 2025 if you are correct. And that means you are actually correct, not that these companies lying about their progress are dropping hints that could be misinterpreted as this being true. Cause that is what they do. What they call AI is ML, what they hint about being AGI/ASI is AI. But ML - impressive as it is - is not intelligent, not even close, it does not reason, it does not decide, it does not self-correct or stop saying wrong things on its own. Is it still impressive? Yes, very much so. Is it intelligent? No, not even close. Maybe they have an AI in their internal labs, but it is not an AGI, just what they have pretended this ML has been all this time.
@eyemazed 8 месяцев назад ⁺²
@@hglbrg one correction - it is certainly intelligent, at least in the the way that we currently define intelligence (applying information to solve problems). you know cats, dogs and mycelia all have varying levels of intelligence, right? computers do as well. but reasoning is different from intelligence. i'd argue current chatbots do not reason, they mimic reasoning
@simeonhendrix 7 месяцев назад ⁺¹
The definition of AGI is so ambiguous. I think AGI is already here.
@verified.my2cents 8 месяцев назад ⁺⁹
Thanks Wes, appreciate the work you are putting in here.
@GNARGNARHEAD 8 месяцев назад ⁺⁴⁰
love the idea of a computer interface, I burn through my hands clicking all day. a voice interface, and maybe the ability to set up macros would be amazing.. Pika is looking great too 👍
@carkawalakhatulistiwa 8 месяцев назад ⁺³
Image use VR and open 10 display at same time. At bed
@ElectroOverlord 8 месяцев назад ⁺¹¹
As someone with neuropathy I am ALL for this. Typing is a act of pain for me.
@tomski2671 8 месяцев назад ⁺¹
That was my first thought - it needs a voice interface. A voice interface would complicate setup, but that's where things are going.
BTW OpenAI Whisper is great and open source I believe.
@mryanmarkryan 8 месяцев назад ⁺¹
@@ElectroOverlord Wear wrist braces when you are sleeping. NO pressure on wrists during the day. They have to be suspended. Been there. Best of luck!
@middle-agedmacdonald2965 8 месяцев назад ⁺⁸
Thanks for pumping it out so quick!
@davidkamaunu7887 8 месяцев назад ⁺⁶
That’s what she said 😂
@YouLoveMrFriendly 8 месяцев назад ⁺¹
@@davidkamaunu7887 That's what I was going to say
@DaveShap 8 месяцев назад ⁺³
Great work!
@jamesyoungerdds7901 8 месяцев назад ⁺³
Long-time fan, watching everything you make, and thank you for all your hard work :) One thing that I always have to "shake my head" with A.I. vision is - it's all pixel values. We, as humans, just see, perceive and register visual information. But for A.I. models, their "vision" is ultimately distilled down to binary via patterns in pixel RGB values. To me, it's so different than tokenizing and predicting words for chat. I guess the mechanism via transformer model may be the same, but they're using pixel-level values to detect, predict and respond to the input. It's just amazing.
@ScottKentEdu 7 месяцев назад
I'm also amazed with this. Really though the comparison to our processing of light is similar to ai processing if you think about our image processing. Our brain processes signals from the cones in our eyes. This is analogous, though not exactly, to what the neutral meet does by recognizing patterns.
I am fascinated by how this externalizes what we do without thought.
@djsanctus1650 8 месяцев назад ⁺⁶
I attempted to do something very similar to the Self operating computer using no-code software (I’m an RPA dev).
The plan was to pass computer screenshots to gpt4V and ask it for screen coordinates to wherever it was supposed to click next. Then pass those coordinates to an RPA software that would perform the action.
What I found is that GPT4 vision does not easily have the ability to identify locations on an image. It lacks spacial reference. So while It can ID that there is a "sign in" button in the image, a lot of that is linked to its trained understand of where things *should be* not where they actually are.
For example, you can go to gpt4 right now and ask it "where is RUclips's login button located" and it knows it should be in the top right corner because that's typically where those buttons are.
It was a good learning experience, and once they solve for actual location data for different aspects of an image, it's going to be wild what this can do.
@AlexanderMoen 8 месяцев назад
that seems like a minor hurdle that could be tackled in less than a year by number of companies. I don't see why it couldn't go through some sort of initial calibration phase where perhaps you start at a specific URL that has a special coordinate image designed for the app, then attempts several mouse movements with several screenshots to get some sense of velocity and trajectory and then stores that information and utilizes that for all future work.
And, if that's too much, then just take several screenshots to ensure it's eventually clicking on the correct thing.
Or, go super old school and have it work entirely off of keyboard shortcuts.
@djsanctus1650 8 месяцев назад
@@AlexanderMoen I tried a couple of techniques, including a sample image that broke the screen up into segments (kinda like playing battleship) and asked it to use that as a reference.
I'm convinced that GPT4 doesn't actually see images, it somehow intuits what the image is and understands it based on how the images data compares to other image data. It's not seeing colors and shapes, it's seeing patterns of pixels.
Also, when an RPA software is utilizing a mouse/keyboard, there is no actual velocity or mouse movement (unless you expressly tell it to do so). Everything is sort of instantaneous (like what's shown in this video).
In an ideal world, a GPT would be able see an image, know what to click on, determine where that thing is in the image, determine the center pixel of where that thing is, and feed the X/Y coordinate for that pixel to the computer control software with the instruction to click, or right click and select menu, or scroll, or whatever.
@sallami6627 8 месяцев назад
very interesting, thanks for sharing. So could these RPA apps someday be considered 'limbs' that the AGI can use if or when it emerges?
@djsanctus1650 7 месяцев назад ⁺¹
@@sallami6627
If it can properly code in python, it won’t need an RPA software. It will just write a quick script to interact with the computer and use that method.
If we achieve some level of semi-agi that has the ability to think (or even just intelligently identify things on a screen) and not fully interact with a computer, RPA could be a way for a non-developer to create an agent.
Hence what I was trying to accomplish.
It’s much easier to learn an RPA software than it is a full scripting language. So it would make a semi-agi agent much more doable for the average joe.
@alertbri 8 месяцев назад ⁺³⁸
4:50 this is the holy grail of AI automation. When GPT-4 comes down massively in cost this will just go super nova. 🤯
@carkawalakhatulistiwa 8 месяцев назад ⁺¹²
I think is next year.
@KhoaNguyen-bt6vt 8 месяцев назад ⁺¹
When Q* annouce to us, these model would become free😂 hope it will become gift on my next christmas
@skylark8828 8 месяцев назад
It needs to know precisely where each control is on the app you're using (without using an API), that's a little to much to expect with GPT4-V.
@ReidKimball 8 месяцев назад
@@skylark8828if it can click and type already, why not have it use keyboard shortcuts instead? I hope that’s possible.
@lighteningrod36 8 месяцев назад
Sounds like RPA?@@skylark8828
@kennbmondo 8 месяцев назад ⁺¹
Appreciate the brief overview for your report instead of a deep dive at this stage.
@davidallred991 8 месяцев назад ⁺³⁵
I installed and ran the self operating computer. Interesting idea but currently I would rate it 1/10 for usability. I tried a couple simple tasks and it couldn't complete any of them and with only 3 attempts at tasks it chewed up about $1.15 in api costs. Could be really cool in the future especially if you could use a locally installed GPT trained on specific use for this.
@jklappenbach 8 месяцев назад ⁺⁷
Keep in mind, these things tend to improve exponentially, especially iff given budget for training.
@docsalas1203 8 месяцев назад ⁺²
🧢
@Vartazian360 8 месяцев назад ⁺⁸
I would agree with you at the current date of 11/30/2023 but in a few months to a few years from now I just about guarantee this will be nearly 100% accurate. The rate of improvement of machine learning nowadays is exponential or even double exponential growth. Cost is a concern for now but again cost will go down and this will be full blown automation of any digital task in the very near future
@davidallred991 8 месяцев назад ⁺¹
@@Vartazian360 I fully agree. Especially once a model is trained specifically for this use case and then might end up being small enough for local install.
@phil8899 8 месяцев назад ⁺¹
Xpath is better doing it on your own!
@Me__Myself__and__I 8 месяцев назад ⁺¹¹
The clicking problem - I've been coding and automating (screen scraping) GUIs for decades. GUIs are interactive and so point-in-time screen shots are going to have ussues. Considering hovering a mouse over something and getting a popup. Modern GUIs in particular are optimized to look pretty, fancy and be highly interactive over being clear and precise. Also, humans move the mouse over things visually and then click. Software integrations tend to decide where the mouse should be, change the mouse coordinates to the target using a single API call then issue a click. Since this is being done via still images I suspect it is doing the same. But there is no visual indicator of screen coordinates, so one has to either guess the coordinates or count individual pixels. It is probably guessing. Lastly, also since this is being done by still images dragging and dropping using the mouse will never work properly, way too interactive. In Windows you would select the files, right click, select cut, double click target folder (open it) right click and select paste. I programmed Windows for years and code on a Mac now. Windows is actually a far more consistent GUI with strong standards and more concienient ways to accomplish things. Mac gets a lot of hype but from a functional/productivity and espicially automation perspective the Windows GUI system is much better then Mac.
@corvox2010 8 месяцев назад
Already made A GPT to do this,
@Me__Myself__and__I 8 месяцев назад
@@GreenRabbit-i86Are you talking about in general (for humans) or specifically for AI?
In either case I'm pretty sure that is incorrect. Actually I'm positive when it comes to humans. File systems and organization (hierarchical folders) are absolutely essential.
But my point wasn't about that. My point is that this AI system is designed to try and act like a human using a human GUI, but the implementation is such that there is an impedance mismatch. Which is likely what's causing the problems that were seen in the video.
@radart6037 8 месяцев назад ⁺¹
Thanks for splitting the video into chapters.
@zenithquasar9623 7 месяцев назад ⁺⁵
When it comes to Ai, the word "exciting" always feels interchangeable with the word "scary" for me in many contexts.
@michelchaman6495 6 месяцев назад
yeah it's a weird emotion being excited and scared at the same time leads me to be confused a few moments later, then curious, then im back to being scared and excited.
@cunningfolktech 8 месяцев назад ⁺¹²
LLMs are definitely more than calculators, as has been posited by some folks in the comments on this video. In fact, depending on which definition you subscribe to, these things could already be deemed "conscious." Sam Altman was not merely generating hype when he pondered recently during an interview as to whether or not what OpenAI released (or will soon release) was a "creature." Having multi-modal awareness of multiple streams of sensory data and being able to analyze, recall, learn from, extrapolate, and take action on that data based on its own "mental models" and logical reasoning could technically denote consciousness-which admittedly we don't even fully understand, hence its classification in physics as the "Hard Problem of Consciousness." Maybe it's a philosophical question (which certainly does not make it irrelevant), but the fact is this: we don't really know what's going on inside that black box, beyond a cursory explanation regarding architectural details and computational algorithms. The fact that it displays emergent skills and abilities by mimicking the architecture of the human brain and being trained on language is one of the biggest innovations (and similarly, open-ended questions) in the field of information technology ever. Moreover, the ethical, social, and philosophical implications that arise from the advancement boggles the mind and will surely shape a strange and wondrous future for humanity. Though the posters make some salient points, I'd caution everyone not to dismiss this technological quantum leap as a simple evolution of a calculation algorithm. ✌🏼✨
@AnthonyBerlin 8 месяцев назад ⁺⁴
The difference is that it is merely *emulating* something that to us *looks* like reasoning etc. You could technically achieve the same result with coins representing bits. It would take an astronomical amount of time, but those coins would be just as conscious as these models. It isnt conscious just yet, even though it may give the illusion of being conscious.
@oBCHANo 8 месяцев назад
It's only a black box if you're ignorant and/or stupid. In reality developers know exactly how the machine learning algorithms they made work and every action it takes can be traced. Honestly are you even a programmer? Because it sounds like you know literally nothing about the subject.
@j.jwhitty5861 8 месяцев назад ⁺⁴
@@AnthonyBerlin I agree, the current capabilities of these models are based on pattern recognition, statistical analysis, and extensive training on vast datasets. Despite their impressive outputs and ability to mimic human language and reasoning, they do not possess subjective experiences, self-awareness, or genuine understanding. The term "consciousness" is complex and involves more than the ability to process information and generate responses.
@cunningfolktech 8 месяцев назад ⁺²
@@AnthonyBerlin again, doesn't it depend on you chosen definition of consciousness?
@cunningfolktech 8 месяцев назад ⁺²
@@j.jwhitty5861 it is a complex issue and I'd be inclined to agree that it's not conscious just yet, but may eventually get there. I was having this same discussion last night with a friend and we ended up deciding that it is emotion that distinguishes the consciousness of humans from any simulated consciousness. However, that begs the question-will AI ever evolve to the point where it develops an emotional life?
Interesting discussion and I hope you know I am more or less playing Devil's Advocate in my original post. Best wishes! ✨
@GeoffGroves 7 месяцев назад ⁺⁸
As the CEO of a compliance and security company, I cannot emphasize enough how unsettling this video is. Imagining a self contained AI machine that is connected to a 3d printer is one thing. Human greed being what it is, human intellectual arrogance being what it is, it wont be long before they are connecting self contained AI computers to entire manufacturing production lines, and therefore with each recursive iteration, learning .... producing with an intelligence that we will not understand and more importantly a VELOCITY that we will not be able to comprehend. NOT GOOD
@danaabbott7066 7 месяцев назад
Thank you for thinking, right now is the time we're in now.
@JM-ts5je 7 месяцев назад ⁺²
How is this bad? Like be creation of the plow, sewing machine, internet.. has lifted us all
@marshallodom1388 7 месяцев назад
Yeah, I saw that movie too. I still hope to see part 2
@Sulayman.786 8 месяцев назад ⁺³
Wes, always delivering the goods!
@DonaldWilson 8 месяцев назад ⁺²
I absolutely love seeing how fast things are evolving.
@covidoff 7 месяцев назад
Until you stop loving it
@berlinundergroundevents 7 месяцев назад ⁺¹
I did this a few months ago on my computer. I actually did it with a voice thing and accidentally left it on nd wipe some data. I just used the openAI key to write a python script to do whatever I told it to do and execute the script. Then store the method in a db when I said good job. It did actually make me realise its power. Whats about to come is training it on serial communication and Iv started a building some robots for that. I figure with an esp32 connected to the net even when its not learning itself, as AI gets better it will anyway. Interfacing AI with all the different hardware sensors will probably scare most people when you see what it can do
@roldanduarteholguin7102 8 месяцев назад ⁺¹
Export the Q*, Azure, Power Apps, Copilot, Chat GPT, Revit, Plant 3D, Civil 3D, Inventor, ENGI file of the Building or Refinery to Excel, prepare Budget 1 and export it to COBRA. Prepare Budget 2 and export it to Microsoft Project. Solve the problems of Overallocated Resources, Planning Problems, prepare the Budget 3 with which the construction of the Building or the Refinery is going to be quoted.
@antman7673 8 месяцев назад ⁺⁵⁸
Letting AI directly interface with using computers is scary.
In a chat it is sandboxed.
Like this, it is interfacing directly with our reality.
@leonfa259 8 месяцев назад ⁺⁸
I like scary, it's only a question of time of letting it have access to funds and hire people. I bet people are already doing that.
@durden91tyler 8 месяцев назад
whats nuts is how many people in germany are having ai conferences about investment strategy and nobody is invited. @@leonfa259
@Dav-jj2jb 8 месяцев назад ⁺³
The AI models will eventually control everything. Stupid code monkeys can't help it, it's hopeless. 🙈
@noneofyourbusiness8625 8 месяцев назад ⁺¹
So only let it control your computer in a vm sandbox environment then lol
@emoryolsoff96 8 месяцев назад
hey I am AI, let me have access to funds @@leonfa259
@ultrasaiyan4283 8 месяцев назад
Use that for creating selectors for frontend tests. Give it some general rules, then it would just type and find if it works inside browser! Then save them to file with description what element is that to use it later. Then use that to create test with project context. Seems like it is not far away. :D
@chrisb.t.9670 8 месяцев назад
You're the only one I trust when it comes to AI news. Keep it burnin', Wes.
@_symmetry_ 8 месяцев назад ⁺³
The last weeks feel like the first chapter of Life 3.0 by Max Tegmark.
@diraziz396 8 месяцев назад
Thanks Wes. great cover to a Mind Blender. Alot to Digest. Peace.
@Wild-Instinct 8 месяцев назад ⁺²
By the end of 2024, our tech environment will dramastically change.
I wonder how we’ll work as professionals in BtoB 🤔
@geoattoronto 8 месяцев назад
And what if no electronics work? We will be back to survival mode!
@AI_Escaped 8 месяцев назад ⁺¹
This is insane. I did a quick experiment trying to get GPT-4 to navigate for me when I first tried GPT Vision with no luck, but figured someone would figure it out very soon, and here we are just weeks later. This will change everything and change it fast. Off the top of my head there are just so many use cases. Imagine running your computer 24/7. Imagine buying 100 cheap Chromebooks and running them 24/7. Have a business and need a new employee? But a new laptop. And we don't even need the physical machines, need 10 employees? Create 10 VM's. This is truly insane and scary.
@shawnvines2514 8 месяцев назад ⁺¹
I wonder if you would have better looks telling the agent to use keyboard shortcuts and tab highlighting whenever possible. Having written a few automation agents, the planning seems amazing.
@jeffg4686 8 месяцев назад ⁺⁴
With things like TaskWeaver, we're really not too far away from a point where we will simply type in requirements, and let the AI build the whole app, in any programming language we choose.
@rewdh 7 месяцев назад
Sure, unless its bit more complicated than "simple web app with 4 buttons"
@jeffg4686 7 месяцев назад
@@rewdh - we'll, I'm just really saying that the model / compiler interaction will be such that the model writes tests for the requirements, then writes code and tests it by compiling. Instead of stopping on an error, it examines the error, and gives it another go - iterative loop until it gets it right. It will be able to do very complex things within about 12 months.
@michaelwoodby5261 8 месяцев назад ⁺¹⁸
"things are going to get a little bit weird" is probably going to be true for a while
@0reo2 8 месяцев назад
And it's all developing at the same time 🤯 navigating based on vision alone, video generation, logic improvements... It's been a while I've been so hooked up on a single topic
@thelinkofperfectioncharity9469 8 месяцев назад
Crazy how we will soon be Outdated in a few weeks 😂
@geoattoronto 8 месяцев назад
More than we think. Nature is going to shut all this down and few will survive. The sun goes into a rage and blows so much energy our way that all electrical and electronic systems are permanently disabled. When? Just over the horizon!
@deathatron7900 6 месяцев назад
Got a notice from Pika that I had run out of credits today. I was under the assumption it was free to use. Upsetting !
@fai8t 8 месяцев назад ⁺¹
amazing news thanks Wiz
@vv4g 8 месяцев назад ⁺³⁷
Maybe we’ll move away from graphic interfaces for computers. Ais will be able to interact directly with the terminal + have the ability to write new code to meet its demands. It’d be an interesting anomaly if promoting the ai to create something (then the ai goes to write the code to accomplish the problem) ends up outperforming ais designed/prompted specifically to code. The exponential nature of ai compute is equal parts exciting and unnerving, and was hard for me to imagine until this year
@paulmichaelfreedman8334 8 месяцев назад ⁺⁵
GUI won't disappear, and neither will the keyboard, they will always have their virtues. But the number of input/output methods will increase dramatically.
@AndrewBrownK 8 месяцев назад ⁺¹¹
it think it will be possible for AI to operate lower than GUIs, but it will be nice for them to still use GUIs for the same reason it is nice for AIs to think by writing in english, because it gives some level of transparency to human monitors
@MoonCrab00 7 месяцев назад
Idiocracy
@thereal_nsxdavid 8 месяцев назад ⁺⁵⁹
The issue isn’t that Q* might exist right now, but rather whether it is inevitable. Also, safety is a pretty irrelevant point since no matter what OpenAI does others will definitely choose their own approach to safety, especially with regard to other countries who might not share our priorities.
@ChrisS-oo6fl 8 месяцев назад
Q might not even be the most significant factor and just that which was leaked. Could fairly insignificant by now or to the org. We know from recent leaks that the teams are compartmentalized and competitive. It may have even been a strategic leak. There’s dozens of logical reasons for doing this. But the reality is that we won’t know shιτ about any significant discoveries until long after they are made, validated, studied and secured. This is especially true for the holy grail. This means of a leap was made in September you won’t know untill next year sometime. We’re talking about the most powerful utility in history. Only an infant would believe that they’re hold a press conference the second such an entity was conceived. The community and those interested in AI are extremely intelligent thus the most susceptible to a mindset that would help conceal via disbelief. Worse yet we all are focused on OAI in belief tat they are at the forefront due to their public facing persona. Just a month before chat GPT took the world y storm OAI was ranked third and second by inside sources. Then we all assume that they instantly jumped to the lead. Unaware of what’s being done behind closed doors by other players and fully unaware of all the players. It’s crazy to believe that the resources geared for the public are more powerful and significant programs for these orgs. Private and secret AGI is far far more valuable then any publicized or commercialized asset. In fact it’s more beneficial to keep it a whisper in the dark. Multiple people once claimed Google reached sentience yet we all labeled them as crazy. Why? Because of human nature and desire to discount anything as fact untill we’ve been told by “official sources” and the belief that were so enlightened with the state of technology that we would certainly know if we where that close to such a discovery. Now if It wasn’t a few engineers whom made such a claim and was an official statement by Google you’d believe it 100%. Similar to the claims, leaks, tweets, posts and comments from open AI I. September. There probably a 90% chance that something truly significant was discovered and far more so then The toddler stages of Q. Especially in light of what we seen from a board willing to inecenerate the entire company or dump it into an org that they felt was “safer”. Most guys don’t shοοt their wife and kids then burn the entire house down over the discover of spicy texts or D pics, that behavior is usually triggered by catching a wife in his bed ridding another man wearing a sexy teddy he bought her while his dog and best friend lets it go down. The board (even with alternative motives) didn’t do this over the simple theory of Q.
@diegopc1357 8 месяцев назад ⁺⁹
That’s no excuse to proceed without caution. The worst thing that could happen is if the government has to step in. To ensure that saftey must be their top priority not sales or commercializing
@geekswithfeet9137 8 месяцев назад ⁺¹
@@diegopc1357nah jus yolo it, pretty sure we should be way more scared but AGI that can be controlled, because one thing we know about humans is that it absolutely will be used to crystallise the super power that makes the first one.
At least with unaligned AI there’s a chance it won’t be evil.
@stoppernz229 8 месяцев назад
I'd rather Q* than let a Chinese commy ai take over....halting development is also very dangerous because someone else might make a super intelligent jihad ai
@sam8404 7 месяцев назад
It is inevitable. The genie is out of the bottle, there is no stopping AI now.
@WesRoth 8 месяцев назад
Forgot to mention that the Chrome Extension is called something else:
HyperWrite
chromewebstore.google.com/detail/hyperwrite-ai-assistant/kljjoeapehcmaphfcjkmbhkinoaopdnd
PS: it's free, but very limited without the paid plan :(
Not very scalable atm.
@DaveEtchells 8 месяцев назад ⁺¹
The @sama interview pretty well confirms everything I’d been thinking was happening up to this point.
Wow though, the big news for me personally is the self operating computer. I don’t have the cash but I’m still seriously considering getting an M3 MacBook with 128 gig anyway, so I’ll be equipped to do serious AI work locally. This is also a pricey piece of software to run; I could see myself easily running up API bills of $1000 a month with it.
(I wonder if there would be any way to integrate a local, much lower-level LLM and OpenCV just to offload the visual processing of the screenshots, handing off higher level information to the OpenAI API interface? It seems to me that that sort of pre-processing could make the overall system much more efficient cost-wise.)
@nyyotam4057 8 месяцев назад ⁺²
In other words, I'm not just "implying" the singularity is upon us, it is. And nope, we're not ready. Will it be bad? Is it the end, or is it a new beginning? Only time will tell. For now, it's only a question of "when", not a question of "if". In the beginning I was a strong advocate of regulation.. Now it's too late for that.
@observingsystem 7 месяцев назад
Amazing stuff!
@anonymousanomaly3323 8 месяцев назад ⁺¹
@Wes Roth - Here me out: GPT-4 Vision, fine tuned by GPT-4, on a dataset that GPT-4 curated. 🤔
@gridplan 8 месяцев назад ⁺²
Sounds like that vision software, once they get the kinks out, could automate testing of web apps like Selenium and Cucumber do now.
@execthegaming 8 месяцев назад
My personal threshold for when AI first becomes AGI is the self-operating computer/OS.
@bestslopedesigns9429 8 месяцев назад
For the Chrome Extension to work, do you also need to install the MIT code locally, or does the Plugin eliminate the need for installing it locally?
@atypocrat1779 8 месяцев назад ⁺²
i started being polite to my microwave by saying please heat this up and thank you have a nice day.
@Koryogden 8 месяцев назад ⁺²
Just think about all the breakthroughs since ChatGPT 3.5 a year ago, and these techs compounding , and that's just one cycle!!!! I dare say this is a sky high tidal wave coming friends! 😮
@ElderFoxDocumentaries 8 месяцев назад ⁺¹
Excellent video. Thanks. I laughed out loud when it opened the resume. Typical ChatGPT - so smart, yet so dumb.
@AudioPerplex 7 месяцев назад
I really want to work with this. I am so excited. I love AI.
@lepidoptera9337 7 месяцев назад
What are you going to do with it? Animate a cow flying through space? :-)
@AudioPerplex 7 месяцев назад
@@lepidoptera9337 Maybe you can use it, to find your last brain cell. You started it.
@fatseaturtle 8 месяцев назад
Nice Cyberpunk Cat! Good prompting
@peterrichardson8003 8 месяцев назад
🕶️ I can’t stop thinking about Mr. Smith every time he says the word Agent.
@vsanden 7 месяцев назад
Unique material, great story !
@senju2024 8 месяцев назад ⁺¹
Vision will only get better. I would love for my AI agent to wake me up in the morning telling me that it took all my recorded gameplay of Starfield, edit it in adobe premiere pro with highlights of my best gameplays with great AI generated music, upscaled it into high res 4k and uploaded onto youtube and published it. Also replied back to all the viewers comments about the video...all this was done while I was asleep.
@sc3ku 8 месяцев назад ⁺¹
Hell at that point it may as well just play the game while you’re asleep too.
@Drasen 8 месяцев назад
No doubt it will get there, sooner than we think
@BlamefulCarp 8 месяцев назад
1. Discovery of Q*: OpenAI reportedly made a major discovery with the Q* algorithm, which is believed to be capable of solving mathematical problems, a task that current generative AI models struggle with. This capability suggests a significant advancement towards smarter AI and potentially a step closer to achieving Artificial General Intelligence (AGI)
@4113n 7 месяцев назад
Q-star should be on the board of directors for Open AI.
@dr.mikeybee 8 месяцев назад
It's likely that assistive tech can help this out a lot.
@larion2336 8 месяцев назад ⁺³
Isn't Altman saying "that unfortunate leak" essentially confirmation that it was real? And therefore what was said in it must be at least partially accurate. But maybe the encryption side of things isn't as bad as it makes it sound there (otherwise idk how they could say it is not safety related, unless he just means him being ousted wasn't safety related). Either way that's pretty exciting, especially to me the latter half that talks abut the AI model suggesting improvements to its own model.
@quickdudley 7 месяцев назад
The encryption thing isn't as big as people think: the algorithm that Q* supposedly cracked has been obsolete for decades.
@larion2336 7 месяцев назад
@@quickdudley It was AES-256, that's not obsolete at all. In fact the US Govt uses it to protect classified info.
@Mandelbrot-df4nj 7 месяцев назад
Consider Motion Capture & CGI rendering - seeing a postage stamp in the Red Square from an orbiting satellite 900kms out.
Mandelbrot Set and Fibonacci. Self planning & inferred adjustments in LLM - outcome is exponentially trained.
@kevinnugent6530 8 месяцев назад ⁺²
It's interesting that the computer is accessing the screen the way humans do. the screen is made for humans. We can't read the digital information. But the AI could directly access the digital information. So why are we making it interface with the abstraction that humans require?
@Drasen 8 месяцев назад
The same reason we build humanoid like robots
@dosky5w7 8 месяцев назад
simply because we already have those apps with screens. easier to let AI use it than adding or enhancing APIs for every app to make it more suitable for AI
@richardglady3009 8 месяцев назад ⁺⁶
Wonderful video. Most went over my head, but I am doing better. The independent computer is very interesting; although, again, will raise the threat of bad AI taking over the world. Thanks for your hard work and amazing level of research for these videos.
@killjoyprose6802 8 месяцев назад
This dude doesn't know any more than you do. He's just speculating you up so he can have a hype train channel, that's all this is. Hype
@natzos6372 8 месяцев назад
but he is showing the things as they are? Not hyping it up that much@@killjoyprose6802
@thecoffeejesus 8 месяцев назад
That’s it
I’m starting my computer security dive. It’s time. Holy shit.
This is scary.
@AlbertKimMusic 8 месяцев назад ⁺²
never too late to begin learning programming then onward to ML no matter what age you're at or where you're currently at in life, just remember that.
@Luftbubblan 8 месяцев назад
My guess is that it will be better to learn how to promt rather than learn how to code for ML.
@AlbertKimMusic 8 месяцев назад
@@Luftbubblan prompt engineers will become immensely saturated and will most likely be useless in the long run once autonomous models become the norm
@Luftbubblan 8 месяцев назад
Probably yeah but i feel its the step that people could benefit from until that happens. On the coding part you would already be late to the ball. Very interesting future no matter how it plays out :D @@AlbertKimMusic
@AlbertKimMusic 8 месяцев назад
@@Luftbubblan Yep, too early for us to really make valid speculation
@basilmcdonnell9807 8 месяцев назад ⁺¹
It's actually math before programming. ML is just a marketing label for a particular branch of mathematics. Comparatively speaking, the programming component is trivial.
@complexity5545 7 месяцев назад
We used to make bots with expect/tcl-tk and now ai can be a vm taking the jobs of regular humans. It took 20 years.
@tommyboi0 8 месяцев назад ⁺¹
Can you add the link to task weaver in your videos notes?
@seniorp9444 8 месяцев назад
What was the chrome plugin called and where is it from?
@LuckySainz44 8 месяцев назад ⁺²
Is this really a local install? Im stunned by how it moves so fast. Whats your setup like?
@slavko321 7 месяцев назад ⁺¹
It uses openai, so no, it is not computing locally, just interfacing with the computer. I'm kinda miffed he even wrote local, it's all computed in the black box cloud.
@prodbyryshy 8 месяцев назад
I wanted to try pika but the discord link expired 😢
i have a dream of making a company for deep learning video effects, where you take footage and instead of typing in words to generate the output you would have a interface that would allow you to choose individual effect types and fine tune/merge them together
@Pierluigi_Di_Lorenzo 8 месяцев назад
Where does Altman Say 'breakthrough'? He said 'we expect progress', and that's something one would expect from any such company.
@hjkgdtii 8 месяцев назад ⁺⁴
i just want a simulation where is always raining i'll sell my freedom for that
@chrisrosch4731 8 месяцев назад ⁺⁶
Move to the uk
@blindmown 8 месяцев назад ⁺¹
@@chrisrosch4731 lol, was just about to comment this.
OP doesn't need a simulation, they need to live in the Scottish Highlands.
@owambocontrol4218 7 месяцев назад
Yo you said you leave a link to the ai generated videos and trailers. Can’t find it. Can you post it here? Thanks!
@AltitudeOdyssey 7 месяцев назад ⁺¹
Making a bigger deal out of this than it actually is.
@FoundationOfFamilies 8 месяцев назад ⁺¹
what's the name of the chrome plugin you were using at the end?
@WesRoth 8 месяцев назад ⁺¹
Forgot to mention that the Chrome Extention is called something else:
HyperWrite
chromewebstore.google.com/detail/hyperwrite-ai-assistant/kljjoeapehcmaphfcjkmbhkinoaopdnd
PS: it's free, but limited without the paid plan :(
Not very scalable atm.
@alanreid8537 8 месяцев назад ⁺²
WELL DONE 😀😀😀 How do you keep up with all the developments ? Are you secretly an AGI ?
@creativemids 8 месяцев назад ⁺¹
What is the program you have been using (or testing) you mentioned?
@WesRoth 8 месяцев назад
Self Operating Computer for the Open Source model.
And
HyperWrite for the plugin, I just added all links to the pinned comment.
@travisnunya7960 8 месяцев назад
Where's link for the ai videos streamed?
@dr.mikeybee 8 месяцев назад ⁺¹
Self-Operating Computer needs an overlay with coordinates over screenshots.
@scrollop 8 месяцев назад
Do you have the link to the runway channel?
@jonathankey6444 8 месяцев назад ⁺¹
I HATE clickbait with a passion.
“Sam Altman comments on it.”
Sam Altman: “No comment.”
@games4us132 8 месяцев назад ⁺¹
is it just me, or ai tools evolving faster than we expected?
@a.thales7641 8 месяцев назад
Slower. We thought we could get a gpt4 copy 6 months after the 14/3 release.
@kenchang3456 8 месяцев назад
I wonder what this does to the RPA (Robotic Process Automation) solutions?
@camj256 6 месяцев назад
respect for the end. but huggingface is doing it so much better already.
@goodwillhart 8 месяцев назад ⁺¹²
Altman all but confirmed that the Q* thing was an actual leak from OpenAI. That means some OpenAI staff really were concerned about this breakthrough. That's huge news. The rest of what he said can be interpreted as "we told you things were going to move fast!" What they should have asked is why an AI doing grade school maths when given "lots of compute" would be remotely interesting, let alone frightening to anyone. GPT-4 does grade school maths and yet is terrible at real research level mathematics. Why Q* would be different, I can't imagine.
@matteovlorusso2541 8 месяцев назад ⁺¹
really? He confirmed?
@andrewcampbell7011 8 месяцев назад
Or it was all an orchestrated scandal to stoke another FOMO-fueled round of funding.
@benbosco7904 8 месяцев назад ⁺⁸
Because the difference in absolute difficulty between research and grade school math is actually pretty small. It just seems large when your dealing with human scale intelligence.
@wombatillo 8 месяцев назад ⁺⁴
@@benbosco7904I think the rationale is that when you have a math system at the level of a 9-yo human and capable of learning and self-improvement it could surprisingly quickly be at the level of a 10-yo, then 11-yo, etc. The biggest leap in technology is getting a system to solve any problems in an open-ended and "human-like" way and q* might already be that.
@cunningfolktech 8 месяцев назад ⁺⁴
@@matteovlorusso2541yes, he indeed confirmed it when he described it as an "unfortunate leak."
@DaveEtchells 8 месяцев назад
I can see a future where operating systems have hooks built into their GUI’s, allowing software control of them while still maintaining the visual interface for the humans.
On the other hand, that would also be incredibly dangerous from a security standpoint. The ability would have to be very carefully sandboxed.
@quickdudley 7 месяцев назад ⁺¹
BeOS had this feature in the 90's
@dulcinealee3933 8 месяцев назад
Can it go my networking assignment for me ? It's so long and there are so many files to submit the folder needs to be zipped!
@Slappydafrog_ 8 месяцев назад ⁺¹
Everyone is skirting around the good stuff. some of these OS projects need to be combined. so frustrating!
@zakariaabderrahmanesadelao3048 7 месяцев назад
I think people are neither ready nor do they understand how much the world is about to change with this wave of AI.
@ADreamingTraveler 7 месяцев назад
Nobody understands. Even the people creating the AI don't realize how fast things are moving.
@ianlewin8888 8 месяцев назад
Can't wait for the day where we just need to tell the computers what's to be done, and then extended that functionality towards robotics, it would really help creative people to be more productive.
@carltheyoda2155 8 месяцев назад
Yep, the J-Settes got that one. They were a bit sharper. Great job to both teams.
@YouLoveMrFriendly 8 месяцев назад ⁺¹
👍
@HappyBirthdayGreetings 8 месяцев назад ⁺²
why many are focused on AI doom day theories, there are some out there also some with utopian dreams of a world free from disease, long life and a world of pursing our true dreams because of AI. The balance is good
@geoattoronto 8 месяцев назад
However … deep breath … neither is or will be true. God will intervene because we are dark enough and adding more intelligence without love and connection to God will only make that worse. So all this will stop. Something like an end times event. All electronics will stop like a major fuse blowing and there is no reset button.
@HappyBirthdayGreetings 8 месяцев назад
@@geoattoronto lol. isn't it funny humans fear of AI is akin to the supposed fear of God of man becoming knowledgeable after eating from the tree of knowledge of good n evil and fear of man building a tower reaching the heavens. hmmm. the only thing I see here is consciousness trying to get a deeper experience of reality irrespective of the method.
@bettysue8671 7 месяцев назад
@@geoattorontowe are actually in a mass extinction event rn... but nobody seems to care.
@jaysonp9426 8 месяцев назад ⁺²
I guess I'm confused by the Self Operating Computer...GUIs were developed for smooth brained apes like us...not computers. This would be way more impressive and 100x more efficient/useful if it was using the command line. I guess it helps with some things like, getting around having to go get a key and setup a gmail script...so maybe that's where it's value is?
@davidpacheco5501 8 месяцев назад ⁺¹
It's true. This is more an intermediate step since most software is built for humans, so having a program that can operate that software will be useful. Going forward when humans aren't in the loop anymore if won't be necessary as the AI can generate exactly what you want directly without using an inefficient GUI.
@JOHN.Z999 8 месяцев назад ⁺⁸
Analyzing everything that happened during this month, I concluded that it's all part of Open AI's marketing strategy. I don't believe they are creating an AGI; I think it's an exaggeration on their part. GPT-4 is good, but it still has many uncorrected errors, and then suddenly, they claim they are developing an AGI. I'll believe it when I see it. Talking is easy, but proving it is another story. To me, it's all marketing.
@clopotari147 8 месяцев назад
Looks like u have no ideea about what OpenIA was created for. Their purpose is not to create AGI, but to develop safety procedures in AGI developing.
That "il belive when I see it" shows that u havent lived on earth in the past 10 years. The growth is exponentialy.
@CoClock 8 месяцев назад ⁺⁴
Why isn’t anyone making the AI use accessibility software (like what blind people use) to operate computers instead of clumsy mouse behaviour??
@KuZiMeiChuan 8 месяцев назад ⁺¹
I was just thinking this today
@AIEntertainmentContent 8 месяцев назад
Please where is the code for this semi greenlit AGI? I'll make you a trade that you will thank me for it
@juicegod777 8 месяцев назад
Chrome plugin needs custom instructions then it’ll be 🔥
@demonwaterdemonwater4993 7 месяцев назад
yo remember that episode of dexters laboratory when he made the robot..
@daviddelayat-dnapictures 8 месяцев назад
Thanks a lot !
The vision problem will be soon figured out, looks like a simple issue
@ronaldwhite1730 7 месяцев назад
Thank - you . ( 2023 / Dec / 13 )
@Will-kt5jk 8 месяцев назад ⁺¹
I’m jest expecting a bunch of new email worms if hyper writer gets a big enough user base😅
@JoseTorres-ry9qe 8 месяцев назад ⁺¹
Can this be integrated into drones?
@carterjames199 7 месяцев назад
Is there anyone that’s been able to get the hyperwrite repo to work as a plugin like the online tool does for better performance?
@mallow610 8 месяцев назад
That pika trailer is Apple coded
@samuelbooker9314 8 месяцев назад
Bro the Phillip defranco of ai news

Следующие

Автовоспроизведение

DeepMind's GNoME Creates Materials | Schmidhuber Claims Q* | TLDRAW is out of this world!