The 517,431 Emails That Trained Siri
HTML-код
- Опубликовано: 16 май 2024
- Use code HAI50 to get 50% OFF your first Factor box plus 20% off your next box at bit.ly/48fuIf7
Get a Half as Interesting t-shirt: standard.tv/collections/half-...
Suggest a video: halfasinteresting.com/suggest
Follow Sam from Half as Interesting on Instagram: / sam.from.wendover
Follow Half as Interesting on Twitter: / halfinteresting
Discuss this video on Reddit: / halfasinteresting
Video written by Amy Muller
Check out our other channels: / wendoverproductions
/ jetlagthegame
Imagine someone casually watching this video, suddenly recognizing one of their emails (like at 3:00) and only now finding out that person hated them after all.
If you wrote emails like these and still don't think this would make people uncomfortable, maybe it IS high time for some reflection
It was you, wasnt it?
@@TotalDrganMania no lol, I just thought of it XD!
I like the idea of him waking up after a long night of eating dino nuggets like "oh god what did i do last night?" covered in crumbs and fried nugget bits
E
E
them dino nuggets were dipped in alcohol huh
Imagine getting dumped via email 20+ years ago, only to have that memory unlocked while harmlessly watching a RUclips video... poor Jason. 🤣Shout out to Kyle, too.
Lol one of the uses for the Enron emails is "fraud detection". Really couldn't have found a better dataset there...
I love how sam has been carefully training us to watch longer, better quality videos over the past few months.
3/4 as interesting
At least longer
I have several times had the job of reading through emails like this. I am a lawyer and hae worked document review a bunch of times. Assume that strangers WILL read your corporation's emails at some point. Companies get involved in litigation and then send massive quantities of emails to teams of young lawyers fresh out of law school to look for anything that is relevant to the case. In the process, they read EVERYTHING (depending on the budget, smaller budget cases may rely on searches, but usually that isn't considered sufficient).
I've seen adult material, discussions of affairs, and all kinds of other things. I wasn't particularly the type to talk about the juicy emails with my colleagues... but some people were. They take confidentiality seriously, so the emails don't leave the secure setting. But people will discuss them in that setting and discuss them without names attached in other places.
Do people still do casual informal talks by email?
@@srpenguinbrYes. And now, lawyers get all slack or teams chats now too.
Legit leaked the office gc 😭
@@jamiemeicheng6589 Sometimes we get entire hard drive images.
I'm a lawyer working in legal tech. We use the Enron data to demo our AI doc review product to show how effective it is at sentiment analysis lol.
Hey Siri how's the weather tomorrow?
-What the f*** is The Lion King?
Excuse me!?
-The movie is a movie where a kid bullies ants.
What...
E
E
Ugh.
Really?
LMAO 🤣😂
I still have more unread emails
If you train a large language model on them and then ask it questions, you get a gist of what's in them.
Hey Siri, what would be a good comment
first good comment that is on topic and not a bot, insane
@@ezikhoyofrrr they’re getting rarer
Here’s what I found on the web for ‘what would be a good comment’.
I’m not sure I understand
Siri: "FIRST!!!"
I've uploaded this data set (or a portion of it) thousands of times to run performance tests. Shoutouts to Andy Zipper.
E
E
E indeed.
GI/GO isn't just an AI idea. It's been around pretty much since the beginning of computer science.
Babbage is famously quoted as having been asked "Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?" so the concept literally predates computer science a as discipline.
Just because he said it's a principle of AI doesn't necessarily mean it's an idea _from_ AI. Unless I'm wrong, which is possible.
@@NightytimeExtras I think you're completely correct, isn't that the whole basis for 'high quality ingredients' and the like? High quality products from high quality resources.
Although no doubt that nowadays AI has taken quite a shine to the term and you'll hear a lot about the quality of the data they're using to train them.
He never said is was just an AI idea, or that it was new...
“Hey dale I found another novelty nappy”
Siri in 2024:
E
E
Nappy as in diaper?!?! Is that a real Enron email?!?!
Dale Gribble?
My old house mate was doing an internship when ENRON went down. He was on all front pages of newspapers with a a box of his office supplies.
I don’t know what he does now, but when Enron collapsed his face was everywhere. Smart guy but just at the wrong company at the wrong time.
Just a note from a sociolinguist, for us at least vernacular speech is the gold standard and a holy grail and we don't pay undergrads to just talk. We go to speech communities we want to study and conduct sociolinguist interviews where we try to basically elicit stories about people's lives and experiences because they're much more likely to have a relaxed, normal speech that they aren't self monitoring.
That guy had two guesses for his question "What the fuck is Lion King?", which were "like disney on ice" or "some kind of porn thing". Idk how the hell he went from some sort of Disney thing to thinking it might be some kind of adult entertainment lol
To be fair, there was a time when Disney was the largest distributor of porn in the world. No joke.
@@benjaminlynch9958 huh, TIL
Loin King
@@mirzaahmed6589 that.... actually makes a lot of sense lol
"Re: I Love You" got a good laugh out of me.
I taught English in Tbilisi for a year. "Siri" means "penis" in Georgian! 🤣🤣🤣
And an ass in Japanese, yeah
siri gotta be one of the worst assistants ever made
When it first came out, it was one of the most advanced. It's just heavily outdated.
It's rumored OpenAI is in talks with apple to replace Siri with a chatGPT based system so it's plausible that in a few years Siri might finally be somewhat competent
When I was 8 years old it was the best assistant ever made
It's better than Clippy ever was.
@@lmpeters no
The videos are getting longer, they are no longer Half as Interesting, they are atleast 3 Quarters as Interesting
"If I eat dino nuggets for half my meals in a given week, I feel like a dinosaur..."
I've always been blown away by how much 'personal use' company email is used for. I've know very intelligent people use their work email for a variety of things that in hindsight are just a very bad idea. I've never got that.
Things were different in the 1990’s. A lot of people back then got their home Internet from AOL. Facebook, Twitter, and Google has yet to be created. Amazon was a niche online seller of books and CD’s, and most websites didn’t even use basic secure encryption methods that are standard today. From a behavior standpoint, lots of people used their work email as their personal email as well because their work email was literally the only email account they’d ever had. It was a different time back then…
So that’s why Siri keeps trying to tell me about mark to market invoicing, and how I should defraud Pacific energy companies whenever I ask for directions to Chipotle.
Seeing the emails used as examples in this video make me realize how realistic the terminal entries in The Outer Worlds are
5:29
Heyy thats the place I work at now!
SRI's rarely mentioned anywhere despite the technology they've invented, I'm just surprised to see Sam bringing it up.
"Garbage in garbage out" has been a programming phrase for decades. As always, love your videos, love your channel
Hi Sam!
I'm so proud of Amy for reading each and every one of those 517,431 emails for the juicy details. She deserves a raise!
"Boring nonsense"? WTF, that email about "Canadian's Testicles Torn-Off, Girlfriend Charged" boring? I think not.
So what you’re saying is, we need to look out for accounting and securities fraud from Siri
This is actually very interesting and funny. Good work :))
How do you not know what the Lion King is by 2002???
From what we can see of the e-mail being responded to, I'm thinking Enron (or whoever "Kevin" worked for) got their hands on a big stack of advance tickets to the _stage musical_ version, which was having a big cross-continental tour in 2002... but the stage version came out in _1997_ and is the _single highest-grossing production in Broadway history,_ so I suppose that doesn't really explain how anyone could hear Kevin yell "Hey guys! The company saved us some tickets for _The Lion King_ if you wanna buy some!" and seriously think "The company reserved us tickets to watch a _cartoon?!_ That can't be right! What do you mean?"
I always enjoy HAI videos, but this one was a cut above the rest. Very fun and interesting.
This video has just made me *really* want to download all of these emails to sort through them for that hot goss.
The e at the end of in Deutsche Bank is not silent. It never is in German. Straße, Hase, Name, etc.
still waiting for that logistics of hello fresh video, a real happy episode
Oh god i have not slept yet, stayed up long enough for a new HAI video
What a weirdly interesting video!
Would love to see this on Brain Blaze in the Whistlerverse
Sam mocking Dino nuggets and saying they'd make me feel bad is what made me unsubscribe. It's clearly the 5G waves not the 317 Dino Nuggets i ate since the beginning of May.
That's really interesting.
90's office work was wild.
3:14 "Canadian's Testicles Torn off, Girlfriend Charged" Wait wat?
Another classic video. This was about half as interesting as any other!!
Citizen Kane is a cartoon of a man torturing ants.
Never heared of those emails, well, now I have^^
I could never imagine sending anything half as personal as these emails through my work email 😂
Absolutely loved all the subtle Texan references in the emails due to their being in Houston. I do wish I could’ve heard more about their escapades on 6th St and how Sam thinks “Gruene” is pronounced.
The term "garbage in garbage out" isn't new and it isn't from AI, it just also applies to AI as it applies to any data processing system.
He never said it was new, nor did he say it originated from AI.
no longer bored
Fun fact: Blockbuster invented streaming before streaming was a thing. Unfortunately, they partnered with Enron for their internet services and, well, we all know what happened with Enron.
Let us all thank the youtube algorithm, that HAI videos are now 8 to 9mins instead of 4 to 5
Hey Siri, how should i explain how good the animations of "Sam" are getting really good
garbage in, garbage out was one of my CS professors favorite thing to say
5:40 just gonna look at this in silence for a few minutes
How does hai even come up with such good ideas?
Goddamn the people at Enron were freaky
7:14 Glad to see you are hanging out in Vegas now
I own the Enron ethics manual from 2001.
why would you have an empty book?
This is WILD on so many levels! What a cool thing to learn how computers learned. Also didn't know Siri was a separate entity before she was bought by Apple. Also I guess a lot of these ex Enron employees are finding out how others in the company hated them LOL
And this is exactly why you don't use your work email (or any work tech) for anything else other than work... 😅
2:05 This will go into the next "All mistakes made" video, lol, unless "Enronyees" was an intentional wordplay
So who do I have to pay to get that version of Enron Siri? 😂
6:27 "... but is not very reflective of what happens in Citizen Kane"
But maybe is for "Them!"?
and now Google uses the entirety of Reddit for their dataset which results in things like "one Reddit user suggested jumping off the Golden gate Bridge"
yoo homer simpson's email on thumbnail
I only watch on here and not nebula to see how good I am at guessing the sponsor.
Holy shoot, poor Susan. That e-mail was cringey af.
“I searched the web, and Sam, is indeed, a los- *YEET*
Love your videos
Oh, the privacy violations. Hate to be one of those email senders/receivers.
Personally I really want an assistant that talks like an unethical energy exec in the 90s
I saw a guy with an Enron Polo earlier today. This is an insane coincidence.
The real ones know why I can never look at a serious factor at again
Why didn't google just train smart compose on their own corporate emails?
Garbage in, garbage out
Hey everyone working in a company. You as an individual, don't break law rule one. Do not write down your crimes. Don't ask for advice on your crimes in an email.
Whenever you're writing an email, ask yourself if this is an email you'd like being read aloud in court.
In the same vein, if anyone ever asks or implies you should break some laws, make them write it in an email to you. A good old "okay sure I can do that, can you send me an email to remind me?" This way your company can be taken down without hurting you.
I kept hearing “Enron Porpoise” 🐬
Is thst why Siri used to send you to the pier to get rid of the body
I really don't understand why people haven't figured out that, if you're planning something illegal, immoral, or fattening, you don't do it over text or email. That seems obvious to me.
You could use strong encryption but, to defeat traffic analysis, you'd have to encrypt _everything,_ even the most banal and innocuous messages. If you only encrypt the good stuff, it will stick out like a sore thumb.
Now, Siri gets trained on the writings of underpaid Nigerian crowdworkers instead! A great improvement.
fun fact, the first fee lines will break carplay
Sad that a lot of people at Enron who weren’t breaking the law had their personal lives exposed to the world. Good vid.
now my phone tells me the weather {actor looks out window seeing the weather}, cheers!
The amount of times this video set off my Siri was record setting
Sam’s voice tricked my husband’s Siri!
4:44 Oh the AI sparkles haha!
Let’s go new video dropped
5:42 green shorts guy 😏
So Siri is GLaDOS now
Good morning!
Finally, Enron got something right.
Lab grown conversation. 😂
The intro 😂😂😂
0:25 I'd joke about the US government using torrents, but on second thought, I'm absolutely sure some department somewhere actually does and has a valid reason for it
watching this before i go to school
why is hai getting more production value this is scaring me
I guarantee that Siri and modern text AIs are absolutely trained on data scraped from public information online, or private posts from sites owned by the company that owns the AI, and only a small part of the training set is taken from existing text databases, free or otherwise. This was something I worked on about ten years ago for a targeted marketing company and it was shockingly easy to train a neural net on this stuff. As much as I want to hope that's changed--come on, of course not.
Hey Siri, what would be a boring first liner for a video?
I found this on the web
YAYYY HAI morning video
Genuinely curious: how is releasing private individual emails to the public legal? I thought it was a basic protection even for convicted criminals, let alone suspected ones.
This suddenly explains a lot about AI.
That is one year on the day before my birthday
Houston mentioned!
0:15 YEET