How Roblox Went Down For 73 Hours

Поделиться
HTML-код
  • Опубликовано: 7 авг 2024
  • A look into what happened behind the scenes during the longest outage in Roblox history.
    Sources:
    blog.roblox.com/2022/01/roblo...
    www.hashicorp.com/resources/h...
    roblox.fandom.com/wiki/2021_R...
    roblox.fandom.com/wiki/Timeli...
    news.ycombinator.com/item?id=...
    raft.github.io/
    www.lmdb.tech/media/20130329-d...
    www.lmdb.tech/doc/
    db.cs.cmu.edu/mmap-cidr2022/
    • Free To Use Gameplay |...
    Chapters:
    0:00 Intro
    0:33 HashiStack Explanation
    4:47 Outage Investigation
    8:20 Root Causes Found
    11:30 Return to Service
    12:19 Slow Leaders
    15:56 Resolution
    Corrections:
    - At 9:44, the default unbuffered channel in Go does not hold any items and has a buffer size of 0. Sends to such a channel are blocked until another goroutine is ready to receive the value. The illustration in the video shows a *buffered channel of size 1* however the overall point still stands.
    Music Credits:
    - Firecracker by LEMMiNO ( • LEMMiNO - Firecracker ... )
    - Impact Prelude by Kevin MacLeod
    - We're Finally Landing by Home
  • НаукаНаука

Комментарии • 544

  • @sirgamsay3596
    @sirgamsay3596 29 дней назад +3396

    Imagine just doing a Hobby project to understand a piece of Software and suddenly the complete Roblox Infrastrucure is build on it.

    • @AjaxGb
      @AjaxGb 29 дней назад +928

      Open source developers: "Hey guys check out this thing I built in my spare time! It's not perfect but I'm making it freely available so other people can learn from--"
      Large corporations: "FREE?? 👀👀🥵🥵👀👀"

    • @JusticeNDOU
      @JusticeNDOU 29 дней назад +62

      i thought this was a joke,

    • @xelspeth
      @xelspeth 29 дней назад +121

      xkcd 2347

    • @shantilkhadatkar1195
      @shantilkhadatkar1195 29 дней назад +4

      @@xelspeth what is xkcd

    • @michael_betts
      @michael_betts 29 дней назад

      @@shantilkhadatkar1195 webcomic. type that text into goold and youll get the comic.

  • @c1ph9r
    @c1ph9r 29 дней назад +1383

    the negative 900 million dollars hits hard 😭

    • @_GhostMiner
      @_GhostMiner 29 дней назад +12

      Why? Trash game has trash income

    • @Luna5829
      @Luna5829 29 дней назад +126

      @@_GhostMiner its not a game tho
      its a game engine and hoster

    • @_GhostMiner
      @_GhostMiner 29 дней назад

      @@Luna5829 **H O S T*

    • @_GhostMiner
      @_GhostMiner 29 дней назад +7

      @@Luna5829 ackhually

    • @ducksongfans
      @ducksongfans 29 дней назад +100

      @@_GhostMiner plenty of trash games hosted on roblox, plenty of great ones

  • @agnor9978
    @agnor9978 29 дней назад +1593

    every time I hear that someones hobby project caused a major outage somewhere I get the feeling that maybe big corporations should maybe check what software they are built on and support it's development/maintenance

    • @fisch37
      @fisch37 29 дней назад +88

      XKCD 2347

    • @humza890
      @humza890 29 дней назад +45

      Except that software relies on another software, which then relies on another software, which then relies on another software....
      It can turn into an endless loop

    • @agnor9978
      @agnor9978 29 дней назад +50

      @@humza890 it can't circular dependencies are usually rare and you can stop looking for dependencies once you've seen it. I also didn't mean, that every company has to look through all of their dependencies and maintain them all, but maybe picking a few or doing an audit of some of them every now and then would be beneficial to not only them, but the world as a whole

    • @GoogleDoesEvil
      @GoogleDoesEvil 29 дней назад +11

      The Unix philosophy of "do one thing" and link against a ton of dependencies was a mistake.

    • @bookle5829
      @bookle5829 29 дней назад

      It's what FUTO stands for.

  • @k4l1hm4n
    @k4l1hm4n 29 дней назад +896

    Turns out, this video could be a great introduction to modern backend architecture and development.

    • @SrKinko
      @SrKinko 29 дней назад +103

      I think all of his videos are a good resource for understanding different architectures and subsequently how fragile they can be lol

    • @wangjiefan8939
      @wangjiefan8939 29 дней назад +14

      I worked at a global e-commerce company a year ago and their platform infrastructure is pretty similar, down to their use of etcd and go channel spaghetti 💀

    • @百合仙子
      @百合仙子 28 дней назад

      and a great counter-example for troubleshooting....

    • @bounceysteve
      @bounceysteve 19 дней назад

      the leaks are too

  • @erie7452
    @erie7452 19 дней назад +52

    Crowdstrike video when?

  • @ratm0
    @ratm0 29 дней назад +1581

    "A massive company with ... -$924 million net income" 💀

    • @klafbang
      @klafbang 29 дней назад +202

      "Each minute of downtime costs us negative $1750, this must be fixed ASAP!"

    • @kosmonautofficial296
      @kosmonautofficial296 29 дней назад +7

      @@klafbanglolol

    • @Mempler
      @Mempler 29 дней назад +3

      That is absurd lmao

    • @zaper2904
      @zaper2904 29 дней назад +75

      @@klafbang So does that mean they were earning money when they were down? 🤔

    • @LibertyMonk
      @LibertyMonk 29 дней назад

      ​@@zaper2904no, because they still had expenses (developers trying to fix the servers) but reduced income (no micro transactions available).

  • @yeetyeet7070
    @yeetyeet7070 29 дней назад +334

    github repo: "it was a toy project never meant for production"
    multibillion dollar company: "YAYEET"

    • @Mihacappy
      @Mihacappy 23 дня назад +3

      "multibillion dollar company"
      _-$924 million net income_

    • @bruhmoment5974
      @bruhmoment5974 19 дней назад

      L pfp

  • @mrdabup
    @mrdabup 29 дней назад +270

    I still remember the day that it went down, people were blaming Chipotle (american fast casual chain) because they had an event that same day where you could claim a free burrito. People suspected that it was due to a mass influx of people, I knew (and a bunch of people too) that this wasn't an issue with influx of people. At the end of the day, it was a fun journey (more or less with the conspiracies, guessing correctly that it went down for 3 days months before this outage, and youtubers just milking on the outage). Thank you for making a video about this.

    • @frezzingaces
      @frezzingaces 29 дней назад +9

      Wait. How in tf could Chipolte's traffic affect Roblox's servers. Whats the theoretical connection?

    • @baribari1000
      @baribari1000 29 дней назад +34

      @@frezzingaces it was a sort of partnership between chipotle and roblox, so if you installed roblox and did a bunch of stuff you'd get a free burrito. I think that's what it was, roblox has done tons of these

    • @fitmotheyap
      @fitmotheyap 29 дней назад +6

      Oh this happened during that time? Man the memes about the roblox crashes during its downtime were so enjoyable

    • @bruhlake
      @bruhlake 24 дня назад +3

      @@baribari1000 Yeah, it was super easy too, you could do it in like 2 minutes on a new account, it gave you a free entrée instead of a free burrito, so you could actually choose most meals you wanted. The few times they did the event with chipotle, I probably earned like 35 or so free entrees, which is pretty decent!

    • @argynews2825
      @argynews2825 24 дня назад +1

      wasnt there also a massive adopt me update at the time which also probably caused a large increase of active accounts

  • @ccccy-o7x
    @ccccy-o7x 29 дней назад +210

    Hi Kevin, amazing content as always! One minor correction @9:54 tho: Go unbuffered channel's length is 0, instead of 1, and it means the sender will get blocked until a receiver receives the value. What the video @9:54 showcase is actually a buffered channel with length 1 (e.g. result of make(chan string, 1)).

  • @theprantadutta
    @theprantadutta 29 дней назад +315

    This is one of the biggest challenges of modern programming, depending on various 3rd party packages, not knowing what that package is, what it does, or whether it's even reliable, and moreover knowing what are the dependency of that 3rd party package and whether they are safe or not.

    • @Paulo27
      @Paulo27 29 дней назад +20

      Also never update anything

    • @juniorwmg
      @juniorwmg 29 дней назад

      *If its not a security fix ​@@Paulo27

    • @tbuk8350
      @tbuk8350 29 дней назад

      Or HashiCorp, being a multi-billion dollar company, could just maintain the fucking project themselves instead of blindly using a 4-year-old abandoned pet project from some random person's GitHub page and trusting it to work in a large production environment.

    • @ironcanon4920
      @ironcanon4920 28 дней назад +5

      And that's before the issues of relying on additional 3rd party companies to supply the correct 3rd party packages. Supply chain issues the whole way down.

  • @useruser-ti1og
    @useruser-ti1og 29 дней назад +167

    This is like the XKCD of all of the world depending on a toy project someone abandoned 10 years ago

    • @imgladnotu9527
      @imgladnotu9527 29 дней назад +15

      probably 2347... as someone mentioned in some comment above.....

  • @mahnibba2674
    @mahnibba2674 19 дней назад +27

    Came here to look for crowdstrike, seems like im way too late🤣

  • @TheeSirRandom
    @TheeSirRandom 27 дней назад +34

    Imagine how it must feel, starting a free project just as a hobby, and planning to abandon it eventually, then pretty much half the internet starts using it as an important building block to support the web. Now you're just sitting there, and have a choice to make. Stop maintaining the software, and pretty much break half the internet or keep going, getting zero thanks, and zero dollars for your work.

  • @shalodey
    @shalodey 20 дней назад +20

    IT global outage vid gonna go crazy

  • @sergelorenzvillasica2361
    @sergelorenzvillasica2361 20 дней назад +10

    Can't wait for the CrowdStrike episode 😀

  • @levimatheri7682
    @levimatheri7682 19 дней назад +14

    Waiting for the Crowdstrike outage video!

  • @arnavn2554
    @arnavn2554 19 дней назад +10

    You gotta make a video about the CrowdStrike outage

  • @michaellin7936
    @michaellin7936 19 дней назад +22

    Crowdstrike video incoming in 2 years

  • @andreyabrz
    @andreyabrz 20 дней назад +24

    Well.. now we know the next video

  • @pdlbackup
    @pdlbackup 29 дней назад +30

    Roblox players figuring out about the DNS steering and sharing ips for early access is kinda crazy 💀

  • @Sam_Hue
    @Sam_Hue 20 дней назад +11

    The Crowdstrike video is going to hit pretty hard

  • @mortred4144
    @mortred4144 20 дней назад +12

    yo when is the CrowdStrike video coming

  • @KieranMahoney
    @KieranMahoney 19 дней назад +8

    WHENS THE CLOUDSTRIKE EPISODE COMING OUT??? 😂😂😂

  • @BananasAintCheap
    @BananasAintCheap 29 дней назад +18

    It’s crazy how much of the internet as a whole is in the hands of solo developers who made a thing in their spare time for fun

  • @ChineseKiwi
    @ChineseKiwi 20 дней назад +8

    Kevin, get busy and make the Crowdstrike video 😂😢

  • @bummbumm6
    @bummbumm6 29 дней назад +163

    This happened in the middle of my friends sleepover, when we were COMPLETELY into Roblox. He pretty much just came to play it. We checked like every 5 minutes if it got better.
    We eventually just slept. THROUGH THE WHOLE THING
    Edit: Are some of you really watching videos on Roblox and just hate people in the comment section who used to like the game? Find something better to do jeez

    • @ProblematicParag0n
      @ProblematicParag0n 29 дней назад +17

      Seems like you guys need to find better games

    • @N30ZUK1
      @N30ZUK1 29 дней назад +2

      ​@@ProblematicParag0n Isn't your avatar from a ripoff of Minetest?

    • @Hellscaped
      @Hellscaped 29 дней назад

      @@N30ZUK1 minetest is a clone of minecraft...

    • @tbuk8350
      @tbuk8350 29 дней назад +41

      @@N30ZUK1 Calling Minecraft a ripoff of Minetest is the most sweaty nerd Redditor thing you could do

    • @dexahtheman
      @dexahtheman 29 дней назад

      @@tbuk8350 Tbh nothing is correct here. Minetest is not trying to be minecraft it's trying to be a general purpose voxel game engine (check out it's other gamemodes there's some pretty unique cool stuff in there)

  • @darthmaul5413
    @darthmaul5413 19 дней назад +6

    can you do a video about the current CrowdStrike Outage?

  • @TheRealStevenPolley
    @TheRealStevenPolley 19 дней назад +5

    Kevin Fang, big fan here. Please cover the clownstrike incident

  • @nebufabu
    @nebufabu 29 дней назад +175

    Whatever it took to make a video about a Roblox server crash and not use the "oof" SFX even once... I salute it.

    • @MartijnvanBerkel
      @MartijnvanBerkel 29 дней назад +73

      It's on 6:45

    • @yeetyeet7070
      @yeetyeet7070 29 дней назад +5

      @@MartijnvanBerkel gottem

    • @aze4308
      @aze4308 29 дней назад +2

      6:45

    • @nebufabu
      @nebufabu 29 дней назад +15

      @@MartijnvanBerkel I stand corrected. Frankly, using it only once is even more impressive.

    • @vincentschumann937
      @vincentschumann937 27 дней назад

      i read this 2 seconds before the oof sound played, well done sir

  • @MaximumADHD
    @MaximumADHD 29 дней назад +59

    Oh shit I was gonna suggest this as an idea, awesome to see that you did it!

    • @0x7f2c
      @0x7f2c 29 дней назад +4

      Lol nice you're here

    • @glefyr
      @glefyr 29 дней назад +4

      is that

    • @use2l
      @use2l 27 дней назад +1

      ​@@glefyrhello call of duty black ops guy

  • @ski3r3n
    @ski3r3n 29 дней назад +82

    the kids enter angry
    the kids leave confused

  • @superbobsaget9000
    @superbobsaget9000 29 дней назад +33

    Thank you for all the work you put into making this!!

  • @yaakovwaxman4807
    @yaakovwaxman4807 29 дней назад +19

    This is by far my favorite documentary channel on yt

  • @iqmal
    @iqmal 20 дней назад +4

    Great. Hopefully you'll make a video about Windows bsod due to CrowdStrike

  • @HarishDoredla
    @HarishDoredla 20 дней назад +5

    Next video on Crowd Strike update causing global outage!!

    • @ChineseKiwi
      @ChineseKiwi 20 дней назад

      It was Crowdstrike, not Microsoft

  • @_tylerkinney
    @_tylerkinney 29 дней назад +3

    Thank you for this, been waiting for this one for awhile now!

  • @tekratek4077
    @tekratek4077 29 дней назад +14

    Nice technical aspect of the outage!

  • @mwalton9526
    @mwalton9526 19 дней назад +11

    How fast can you pop out a video? I think there might be something video worthy.

  • @pompomaddons
    @pompomaddons 29 дней назад +152

    KEVIN FANG JUST DROPPED A VIDEO ABOUT THE HALLOWS OUTAGE OH MY GOD

    • @jakfjfrgnei
      @jakfjfrgnei 27 дней назад +1

      The burrito incident

    • @pompomaddons
      @pompomaddons 27 дней назад

      @@jakfjfrgnei the slippery cord incident

  • @Aunarky
    @Aunarky 29 дней назад +1

    I'm glad you made a video on that. I had no idea how it went down behind the scenes! :D

    • @Komas19Gaming
      @Komas19Gaming 23 дня назад

      there was an blog post made after the outage

  • @davidslevs
    @davidslevs 29 дней назад +6

    Roblox is actually a bigger company than most think. Thanks for doing a video on it.

  • @i-am-linja
    @i-am-linja 29 дней назад +12

    I'd imagine programmer Hell is just a bug like this which takes all of Eternity to fix, also it takes down the company's internal issue tracker and communication system.

  • @rusprice
    @rusprice 27 дней назад

    Thanks! I submitted this in as a suggestion a while ago, never thought it’d be published.

  • @patahgaming
    @patahgaming 29 дней назад +42

    Saddest day ever for 7 Y.O i hope they can recovery from this 😢

    • @dagdnoob
      @dagdnoob 29 дней назад

      😂😂😂😂😂😂😂😂😂😂😂

    • @gn2b445
      @gn2b445 22 дня назад

      developers probably missed out on millions of dollars too!

  • @Hopgop1
    @Hopgop1 29 дней назад

    Man I love your videos, this was a particularly technical one, but still really well presented and interesting.

  • @zenobikraweznick
    @zenobikraweznick 29 дней назад +1

    Amazing CGI as always, thanks !!!

  • @poketopa1234
    @poketopa1234 26 дней назад

    Great great video, I seriously love the format and I learn so much

  • @wormonastring6888
    @wormonastring6888 29 дней назад +2

    Another super interesting well researched + explained video! As a back end game dev, thanks for the nightmares!

  • @xFrednet
    @xFrednet 29 дней назад

    Awesome summary, as always. Thank you! :D

  • @MohamedAruham
    @MohamedAruham 29 дней назад +8

    Damn I was waiting for this one

  • @pitust
    @pitust 29 дней назад +8

    9:45 "A default channel can only hold one piece of data at a time" It's actually even worse than this: an unbuffered channel also requires that this piece of data is received before a send can complete (!)

  • @hdgrove5567
    @hdgrove5567 29 дней назад +2

    Love these videos please keep them coming!

  • @passenger175
    @passenger175 29 дней назад

    Good work, these are both interesting from the tech perspective and just plain fun hah

  • @asmith7966
    @asmith7966 27 дней назад +2

    Haven't finished the video yet, but this already makes me feel better about the half-day internet outage I fixed at work

  • @flokibyarian6832
    @flokibyarian6832 27 дней назад +1

    Thank you for the great information and entertainment video like always😊

  • @_xord
    @_xord 29 дней назад +12

    new kevin fang video
    today is a good day

  • @Cmanorange
    @Cmanorange 29 дней назад +1

    daily appreciation of kevin's visual style, i love how you're able to break down the language i might take for granted and make it easily followable

  • @ElioAllen-sb6by
    @ElioAllen-sb6by 29 дней назад +1

    I like your stuff keep it up make more security related stuff!

  • @Viniter
    @Viniter 29 дней назад +23

    I love this series. It's like true crime or airplane disaster videos, but it can be fun, because nobody really gets hurt. Except for big corporations and Roblox players, and well... screw them.

    • @hagangray8006
      @hagangray8006 29 дней назад +10

      That’s a bit harsh on Roblox players… I mean most of them are like 9 years old

    • @absoultethings4213
      @absoultethings4213 28 дней назад

      @@hagangray8006if they aren’t 9 there’s a 50% chance they’re a predator or another kind of scum

    • @apersoniguess_
      @apersoniguess_ 27 дней назад +7

      @@absoultethings4213 or… just normal people. Big shocker I know

    • @adityaramadhan1708
      @adityaramadhan1708 26 дней назад

      ​@@apersoniguess_impossible😱😱😱😱😱

    • @enthuscimandiri1640
      @enthuscimandiri1640 26 дней назад

      unti money some rando get involved, yeahhh its really fun

  • @hasanpatel9029
    @hasanpatel9029 29 дней назад

    The oof sound was a chef kiss to this master piece of video. Great work as always.

  • @Dudex11a
    @Dudex11a 29 дней назад +2

    This video is very well executed!

  • @TheGrimravager
    @TheGrimravager 29 дней назад +12

    > And probably some machine learning and block chain for good measure
    lmao nice

  • @warw
    @warw 29 дней назад +3

    Great video!

  • @sg9257
    @sg9257 День назад

    Your videos are awesome Kevin!

  • @Evercreeper
    @Evercreeper 29 дней назад

    YAY GLAD YOU DROPPED THIS

  • @JustDeeevin
    @JustDeeevin 26 дней назад

    Tons of love for captioning your videos❤❤

  • @mat-hu5ys
    @mat-hu5ys 29 дней назад +2

    love your vids! please make more

  • @matze489
    @matze489 29 дней назад +4

    its a good day when there ia a new kevin fang video

  • @3rdalbum
    @3rdalbum 23 дня назад

    Another great video, I really enjoyed it.
    There's probably heaps of outages you can do next, but perhaps you could do a video on the "OpenOffice can't print on Tuesdays" bug?

  • @ImTotallyTechy
    @ImTotallyTechy 29 дней назад +2

    In life... you have roblox
    (another BANGER kevin fang video, cant wait for the next)

  • @ishan6771
    @ishan6771 27 дней назад

    Well done as always

  • @thegammingbros6231
    @thegammingbros6231 19 дней назад

    love this video makes everything understandable!

  • @heyjakeay
    @heyjakeay 29 дней назад +27

    yo honey wake up, new Kevin Fang video to watch while at work

  • @gareth2021
    @gareth2021 29 дней назад

    great video, thanks dude

  • @Mihacappy
    @Mihacappy 23 дня назад +1

    Ah yes, that day in 2021 that i was working in Studio and the toolbox stopped working, and my ass almost had a heart attack because i though i got banned.

  • @redslashed
    @redslashed 21 час назад

    very good explanation tysm

  • @No-day-off
    @No-day-off 17 дней назад +2

    Let’s go bro. CrowdStrike is giving free material to your next video.

  • @frosty4769
    @frosty4769 28 дней назад

    the goat's back with another banger

  • @fang-penlin4482
    @fang-penlin4482 29 дней назад +3

    Oh man, I love your channel so much. I can't wait to see the XZ backdoor video made by you, it's gonna be fun 😂

  • @ibis8566
    @ibis8566 29 дней назад +1

    these videos make me feel like im watching a some type of CSI crime documentary

  • @Spiffycaius
    @Spiffycaius 29 дней назад +2

    Man I remember when this happened this was crazy.

  • @vash47
    @vash47 28 дней назад +1

    your videos are quality over quantity

  • @NoobieNoodle89
    @NoobieNoodle89 29 дней назад +1

    I love the way that you explain these complex incidents. You deserve a 冰淇淋🍦

  • @H-E-S-C
    @H-E-S-C 25 дней назад

    finally, a good video on the infamous outage

  • @greatcanadianmoose3965
    @greatcanadianmoose3965 29 дней назад +1

    Always love kevin fang videos... but would you mind using I Home's we're finally landing closer to the end of the video please? Thx

  • @thecatmanager
    @thecatmanager День назад

    3:33 Roblox has a whole dedicated Fast Flag system including Dynamic Fast Flags which can be updated while servers and clients are live and will apply within around 10 minutes. The infustrucutre for this is quite interesting and you may want to touch on it

  • @GardenOfUna
    @GardenOfUna 29 дней назад +1

    I don't understand a single thing but I'm so incredibly curious that I want to know more
    I genuinely really love this for some reason

  • @randomazzy11
    @randomazzy11 28 дней назад +3

    5:04 I also heard avatars broke before the whole game went out, and some players were able to play roblox but most of the scripts were missing so it was pretty unplayable. Is it because the game couldnt fetch those? Wow

  • @Core533
    @Core533 12 дней назад +4

    Waiting for the crowdstrike video

  • @CiY3
    @CiY3 29 дней назад

    Finally, a Kevin Fang video about an outage I was witness to.

  • @caduhidalgo4996
    @caduhidalgo4996 29 дней назад +4

    Baby, wake up!
    New Kevin Fang lore just dropped 🎉

  • @siz1700
    @siz1700 29 дней назад +1

    Nice! I wish Roblox never recover from that!

  • @Jmcgee1125
    @Jmcgee1125 29 дней назад

    I paused the video when that perf screenshot came up. 5 seconds later I'm like "why the hell did nobody check this before?" We love lock contention.

  • @mementomori8856
    @mementomori8856 29 дней назад

    GO mentioned!
    So you're telling me that I should continue to be paranoid about how every single line of code of my personal projects is not efficient or secure enough? Deal! Love this thank you!

  • @chawrx3
    @chawrx3 29 дней назад

    STARTING THE DAY OFF NICE !

  • @broisvoldemort
    @broisvoldemort 29 дней назад +1

    new kevin fang video finally

  • @5TC
    @5TC 29 дней назад

    Wasn't expecting him to talk about this but man I remember when this happened

  • @soulsmwc
    @soulsmwc 29 дней назад +1

    Great!

  • @06NinjaKid06
    @06NinjaKid06 29 дней назад +1

    best roblox video

  • @leosh9026
    @leosh9026 23 дня назад

    Thanks for the explanation dude"