Server Meltdown: One School System's Recovery Story

Поделиться
HTML-код
  • Опубликовано: 26 дек 2024

Комментарии • 28

  • @qualin1974
    @qualin1974 3 года назад +21

    For those that get this recommended in the RUclips Algorithm, the way that this disaster could have been avoided:
    1. Redundant A/C Units on different circuits are a must.
    2. Environmental monitoring is an absolute must in any equipment room. Most datacentre grade UPS units offer environmental cards.
    3. Implementation of monitoring software that can query the environmental cards and send out pages when things are out of whack. (Such as PRTG)
    4. Implementing software that can automatically shut down hosts in the event of a power failure or environmental issue.
    By doing these things, not only could the IT team have saved money on replacing servers, but an outage could have been avoided. (Also, Novell in 2011? Yikes.)

    • @riccardoz2953
      @riccardoz2953 3 года назад +1

      have u seen the windows Xp logo in one of the classroom?

    • @marvinlueken25
      @marvinlueken25 3 года назад

      My Qestion is, why the Server didn´t shut down themself? They have onboard heat sensors. When they detect too high temperature they should gracefully shut down the System. Or ist the Hardware to old for this? (Isn´t it implemented in every Server?) Or did they turn off the heat control? The Server could have died on other things too then. Like the fan of the CPU could have failed and killed itself without someone noticing.
      Please correct me, if i am wrong in some situations. But as i know is, that Data and Hardware have more priority than aviability. (At least on modern Systems)

    • @joshface04
      @joshface04 3 дня назад

      @@marvinlueken25 Depends on the manufacturer. Working in I.T. myself I've seen some Dell servers that just throw up a high temp error on the front LCD but don't shut the system down. As weird as it sounds, I think this is to protect against data loss as if the server shut itself down without confirmation it could cause service issues. If you're implementing servers in any type of infrastructure where outages cost time and money (from schools up to big businesses) then they should be monitoring the environment the equipment is installed in.

  • @rdwatson
    @rdwatson 3 года назад +20

    I liked seeing the Sun logo without Oracle at 3:12

  • @infl
    @infl 7 лет назад +83

    Or just have 2 AIR CONDITIONERS

    • @highkicker11
      @highkicker11 6 лет назад +3

      and then you have a cascading failure of both and you are still up the creek without a paddle. meaning that no matter how much redundancy you build in there is always room for Murphy.

    • @IsaacBG84
      @IsaacBG84 6 лет назад +6

      and energy fault can take both them down. So the bes thing in my opinion is to have room sensors hardware to let you know of this problems and then you can shutdown everything until you fix the AC. But yes two AC can give you better chances

    • @macsrule94
      @macsrule94 6 лет назад +1

      Most places have the data room off the main air handler with a thermostat that uses the chilled water the rest of the building uses, and then have a mini split that is dedicated to it should for instance the chiller fail.

  • @l0calnet
    @l0calnet 6 лет назад +9

    Novell Netware at 0:53 :-)

  • @astrixistheman
    @astrixistheman 3 года назад +2

    At first i thought this was a joke video because of how serious they were but turned out they went joking.

  • @juoig7799
    @juoig7799 3 года назад +1

    There should be some sort of AES (Automatic Emergency Shutdown) that would automatically make the servers save all the data to their hard drives and shut down if the temperature got too high,

    • @brushy
      @brushy 3 года назад

      Yeah it should but if you shutdown the server it might have some issues

  • @jamesdean8864
    @jamesdean8864 3 года назад

    Air Crash Investigation meets a school IT team

  • @watcher206
    @watcher206 7 лет назад +15

    And that is the reason why you don't have everything hooked up to computers. They are going to fail sooner or later, computers are complex and sensitive machines after all. For the best reliability, design a fault tolerant network and keep backups somewhere else. Have a Backup plan in case of total computer network failure.

    • @jamescollins6085
      @jamescollins6085 6 лет назад +14

      watcher206 It's not the computer's fault that the air conditioning failed.

    • @McRambro
      @McRambro 5 лет назад +2

      This is why you have off site data centres.

    • @rippspeck
      @rippspeck 5 лет назад +1

      +McRambro Especially if you're located in tornado country.

  • @williambusseyftwaafdftl2511
    @williambusseyftwaafdftl2511 9 месяцев назад

    Bruh just turn it on at the Plotagon High School and install Windows Server 2012 R2

  • @StateCollegeCONELRAD
    @StateCollegeCONELRAD 4 года назад +7

    Bruh just turn it on

    • @AIC69420
      @AIC69420 3 года назад +2

      the motherboard died so u couldnt

    • @randalfik7822
      @randalfik7822 3 года назад +3

      @@AIC69420 just fix it lol

    • @AIC69420
      @AIC69420 3 года назад +2

      @@randalfik7822 The servers were old anyway as mentioned so he recommended to change them

    • @AIC69420
      @AIC69420 3 года назад

      @@godlyghost3111 its actually not a woosh because its just an explanation and if you read it carefully its irony and satire