The Story of Why We Migrate to gRPC and How We Go About It - Matthias Grüter, Spotify

Kubernetes Failure Stories and How to Crash Your Clusters - Henning Jacobs, Zalando SE

We Stole Tampons from the Cashier-less Amazon Go Store

Diamond Platnumz x Jason Derulo ft Khalil Harisson & Chley - Komasava Remix (Official Music Video)

The Unthinkable Has Arrived | Corvette ZR1 | Chevrolet

Ryan Reynolds and Hugh Jackman Go Claws Out While Eating Spicy Wings | Hot Ones

Keynote: How Spotify Accidentally Deleted All its Kube Clusters with No User Impact - David Xia

CNCF [Cloud Native Computing Foundation]

Просмотров 46 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 июл 2024
Join us for Kubernetes Forums Seoul, Sydney, Bengaluru and Delhi - learn more at kubecon.io
Don't miss KubeCon + CloudNativeCon 2020 events in Amsterdam March 30 - April 2, Shanghai July 28-30 and Boston November 17-20! Learn more at kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects
Keynote: How Spotify Accidentally Deleted All its Kube Clusters with No User Impact - David Xia, Infrastructure Engineer, Spotify
During Spotify's Kubernetes migration, David's team deleted most of their production Kubernetes clusters. Accidentally. Twice. With little to no user impact. David shares how they recovered and learned to operate many clusters automatically and safely.
In 2017, Spotify planned the migration of hundreds of teams, thousands of services, and tens of thousands of hosts to Google Kubernetes Engine (GKE). In the last half of 2018, Spotify migrated 50 teams and hundreds of services, including critical ones, onto multiple production clusters.
David describes what led to the cluster deletions and how they barely affected users. Since the postmortem, Spotify has minimized downtime and human error by declaratively defining clusters in code with Terraform, backing up and restoring clusters with Ark, and increasing scalability and availability by running many more clusters.
sched.co/MQbb
Наука

Комментарии • 16

@MrRidwanbejoz Год назад
I believe Spotify is company that appreciate the engineering team so much. Culture of Learning is new thing.
@sreejitkar7981 Год назад
This is like one of those water cooler conversations you get to have from that seasoned architect at your work who has had enough interesting mistakes ! Also I feel isolating your bundles with your infra can actually help avoid these errs
@unfathomablej 5 лет назад ⁺⁶
This is super entertaining. Sorry you guys had to deal with a mangled tfstate file in production. It's a terrible rite of passage.
@anandkrishna6687 4 года назад ⁺¹
loved it, honest , great learning
@manipal2011 5 лет назад ⁺³
2 Teams - Kubernetes [Cluster Operators and Cluster Users]
@bojackhorsingaround 2 года назад ⁺²
Wonderful test case even for a beginner like me. Good talk!
@emmadoyle4157 2 года назад ⁺³
If the internal slack channel was "eerily quiet" it's probably because teams don't have enough alerting set up to notify them that their applications/services aren't running in production.
@mj-np9sy 5 лет назад ⁺⁴
Go through your envs and protect those clusters from deletion now that you can!
@fredow123456 5 лет назад ⁺¹
let's do it before bad things happen 😂
@jasonquek8279 5 лет назад ⁺⁶
Ouch this was really painful. I guess you were running -auto-approve or no manual review of the tf plan before application.
@sachinkadam4742 2 года назад
ya, Guess so...quite an ignorant and not recommended approach for Prod.
@dm5665 2 года назад
yes many times deleted k8s cluster accidentally...
@eusfrasiuspatrickmarshall 4 года назад ⁺¹
:clappepe:
@rayoroderik 4 года назад
Patrick Marshall :pepego:

Следующие

Автовоспроизведение

The Story of Why We Migrate to gRPC and How We Go About It - Matthias Grüter, Spotify

The Story of Why We Migrate to gRPC and How We Go About It - Matthias Grüter, Spotify

Kubernetes Failure Stories and How to Crash Your Clusters - Henning Jacobs, Zalando SE

Kubernetes Failure Stories and How to Crash Your Clusters - Henning Jacobs, Zalando SE

We Stole Tampons from the Cashier-less Amazon Go Store

We Stole Tampons from the Cashier-less Amazon Go Store

Diamond Platnumz x Jason Derulo ft Khalil Harisson & Chley - Komasava Remix (Official Music Video)

Diamond Platnumz x Jason Derulo ft Khalil Harisson & Chley - Komasava Remix (Official Music Video)

The Unthinkable Has Arrived | Corvette ZR1 | Chevrolet

The Unthinkable Has Arrived | Corvette ZR1 | Chevrolet

Ryan Reynolds and Hugh Jackman Go Claws Out While Eating Spicy Wings | Hot Ones

Ryan Reynolds and Hugh Jackman Go Claws Out While Eating Spicy Wings | Hot Ones

New video shows whale after it hit boat off Portsmouth, New Hampshire

New video shows whale after it hit boat off Portsmouth, New Hampshire

Do NOT Learn Kubernetes Without Knowing These Concepts...

Do NOT Learn Kubernetes Without Knowing These Concepts...

Who Invented Trap Music?

Who Invented Trap Music?

Keynote: Kubernetes and the Path to Serverless - Kelsey Hightower, Staff Developer Advocate, Google

Keynote: Kubernetes and the Path to Serverless - Kelsey Hightower, Staff Developer Advocate, Google

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Highly Available Kubernetes Clusters - Best Practices - Meaghan Kjelland & Karan Goel, Google

Highly Available Kubernetes Clusters - Best Practices - Meaghan Kjelland & Karan Goel, Google

How to Come Up with a YouTube Name - 3 Tips & Mistakes to Avoid

How to Come Up with a YouTube Name - 3 Tips & Mistakes to Avoid

Keynote: Anatomy of a Production Kubernetes Outage - Oliver Beattie, Head of Engineering, Monzo Bank

Keynote: Anatomy of a Production Kubernetes Outage - Oliver Beattie, Head of Engineering, Monzo Bank

Elon Musk: The future we're building -- and boring | TED

Elon Musk: The future we're building -- and boring | TED

Waze: migrating a multi million user app to Google Cloud Platform (Google Cloud Next '17)

Waze: migrating a multi million user app to Google Cloud Platform (Google Cloud Next '17)

iPhone 15 Pro Max vs IPhone Xs Max troll face speed test

iPhone 15 Pro Max vs IPhone Xs Max troll face speed test

А зачем НУЖЕН компьютер? #компьютер #айти #программирование #дизайн #игры

А зачем НУЖЕН компьютер? #компьютер #айти #программирование #дизайн #игры

А зачем НУЖЕН компьютер? #компьютер #айти #программирование #дизайн #игры

А зачем НУЖЕН компьютер? #компьютер #айти #программирование #дизайн #игры

Battery low 🔋 🪫

Battery low 🔋 🪫

Самые крутые школьные гаджеты

Самые крутые школьные гаджеты

APPLE дают это нам БЕСПЛАТНО!

APPLE дают это нам БЕСПЛАТНО!

Что если робот Cozmo увидит огромную африканскую Саранчу?

Что если робот Cozmo увидит огромную африканскую Саранчу?

🖼️Этот девайс не купить в магазине! Самоделка с нейросетью

🖼️Этот девайс не купить в магазине! Самоделка с нейросетью