Kubernetes Failure Stories and How to Crash Your Clusters - Henning Jacobs, Zalando SE

Поделиться
HTML-код
  • Опубликовано: 21 май 2019
  • Join us for Kubernetes Forums Seoul, Sydney, Bengaluru and Delhi - learn more at kubecon.io
    Don't miss KubeCon + CloudNativeCon 2020 events in Amsterdam March 30 - April 2, Shanghai July 28-30 and Boston November 17-20! Learn more at kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects
    Kubernetes Failure Stories and How to Crash Your Clusters - Henning Jacobs, Zalando SE
    Bootstrapping a Kubernetes cluster is easy, rolling it out to nearly 200 engineering teams and operating it at scale is a challenge. In this talk, we are presenting our approach to Kubernetes provisioning on AWS, operations and developer experience for our growing Zalando developer base. We will walk you through our horror stories of operating 100+ clusters and share the insights we gained from incidents, failures, user reports and general observations. Our failure stories will be sourced from recent and past incidents, so the talk will be up-to-date with our latest experiences. Most of our learnings apply to other Kubernetes infrastructures (EKS, GKE, ..) as well. This talk strives to reduce the audience's unknown unknowns about running Kubernetes in production.
    sched.co/MPcM
  • НаукаНаука

Комментарии • 3

  • @towolf
    @towolf 5 лет назад

    Does he keep saying "badly" when he means "basically"?

    • @towolf
      @towolf 5 лет назад

      @@henningjacobs6205 23:45 "this one small bugfix which made ??? configmaps and secrets read-only." What's the word? I'm really puzzled.

    • @towolf
      @towolf 5 лет назад

      @@henningjacobs6205 yeah, old habits die hard :)