SREcon24 Europe/Middle East/Africa - The Frontiers of Reliability Engineering
HTML-код
- Опубликовано: 16 дек 2024
- The Frontiers of Reliability Engineering
Heinrich Hartmann, Zalando SE
We take the 10s anniversary of SRECon as an occasion to reflect over the past decade of advancements in Reliability Engineering and provide an overview about the Frontiers we are facing today. Within Zalando we followed major trends of the industry in outsourcing hardware provisioning to AWS, package applications into Docker images, fully automated deployments (CI/CD), and implemented Distributed Tracing for Microservice Observability. Despite these advances, many challenges remain in building reliable, observable software systems and new areas arose which require new methods and tools. In the talk we are proving a number of conceptual view that help to map out the larger Reliability Engineering landscape and zone-in on 3 specific frontiers that we are actively investing in at Zalando: (1) Data Operations and Monitoring Event Based Systems (2) Mobile Observability (3) Effective Management Practices for Reliability.
View the full SREcon24 Europe/Middle East/Africa program at www.usenix.org...