Build Reliable Systems with Chaos Engineering // Benjamin Wilms // MLOps Podcast

Поделиться
HTML-код
  • Опубликовано: 11 июл 2024
  • Join us at our first in-person conference on June 25 all about AI Quality: www.aiqualityconference.com/
    Build Reliable Systems with Chaos Engineering // MLOps podcast #237 with Benjamin Wilms, CEO & Co-Founder of Steadybit.
    Huge thank you to ‪@amazonwebservices‬ for sponsoring this episode. AWS - aws.amazon.com/
    // Abstract
    How to build reliable systems under unpredictable conditions with Chaos Engineering.
    // Bio
    Benjamin has over 20 years of experience as a developer and software architect. He fell in love with chaos engineering 7 years ago and shares his knowledge as a speaker and author. In October 2019, he founded the startup Steadybit with two friends, focusing on developers and teams embracing chaos engineering. He relaxes by mountain biking when he's not knee deep in complex and distributed code.
    // MLOps Jobs board
    mlops.pallet.xyz/jobs
    // MLOps Swag/Merch
    mlops-community.myshopify.com/
    // Related Links
    Website: steadybit.com/
    -------------- ✌️Connect With Us ✌️ ------------
    Join our slack community: go.mlops.community/slack
    Follow us on Twitter: @mlopscommunity
    Sign up for the next meetup: go.mlops.community/register
    Catch all episodes, blogs, newsletters, and more: mlops.community/
    Connect with Demetrios on LinkedIn: / dpbrinkm
    Connect with Benjamin on LinkedIn: / benjamin-wilms
    Timestamps:
    [00:00] Benjamin's preferred coffee
    [00:28] Takeaways
    [02:10] Please like, share, leave a review, and subscribe to our MLOps channels!
    [02:53] Chaos Engineering tldr
    [06:13] Complex Systems for smaller Startups
    [07:21] Chaos Engineering benefits
    [10:39] Data Chaos Engineering trend
    [15:29] Chaos Engineering vs ML Resilience
    [17:57 - 17:58] AWS Trainium and AWS Inferentia Ad
    [19:00] Chaos engineering tests system vulnerabilities and solutions
    [23:24] Data distribution issues across different time zones
    [27:07] Expertise is essential in fixing systems
    [31:01] Chaos engineering integrated into machine learning systems
    [32:25] Pre-CI/CD steps and automating experiments for deployments
    [36:53] Chaos engineering emphasizes tool over value
    [38:58] Strong integration into observability tools for repeatable experiments
    [45:30] Invaluable insights on chaos engineering
    [46:42] Wrap up
  • НаукаНаука

Комментарии •