Chaos Engineering: System Resiliency in Practice

1st Edition
308
English
1492043869
9781492043867
27 Apr

As more companies move toward microservices and other distributed technologies, the complexity of these systems increases. You can't remove the complexity, but through Chaos Engineering you can discover vulnerabilities and prevent outages before they impact your customers. This practical guide shows engineers how to navigate complex systems while optimizing to meet business goals.

Two of the field's prominent figures, Casey Rosenthal and Nora Jones, pioneered the discipline while working together at Netflix. In this book, they expound on the what, how, and why of Chaos Engineering while facilitating a conversation from practitioners across industries. Many chapters are written by contributing authors to widen the perspective across verticals within (and beyond) the software industry.

  • Learn how Chaos Engineering enables your organization to navigate complexity
  • Explore a methodology to avoid failures within your application, network, and infrastructure
  • Move from theory to practice through real-world stories from industry experts at Google, Microsoft, Slack, and LinkedIn, among others
  • Establish a framework for thinking about complexity within software systems
  • Design a Chaos Engineering program around game days and move toward highly targeted, automated experiments
  • Learn how to design continuous collaborative chaos experiments

Reviews (95)

excellent and varied real world examples

This is a very accessible (and quotable!) overview of the principles and practices of chaos engineering. I also really enjoyed the case study chapters. The book contains a wide variety of case studies, and each one felt authentic and real. This book feels like it is meant to build community as much as educate technologists. I think that is super important at this point in the evolution of the chaos engineering and resilience engineering communities.

Fascinating concepts - even for non-engineers

I am not a software engineer or coder, but I found the concepts in this book really interesting. The examples used to illustrate the use of chaos engineering begin very simply and increase in complexity, allowing you to follow along even if you don't know the lingo used by coders. Now that I know more about chaos engineering, I think differently every time I read the news about a major outage at Netflix or Google, for example. The key concept that really grabbed me is that in complex computer systems, there is no way for a single human brain to understand how it all works. Therefore, there is no way to predict the outcome as an infinite number of variables come into play. The system itself must be made resilient enough to detect and adjust when something is about to crash.

Highly recommended book for all engineers - "Resilience is created by people. "

As a lead, of a team of engineers, responsible for mitigating the risks and ensuring the "high availability" of multple products, I found this book as a great resource in understandnig and leveraging Chaos principles in my profession. I highly recommend this welll written book to those in software design/development/quality engineering. who want to learn more about how Chaos engineering can help you to proactively extract the possible risks and unknowns .. by leveraging well explained methodologies, .. exploring and unearthing critical pathways that can prevent crashes and unpredictable behaviors your customers will (hopefully not) enounter. The book provides details and explains the theories, including experiments, while offering insight on the "how's" and "whys" .. driving the point that learning from "failures" and even inducing "faults" in the system are so critical.

An informative and "enjoyable" read!

Chaos Engineering deserves 5 stars for both readability and content. I am certainly a novice in the field but I was able to grasp the concepts laid out in the book as opposed to feeling out of my depth. And what is more, it is a very enjoyable read. I recommend this book to anyone working in, or interested in, software engineering as well as fans of the Chaos Community Broadcast (youtube). The industry certainly seems to be leaning heavily into chaos engineering and continuous verification at all levels, from small startups to enterprise business.

This book changes the way you think of systems

Chaos Engineering is not about breaking all the things or wreaking havoc in production. Chaos Engineering is a discipline that helps navigate the inherent complexity in our systems. This book is packed with insight from engineering leaders at Google, Slack, and LinkedIn in addition to the authors' experience at Netflix. The chapters Security Chaos Engineering, Continuous Verification, ROI of CE, and People in the Loop were all very helpful in addition to the more case study style chapters. If you are new chaos engineering or if you are taking your resilience and reliability engineering to the next level, you need to read this book. Disclaimer: I am not an impartial reviewer. I was a reviewer for the Security Chaos Engineering chapter, and as this book was getting written/published, I joined the chaos engineering startup (verica.io) which was founded by one of the authors of the book.

establishes Chaos Engineering as a paradigm sprouted from roots in Site Reliability Engineering/SRE

Although the title might suggest this is an anarchists approach to Site Reliability Engineering(SRE), this book expresses the dynamics of applying functional safety practices (like FMAE) to complex systems. It is a subset of SRE, and although the book at a couple places tries to differentiate itself completely from SRE, it extends the SRE paradigm in great detail in a way that can be applied to most any complex process --even medical and financial ones which traditionally one might consider too risky for such things. This book looks at the process (system) from all practical perspectives in order to give the reader a good sense of guidelines on how Chaos Engineering might be applied with their very different (and maybe unique) complex system. A must read. Just know that verify, validate and test have different definitions in different contexts, and the authors' definitions are just to differentiate specific processes for part of the book.

Great In Depth Exploration Of The Topic

When people first hear about chaos experimentation, usually through Netflix' Chaos Monkey, it blows their minds - seems like a super cool idea, very "Netflix dramatic," they are sure elite engineers, but that's probably not for me and my shop... But with a thorough understanding of the principles underlying it, you can perform safe and helpful resilience experiments anywhere. This book explains the theory, goes through a bunch of case studies of real companies from CapitalOne to LinkedIn doing chaos experimentation, and presents a maturity model and many practical tips on how to start using experiments to determine the safe boundaries of your systems. If you're serious about ensuring the resilience of your systems, this is a new development that you can't ignore, and this book does an extremely thorough job of explaining the concepts and practice behind chaos engineering.

Reality based engineering for modern internet distributed applications

It's ironic that so many organizations have lost sight of their business processes being built on top of a communications protocol (tcp/ip) which was designed for resilience and adaptation to expected failure. This has led to some embarrassing transitions from legacy client/server architectures. The complexity and potential paths for failure in even a relatively simple distributed application is beyond the traditional management models for QA, security, and disaster recovery modeling. Chaos Engineering is a realistic, practical, and thorough guide to reintroduce technology systems management to the original principles of the internet - that failure is intrinsic to distributed computing. So embrace, plan, and adapt. Chaos Engineering blasts past the shiny acronyms to prescribe reality based engineering. If you are in any way responsible for or part of the modern application process I recommend you read this book. (disclaimer: I was provided a complimentary copy while attending a recent AllDayDevOps conference. This review is based though on my 30+ years in engineering, analyzing, and securing internet based systems, from Arpanet era to cloud microservices)

An excellent resource for improving resiliency

Chaos Engineering is an excellent resource that provides a comprehensive view of this titular modern practice which has become so crucial for successful distributed systems. Throughout the book, Casey and Nora explain the history of Chaos Engineering, how to implement it in your own complex systems, and how to scale its experiment-driven approach to a large environment. But they go beyond that as well, with two particular sections that stood out to me. The first is the concrete examples they provide from major companies like Slack, LinkedIn, and Capital One. These really helped illustrate the principles with real-world references. The second was the commentary on business factors and the ROI of chaos engineering. This is critical, as getting the business to buy in to willfully breaking things can be a hurdle if you don't know how to effectively communicate the benefits. Overall this book addressed everything I was looking for and more, and left me eager to begin my own resiliency experiments.

Great for team leads of for providing solid information when proposing Chaos Engineering to an org

I come from a developer background, so my initial expectation when getting a hold of this book is that it would be more practical, focusing on the technology step-by step. Instead, it gives a broader view of the theory behind chaos engineering, the supporting evidence to validate its need, and organizational steps to implement it. To be honest, I believe that for a book, this second approach offers greater value. Focusing on tech specifics would get outdated quickly, and that info can be easily searched online. From my personal experience, the biggest challenge of selling internally and implementing a new approach like Chaos Engineering is how to address cultural change. Corporate inertia is pretty hard to fight, so this book will equip you with the arguments, with references of other companies and success cases, and provide an action plan on how to implement in an org. Highly recommended!

excellent and varied real world examples

This is a very accessible (and quotable!) overview of the principles and practices of chaos engineering. I also really enjoyed the case study chapters. The book contains a wide variety of case studies, and each one felt authentic and real. This book feels like it is meant to build community as much as educate technologists. I think that is super important at this point in the evolution of the chaos engineering and resilience engineering communities.

Fascinating concepts - even for non-engineers

I am not a software engineer or coder, but I found the concepts in this book really interesting. The examples used to illustrate the use of chaos engineering begin very simply and increase in complexity, allowing you to follow along even if you don't know the lingo used by coders. Now that I know more about chaos engineering, I think differently every time I read the news about a major outage at Netflix or Google, for example. The key concept that really grabbed me is that in complex computer systems, there is no way for a single human brain to understand how it all works. Therefore, there is no way to predict the outcome as an infinite number of variables come into play. The system itself must be made resilient enough to detect and adjust when something is about to crash.

Highly recommended book for all engineers - "Resilience is created by people. "

As a lead, of a team of engineers, responsible for mitigating the risks and ensuring the "high availability" of multple products, I found this book as a great resource in understandnig and leveraging Chaos principles in my profession. I highly recommend this welll written book to those in software design/development/quality engineering. who want to learn more about how Chaos engineering can help you to proactively extract the possible risks and unknowns .. by leveraging well explained methodologies, .. exploring and unearthing critical pathways that can prevent crashes and unpredictable behaviors your customers will (hopefully not) enounter. The book provides details and explains the theories, including experiments, while offering insight on the "how's" and "whys" .. driving the point that learning from "failures" and even inducing "faults" in the system are so critical.

An informative and "enjoyable" read!

Chaos Engineering deserves 5 stars for both readability and content. I am certainly a novice in the field but I was able to grasp the concepts laid out in the book as opposed to feeling out of my depth. And what is more, it is a very enjoyable read. I recommend this book to anyone working in, or interested in, software engineering as well as fans of the Chaos Community Broadcast (youtube). The industry certainly seems to be leaning heavily into chaos engineering and continuous verification at all levels, from small startups to enterprise business.

This book changes the way you think of systems

Chaos Engineering is not about breaking all the things or wreaking havoc in production. Chaos Engineering is a discipline that helps navigate the inherent complexity in our systems. This book is packed with insight from engineering leaders at Google, Slack, and LinkedIn in addition to the authors' experience at Netflix. The chapters Security Chaos Engineering, Continuous Verification, ROI of CE, and People in the Loop were all very helpful in addition to the more case study style chapters. If you are new chaos engineering or if you are taking your resilience and reliability engineering to the next level, you need to read this book. Disclaimer: I am not an impartial reviewer. I was a reviewer for the Security Chaos Engineering chapter, and as this book was getting written/published, I joined the chaos engineering startup (verica.io) which was founded by one of the authors of the book.

establishes Chaos Engineering as a paradigm sprouted from roots in Site Reliability Engineering/SRE

Although the title might suggest this is an anarchists approach to Site Reliability Engineering(SRE), this book expresses the dynamics of applying functional safety practices (like FMAE) to complex systems. It is a subset of SRE, and although the book at a couple places tries to differentiate itself completely from SRE, it extends the SRE paradigm in great detail in a way that can be applied to most any complex process --even medical and financial ones which traditionally one might consider too risky for such things. This book looks at the process (system) from all practical perspectives in order to give the reader a good sense of guidelines on how Chaos Engineering might be applied with their very different (and maybe unique) complex system. A must read. Just know that verify, validate and test have different definitions in different contexts, and the authors' definitions are just to differentiate specific processes for part of the book.

Great In Depth Exploration Of The Topic

When people first hear about chaos experimentation, usually through Netflix' Chaos Monkey, it blows their minds - seems like a super cool idea, very "Netflix dramatic," they are sure elite engineers, but that's probably not for me and my shop... But with a thorough understanding of the principles underlying it, you can perform safe and helpful resilience experiments anywhere. This book explains the theory, goes through a bunch of case studies of real companies from CapitalOne to LinkedIn doing chaos experimentation, and presents a maturity model and many practical tips on how to start using experiments to determine the safe boundaries of your systems. If you're serious about ensuring the resilience of your systems, this is a new development that you can't ignore, and this book does an extremely thorough job of explaining the concepts and practice behind chaos engineering.

Reality based engineering for modern internet distributed applications

It's ironic that so many organizations have lost sight of their business processes being built on top of a communications protocol (tcp/ip) which was designed for resilience and adaptation to expected failure. This has led to some embarrassing transitions from legacy client/server architectures. The complexity and potential paths for failure in even a relatively simple distributed application is beyond the traditional management models for QA, security, and disaster recovery modeling. Chaos Engineering is a realistic, practical, and thorough guide to reintroduce technology systems management to the original principles of the internet - that failure is intrinsic to distributed computing. So embrace, plan, and adapt. Chaos Engineering blasts past the shiny acronyms to prescribe reality based engineering. If you are in any way responsible for or part of the modern application process I recommend you read this book. (disclaimer: I was provided a complimentary copy while attending a recent AllDayDevOps conference. This review is based though on my 30+ years in engineering, analyzing, and securing internet based systems, from Arpanet era to cloud microservices)

An excellent resource for improving resiliency

Chaos Engineering is an excellent resource that provides a comprehensive view of this titular modern practice which has become so crucial for successful distributed systems. Throughout the book, Casey and Nora explain the history of Chaos Engineering, how to implement it in your own complex systems, and how to scale its experiment-driven approach to a large environment. But they go beyond that as well, with two particular sections that stood out to me. The first is the concrete examples they provide from major companies like Slack, LinkedIn, and Capital One. These really helped illustrate the principles with real-world references. The second was the commentary on business factors and the ROI of chaos engineering. This is critical, as getting the business to buy in to willfully breaking things can be a hurdle if you don't know how to effectively communicate the benefits. Overall this book addressed everything I was looking for and more, and left me eager to begin my own resiliency experiments.

Great for team leads of for providing solid information when proposing Chaos Engineering to an org

I come from a developer background, so my initial expectation when getting a hold of this book is that it would be more practical, focusing on the technology step-by step. Instead, it gives a broader view of the theory behind chaos engineering, the supporting evidence to validate its need, and organizational steps to implement it. To be honest, I believe that for a book, this second approach offers greater value. Focusing on tech specifics would get outdated quickly, and that info can be easily searched online. From my personal experience, the biggest challenge of selling internally and implementing a new approach like Chaos Engineering is how to address cultural change. Corporate inertia is pretty hard to fight, so this book will equip you with the arguments, with references of other companies and success cases, and provide an action plan on how to implement in an org. Highly recommended!

Great book! Definitely recommend!

I highly recommend this book to those in software design/development/quality engineering, who also want to learn more about how Chaos engineering can help you to proactively extract the possible risks and unknowns. This book leverages well explained methodologies, explores critical pathways that can prevent crashes and unpredictable behaviors your customers will enounter. The book provides details and explains the theories, including experiments, while offering insight on the "how's" and "whys" .. driving the point that learning from "failures" and even inducing "faults" in the system are so critical.

There is no better way to test your system than Chaos Engineering!

I'll be honest. I started to read this book because Team Verica offered a cool t-shirt if I read the book and posted a review here on Amazon.com. And then, during a meeting with my manager about how to test some code I was writing, we discussed how to ensure that the system we were designing was resilient and provided high availability and security. Having just read the first few chapters of this book, I recognized that we were talking about Chaos Engineering. A few days later Werner Vogels, the CTO of Amazon.com, at AWS re:Invent 2020, said this during his keynote address: "There is no better way to test your system than Chaos Engineering!" Realizing the right answer when told (now twice), I sat down with the book and powered through it that afternoon. It's a good, informative read! Chaos Engineering, which was born at Netflix and has been adapted by thousands of companies, seeks to help improve the resilience of systems. This book helps you to understand the principles of Chaos Engineering so that you know when you're doing it and how to do it well. It then provides several case studies from very well known companies to show how they are doing Chaos Engineering. It then goes on to provide insight on the human in the loop, as well as business factors and, finally, the current trends in Chaos Engineering which provide an insight into the evolution of the field. This book will help you and your team to better test your systems. Highly recommended!

The best approach to resiliency in distributed systems

This is such a good book. I've worked in distributed software at scale for 20 years, and seen many approaches to the never-ending increase in infrastructure complexity and organizational demand. Many of these approaches make the problem worse by introducing additional complication, or demanding that humans gain a stepwise increase in understanding, or somehow stop making mistakes. Chaos Engineering tells us that the complexity is inevitable, that humans will not change much in the near future, and that we need a different way entirely to think about these problems. Rather than focus on infrastructure or tooling, Chaos Engineering focuses on the important humans in the equation -- the customers, and the IT professionals (largely infrastructure & software engineers) charged with caring for their needs. Chaos Engineering teaches us to develop systems and practices that prioritize humans needs, and speed human learning by affording safety and confidence. We can't just "engineer harder" anymore. These large distributed systems are too big and volatile to understand. This book gives us the tools to approach these problems in new and useful ways, with more humane and practical goals.

Chaos accessible to the engineering masses

The book as a whole is fantastic. it is a great deep dive into chaos engineering at multiple levels. Where so many blog posts and talks skim the surface, this gets into the lower level of details and opens your eyes to what could go wrong that you have never thought about. As a good book does, this spurs the imagination to conceiving new challenges to explore, new ways to test failure, and new ideas about how to prevent and deal with future failures. While I find the entire book (and intend to read quite a bit of it a few more times), I particularly enjoyed chapters 19 and 20 which hit two of my biggest questions on how to inject chaos: database and security (and thus some of the inspirational thoughts: what happens to applications when databases fail, does the cluster approach actually work automatically, etc.). All around, great book, super useful, practical, and inspirational. Kudos to all of the authors and contributors.

Important book on how to design resilient systems

Isn't it great how Netflix seemingly never goes down? Much of that is the result of many years of effort to design systems that intentionally fail to en sure other systems don't. Chaos Engineering is the approach to design for failure so your systems can become more resilient. This book has a lot of actionable advice to begin thinking about Chaos Engineering, and even some specific details about implementing these systems. A good read for any Ops/DevOps person.

Great book to gain knowledge on Choas engineering

While the concept of Chaos engineering is not new, it is not something that many companies actively perform. This is a great book to get an introduction into the principles of choas engineering and why its such a critical area of testing. The cases studies provide really useful and interesting information. I would recommend this as a started for anyone interested in choas testing.

Good read for site reliability engineers

This book touched upon many aspects of site reliability engineering: from unexpected application failures to network failures and infrastructure failures and the how to build resiliency and robustness in your application, network and infrastructure to avoid, prevent and recover from them, specially disaster recovery. This is a must read for site reliability engineers (aka SREs, the new term for system administrators), application developers and even security researchers.

Detailed and authoritative reference on this innovative testing technique

Chaos Engineering: System Resiliency in Practice is an easy to read and highly informative book about Chaos Engineering. We use this technique at my company to validate our system design for resiliency and fault tolerance and it is highly effective. I recommend this book to all new employees coming in to our org and to anyone outside our org who is interested in this testing strategy because it is such an effective book for enhancing understanding.

Move fast and don't break things

Engineers and operators of modern software systems are waking up to the reality that the systems they build and maintain are growing increasingly complex, and it's neither feasible nor practical to try to reduce that complexity. I appreciated a number of things about this book: first and foremost, it puts theory into practice. You're not Netflix, where these practices arose, but you can learn from others who have been putting them to work in their organizations. Additionally, the theory behind these practices is rooted in both research and deep practitioner-based experience. You'll walk away from this book not only seeing the systems you build (and those around you) differently, you'll be armed with practical ideas and tactics to put into play in your own work.

Level up your CICD pipeline

Just got this book recently and am in chapter 4 but OMG what an amazing treasure trove of information. I am tired of tiny little things crashing my jobs and failing my builds. I am ready to finish this front to back and be a CICD God. An amazing quality unique to this book is that its very easy to read. I feel like I am reading a sci-fi comic and learning a lot! Cutting edge stuff!

Cutting edge! Get your copy now.

Chaos Engineering may be an unfamiliar term for most. This concept is so masterfully articulated and broken down into practical, consumable and most importantly, real life examples from the leading experts and pioneers in this field. System resiliency is a model and desired state for most organizations that are moving toward microservices in order to drive real business goals into reality. Chaos engineering helps organizations and their operators to manage complexity, focus on innovation and customer outcomes that drive real change. The authenticity of the authors shines through from chapter to chapter. This is a must read for all lines of business who are seeking to manage complexity to build resiliency into their most mission critical business technologies and systems. You will enjoy this!

the reference manual for an entire new industry

I'm new to this discipline and have vaguely absorbed the author's work over the years. This was such an accessible, yet detailed explanation of the lessons and principles of chaos engineering earned over decades' worth of experience from the authors. Easily the highest ROI considering the amount of money spent and lost over the lessons learned in running large scale resilient systems.

Excellent book and insights of Engineering

This is a very accessible (and quotable!) overview of the principles and practices of chaos engineering. I also really enjoyed the case study chapters. The book contains a wide variety of case studies, and each one felt authentic and real. This book feels like it is meant to build community as much as educate technologists.

Great addition to learn a new discipline within software engineering

This is a very good book to read if you want to learn about chaos engineering. It explains the principles and also provided several useful case studies. I find these concepts easy to understand and digest and I would recommend this book to anyone who wants to start their chaos engineering journey and to learn more about the discipline.

Excellent resource, highly recommend!

I came into this book having heard about chaos engineering but not really knowing anything about it. I learned enough to enable me to confidently and competently discuss the topic in professional settings. The principles are clear and I felt they were easy to understand. I will revisit and refer to this book often!

Great book for going through evolution and details of Chaos Engineering

We ourselves are early adopters of Chaos Engineering, and the book has a lot of details on the types of attacks, observations and details around the landscaping , execution and observation collection process of Chaos Engineering.

Hard-won knowledge from distributed systems veterans

It's a common trope that the "internet goes down" when AWS has issues, but practitioners of Chaos Engineering are prepared for this eventuality. The discipline of Chaos Engineering isn't about creating chaos, but about realizing that chaos is inherent in the world around us -- including (or especially!) in complex distributed systems -- and designing systems to be resilient in the face of unexpected conditions. If you manage systems of technology and people, you need to read this book from two of the founding engineers in the Chaos Engineering movement.

Great book - both for in troduction of principal of chaos engineering and practical examples

I really liked this book. It provides a very good overview and explanation of the principles of chaos engineering, good variety of use cases and case studies as well as various chaos engineering methodologies. It is very timely given the wide adoption of cloud computing and migration projects that are happening at masse within organizations. I can attest it is happening in the organization I work for. It has given me a lot of useful information and inspired to start implementation of some chaos engineering steps on the projects I work with.

Objective guide to chaos engineering

Great read if you are interested in the topic of chaos engineering, but don't have much experience at how and why it is done. I first learnt abut chaos engineering at a Netflix session during Re:invent, and it was fascinating. This book adds more context on that session on the whys and hows of chaos engineering. I feel like it is the right level of specificity for this book, and will be a handy guide for days to come.

Great coverage of the topic!

Being an entry level researcher into this subject, I found this book to be an excellent resource in that it contains a very detailed compendium of definitions, principles, and examples of implementations taking place across different organizations. Having this sort of information in one place is very valuable. I also enjoyed the contributing actors' role in creating this work; I believe it enhanced the quality of it. Highly recommended.

Excellent read. Highly recommended

I read this book after listening to Casey speak at the ADDO conference. Its an excellent read into embracing chaos and how to proactively handle it.

Great book - an interesting read even if you aren't in the field.

The book gets you to think about systems in a different way. I had heard about it before but relegated it as just "breaking things in production". It is something more systematic and complex than that, and this book gives a well-written overview on how it has evolved to this point. I recommend it. Even if you aren't planning on going into this field, it provides a fresh perspective for you to think about. And besides that, it's a nice read.

Excelente Libro.

El mejor libro de Chaos Engineering que he leído, buenas prácticas, habilidades requeridas. Muy bien explicados. Recomendado para los que estamos implementado software altamente resiliente y con capacidad de adaptarse a las necesidades del mercado.

Fault tolerance and resilience

Chaos Engineering: System Resiliency in Practice is an interesting book wish is easy to understand. As lead project manager at Orange we need this book here in the CDN engineering team. Happy to have a copy of the book I’ll recommend it to all new comers and the hole head of the project Tiktok we as primary project. The book still a good source of information about resilience and fault tolerance. Regards,

great book on chaos engineering

Very clearly articulates as to how SRE's can do well in their job by well articulated chaos engineering ahead of actual chaos happening

A very practical and insightful book

This book is straight to the point, giving clear explanations and usable guidelines. I enjoyed reading it and recommend it.

Great guideline for securing your cyber initiatives

There's so much information, and in great detail, that I'll be reading through this book many times over.

Good Overall Coverage of the topic

I found this really helpful for exploring the topic of Chaos Engineering beyond just the buzzword hype. As I try to grow the resiliency of the products my team works on, there are a lot of practical approaches to both what to do and how to communicate about it..

The first book you'll ever need for chaos engineering

Loved this. I honestly didn't have a great grasp on what chaos engineering really was till I read this.

Excellent book to understand chaos engineering

This is an excellent book to understand chaos engineering. It is very detailed and self explanatory.

Awesome book on Chaos Engineering

It clearly outlines the principles and methods of Chaos Engineering. It helps with resiliency and fault tolerance as well as how to validate the architecture and design of the microservices. Also clearly articulates a great testing strategy.

Great depth and breath

The authors were literally at the infancy of the movement to improve software quality and resiliency through chaos engineering.

Great insights which will help me to build more resilient systems.

I really liked the examples given in the book. The level of detail and explanation enabled me to see how I could use what was being done in the application and service stacks that I am implementing. Great information which will lead me to create more robust and resilient systems.

Great Read for engineers and management

Thoroughly enjoyed the book. Highly enlightening and interesting. Was nice to catch up on some new ideas. Thank you...

They figuratively wrote the book on Chaos Engineering, and then they literally wrote it.

Chaos Engineering is an idea that seems crazy in theory, but once you learn about the benefits it provides, it becomes obvious in retrospect. This is the book to take you on that journey. Accessible, informative, and, dare I say, entertaining. If you want to learn about Chaos Engineering and why your organization needs it, this is the book.

A must-read for software engineers and DevOps practitioners

Excellent reading on chaos engineering to improve resilience. This book explains what chaos engineering is with in action examples, including its business motivation and considerations, in an easy lecture. Definitively, a must-read for software engineers and DevOps practitioners.

Very Informative Book.

Easy enough to understand that even I could understand the concepts as an Ops guy.

Clear writing by folks who have earned their proverbial t-shirt

Well organized and replete with real world examples that communicate both the complexities involved and the advantages accruing from implementing chaos engineer principles. If you are interested in or responsible for providing IT capabilities at scale, this book is a must read.

A paradigm shift in thinking about resiliency

Found this to be a great way to think about resiliency through experimentation and hypothesis testing. The practices discussed are transferrable to nearly anyone running complex systems. Chaos eng is the proactive approach you should take.

Highly recommended if you want your software to be highly resilient

Really good playbook to learn and implement chaos engineering practices to your products solutions and tackle the problem of resiliency.

Wonderful starting resource

This book is great. Explores the topic and various aspects so you can continue to build and learn further. Excellent place to start if you are new to chaos engineering and what to take your system engineering skill set up a level!

Great history and overview at several levels

The book provides many cases studies and opinions on chaos engineering. You can apply this well beyond the software industry. I had to read it more than once as each time it made me reflect and think abut possibilities.

Essential to anyone wanting to learn about Chaos Engineering

I've always been interested in Chaos Engineering and had heard the term thrown around, but never really understood it. This book does a great job explaining the subject and is essential for anyone wanting to learn more about this discipline.

Wonderful

This is such an amazing book to learn about chaos engineering, I consider this book as a good guide to start in this discipline. People can onborard on the topics that are covered in this book and can begin in the path of caos engineering.

Very well written in every aspect

As a beginner to Chaos Engineering, I got a lot out of this book. The language was very clear, and the relevant examples not only helped me understand the concepts better, but also improved my way of thinking about systems in general. It's a great read!

Excellent book on building resilient systems.

This book is a great resource for any company that is building large (especially distributed) systems. It will help you discover systems vulnerabilities and prevent outages before they impact your customers.

Muy util

No existen muchos libros de Ingeniería del Caos, si estas involucrado de alguna forma en DevOps o SRE debes agregarlo a tu lista de lecturas. Este libro explica la teoría incluyendo casos de estudio lo que hace de manera muy ameno e ilustrativos.

Great material for systems engineers

Chaos engineering is the only way to guarantee system reliablity - and this book covers it all. A great in-depth look at everything one would need to engineer the best reliability practices.

Concepts really covered well

The book has a good coverage and all the concepts explained really well

Great Read

Great book to start learning about chaos engineering!!!

Good book

This is a good book and I enjoy reading it. As a software engineer, it's helpful to learn from the book how to discover vulnerabilities and prevent outages before they impact the customers.

Helps me to design a resilient system

This book helps me to design a resilient system. Dealing with a highly scalable and distributed system comes with a cost. This book helps me to tackle issues that we are usually not aware of.

Effective and Insightful

This contains effective practices that I can apply to everyday work and notions that help ensure security and stability. Definitely insightful and worth reading!

Very accessible

I appreciated how approachable this book was for anyone no matter what your skill level of system design is. It's a gentle lead into a topic that gets complex.

Great book to learn chaos engineering techniques.

Excellent source of valuable information, authors describe all concepts in readable and engaging manner with lots of examples. Easy to read and digest.

Different perspective to look at the systems - Must Read

Fantastic read at different levels with many real world examples. It will give you a different perspective in the way you look at the software systems in general.

A very good book for Engineers getting to know about Chaos engineering practices and concepts

A very good book on the concepts of Chaosing. Sets you up with a lot of real world examples.

Great Book

This was a great read. It was highly relevant and provided guidance that was directly helpful. I highly recommend this for anyone getting into chaos engineering.

Excellent read for someone looking to Invest in Chaos

The book had lot of great ideas and fundaments about Chaos practice. I read the book and it helped me to start a new thread as part of our SRE to work on Chaos.

Fantastic book

This book really helped me in my cloud engineering job on a daily basis, the lessons were fantastic and I plan on reading it over again soon!

Book has useful information

Very useful information in this book as I learned more about this topic

Great Introduction to Chaos Engineering

A really nice introduction to chaos engineering. I thoroughly enjoyed this book.

Start chaos from here

Excellent book! For anyone learning about chaos, you should start from here

Practical Reference for Chaos Engineering

Contributed chapters from people at Slack, Google, Microsoft, LinkedIn.. are really helpful

Strongly recommended for s/w engineers

Thoroughly enjoyed the book. Highly enlightening and interesting. Was nice to catch up on some new ideas.

Learned a lot!

Great book, learned a lot from the examples and laying out Chaos Engineering

Just right for engineers and management

Really enjoyed the book. Very enlightening and interesting. Great to learn some new ideas. Thank you...

Awesome Book!

A great book on Chaos Engineering and I learn many new fresh knowledge from this awesome book!

this book is awesome

very good content and very useful detail. You can easy to follow and apply to the reality

Great place to start

Good organization and flow, good details and case studies.

awesome read

awesome read, with real world examples! highly recommend

Helpful

It is a helpful book, great examples and clear explanation

very helpful

I would introduce to my staffs this book. Really love it

Good book

Great book for engineers interested in building resilient systems

Excellent contextualization of an under appreciated engineering skill

Every high level engineer needs to read this title.

Trending Books