The Infection Monkey

A couple of years ago Netflix introduced a concept they called a “Simian Army“. The idea was to have a bunch of automated processes that checked their cloud’s resilience to various failure scenarios. A prime example was a “Chaos Monkey” which randomly shuts down servers in their infrastructure to test the application’s ability to withstand server failures. When you know that a Chaos Monkey is running free in your infrastructure and your service stays up you know that you are can handle server failure effectively. We think that a similar approach applies to securing cloud infrastructure. Read more