# Concept

Reliability Platform is a set of SDKs, tooling and integrations that are designed for the creation of reliable applications. The project provides a vendor-agnostic implementation to inject faults to the backend(s) of your choice. It supports a variety of popular open-source projects including Python Django or Java Spring Boot.

# Why you need Reliability Platform and what it can do

Software project teams build cloud-native applications in a distributed and polyglot architecture. Distributed architectures introduce a variety of operational challenges including how to solve availability and reliability.

Fault injections are needed to empower resilient, cloud-native software applications. Traditionally, fault injections provide mechanisms to stop or delete infrastructure resources. The most popular example is the chaos monkey by Netflix. The chaos monkey stops a virtual machine instance in your productive environment by random. The deletion or stopping of infrastructure resources has a big blast radius. Such high blast radiuses might cause unwanted harm and introduces side effects. Application attacks are considered to be less invasive as infrastructure or platform attacks. Application attacks are great for spotting issues in error handling functions or issues related to request timeouts or slow network.

The Reliability Platform project solves these problems by providing a vendor-agnostic, minimized blast radius solution.

Reliability Platform provides you with:

  • A vendor-agnostic instrumentation library per language with support for fault injections on the application level.
  • An end-to-end implementation to fault injections.
  • A hosted portal to active and deactive fault injections in your backend.

# What Reliability Platform is not

Reliability Platform is not a fault injection for your infrastructure.