Ebook

Site Reliability Engineering for Business Continuity

The 8 Core Principles of site reliability engineering for longevity and success in business.

Site Reliability Engineering for Business Continuity

The 8 Core Principles of site reliability engineering for longevity and success in business.

Summary

The 8 Core Principles of site reliability engineering for longevity and success in business.

Table of Contents

1. Principle #1: Minimizing SPOFs (single points of failure)

2. Principle #2: Preparing for staffing reductions and changes to continuity plans

3. Principle #3: Creating margin in the system to allow for adaptive capacity

4. Principle #4: Using SLOs (compassionately) to drive prioritization

5. Principle #5: Fostering teamwork and culture in difficult times

6. Principle #6: Adapting to remote work

7. Principle #7: Enabling continuous improvement through work-as-done VS imagined

8. Principle #8: Learning for the long term

9. Conclusion

"I have less anxiety being on-call now. It’s great knowing comms, tasks, etc. are pre-configured in Blameless. Just the fact that I know there’s an automated process, roles are clear, I just need to follow the instructions and I’m covered. That’s very helpful."
Jean Clermont, Sr. Program Manager, Flatiron
"I love the Blameless product name. When you have an incident, "Blameless" serves as a great reminder to not blame anything or anyone (not even yourself) and just focus on the incident resolving itself."
Lili Cosic, Sr. Software Engineer, Hashicorp
Read their stories

Sign up for our monthly newsletter

Be the first to hear about new content and events happening at Blameless.