Navigate Incident Management Like a Pro: MyFitnessPal's Sr. Director of Engineering Shares Insider Strategies with Lee Atchison
How much time are engineering teams spending on incidents?
Are you trying to set your engineering team free to do their best work? Read our new case study to learn how Blameless can help you do that.

SREview Issue #11 March 2021

Is it spring yet? Or spring still? Time sure is strange nowadays. At least we have a ton to look forward to in the next few weeks! Here are some of the most exciting Tweets, content, and events happening in the SRE and resilience engineering community this month.


White dog running from a winter scene with snow and bare trees into a spring scene with green grass, flowers, and butterflies.

Tweets that have us twittering

SREading

SRE2AUX: How Flight Controllers were the first SREs: Geoff White writes about what vintage space lore has to do with site reliability engineering in the 21st century.

The Netflix Cosmos Platform: This article explains why the Netflix team built Cosmos, how it works, and shares some of the things the team learned along the way.

SRE as Organizational Transformation: Lessons from Activist Organizers: Chris Hendrix writes about how we can learn from activist organizers while driving company-wide change.

What is a Canary Deployment?: This post contains a thorough description of canary releases including benefits, visual examples, and how it fits into an effective deployment strategy.

How We Built and Use Runbook Documentation: Alicia Li and Lucas Bartroli write about runbooks. “Even if you don’t notice, you are executing runbooks everyday, all the time.”

Increment’s Reliability Issue: This issue contains articles on reliability from thought leaders such as Tanya Reilly, Mads Hartmann, Ana Margarita Medina, and more.

Give it a whirl

Teams have a new tool in their tool belts. Blameless Runbook Documentation is available for early access.

Product image of Blameless Runbook Doucuments. GIF shows user adding steps including code snippet.

Runbooks are an industry best practice, empowering teams to codify the incident response process and drive process repeatability and consistency. These sets of instructions allow teams to resolve incidents faster with greater confidence and less toil.

Events

SRE Thought Leader Panel: SRE Adoption as Organizational Transformation March 25, 11 AM PDT: Hear from experts Kurt Andersen, Vanessa Yiu, and Tony Hansmann. Hosted by Chris Hendrix.

Blameless Bi-Weekly Demo March 30 at 8 AM PDT: Check out a live demo of Blameless as we walk you through operations best practices, and get your questions answered.

DevOps Online Summit April 26-30: DevOps professionals throughout the world come together and share their learnings.

Deserted Island DevOps April 30: A single-day virtual event streamed on Twitch. All presentations will take place in the world of Animal Crossing: New Horizons.

Want to contribute?

If you’re looking to share your insights with the SRE and resilience engineering community, we’d love to partner with you on content. Fill out our form here and we’ll reach out!

Resources
Book a blameless demo
To view the calendar in full page view, click here.