Navigate Incident Management Like a Pro: MyFitnessPal's Sr. Director of Engineering Shares Insider Strategies with Lee Atchison
How much time are engineering teams spending on incidents?
Are you trying to set your engineering team free to do their best work? Read our new case study to learn how Blameless can help you do that.

What's difficult about problem detection?

What's difficult about problem detection?

Description

In this episode, Joanna Mazgaj, Director, Production Support, and Laura Nolan, SRE at Flatiron, join Matt Davis and Kurt Andersen from the Blameless team to detect the problems of problem detection! Knowing what's going wrong isn't always easy. Learn how to get ahead by building collective intelligence, stopping things from slipping, and more!

Speakers

Kurt Anderson

Strategy, Blameless

Kurt Anderson

Strategy, Blameless
Kurt Andersen is a practitioner and an active thought leader in the SRE community. He speaks at major DevOps & SRE conferences and publishes his work through O'Reilly in quintessential SRE books such as Seeking SRE, What is SRE?, and 97 Things Every SRE Should Know. Before Blameless, Kurt was a Sr. Staff SRE at LinkedIn, implementing SLOs (reliability metrics) at scale. Kurt is a member of the USENIX Board of Directors and part of the steering committee for the world-wide SREcon conferences.
Blue cross X  - Blameless Images

Matt Davis

Staff Infrastructure Engineer, Blameless

Matt Davis

Staff Infrastructure Engineer, Blameless
Matt is a Sr. Infrastructure Engineer at Blameless. His expertise brings to bear a variegated background including data-center operations, storage hardware and distributed databases, IT security, site reliability, support services, observability systems, and techops leadership. He has a passion for exploring the relationships between the artistic mind and operating distributed software architectures.
Blue cross X  - Blameless Images

Joanna Mazgaj

Director, Production Support, Tala

Joanna Mazgaj

Director, Production Support, Tala
I manage production engineering organization at Tala, which is a part of our CloudOps/DevOps group. My teams build internal tools and platforms for customer and product management and we own the production incidents escalation process. All the way from simple change request escalations to P0/P1 incidents. In my spare time I'm probably on a hike, playing a computer game or cooking. I collect cookbooks, I have about 85 right now.
Blue cross X  - Blameless Images

Laura Nolan

Principal Software Engineer, Stanza Systems

Laura Nolan

Principal Software Engineer, Stanza Systems
Laura Nolan is a software engineer and SRE. She has contributed to several books on SRE, such as the Site Reliability Engineering book, Seeking SRE, and 97 Things Every SRE Should Know. Laura is a Principal (and principled) Engineer at Stanza Systems, where she is building software to help humans understand and control their production systems. Laura is a member of the USENIX board of directors and a long-time SREcon volunteer. She lives in rural Ireland in a small village full of medieval ruins.
Blue cross X  - Blameless Images