In this episode, Joanna Mazgaj, Director, Production Support, and Laura Nolan, SRE at Flatiron, join Matt Davis and Kurt Andersen from the Blameless team to detect the problems of problem detection! Knowing what's going wrong isn't always easy. Learn how to get ahead by building collective intelligence, stopping things from slipping, and more!
In this episode, Joanna Mazgaj, Director, Production Support, and Laura Nolan, SRE at Flatiron, join Matt Davis and Kurt Andersen from the Blameless team to detect the problems of problem detection! Knowing what's going wrong isn't always easy. Learn how to get ahead by building collective intelligence, stopping things from slipping, and more!

Speakers

Kurt Anderson

Strategy, Blameless
Read Bio

Kurt Anderson

Strategy, Blameless
Kurt Andersen is a practitioner and an active thought leader in the SRE community. He speaks at major DevOps & SRE conferences and publishes his work through O'Reilly in quintessential SRE books such as Seeking SRE, What is SRE?, and 97 Things Every SRE Should Know. Before Blameless, Kurt was a Sr. Staff SRE at LinkedIn, implementing SLOs (reliability metrics) at scale. Kurt is a member of the USENIX Board of Directors and part of the steering committee for the world-wide SREcon conferences.

Matt Davis

Staff Infrastructure Engineer, Blameless
Read Bio

Matt Davis

Staff Infrastructure Engineer, Blameless
Matt is a Sr. Infrastructure Engineer at Blameless. His expertise brings to bear a variegated background including data-center operations, storage hardware and distributed databases, IT security, site reliability, support services, observability systems, and techops leadership. He has a passion for exploring the relationships between the artistic mind and operating distributed software architectures.

Joanna Mazgaj

Director, Production Support, Tala
Read Bio

Joanna Mazgaj

Director, Production Support, Tala
I manage production engineering organization at Tala, which is a part of our CloudOps/DevOps group. My teams build internal tools and platforms for customer and product management and we own the production incidents escalation process. All the way from simple change request escalations to P0/P1 incidents. In my spare time I'm probably on a hike, playing a computer game or cooking. I collect cookbooks, I have about 85 right now.

Laura Nolan

Principal Software Engineer, Stanza Systems
Read Bio

Laura Nolan

Principal Software Engineer, Stanza Systems
Laura Nolan is a software engineer and SRE. She has contributed to several books on SRE, such as the Site Reliability Engineering book, Seeking SRE, and 97 Things Every SRE Should Know. Laura is a Principal (and principled) Engineer at Stanza Systems, where she is building software to help humans understand and control their production systems. Laura is a member of the USENIX board of directors and a long-time SREcon volunteer. She lives in rural Ireland in a small village full of medieval ruins.

Video Transcript