1-day reliability engineering & product leader summit

Blameless Summit 2019

Mon, Aug 12th 2 Jack London Alley, San Francisco

About the Summit

If you're a manager, director, VP, or executive in Engineering or Product — join us on August to learn and talk about reliability.

You'll connect with peers & experts, participate in interactive case studies, and leave with actionable advice you can implement immediately — that's what makes the Blameless Summit unique.

Learn how to get teams to deliver innovative features while maintaining reliability, processes to decrease time during incidents, how to plan for outages, and how to be a manager that people love to work with. Grab a ticket before we're sold out! Seating is VERY limited!

Speakers

Dave Rensin

Keynote Speaker

Dave Rensin

Dave Rensin an Engineering Director in Google SRE. He is the head of Customer Reliability Engineering (CRE) at Google and is responsible for applying the lessons and practices of Google Site Reliability Engineering (SRE) to customer production systems. He is also responsible for Google's network capacity planning including network simulation, demand modeling and forecasting. He is an accomplished entrepreneur and author.

Liz Fong-Jones

Keynote Speaker

Liz Fong-Jones

Liz is a developer advocate, labor and ethics organizer, and Site Reliability Engineer (SRE) with 15+ years of experience. She is an advocate at Honeycomb.io for the SRE and Observability communities, and previously was an SRE working on products ranging from the Google Cloud Load Balancer to Google Flights.

Paul Osman

Case Study Speaker

Paul Osman

Paul Osman is the Senior Manager for Site Reliability Engineering at Under Armour where he and his team support the fitness tracking applications MapMyFitness, Endomondo, and MyFitnessPal. He has 15+ years of experience as a software engineer, focusing mostly on microservices, reliability, and DevOps practices. He is an advocate for chaos engineering and building just cultures that prioritize safety and resiliency. Prior to joining Under Armour, he helped build platform teams at PagerDuty, SoundCloud and 500px.

Ben Rockwood

Case Study Speaker

Ben Rockwood

Ben has been in the industry for over 20 years, from the Enterprise Computing of the 90's to building one of the first clouds at Joyent in 2006 and helping define the DevOps movement in 2009 and onward. He loves to learn and lives on an island near Seattle with his amazing wife and 5 children.

Summit Schedule

9:30AM-10:30AM • Keynote 1

Dave Rensin of Google

10:30AM-11AM

Break

11AM-11:30AM • Improving Postmortems: From Chores to Masterclasses

Paul Osman of Under Armour

Postmortems are an essential tool for learning from production incidents. Unfortunately, it's common for them to become laborious chores for engineers and incident response teams. Whether it's because they feel like theatre or blame accidentally sets in, this can have an adverse effect on the resiliency of your organization. An effective postmortem culture doesn't just happen, it has to be an intentional and ongoing effort at multiple levels of an organization.

In this talk, I'll walk through some of our efforts at Under Armour to improve our postmortem culture - I'll discuss some of the successes we've had, lessons we've learned along the way and areas we're excited to focus on next.

11:30AM-12PM • Incident Management for Small and Medium-Sized Startups

Ben Rockwood of Packet

A brief case study in the challenges of incident management in small to medium startups and how Packet is using Blameless to improve the process.

12PM-1:30PM

Lunch

1:30PM-2PM • Case Study 3

Jed Needle of Procore

2PM-3PM • Cultivating Production Excellence

Liz Fong-Jones of Honeycomb.io

Taming the complex distributed systems we're responsible for requires changing not just the tools and technical approaches we use; it also requires changing who is involved in production, how they collaborate, and how we measure success.

In this talk, you'll learn about several practices core to production excellence: giving everyone a stake in production, collaborating to ensure observability, measuring with Service Level Objectives, and prioritizing improvements using risk analysis.

3PM-5PM

Happy Hour

Receive news, announcements, and special offers.