Resources
Browse through videos, guides, and other educational resources that cover incident management, reliability, team culture, and more.


Podcasts
Ebook
9.16.2022
Resilience in Action E15: Scaling SRE and DevOps operations for the company that mouses around and its galaxies far far away with Brian Scott
Resilience in Action is a podcast about all things resilience, from SRE to software engineering, to how it affects our personal lives and more. This podcast is hosted by Kurt Andersen. Kurt is a practitioner and an active thought leader in the SRE community. He speaks at major DevOps & SRE conferences and publishes his work through O'Reilly in quintessential SRE books such as Seeking SRE, What is SRE?, and 97 Things Every SRE Should Know.


Blog
Ebook
9.12.2022
Blameless Expands Microsoft Partnership to Deliver Faster, More Intuitive Incident Response Collaboration
The integration between Blameless and Microsoft Teams is significant for our customers, because it enhances their main line of communication during the most pressing moments of incident response. Directly from Microsoft Teams, an on-call engineer initiates an incident, notifies stakeholders, and orchestrates rapid response, all while automatically collecting each event or “touch” that adds value to the retrospective (postmortem) for learning.


Blog
Ebook
8.31.2022
Software Metrics Every SRE Team Should Measure
Software metrics give important insight into the performance of your product, but which ones matter most to SRE teams? How do you decide which metrics to track?


Videos
Ebook
8.25.2022
What's difficult about problem detection?
In this episode, Joanna Mazgaj, Director, Production Support, and Laura Nolan, SRE at Flatiron, join Matt Davis and Kurt Andersen from the Blameless team to detect the problems of problem detection! Knowing what's going wrong isn't always easy. Learn how to get ahead by building collective intelligence, stopping things from slipping, and more!


.png)
Customer Stories
Ebook
8.25.2022
Procore went from disjointed ad-hoc tasks to a smooth, cohesive incident response process, tailored for their needs, with Blameless.


Blog
Ebook
8.24.2022
What is an SRE job description?
Whether you’re building an SRE team or looking for a job as an SRE, understanding the SRE job description is important. How would you define an SRE job?


.png)
Customer Stories
Ebook
8.23.2022
Agero’s Incident Management Is “Invincible” with the Help of Blameless Automation


Videos
Ebook
8.18.2022
Procore Case Study Video
Procore went from disjointed ad-hoc tasks to a smooth, cohesive incident response process, tailored for their needs, with Blameless


Blog
Ebook
8.17.2022
Chaos Engineering: What Is It & How Does It Work?
Distributed software systems have many points of failure. Can the process of chaos engineering help identify problems and gauge resiliency?


Videos
Ebook
8.12.2022
Reliability Insights: Out of the Box Reports
In this video, Matthew Dodge, Customer Success Manager at Blameless, walks you through all of the out-of-the-box Reliability Insights reports you get in Blameless automatically. He'll explain where the data comes from, why it's helpful in understanding reliability, and how you can translate these reports into actionable steps.
Incident Impact Calculator
Find out how much you could save
Incidents can do real damage to companies that aren't sufficiently prepared them. Use our calculator to estimate the full cost of incidents for your team.
use the calculator