Incident Response, Management & Alerts - where they fit in CloudOps? | Chris Riley | Conf42 SRE 2021
Chris Riley - Senior Technology Advocate @ Splunk Responding to incidents is not just about wiring up the right tools, it’s also a strategy and process to know how to respond, how to record details of incidents, and how to learn from them after everything has been resolved. There has been a lot of confusion about the relationship of incident response to incident management, and alerting. In this session we will talk about the differences and the best practices for these key processes in healthy cloud operations environments. Expect to learn in this session. 1) What is an alert and how does it feed incidents 2) What is the difference between incident response (IR), and incident management (IM) 3) What are the best practices for IR and IM tooling 4) How incidents are being handled in modern dev environments We will also talk about on-call scheduling, shift-left, and how machine learning supports incident response strategies. Other talks at this conference 🚀🪐 https://www.conf42.com/sre2021 — 0:00 Intro 1:16 Talk