Incident Response? Let's do science instead | Ivan Merrill | Conf42 Platform Engineering 2023
Read the abstract ➤ https://www.conf42.com/Platform_Engineering_2023_Ivan_Merrill_incident_response_lets_do_science_instead Other sessions at this event ➤ https://www.conf42.com/platform2023 Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:25 preamble 1:27 incident response can learn from safety engineers in other domains 3:13 a definition... 4:12 catastrophe is always around the corner 5:30 incident response isn't easy 6:02 an overreliance of dashboards and runbooks 8:21 guesswork 9:46 spending a long time on the wrong hypothesis 11:10 fear of failure 12:42 'history doesn't repeat itselg but it often rhymes' 13:30 'it seems easy to look back at an incident and determine what went wrong (...)' 14:54 normative language 16:47 mechanistic reasoning 18:55 above the line, below the line 22:07 change introduces new forms of failure 23:35 experienced troubleshootes rely more on case-based strategies 26:48 science - definition 27:23 the theory of falsifiability 30:32 'a more scientific, hypothesis-driven, approach to how humans perform (...) can improve reliability 31:30 why bother? 34:20 3 steps 36:13 all practitioner acts are a gamble 37:16 thank you