Debugging cluster issues as an on-call SRE | Pravar Agrawal | Conf42 SRE 2024

Read the abstract ➤ https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_Pravar_Agrawal_debugging_cluster_oncall Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 0:40 agenda 1:21 whoami 1:39 introduction to sre 2:55 understanding on-call process 4:23 some common cluster issues 6:32 approach to debugging 8:10 automation to the rescue? 9:16 shades of automation 12:06 advice for beginners 13:09 thank you!