Health checks & dependency management | Andrew Robinson | Conf42 Cloud Native 2021
Andrew Robinson Principal Solutions Architect @ AWS We can improve the reliability of services by decoupling dependencies, using health checks, and understanding when to use fail-open and fail-closed behaviours. In this session we’ll talk about and demonstrate how to implement graceful degradation, monitor all the layers of your workload to help detect failures, route traffic only to healthy nodes, use fail-open and fail-closed as appropriate in response to faults, and reduce mean time to recovery. We’ll take some lessons learnt from the AWS Well-Architected framework and from the Amazon Builder’s Library, showing some of how Amazon builds and operates it’s software. — 00:00 Intro 00:40 Talk — 🥇 Gold Sponsors: Chaos Native Fugue Honeycomb.io StackPulse Teleport 🥈 Silver Sponsors: 7bulls.com Aerospike Alwaysdata Aspecto bxblue Capgemini CircleCi Container Solutions CloudSkiff Earthly Hazelcast Imply Lightrun Logz.io Lumigo MongoDB Nuaware Qwinix Zeti 🤝 Media Partners: Manning AWS Inside Dev — Website 🚀🪐 https://www.conf42.com Reach Out 📧📭 mark@conf42.com Discord Server 🧑🤝🧑💬 https://discord.com/invite/dT6ZsFJ5ZM LinkedIn 👨💼💼 https://www.linkedin.com/company/4911... Twitter 🎵🐦https://twitter.com/conf42com Conf42Cast 🎧 http://www.conf42.com/podcast