Let the machines optimize the machines | Stefano Doni | Conf42: SRE 2021

Stefano Doni - CTO @ Akamas SREs’ main goal is to achieve optimal application performance, efficiency and availability. A crucial role is played by configurations (e.g. JVM and DBMS settings, container CPU and memory, etc): wrong settings can cause poor performance and incidents. But tuning configurations is a manual and lengthy task, as there are 100s of settings in the stack all interacting in counterintuitive ways. In this talk, we present a new approach that leverages machine learning to find optimal configurations of the tech stack. The optimization process is automated and driven by performance goals and constraints that SREs can define (e.g. minimize resource footprint while matching latency and throughput SLOs). We show examples of optimizing Kubernetes microservices for cost efficiency and latency tuning container sizing and JVM options. With the help of ML, SREs can achieve higher application performance, in days instead of months, and have a lot of fun in the process! Other talks at this conference 🚀🪐 https://www.conf42.com​/sre2021 — 0:00 Intro 1:16 Talk