List of videos

Automate merging to keep builds healthy at scale | Ankit Jain | Conf42 Incident Management 2022

Code-submission processes can highly impact developer productivity, especially as engineering teams scale and codebase complexity grows. Often, teams that work on a monorepo struggle with keeping their main branch stable, especially as the number of engineers merging changes (and consequently, the number of code-submissions per day) grows. This happens because incompatibilities emerge when multiple changes are combined, causing builds to break frequently. This in-turn cases costly rollback and blocked deployments and hours of engineering lost. Poly-repo setups present their own challenges: synchronizing merges when changes span multiple repositories, rolling back related changes across repos, and testing across multiple build/test pipelines can become coordination time-sinks for developers. This talk will feature a distillation of various merge strategies that help teams scale, and their associated developer-productivity trade offs. Other talks at this conference πŸš€πŸͺ https://www.conf42.com/im2022 β€” 0:00 Intro 1:40 Talk

Watch
Get Ready to Recover with Reliability Management | Jeff Nickoloff | Conf42 Incident Management 2022

The best way to minimize the impact of incidents is to prepare to respond ahead of time. But it is difficult and expensive to prepare for every possible symptom or cause. This talk will cover how to test the reliability of your system, evaluate your incident readiness, and prioritize future preparation. Other talks at this conference πŸš€πŸͺ https://www.conf42.com/im2022 β€” 0:00 Intro 1:40 Talk

Watch
Using incidents to level-up your teams | Lisa Karlin Curtis | Conf42 Incident Management 2022

Incidents are a great opportunity to gather both context and skill. They take people out of their day-to-day roles, and force ephemeral teams together to solve unexpected and challenging problems. The first part of the talk will walk through the different things you can learn from incidents, including: - Taking you to the edges of the systems your team owns, and beyond - incidents help broaden your understanding of the context in which you're building - Showing you how systems fail, so you can learn to identify and build software with good observability, and considerations of failure modes - Expanding your network inside your organisation, making connections with different people, who you can learn from and collaborate with We'll then talk about how to get the best value from the incidents which you do have as an individual, thinking about when is an appropriate time to ask questions, and how to get your own learnings without 'getting in the way'. Finally, we'll discuss how to make this part of the culture of an organisation: as part of the leadership team, what can you do to encourage this across your teams? Other talks at this conference πŸš€πŸͺ https://www.conf42.com/im2022 β€” 0:00 Intro 1:40 Talk

Watch
Incident Management - Talk the Talk, Walk the Walk | Hila Fish | Conf42 Incident Management 2022

Remember when we were at school, and people said - ""Actively listening in class guarantees 50% prep for the upcoming test""? The same goes for being proactive at work in ways that will instantly prepare you to manage incidents better (at night or in general). In this talk, I will lay out the foundations of incident management, including key questions that if you'll be able to answer - You will be able to easily manage incidents, no matter the time and place. I will also show the best practices I've finalized over the years that helped me get a clear vision of how to manage production incidents in the quickest & efficient way possible. Embracing the tips I'll give you will guarantee you'll not only talk the talk but also walk the walk when it comes to incident management. Other talks at this conference πŸš€πŸͺ https://www.conf42.com/im2022 β€” 0:00 Intro 1:40 Talk

Watch
Plan for Unplanned Work: Game Days & Chaos Engr. | Mandi Walls | Conf42 Incident Management 2022

How do you plan for unplanned incidents? You practice with Chaos Engineering. Strong incident response doesn't just happen, you have to build the skills and train your team. Practicing for major incidents gives your team insight into how your applications will behave when something goes wrong as well as how the team will interact to solve problems. Combining your Incident Response practices with Chaos Engineering roots your response practice in real-world scenarios, helping your team build confidence. Other talks at this conference πŸš€πŸͺ https://www.conf42.com/im2022 β€” 0:00 Intro 1:40 Talk

Watch
Create a learning culture | Amir Shaked | Conf42 Incident Management 2022

Building and marinating a five 9s system isn’t just about the tools and technologies. Development culture has a big part in how you keep a system available while scaling it up and supporting more features, users, and locations. A healthy learning culture, supporting the development, not repairing mistakes, and identifying weak points is another tool in the engineering toolbox. In this talk, we will discuss how to create a learning culture using debriefs, what to avoid, and how to instill change in an engineering organization. Other talks at this conference πŸš€πŸͺ https://www.conf42.com/im2022 β€” 0:00 Intro 1:40 Talk

Watch
Relia...bility? | Ricardo Castro | Conf42 Incident Management 2022

Technology ecosystems are complex and it is really important to understand every change and how it affects our systems, as well as the service provided. Users expect systems to be up, responsive, fast, consistent, and reliable. Reliability for systems means that they are doing what their users need them to do. A system's reliability is essentially how happy users are and we know those happy users are good for business. If reliability is one of the most important requirements of any system, users determine what reliability means, and it’s okay to not be perfect all the time. We need a way of thinking that can address this way of thinking since we have limited resources to spend, be they financial, human, or political. Other talks at this conference πŸš€πŸͺ https://www.conf42.com/im2022 β€” 0:00 Intro 1:40 Talk

Watch
Build a low-cost CI/CD solution on top of AWS | Valera Bronshtein | Conf42 Incident Management 2022

Typical start-up build its initial infrastructure quick and dirty to get relevant and grow fast. Its awesome, but the fee comes later as not-bestpractised tools that consume a lot of time and money to manage them. In this session we will show how to switch from huge one-node Jenkins server to high-performance Jenkins fleet based on on-spot agents. Other talks at this conference πŸš€πŸͺ https://www.conf42.com/im2022 β€” 0:00 Intro 1:40 Talk

Watch
Policy as [versioned] Code | Chris Nesbitt-Smith | Conf42 Incident Management 2022

In this talk Chris will trace back the origins of how policies are often incepted, how it can get out of hand, be slow if not impossible to update and measure compliance, and often lead us to question of is the policy helping or hindering. From this talk you'll learn how to use a software development pattern and product ways of thinking towards how your organization can manage policy; achieve continual updates to policy allowing the risk mitigations to move as fast as the risk does, not get in the way and be easy to measure compliance. Key take aways: - Policy often causes more harm than good, is slow to update, exemptions are harder still to manage, measuring compliance at scale is near on impossible. - Throwing some curly braces at a problem is not the solution. Policy if it is articulated as code, needs to embrace all the best practices of code. - Purposeless policy is potentially practically pointless. (now say it 5 times quickly) Other talks at this conference πŸš€πŸͺ https://www.conf42.com/im2022 β€” 0:00 Intro 0:39 Talk

Watch