List of videos

No WAFs: Don’t use a Web Application Firewall, and when you should | Joshua Fox | Conf42 SRE 2024

Read the abstract ➤ https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_Joshua_Fox_wafs_web_application Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 0:33 about joshua fox 1:03 doit 1:25 article 1:36 scenario 2:02 what is a waf? 2:16 drivers for getting a waf 2:22 hacker attack 2:35 penetration test 3:09 urgency 3:18 expertise 3:32 outside requirement/audit 4:00 security blanket 4:18 web threats 4:50 walktrhrough: cross site scripting 5:09 wihout waf 5:44 demo waf architecture 5:59 make it safe! 6:28 a simple chat message is executed 6:34 with waf 7:05 sql injection 8:02 ddos 8:10 why distributed? 8:33 application-level threats 8:39 broken access control 9:15 toss in a waf 9:20 how cloud armor works 9:25 architecture 9:57 policies and rules 10:16 rules 10:41 types of rules 11:28 preconfigured rules (use these!) 11:48 sensitivity (paranoia) 12:08 standard signatures 12:41 sample signature 13:03 rule language 13:28 waf won't protect you! 13:37 blocking your own app 14:24 false positives 15:06 job zero 15:33 secure your app 16:33 but the most important 16:43 ddos 17:27 ip address 17:45 geo 17:52 dry run 17:56 preview 18:26 problem with preview 18:47 false negatives 18:54 imperfection detection 19:17 the worst: broken access control 19:40 attackers shift 19:57 attackers are smart 20:08 flexibility? 21:22 waf adds risk, man-in-the-middle 21:37 risk: complacency 22:01 risk to performance 22:12 pricing 23:03 at long last... 23:07 eternal requirement 23:18 third-party apps 23:33 central supervision 24:36 the one go-to feature 24:43 consider advanced services 25:11 if you're going to do it, do it now 25:19 prefer your cloud's waf 25:46 minuses of waf 26:09 plusses of a waf 26:31 conclusion 26:45 we're hiring!

Watch
Zero-instrumentation observability based on eBPF | Nikolay Sivko | Conf42 SRE 2024

Read the abstract ➤ https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_Nikolay_Sivko_zeroinstrumentation_observability_ebpf Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 0:33 observability is ... 1:06 systems a while ago 1:42 modern systems 2:31 making a system observable 4:32 collecting telemetry data 10:50 a quick intro into ebpf 11:42 how to use ebpf 15:24 caroot-node-agent (apache 2.0 license) 16:32 how the agent leverages ebpf 21:32 ssl 22:59 ebpf: performance impact 25:13 ebpf-based metrics 28:18 ebpf-based traces (spans) 28:47 ebpf-based tracing limitations 31:05 ebpf-based continuous cpu profiling 32:10 ebpf-based cpu profiling 33:10 how coroot works 34:10 conclusion 34:54 thank you, let's connect!

Watch
2024: I Don't Know K8S and at This Point, I'm Too Afraid To Ask | Alex Dejanu | Conf42 SRE 2024

Read the abstract ➤ https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_Alex_Dejanu_k8s_strategies_ask Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 0:55 agenda 1:14 whoami 1:34 history 3:37 are you looking for a job? 4:18 kube_flex 5:59 architecture 6:48 architecture 7:28 controller 7:50 operators 8:37 operator under 5min - demo 19:50 more memes 20:58 conclusions 21:55 thank you

Watch
Building reliable product through SRE community | Jorge Luis Castro Toribio | Conf42 SRE 2024

Read the abstract ➤ https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_Jorge_Luis_Castro_Toribio_building_reliable_community Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 1:21 jorge luis castro toribio 2:15 in this talk 3:41 let's take care of the basics 3:55 what is sre? 10:03 #real_life 11:59 some of our problems 14:20 community of practices (cop) 15:21 our first thoughts 17:05 sre community of practices (srecop) 19:42 sre cop team 22:04 cop is an investment 22:24 sre cop aligned to business strategy 29:22 some metrics recommendation 29:55 how do we make cop last longer and more engager? 30:15 meg: minimum enjoyable game 30:44 octalysis 31:52 what we achieved? 32:42 learned lessons 33:42 to sum up 34:07 books 34:26 thank you!

Watch
Resilient Systems & Teams: A CTO’s Blueprint for SRE Excellence | Evgenii Korneev | Conf42 SRE 2024

Read the abstract ➤ https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_Evgenii_Korneev_resilient_teams_blueprint Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 0:42 topics 1:34 do startups need sre? 4:25 on-call duties 7:17 when do you need a dedicated sre engineer or sre team? 9:47 communications 11:24 strong leads are key to success 12:26 system resilience is a comprehensive set of measures 13:40 thank you!

Watch
How to Avoid Being an Agile Victim | David Argent | Conf42 SRE 2024

Read the abstract ➤ https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_David_Argent_avoid_agile_victim Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 0:44 agenda 1:02 introduction 2:13 a (very) brief review of agile 4:25 how to fail 4:40 failure 101 6:03 failure 201 7:49 failure 301 10:54 think before you code 11:19 design 101 15:04 design 201 17:42 design 301 20:08 balancing tactics and strategy 20:37 tactics 101 23:53 strategy 101 25:34 strategy 201 27:13 code is not your only deliverable 27:38 deliverables 101 28:23 deliverables 201 30:23 summary 31:29 thank you

Watch
Clinical troubleshooting: diagnose production issues | Dan Slimmon | Conf42 SRE 2024

Read the abstract ➤ [https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_Dan_Slimmon_clinical_troubleshooting_diagnose Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 0:59 who's this guy? 1:39 clinical troubleshooting 26:42 reach out to dan

Watch
Debugging cluster issues as an on-call SRE | Pravar Agrawal | Conf42 SRE 2024

Read the abstract ➤ https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_Pravar_Agrawal_debugging_cluster_oncall Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 0:40 agenda 1:21 whoami 1:39 introduction to sre 2:55 understanding on-call process 4:23 some common cluster issues 6:32 approach to debugging 8:10 automation to the rescue? 9:16 shades of automation 12:06 advice for beginners 13:09 thank you!

Watch
Mastering the Maze: Navigating Complexity in Distributed Systems | Aleksei Popov | Conf42 SRE 2024

Read the abstract ➤ https://www.conf42.com/Site_Reliability_Engineering_SRE_2024_Aleksei_Popov_maze_complexity_distributed_systems Other sessions at this event ➤ https://www.conf42.com/sre2024 Support our mission ➤ https://www.conf42.com/support Join Discord ➤ https://discord.gg/DnyHgrC7jC Chapters 0:00 intro 0:26 preamble 0:33 agenda 0:49 what is a distributed system? 1:41 what is complexity? 2:24 monolithic architecture 2:56 disadvantages 3:54 microservices architecture 4:42 what do distributed systems give us? 6:58 challenges, quality attributes 9:05 what are main troubles? 11:28 unreliable networks 12:44 strategy: timeout 13:13 strategy: retry 14:01 strategy: idempotency 14:51 strategy: circuit breaker 15:42 concurrency and lost writes 16:22 strategy: snapshot isolation 18:49 strategy: compare and set 20:03 strategy: lease 20:51 dual write problem 22:25 strategy: transactional outbox 23:08 strategy: log tailing 23:29 unreliable clocks 25:16 availability and consistency 25:41 high availability 25:55 failure 26:24 consistency types 26:44 linearizability 27:23 strategy: distributed consensus algorithm, e.g. raft 29:16 more complexities 30:03 eventual consistency 30:56 strategy: read from leader 31:19 process pauses 32:00 strategy: fencing 34:01 observability 34:24 strategy: distributed tracing 35:40 strategy: orchestration over choreography 36:36 evolvability and cybernetics principles 37:32 systems thinking 38:30 feedback loops 39:13 adaptability and learning 39:46 goal-oriented design 40:40 big ball of mud 41:30 hierarchy 42:04 fallacy: all microservices are the same 42:22 strategy: service types 45:30 sre principles 48:24 infrastructure as code 49:11 chaos engineering and testing: jepsen tests 50:39 simplicity and measuring complexity 51:58 thank you for attending!

Watch