More talks in the program:
17:45 - 18:45
The complexity of the socio-technical systems we engineer, operate, and exist within is staggering. Despite this complexity, it remains a fact of life in software development and operations, one which can become easy to ignore, due to our daily interactions with and familiarity of that system (and, let’s face it, as a good strategy to cope with it). When those systems falter or fail, we often find in the postmortems and retrospectives afterward that there were "weak signals" that portended doom, but we didn’t know they were there or how to sense them.
In this talk, we’ll look at what decades of research in the safety sciences has to say about humans interacting with and operating complex socio-technical systems, including what air craft carriers have to do with Internet infrastructure operations, how resilience engineering can help us, and the use of heuristics in incident response. All of these provide insight into ways we can improve one the most advanced — and most effective — monitoring tools we have available to keep those systems running: ourselves.