A decade ago, Joyent accidentally rebooted an entire datacenter, an experience that I described in my 2017 GOTO Chicago talk, "Debugging Under Fire":

https://www.youtube.com/watch?v=30jNsCVLpAE

On the next Oxide and Friends, @ahl we will be joined by folks who were at Joyent a decade ago, both to recall the fateful outage and to reflect on its ramifications, both at Joyent and beyond. Join us, Monday, 5p Pacific:

https://discord.gg/dqUCRwsx?event=1243638578484088842

@bcantrill Ah right, that was the talk I was thinking about when I read this last week: https://blog.danslimmon.com/2024/05/15/ask-questions-first-shoot-later/
Ask questions first, shoot later

The fact that fixing and diagnosing often converge to the same actions doesn’t change the fact that these two concurrent activities have different goals. The goal of fixing is to bring the sy…

Dan Slimmon