Site reliability engineering (SRE) uses software engineering to automate IT operations tasks – e.g. production system management, change management, incident response, even emergency response – that would otherwise be performed manually by systems administrators. The concept behind SRE is that using software code to automate oversight of large software...