You vastly underestimate the complexity of systems in a company like Amazon.
COEs and Operation Readiness Reviews are already the documents that you mention, but they are largely useless in preventing incidents.