Please enable JavaScript.
Coggle requires JavaScript to display documents.
AWS Well-Architecture Operational Excellence - Coggle Diagram
AWS Well-Architecture Operational Excellence
Operational excellence
cloud env
refine operations procedures frequently
Anticipate failure
make frequent, small, reversible changes
Learn from all operational failures
Perform operations as code
organization
Prepare
design telemetry
Improve flow
Mitigate deployment risks
Understand operational readines
use multiple environments
dev env
infra as code
increase control env approach prod
turn off env are not in use
Operate
understanding workload health
understanding operations health
Event, incident and problem management
Event
Obs of interest
Incident
An event requires a response
Problem
an incident can be resolved or not
owner of events
alert users when compute impacted
alert when back normal
Evolve
learn from exp
make improvements
share learning
feedback loops
identify areas of improv
improv where need
operations activiies, customer exp
recognize improv
Key points
understand business priorities
evaluate operational readiness
understand workload and operational health
design for operations
learn from your exp
prepare for events