Please enable JavaScript.
Coggle requires JavaScript to display documents.
Site Reliability Engineering - Coggle Diagram
Site Reliability Engineering
Create
Create Virtual machines / PXE Baremetals
Setup Services
Manage Users on the machines
Create monitoring scripts
Write automation scripts
Create optimizations in Infrastructure
Monitor
Use monitoring scripts & tools
retain and plot historical data
predict failures
alert when threshold breached
React to alerts / troubleshoot / debug
Keep an eye on capacity requirement
Keep an eye on potential failures
Audit
Maintain
Scale up the infrastructure
Upgrade & update the components
Estimate the growth of the infrastructure
Use resources wisely
Communicate
Backup & High Availability
Destroy
Scale down the infrastructure
Delete data that is no longer needed
database
logs
expired certs
cache
Decommission services & machines