Please enable JavaScript.
Coggle requires JavaScript to display documents.
Designing for Reliability in GCP - Coggle Diagram
Designing for Reliability in GCP
Improving Reliability
Cloud Monitoring
Metrics
Cloud Series
Dashboards
Alerting
Policies
Conditions
Notifications
Cloud Logging
Export to BigQuery if need to keep for more than 30d
Open Source Observability tools
Prometheus
Grafana
Release Management
Continuous Delivery
Continuous Integration
Test before deploying
Reliability
Acceptance
Code versioning
Cloud Source Repository
GitHub
Code build and test
Google Cloud Build
Jenkins
Deployment Strategies
Complete Deployment
Rolling Deployment
Canary Deployment
Blue/Green Deployment
System Reliability
Testing for reliability
Reliability stress tests
Load testing
Integration tests
Unit tests
System tests
How to respond to overload?
Shedding Load
Degrading Quality of Service
Upstream Throttling
Tripping
Cascading failures
Avoid trashing!
Incident Management and Post-Mortem Analysis