Please enable JavaScript.
Coggle requires JavaScript to display documents.
gcp GCP Operations Suite (formally Stackdriver), Base of the Site…
GCP Operations Suite (formally Stackdriver)
Error reporting
Used in:
GKE
App engine
Features
Error monitoring
Error Dashboard
Aggregate and display error running GCP services
Tracing
Collect latency data
Tracing system
Near Realtime
Latency reporting
Cloud Monitoring (Formally Stackdriver monitoring)
Metrics
Offers custom metrics
Uptime and health check
Ensure availability of your public resources
Dashboard
Visualize utilization and network traffic
Alerts
Alerting policies can notify you of certain conditions - using emails, SMS, webhooks, etc.
Monitoring agent
Installed in compute instance VMs and EC2 (AWS) instances
Workspace
Root that holds monitoring information
Cloud logging
Features
Analyze log in BigQuery
30 days retention - can be exported to:
GCS
Pub/Sub
BigQuery
Data Studio
Log based metrics
Log search / view / alerts
Logging Agent
Preinstalled into all except VMs
Debugger
Debug snapshot
Create snapshot on the line causing error
Debug logpoints
Insert piece of code to log something
Key benefits
Reduce overhead
Reduce noise
Streamlined use
Faster problem fixes
Base of the Site reliability engineering (SRE)