Please enable JavaScript.

Coggle requires JavaScript to display documents.

Compute - Coggle Diagram

- - - - Target Tracking
        
        Set a target metric and define a threshold to react to it
      - Scheduled Actions
        
        Based on the known patterns (peak time, etc.)
      - Simple / Step
        
        Manually add/remove units based on external triggers (ex. CloudWatch alarm)
        
        Simple
        
        Define one policy that controls scaling
        
        Obviously, only can listen to one external trigger
        
        Has 'cooldown' period which ASG waits for some time before taking next scaling action (300 secs by default)
        
        Step
        
        Multiple policy where each policy listen to external trigger
        
        Scaling can be more fine-grained utilizing multiple triggers
        
        Can configure warm-up period, which if the instance is in then is not counted toward the EC2 metrics
      - Predictive Scaling
        
        Use the historic data to forecast the usage
        
        Enough data must be accumulated
    - - Lifecycle Hooks
        
        Optionally, hooks can be added to each instance, so it can be possible to have more through control of the instances in ASG
      - Health Check
        
        ASG periodically checks EC2 instances.
        
        Manual configuration is needed (endpoint to health check)
        
        Must check only the status of the instance itself, not the dependencies
    - - Doubling
        
        Simply increase the ASG capacity twice as desired size, so there will be both old and new versions running
        
        Once all new instances are running, decrease the capacity back to the desired size, so the legacy instances will be removed
      - ALB Redirection
        
        Create a new scaling group and add it to the existing ALB.
        
        Traffic will shift gradually from old ASG to new ASG
        
        Once all traffic flow into the new ASG, old ASG is removed
      - Instance Refresh
        
        Instances will gradually shutdown and restarted to avoid service interruption
        
        Warm-up Time
        
        The time needed for a restarted instance to be ready for service
        
        Min-Healthy Percentage
        
        The percentage to maintain for instances to be running
        
        If # of running instances fall below the line, refresh task will stop until the time come
      - DNS Redirection
        
        Create a new ALB and a ASG
        
        Route 53's weighted record is used to redirect some of the traffics to the new ALB
        
        Once the new ALB with ASG is ready, abandon the old one
        
        Manual testing is also possible on the new ALB accessing directly to the ALB
    - - Instance Tenacity
        
        Define how to distribute EC2 instances accross physical hardware
        
        types
        
        Dedicated (dedicated instance)
        
        Run on single-tenant hardware
        
        If an instance fail, the replacement will be placed in a selection of 'hardwares' available
        
        Host (dedicated host)
        
        A physical host or VM
        
        If an instance fail, the replacement will launch in the same hardware
        
        Shared (default)
        
        Multiple AWS accounts share the same physical hardware
        
        Tenacity attribute of VPC
        
        The VPC can set default tenacity setting to be respected by ASG
        
        If the launch configuration is empty but VPC has tenacity set, the tenacity of VPC will be followed
  - - - Dedicated Hosts
        
        Pay per physical hardware running the instances.
        
        Can bring per-socket, per-core, per-VM software licenses
      - Reserved
        
        Pay upfront for instance type,family,term
      - Saving Plans
        
        EC2 costs can be decreased by consistent amount of usage
        
        Can be set in 1 or 3 years arrangement
      - Spot
        
        Set price threshold and utilize the instance only when the pricing fall below the threshold
        
        2 minutes grace period is given when the instance shutdown
      - On-Demand
        
        Pay by usage
      - Dedicated Instances
        
        Pay per logical hardware running the instances
      - Capacity Reservation
        
        Pin-down EC2 instances in a specific AZ
    - - R (Memory)
        
        Focused on more memory allocation
      - C (Processing)
        
        Focused on more processing power
      - G (GPU)
        
        Special unit equipped with GPU
        
        Usually for video rendering and ML
      - T2/T3 (Burstable)
        
        Processing power can be bursted in high demand
        
        Most affordable option
        
        unlimited
        
        The bursting can be over the baseline
      - D (Storage)
        
        Best performance on sequential I/O access
      - A (Gravition)
        
        Dedicated processor by AWS
        
        Not available for Windows
        
        ARM based
        
        Graviton 2
        
        40% performance boost compared to 5th gen x86
        
        Graviton 3
        
        3 times the Graviton 2
      - I (Throughput)
        
        Best performance on random access I/O
      - M (General Purpose)
        
        Balance between memory and processing
- - - - ECS Task
        
        Definition of a single container image, resource requirements, and network configuration
        
        CPU quota/count, RAM, storage options, etc.
      - ECS Service
        
        Definition of things like # of tasks to run, replicas, restarting policy, load balancing, etc.
      - ECS IAM Roles
        
        Default roles that can be assumed by any EC2 instances of the ECS Cluster.
        
        Required to make API calls to ECS, send logs to CloudWatch, etc.
        
        Role: Instance Profile
        
        The actual role that any EC2 instance of the ECS Cluster can assume
        
        Can be assigned to the ECS Cluster when creating the cluster
        
        Role: Task IAM Role
        
        Role that is assumed used by the ECS task, instead of EC2
        
        Can be assigned to the ECS Task when crating a new task
      - ECS Cluster
        
        Logical grouping of all EC2 instances deployed by the ECS
    - - Classic Spot Instances
        
        Spot instances are provisioned manually, and ECS deploy containers to the instances as necessary
        
        As Spot instances may not available all the time, it's not suitable for tasks that require high reliability
      - Fargate Spot Instances
        
        Serverless spot instances are automatically provisioned, and ECS deploy the containers to them
        
        The region and instance type is fixed, so pricing is more predictable while being more reliable