Please enable JavaScript.

Coggle requires JavaScript to display documents.

HA, DR and Scalability - Coggle Diagram

- - - - Dynamic Scaling measure load and determine if more capacity is needed (reactive). The following policies are supported:
        
        Target tracking scaling
        
        Step Scaling
        
        Simple Scaling
        
        Step Scaling
        
        Avoid Thrashing you want to create instances quickly and spin them down slowly
        
        Warm-Up (stops instances from being placed behind ELB, failing the health check, and get terminated). During warmup, scale-in is blocked. Warming up instance is not counted toward the aggregated metrics of the ASG
        
        Increase or decrease (absolute or %) or set exact size the current capacity of the group based on a set of scaling adjustments, known as step adjustments, that vary based on the size of the alarm breach.
        
        Simple Scaling
        
        Cooldown pauses Auto Scaling for a set amount of time. Helps to avoid runaway scaling events. Default is 5 minutes
        
        Increase and decrease the current capacity of the group based on a single scaling adjustment, with a cooldown period between each scaling activity.
        
        Target Tracking Scaling
        
        Increase and decrease the current capacity of the group based on a Amazon CloudWatch metric and a target value. It works similar to the way that your thermostat maintains the temperature of your home—you select a temperature and the thermostat does the rest.
        
        Metrics that decrease when capacity increases and vice versa can be used to proportionally scale out or in the number of instances using target tracking (e.g. custom metric .. per instance)
        
        Includes Warm-up parameters. You can set default instance warmup to avoid setting it in the scaling policy (target tracking, step and instance refresh)
      - Scheduled Scaling if you have a predictable workload, create scaling event to get your resources ready before they are actually needed.
        
        Min/Desired/Max
        
        Recurrence (once/cron/every...)
        
        Start Date Time
        
        (optional) End Date Time
      - Predictive Scaling
        
        AI/ML to determine when you need sto scale
        
        This is re-evaluated every 24 hours to create a forecast for the next 48 hours
        
        Needs at least 2 weeks of data
      - Manual Scaling At any time, you can change the size of an existing Auto Scaling group manually. You can either update the desired capacity of the Auto Scaling group, or update the instances that are attached to the Auto Scaling group. Not always automatic scaling is needed
    - - Keep an eye on provisioning time, configure AMIs to minimize it
      - Use EC2 RIs for Minimum Capacity count and possibly Spot Instances for scale out if applicable
      - Scale in conservatively
      - CloudWatch best option to trigger Auto Scaling event (Out and In)
      - Scale Out aggressively
    - - Let you create solutions that are aware of events in the Auto Scaling instance lifecycle, and then perform a custom action on instances when the corresponding lifecycle event occurs
      - Use Cases
        
        in scale-in event pause the instance termination for a certain amount of time to allow the EC2 to upload all data logs before it gets completely terminated
        
        control when instances are registered with Elastic Load Balancing. By adding a launch lifecycle hook to your Auto Scaling group, you can ensure that your bootstrap scripts have completed successfully and the applications on the instances are ready to accept traffic before they are registered to the load balancer at the end of the lifecycle hook
        
        Complete the lifecycle action with result = CONTINUE to finish before the timeout expires. If you don't complete the lifecycle action, the hook goes to the status specified for Default result after the timeout period ends
        
        use aws autoscaling complete-lifecycle-action --lifecycle-action-result CONTINUE to either manually or automatically complete the lifecycle action
      - By default, when you add a lifecycle hook in the console, Amazon EC2 Auto Scaling sends lifecycle event notifications to Amazon EventBridge
        
        Using EventBridge or a user data script is a recommended best practice
        
        To create a lifecycle hook that sends notifications directly to Amazon SNS or Amazon SQS, use the AWS CLI, AWS CloudFormation, or an SDK to add the lifecycle hook
  - - - Warm pool size
        
        Default = ASG Maximum - ASG Desired (current)
        
        Custom number = Custom - ASG Desired (current)
      - Optionally you can set minimum pool size
  - - - Amazon EC2 status checks and scheduled events
        
        Default in ASG
        
        Checks that the instance is running
        
        EC2 instance status checks and system status checks
        
        Checks for underlying hardware or software issues that might impair the instance
        
        Turn on EBS health checks (EBS monitors if EC2 root volume / attached volume stalls. In case of alarm, ASG replaces EC2)
        
        If instance is affected by a scheduled event, ASG considers the instance to be unhealthy and replaces it according timestamp of the event
        
        ELB health checks
        
        Checks whether the load balancer reports the instance as healthy, confirming whether the instance is available to handle requests
        
        To run this health check type, you must enable it for your ASG
        
        If connection draining (deregistration delay) is enabled, ASG waits for either in-flight requests to complete or the max timeout to expire before it terminates unhealthy instances
        
        VPC Lattice health checks
        
        Checks whether VPC Lattice reports the instance as healthy, confirming whether the instance is available to handle requests
        
        To run this health check type, you must enable it for your ASG
        
        Custom health checks
        
        Checks for any other problems that might indicate instance health issues, according to your custom health checks
  - - - EC2 Auto Scaling automatically rebalances the Auto Scaling group. It does this by launching instances in the enabled AZ with the fewest instances and terminating instances elsewhere
        
        The following actions can lead to rebalancing activity:
        
        You change the AZ associated with your Auto Scaling group
        
        You explicitly terminate or detach instances or place instances in standby, and then the group becomes unbalanced
        
        An AZ that previously had insufficient capacity recovers and now has additional capacity
        
        An AZ that previously had a Spot price above your maximum price now has a Spot price below your maximum price
      - When rebalancing new instances are launched before terminating others (does not compromise the performance or availability)
        
        Being at or near the specified maximum capacity could impede or completely halt rebalancing activities
        
        To avoid this problem, the system can temporarily exceed the specified maximum capacity of a group during a rebalancing activity (the greater between 10% or one instance)
      - Suspend AZ Rebalance
        
        A scale-out or scale-in event occurs, the scaling process still tries to balance the Availability Zones. For example, during scale-out, it launches the instance in the Availability Zone with the fewest instances
        
        If you suspend the Launch process, AZ Rebalance neither launches new instances nor terminates existing instances. This is because AZRebalance terminates instances only after launching the replacement instances
        
        If you suspend the Terminate process, your Auto Scaling group can grow up to 10% larger than its maximum size because this is allowed temporarily during rebalancing activities. If the scaling process cannot terminate instances, your Auto Scaling group could remain above its maximum size until you resume the Terminate process
    - - When using Spot Instances you can turn on Capacity Rebalancing
        
        Attempt to launch a Spot Instance whenever EC2 reports that a Spot Instance is at an elevated risk of interruption
        
        After launching a new instance, it then terminates an earlier instance
  - - - Launch—Adds instances to ASG when the group scales out, or when EC2 Auto Scaling chooses to launch instances for other reasons, such as when it adds instances to a warm pool
        
        Terminate—Removes instances from the ASG when the group scales in, or when EC2 Auto Scaling chooses to terminate instances for other reasons, such as when an instance is terminated for exceeding its maximum lifetime duration or failing a health check
        
        AddToLoadBalancer—Adds instances to the attached load balancer target group or Classic Load Balancer when they are launched
        
        AlarmNotification—Accepts notifications from CloudWatch alarms that are associated with dynamic scaling policies
        
        AZRebalance—Balances the number of EC2 instances in the group evenly across all of the specified Availability Zones when the group becomes unbalanced, for example, when a previously unavailable Availability Zone returns to a healthy state
        
        HealthCheck—Checks the health of the instances and marks an instance as unhealthy if Amazon EC2 or Elastic Load Balancing tells Amazon EC2 Auto Scaling that the instance is unhealthy. This process can override the health status of an instance that you set manually
        
        InstanceRefresh—Terminates and replaces instances using the instance refresh feature
        
        ReplaceUnhealthy—Terminates instances that are marked as unhealthy and then creates new instances to replace them
        
        ScheduledActions—Performs the scheduled scaling actions that you create or that are created for you when you create an AWS Auto Scaling scaling plan and turn on predictive scaling
    - - You can suspend and resume individual processes or all processes
        
        Suspending a process affects all instances in your Auto Scaling group
        
        Suspending AlarmNotification allows you to temporarily stop the group's target tracking, step, and simple scaling policies without deleting the scaling policies or their associated CloudWatch alarms
        
        If you suspend the Launch and Terminate processes, or AZRebalance, and then you make changes to your Auto Scaling group (e.g. detaching instances or changing the AZs) your group can become unbalanced between Availability Zones. If that happens, after you resume the suspended processes, Amazon EC2 Auto Scaling gradually redistributes instances evenly between AZs
        
        Suspending the Terminate process doesn't prevent the successful termination of instances using the force delete option with the delete-auto-scaling-group command
  - - - Useful when a configuration change requires you to replace instances (e.g. new AMI, new user data)
        
        Configure the minimum healthy percentage, optional checkpoints
        
        Start executing a rolling replacement of the instances. Takes a set of instances out of service, terminates them, and launches a set of instances with the new desired configuration
        
        Health checks ensure new instances are healthy before continuing
        
        The process automatically rolls back if health checks fail
        
        After a certain percentage of the group is replaced, a checkpoint is reached (e.g.g 25%, 50%, 75%), temporarily stops replacing instances and sends a notification (manual approval required)
      - Key Parameters
        
        Minimum Healthy Percentage (default: 90%)
        
        Instance Warmup (default: uses ASG's health check grace period)
        
        Time to wait after instance launch before moving to next batch
        
        Checkpoint Percentages (optional)
        Max Healthy Percentage (default: 100%, can be > 100%)
    - - Common use case is the requirement to replace your instances on a schedule because of internal security policies or external compliance controls
        
        maximum amount of time (in seconds) that an instance can be in service before it is terminated and replaced
        
        Must specify a value of at least 86,400 seconds (one day)
        
        To clear a previously set value, specify a new value of 0. This setting applies to all current and future instances in your Auto Scaling group
- - - - Fully managed capacity management
      - Pay small amount of money per read and write. Less cost effective
      - Sporadic / Unpredictable / New / Frequently Idle Workload
    - - Most cost-effective model. You can also buy Reserved Capacity to further lower the cost
      - Predictable Workload
      - Requires effort to review past usage and setup upper/lower scaling limits, better to enable the auto-scaling mode
        
        Minimum capacity units
        
        Maximum capacity units
        
        Target Utilization %
        
        Initial provisioned units
      - You must specify table provisioned throughput capacity: amount of read and write activity that the table can support. DynamoDB uses this information to reserve system resources to meet your throughput needs
      - Optionally enable auto scaling to manage your table's throughput capacity. You still must provide initial settings for read and write capacity when you create the table. Auto scaling uses initial settings as a starting point, and then adjusts them dynamically