Auto Scaling allows you to scale your capacity up or down automatically according to conditions you define. With Auto Scaling, you can ensure that the number of instances you’re using increases seamlessly during demand spikes to maintain performance, and decreases automatically during demand lulls. Auto Scaling is particularly well suited for applications that experience hourly, daily, or weekly variability in usage. Auto Scaling is enabled by CloudWatch and/or scaling can be done with manual intervention.
Auto scaling is a new service in Eucalyptus. It is tightly integrated with Cloud Watch and ELB. Alarms and policies can be created such that when cloud watch detects any of the set criteria scaling can occur either up or down. For example, scaling can happen at predetermined times, dynamically based on CPU or network usage or manually by executing a scaling policy.
The new component service is called AutoScaling. At this time the service is colocated with the CLC
At its core auto scaling relies on 2 fundamental objects: a launch configuration and an autoscaling group. the very basis for autoscaling is the launch configuration and is required for autoscaling.
A launch configuration is the blue print for what an autoscaling group will launch when it needs to scale. At a minimum a launch config defines the emi, the vm type and name of the launch config. Once a launch configuration is created it can be used as the base for an autoscaling group.
Among other things, an autoscaling group defines an availability zone, minimum number of instances, maximum number of instances, desired capacity, the launch configuration to use when launching new instances and name.
Autoscaling policies describe different modes of scaling and can be executed manually or triggered based on cloud watch metrics. Different types of scaling policies are ChangeInCapacity increases or decreases current capacity by a given number of instances, PercentChangeInCapacity changes capacity by a percentage of current capacity. For instance PercentChangeInCapacity of 0.5 when there were 4 instances in the group would terminate 2 instances such that the result would be 2 running instances in the group. ExactCapacity will change the capacity to an exact number of instances.
Scaling can be done by increasing or decreasing the desired capacity of the autoscaling group, by executing a policy manually or alarmed based trigger of policy execution.
At all times an autoscaling group is monitoring the health of its instances and if enabled instances are submitting metrics to CloudWatch. If at any time an instance in an autoscaling group becomes unhealthy it will be replaced. Also if there is any scaling action then instances will be launched or terminated based on the scaling type. Scaling can be configured to allow for cooldown times, amount of time, in seconds, after a scaling activity completes before any further trigger-related scaling activities may start. Another important feature of autoscaling is grace period. A grace period is the number of seconds to wait before starting health checks on newly-created instances.
Users can interact with AutoScaling in a number of ways. Euca2ools 3, EucaLobo, boto and AWS Java SDK to name a few. The new Euca2ools 3 delivers a whole suite of commandline tooling for autoscaling. The new autoscaling commands begin with the prefic "euscale-" pronounced "you scale"
- Create a launch configuration:
euscale-create-launch-config -i emi-24E73962 -t t1.micro --group securtiy-group-1 --key don-key My-LC
- Create an autoscaling group (no instances will be launched in this example yet):
euscale-create-auto-scaling-group -l My-LC --min-size 0 --max-size 5 -z PARTI00 My-ASG
- Set desired capacity of the ASG (this will launch instances)
euscale-set-desired-capacity -c 2 My-ASG
- You can see the scaling activity
[root@h-17 ~]# euscale-describe-scaling-activities ACTIVITY 94633c90-0784-431f-8178-7030f47af331 2013-06-11T00:10:27.193Z My-ASG Successful
- How will the administrators configure the feature? An admin will have to create a policy for users to allow access to autoscaling example:
{ "Statement":[{ "Effect":"Allow", "Action":"autoscaling:", "Resource":"" } ] }
- How will admins monitor/create/delete resources from other accounts? The cloud admin cal view all launch configs and autoscaling groups defined in a cloud across all accounts using:
euscale-describe-launch-configs verbose
- Checking "euscale-describe-scaling-activities" can give you an idea if scaling that has been triggered including the reason in the case of a scaling failure
- This is autoscaling. Instances belonging to an autoscaling group that are terminated will be replaced.
- You cannot delete an autoscaling group that has running instances. In order to delete an autoscaling group the recommended method is to set minimum, maximum and desired capacity of the ASG all to ZERO wait for all instances to terminate then delete the ASG.
- Alternatively you can force delete. In Euca2ools add the optrion "-force-delete", again this is not recommended procedure but most tooling has the force option available.
- You cannot change a launch config. The best you can do is to create a new launch config with your desired parameters then update your autoscaling group specifying your new launch config. Subsequent instances launched in the group will be of the new launch config type.
[[category.Training]]