Bobcares

All about AWS EMR managed scaling

by | Oct 1, 2021

Stuck with AWS EMR managed scaling? Read on to find out what our expert Support Engineers have to say.

Learn how to use AWS CLI to configure managed scaling from the skilled Support Team at Bobcares.

Dealing with AWS EMR managed scaling

Let’s take a look at look at how to configure managed scaling for EMR with CLI commands. You can opt to use a shorthand syntax with the JSON configuration inline included in the relevant commands or reference the file with the configuration JSON.

Enabling the managed scaling during cluster launch

This example demonstrates how to enable managed scaling during cluster launch,

aws emr create-cluster \
 --service-role EMR_DefaultRole \
 --release-label emr-5.33.0 \
 --name EMR_Managed_Scaling_Enabled_Cluster \
 --applications Name=Spark Name=Hbase \
 --ec2-attributes KeyName=keyName,InstanceProfile=EMR_EC2_DefaultRole \
 --instance-groups InstanceType=m4.xlarge,InstanceGroupType=MASTER,InstanceCount=1 InstanceType=m4.xlarge,InstanceGroupType=CORE,InstanceCount=2 \
 --region us-east-1 \
 --managed-scaling-policy ComputeLimits='{MinimumCapacityUnits=2,MaximumCapacityUnits=4,UnitType=Instances}'

Furthermore, you can specify the managed policy configuration via the –managed-scaling-policy option while using create-cluster.

Applying managed scaling policy to an existing cluster

The following example applies managed scaling to an existing cluster:

aws emr put-managed-scaling-policy  
--cluster-id j-123456  
--managed-scaling-policy ComputeLimits='{MinimumCapacityUnits=1,
MaximumCapacityUnits=10,  MaximumOnDemandCapacityUnits=10, UnitType=Instances}'

Moreover, this can be done also via the AWS EMR put-managed-scaling-policy command. For instance, this example makes a reference to managedscaleconfig.json, a JSON file.

aws emr put-managed-scaling-policy --cluster-id j-123456 --managed-scaling-policy file://./managedscaleconfig.json

Here is a look at the contents of the managedscaleconfig.json file. It defines the managed scaling policy.

{
    "ComputeLimits": {
        "UnitType": "Instances",
        "MinimumCapacityUnits": 1,
        "MaximumCapacityUnits": 10,
        "MaximumOnDemandCapacityUnits": 10
    }
}

Retrieve an EMR managed scaling policy configuration

In order to retrieve the policy configuration, use the GetManagedScalingPolicy command. For instance, it is used here to retrieve the configuration of the cluster with the cluster id j-123455.

aws emr get-managed-scaling-policy --cluster-id j-123455

This results in the following output:

{
   "ManagedScalingPolicy": { 
      "ComputeLimits": { 
         "MinimumCapacityUnits": 1,
         "MaximumOnDemandCapacityUnits": 10,
         "MaximumCapacityUnits": 10,
         "UnitType": "Instances"
      }
   }
}

Removing EMR managed scaling policy

The RemoveManagedScalingPolicy is used to remove the managed scaling policy. For instance, this command removes the policy of the cluster with cluster id j-123455:

aws emr remove-managed-scaling-policy --cluster-id j-123455

[Need help with Server Management? We are here to help.]

Conclusion

In brief, we learned how to use AWS CLI to configure AWS EMR managed scaling. With the guidance of the skilled Support Engineers at Bobcares, Server management is now a breeze.

PREVENT YOUR SERVER FROM CRASHING!

Never again lose customers to poor server speed! Let us help you.

Our server experts will monitor & maintain your server 24/7 so that it remains lightning fast and secure.

GET STARTED

0 Comments

Submit a Comment

Your email address will not be published. Required fields are marked *

Never again lose customers to poor
server speed! Let us help you.

Privacy Preference Center

Necessary

Necessary cookies help make a website usable by enabling basic functions like page navigation and access to secure areas of the website. The website cannot function properly without these cookies.

PHPSESSID - Preserves user session state across page requests.

gdpr[consent_types] - Used to store user consents.

gdpr[allowed_cookies] - Used to store user allowed cookies.

PHPSESSID, gdpr[consent_types], gdpr[allowed_cookies]
PHPSESSID
WHMCSpKDlPzh2chML

Statistics

Statistic cookies help website owners to understand how visitors interact with websites by collecting and reporting information anonymously.

_ga - Preserves user session state across page requests.

_gat - Used by Google Analytics to throttle request rate

_gid - Registers a unique ID that is used to generate statistical data on how you use the website.

smartlookCookie - Used to collect user device and location information of the site visitors to improve the websites User Experience.

_ga, _gat, _gid
_ga, _gat, _gid
smartlookCookie
_clck, _clsk, CLID, ANONCHK, MR, MUID, SM

Marketing

Marketing cookies are used to track visitors across websites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.

IDE - Used by Google DoubleClick to register and report the website user's actions after viewing or clicking one of the advertiser's ads with the purpose of measuring the efficacy of an ad and to present targeted ads to the user.

test_cookie - Used to check if the user's browser supports cookies.

1P_JAR - Google cookie. These cookies are used to collect website statistics and track conversion rates.

NID - Registers a unique ID that identifies a returning user's device. The ID is used for serving ads that are most relevant to the user.

DV - Google ad personalisation

_reb2bgeo - The visitor's geographical location

_reb2bloaded - Whether or not the script loaded for the visitor

_reb2bref - The referring URL for the visit

_reb2bsessionID - The visitor's RB2B session ID

_reb2buid - The visitor's RB2B user ID

IDE, test_cookie, 1P_JAR, NID, DV, NID
IDE, test_cookie
1P_JAR, NID, DV
NID
hblid
_reb2bgeo, _reb2bloaded, _reb2bref, _reb2bsessionID, _reb2buid

Security

These are essential site cookies, used by the google reCAPTCHA. These cookies use an unique identifier to verify if a visitor is human or a bot.

SID, APISID, HSID, NID, PREF
SID, APISID, HSID, NID, PREF