Wondering how to connect AWS RDS SQL Server with AWS Glue? We can help you.
The AWS Glue service is an ETL service that utilizes a fully managed Apache Spark environment.
Here at Bobcares, we often handle requests from our customers with similar queries as a part of our AWS Support Services.
Today, we will see steps followed by our support engineers to connect with AWS Glue.
What is AWS Glue?
AWS Glue is a serverless service offering from AWS for metadata crawling, metadata cataloging, ETL, data workflows and other related operations.
It can be used to connect to different types of data repositories, crawl the database objects to create a metadata catalog, which can be used as a source and targets for transporting and transforming data from one point to another.
It supports workflows to enable complex data load operations.
AWS RDS for SQL Server instance
Firstly, we need working Amazon RDS for SQL Server instance.
You can create the AWS RDS SQL Server instance using any edition of SQL Server supported by the RDS service.
Ensure that you have the required privileges to connect and access data from the instance.
How to connect AWS RDS SQL Server with AWS Glue?
Today, let us see steps performed by our Support Techs in order to make this connection.
- The first step for any operation is connecting to the data source of interest by creating a new connection.
To learn the required configurations for creating a new connection, navigate to the AWS Glue home page from the AWS Search console by searching for the Glue service
- Once you are on the home page of AWS Glue service, click on the Connection tab on the left pane and you would be seeing a screen.
- Next, to create a new connection to our AWS RDS SQL Server instance.
Click on the Add connection button to start creating a new connection.
- Then, a new wizard screen would appear which will have multiple steps to collect details regarding the data source to which we intend to create a connection.
Provide a relevant name for the connection.
- Next, we must select the type of connection.
We need to select Amazon RDS as it’s the service that holds our AWS RDS SQL Server instance.
- Next, we need to select SQL Server.
If the AWS RDS SQL Server instance is configured to allow only SSL enabled connections, then select the checkbox titled “Requires SSL Connection”. Then click on Next.
- Then, you would be able to find the instance name listed in the instance dropdown.
Select this instance name from the instance list.
- Next, provide the database name and database connection credentials (user id and password).Then click next.
- Next, review the details and create a new connection.
- Then, click on the Test Connection button and it would start opening an active connection to the AWS RDS SQL Server instance.
- Next, you can click on the Actions menu to edit the connection, delete the connection, or view the details of the connection.
- AWS Glue connection properties and details differ for each type of connection and data source selected in the connection.
- Once the connection is in place, the same can be used in ETL Jobs and Workflows.
There can be numerous ETL jobs that share the same connection.
The benefit of this centralized connection approach is that just by changing the connection detail.
All the ETL jobs start pointing dynamically to the updated target location.
[Need help with AWS connection? We’d be happy to help you]
In short, today we saw steps followed by our Support Techs in order to Connect AWS RDS SQL Server with AWS Glue.