

# Connecting to your data


For more information on connecting to the following data sources, choose the section that applies to you. 
+ **AWS Glue Data Catalog** – You can use the Data Catalog to define references to data objects stored in the AWS Cloud, including the following services:
  + Amazon Redshift
  + Aurora MySQL
  + Aurora PostgreSQL
  + Amazon RDS for MySQL
  + Amazon RDS for PostgreSQL

  DataBrew recognizes all Lake Formation permissions that have been applied to Data Catalog resources, so DataBrew users can only access these resources if they're authorized.

  To create a dataset, you specify a Data Catalog database name and a table name. DataBrew takes care of the other connection details.
+ **AWS Data Exchange** – You can choose from hundreds of third-party data sources that are available in AWS Data Exchange. By subscribing to these data sources, you always have the most up-to-date version of the data.

  To create a dataset, you specify the name of a Data Exchange data product that you're subscribed to or entitled to use.
+  **JDBC driver connections** – You can create a dataset by connecting DataBrew to a JDBC-compatible data source. DataBrew supports connecting to the following sources through JDBC:
  + Amazon Redshift
  + Microsoft SQL Server
  + MySQL
  + Oracle
  + PostgreSQL
  + Snowflake

**Topics**
+ [

# Using drivers with AWS Glue DataBrew
](dbms-driver-connections.md)
+ [

# Supported JDBC drivers
](jdbc-drivers.md)

# Using drivers with AWS Glue DataBrew
Using JDBC drivers to connect data

A *database driver* is a file or URL that implements a database connection protocol, for example Java Database Connectivity (JDBC). The driver functions as an adaptor or a translator between a specific database management system (DBMS) and another system. 

In this case, it allows AWS Glue DataBrew to connect to your data. Then you can access a database object, like a table or view, from a supported data source. The data source that you're using might be called a database, a data warehouse, or something else. However, for the purpose of this documentation we refer to all data providers as data sources or connections.<a name="jdbc"></a><a name="download-jars"></a>

To use a JDBC driver or jar file, download the file or files you need and put them in an S3 bucket. The IAM role that you use to access the data needs to have read permissions for both the driver files. 

**Note**  
With AWS Glue 4.0, connecting to Snowflake as a data source is supported natively. You don't need to provide custom `jar` files. In AWS Glue DataBrew, choose Snowflake as the External source connection and provide the URL of your Snowflake instance. The URL will use a hostname in the form `https://account_identifier.snowflakecomputing.com`.   
 Provide the data access credentials, Snowflake database name, and Snowflake schema name. Additionally, if your Snowflake user does not have a default warehouse set, you will need to provide a warehouse name.   
 Snowflake connections use an AWS Secrets Manager secret to provide credential information. Your project and job roles in must have permission to read this secret.   

![\[The screenshot shows a Connection access wizard with Snowflake selected as the External source. Complete the other fields to finish the connection: JDBC URL and Database access credentials.\]](http://docs.aws.amazon.com/databrew/latest/dg/images/databrew-connection-access-snowflake-new.png)


**To use drivers with DataBrew**

1. Find out which version of your data source you're on, using the method provided by the product. 

1. Find the latest version of connectors and driver required. You can locate this information on the data providers website.

1. Download the required version of the JDBC files. These are normally stored as Java ARchives (.JAR) files.

1. Either upload the drivers from the console to your S3 bucket or provide the S3 path to your .JAR files.

1. Enter the basic connection details, for example class, instance, and so on.

1. Enter any additional configuration information that your data source needs, for example virtual private cloud (VPC) information.

# Supported JDBC drivers



| Product | Supported version | Driver instructions and downloads | SQL queries supported | 
| --- | --- | --- | --- | 
|   Microsoft SQL Server  |  v6.x or higher  |  [Microsoft JDBC Driver for SQL Server](https://docs.microsoft.com/sql/connect/jdbc/download-microsoft-jdbc-driver-for-sql-server)  | Not supported | 
|   MySQL  |  v5.1 or higher  |  [MySQL Connectors](https://www.mysql.com/products/connector/)  | Not supported | 
|   Oracle  |  v11.2 or higher  |  [Oracle JDBC downloads](https://www.oracle.com/database/technologies/appdev/jdbc-downloads.html)  | Not supported | 
|   PostgreSQL  |  v4.2.x or higher  |  [PostgreSQL JDBC driver](https://jdbc.postgresql.org/download.html)  | Not supported | 
|   Amazon Redshift  |  v4.1 or higher  |  [Connecting to Amazon Redshift with JDBC](https://docs.aws.amazon.com/redshift/latest/mgmt/configuring-connections.html#connecting-drivers)  | Supported | 
|   Snowflake  |  To see your Snowflake version, use [CURRENT\$1VERSION](https://docs.snowflake.com/en/sql-reference/functions/current_version.html) as described in the Snowflake documentation.  |  To connect to Snowflake you need both of the following:  [\[See the AWS documentation website for more details\]](http://docs.aws.amazon.com/databrew/latest/dg/jdbc-drivers.html)  | Supported | 

To connect to databases or data warehouses that require a different version of the driver from what DataBrew natively supports, you can provide a JDBC driver of your choice. The driver must be compatible with JDK 8 or Java 8. For instructions on how to find the latest driver version for your database, see [Using drivers with AWS Glue DataBrew](dbms-driver-connections.md).