

# Using datasets in AWS Glue DataBrew
<a name="datasets.creating"></a>

To view a list of your datasets in the DataBrew console, choose **DATASET** at left. In the datasets page, you can view detailed information for each dataset by clicking its name or choosing **Actions**, **Edit** from its context menu. 

To create a new dataset, you choose **DATASET**, **Connect new dataset**. Different data sources have different connection parameters, and you enter these so that DataBrew can connect. When you save your connection and choose **Create dataset**, DataBrew connects to your data and begins loading data. For more information, see [Connecting to your data](datasets.connecting-to-data.md). 

The dataset page has the following elements to help you explore your data.

**Dataset preview** – On this tab, you can find connection information for the dataset and an overview of the overall structure of the dataset, as shown following.

![\[Dataset details and preview showing metadata and sample rows from a JSON file stored in S3.\]](http://docs.aws.amazon.com/databrew/latest/dg/images/dataset-preview.png)


**Data profile overview** – On this tab, you can find a graphical data profile of statistics and volumetrics for your dataset, as shown following.

![\[Data profile overview showing dataset summary, data types, missing cells, and correlations.\]](http://docs.aws.amazon.com/databrew/latest/dg/images/data-profile-overview2.png)


**Note**  
To create a data profile, run a DataBrew profile job on your dataset. For information about how to do this, see [Step 5: Create a data profile](getting-started.05.md).

**Column statistics** – On this tab, you can find detailed statistics about each column in your dataset, as shown following.

![\[Data profile overview showing column statistics, data quality, and value distribution for a dataset.\]](http://docs.aws.amazon.com/databrew/latest/dg/images/dataset-column-stats.png)


**Data lineage** – This tab shows a graphical representation of how your dataset was created and how it's used in DataBrew, as shown following.

![\[Data lineage diagram showing dataset creation and usage flow in DataBrew.\]](http://docs.aws.amazon.com/databrew/latest/dg/images/dataset-lineage.png)


**Topics**
+ [Deleting a dataset](datasets.deleting.md)

# Deleting a dataset
<a name="datasets.deleting"></a>

If you no longer need a dataset, you can delete it. Deleting a dataset doesn't affect the underlying data source in any way. It simply removes the information that DataBrew used to access the data source.

You can't delete a dataset if any other DataBrew resources rely on it. For example, if you currently have a DataBrew project that uses the dataset, delete the project first before you delete the dataset.

To delete a dataset, choose **Dataset** from the navigation pane. Choose the dataset that you want to delete, and then for **Actions**, choose **Delete**.