Creating a dataset using an Apache Impala data source - Amazon QuickSight

Creating a dataset using an Apache Impala data source

Apache Impala is a high-performance massively parallel processing (MPP) SQL query engine designed to run natively on Apache Hadoop. Use the procedure below to establish a secure connection between Amazon QuickSight and Apache Impala.

All traffic between Amazon QuickSight and Apache Impala is encrypted using SSL. QuickSight supports standard username and password authentication for Impala connections.

To establish a connection, you'll need to configure SSL settings in your Impala instance, prepare your authentication credentials, set up the connection in Amazon QuickSight using your Impala server details, and validate the connection to ensure secure data access.

To create a dataset using an Apache Impala data source
  1. On the Amazon QuickSight start page, choose Datasets.

  2. On the Datasets page, choose New Dataset.

  3. In the FROM NEW DATA SOURCES section, choose Impala.

  4. Enter a name for the data source.

  5. For public connections:

    1. Enter connection details for Database server, HTTP Path, Port, Username, and Password.

    2. Once the validation is successful, choose Create data source.

  6. For private connections:

    1. Coordinate with your administrator to set up a VPC connection before entering connection details.

      You or your administrator can configure the VPC connection in QuickSight. SSL is enabled by default to ensure secure data transmission. If you encounter connection validation errors, please verify your connection and VPC details.

      If issues persist, consult your administrator to confirm that your Certificate Authority is included in QuickSight's approved list of certificates.

  7. In the Choose your table menu, you can either:

    1. Choose a specific schema or table, then choose Select.

    2. Choose Use custom SQL to write your own SQL query.

  8. After completing your selection, you will be redirected to the data preparation page. Make any adjustments to your data, then choose Publish & visualize to analyze your Impala data in QuickSight.

Note

This connector supports:

  • Username and password authentication

  • Public and private connections

  • Table discovery and custom SQL queries

  • Full data refresh during ingestion

  • SPICE storage only