Skip to content

Example to integrate AWS Athena tool with Integrated Data Lake

This is an example to integrate AWS Athena tool with Integrated Data Lake after enabling cross account access.

Procedure to integrate AWS Athena tool

To integrate AWS Athena tool with Integrated Data Lake, follow these steps:

  1. Open AWS Athena tool.
  2. Create the table in Athena

    create-table-athena

    ① Click to create table

    ② Choose the option "from S3 bucket data"

  3. Add the details related to database, table and input data.

    • Database: Select the existing database or create a new by selecting "Create a new database".
    • Table Name: Enter the table name.
    • Location of Input Data Set: Path provided to access while configuring the cross account accesses.

    Note

    Location of input data set would look like "s3://+<<storage account>> + <<storage path>> and path should end with '/'. For example:

    • Storage Account = datalake-prod-a-starter-576071197214
    • Storage Path = data/ten=starter/Brewery/Lobrau/Europe/Netherlands/Amsterdam
    • Location of Input Data Set = s3://datalake-prod-a-starter-576071197214/data/ten=starter/Brewery/Lobrau/Europe/Netherlands/Amsterdam/
  4. Click "Next".

  5. Select "Data Format".

    data-format

    Note

    File uploaded in the Data Explorer should be .csv file.

  6. Add column name and column type corresponding to the data in the .csv file.

  7. Click "Next".

  8. Click "Create Table".

Result

The table is created successfully and now you can query the data in AWS Athena. To query the data, follow the below steps:

aws-athena

① Select the table from the database

② Specify the table name in the query as database.tablename

③ Columns of the table

④ Click to run the query

⑤ Query result

Any questions left?

Ask the community


Except where otherwise noted, content on this site is licensed under the MindSphere Development License Agreement.


Last update: August 29, 2022