About External Data - Developer Documentation
About External Data

External data in Predictive Learning is stored and accessed through Amazon S3 buckets. External data is accessible from the Zeppelin environment. You can load data from your external S3 bucket (e.g., with Spark APIs), and then save the dataset in Predictive Learning.

The data buckets are set up at the tenant level with user folders. The following are supported for external buckets and Predictive Learning:

  • One external bucket per user can be configured using the Advanced Configuration dialog box on the Manage Analytics Workspace page.
  • The external bucket must exist in the same AWS region where the Predictive Learning application is hosted (currently eu-central-1 only)

Contact your PrL administrator for more information about setting up access to your Amazon S3 bucket.

Last update: June 15, 2023