Prerequisites

These steps require VFS connections.

To use the Read Metadata or Write Metadata steps:

  • Set up a VFS connection to a stand-alone instance of Data Catalog and provide your role access credentials. For more information, see Access to Pentaho Data Catalog.

To use the Catalog Input and Catalog output steps:

  • Set up a VFS connection to a stand-alone instance of Data Catalog and provide your role access credentials. For more information see Access to Pentaho Data Catalog.

  • Configure S3 as the Default S3 Connection in VFS Connections to access S3 storage. For details, see Connecting to Virtual File Systems.

  • You must have an established PDI connection to the cluster(s) you plan on using. For example, a Hadoop driver must be configured as a named connection for your distribution for accessing HDFS. For information on named connections, see Connecting to a Hadoop cluster with the PDI client.

Last updated

Was this helpful?