Before you begin
Before using the Catalog Input step, be aware of the following conditions:
You must have an established Catalog connection to Data Catalog. For details, see Access to Pentaho Data Catalog.
S3 must be configured as the Default S3 Connection in VFS Connections to access S3 storage. For details, see Connecting to Virtual File Systems.
You must have an established PDI connection to the cluster(s) you plan on using. For example, a Hadoop driver must be configured as a named connection for your distribution for accessing HDFS. For information on named connections, see Connecting to a Hadoop cluster with the PDI client.
Last updated
Was this helpful?