Import a cluster connection
You can add a cluster by importing the site.xml
files from an existing cluster. Perform the following steps to import a cluster connection.
In the PDI client, create a new transformation or job or open an existing transformation or job.
Click the View tab and then right-click the Hadoop Clusters folder.
Click Import cluster.
The Hadoop Clusters dialog box appears.
Hadoop Clusters Import dialog box Enter a user-defined name for the cluster connection in the Cluster name field.
Valid cluster names may include uppercase and lowercase letters, and numbers. In addition, the only special character allowed is the dash (
-
). To ensure a valid cluster name, do not use any other symbols, punctuation characters, or blank spaces.After you create the connection, you can locate this named connection in the View tab on the PDI client.
Note: If the Cluster name is already in use, you will be notified that proceeding will overwrite an existing cluster, which cannot be undone. In this case:
Click Cancel then enter a unique name to create the cluster.
Click Yes, Overwrite to overwrite the existing cluster.
Use the Driver and Version options to select the distribution of Hadoop on the cluster and its version number. The Support Portal provides supported drivers that you can download and install.
Click Browse to add file(s) and browse to the directory containing the
site.xml
files that were provided to you by the cluster administrator.The required files include:
hive-site.xml
mapred-site.xml
yarn-site.xml
core-site.xml
hbase-site.xml
hdfs-site.xml
oozie-site.xml
(if you are using Oozie in the configuration)
Click Open.
The Site XML files section displays the files you selected.
If you are connecting to a secure cluster, then enter the credentials in the Username and Password fields in the HDFS section.
Click Next and then specify the security option for the cluster.
If the Hadoop cluster is non-secure, select None and then click Next to test the connection.
If your Hadoop cluster is secure, you need to add security to the cluster connection. See Add security to cluster connections for instructions.
Last updated
Was this helpful?