Install a driver for the Pentaho Server
Before you can add a named connection to a cluster, you must install a driver for the vendor and version of the Hadoop cluster that you are connecting to. Perform the following steps to install a driver for the Pentaho Server.
This task assumes that you have downloaded your driver from the Support Portal or that you are using the Apache Hadoop driver that is shipped with Pentaho.
Verify that you are connected to a repository.
In the PDI client, select the View tab of your transformation or job.
Right-click the Hadoop clusters folder and click Add driver.
The Add driver dialog box appears.
Add driver dialog box Click Browse
The Choose File to Upload dialog box appears.
Navigate to the directory where you downloaded your
.kar
file from the Support Portal.Select the driver (
.kar
file) you want to add, click Open, and then click Next.The selected file name appears in the Browse text field. The vendor distribution files contain their abbreviations in the
.kar
file names as shown below:Amazon EMR (emr)
Azure HDInsight (hdi)
Cloudera Data Platform (cdp)
Google Dataproc (dataproc)
Click Next.
The Congratulations dialog box appears, notifying you that you must restart the Pentaho Server and the PDI client. The installed driver is now available for selection in the Driver field in the New cluster and Import cluster dialog boxes.
Last updated
Was this helpful?