Install a driver for the Pentaho Server

Before you can add a named connection to a cluster, you must install a driver for the vendor and version of the Hadoop cluster that you are connecting to. Perform the following steps to install a driver for the Pentaho Server.

This task assumes that you have downloaded your driver from the Support Portal or that you are using the Apache Hadoop driver that is shipped with Pentaho.

  1. Verify that you are connected to a repository.

  2. In the PDI client, select the View tab of your transformation or job.

  3. Right-click the Hadoop clusters folder and click Add driver.

    The Add driver dialog box appears.

    Add driver dialog box
  4. Click Browse

    The Choose File to Upload dialog box appears.

  5. Navigate to the directory where you downloaded your .kar file from the Support Portal.

  6. Select the driver (.kar file) you want to add, click Open, and then click Next.

    The selected file name appears in the Browse text field. The vendor distribution files contain their abbreviations in the .kar file names as shown below:

    • Amazon EMR (emr)

    • Azure HDInsight (hdi)

    • Cloudera Data Platform (cdp)

    • Google Dataproc (dataproc)

  7. Click Next.

    The Congratulations dialog box appears, notifying you that you must restart the Pentaho Server and the PDI client. The installed driver is now available for selection in the Driver field in the New cluster and Import cluster dialog boxes.

Last updated

Was this helpful?