Try Pentaho Data Integration and Analytics

Learn how to install an evaluation version of Pentaho Data Integration and Analytics, then get started with basic concepts, walk-throughs, and workflows.

Pentaho Data Integration and Analytics is a platform for access and analytics. It supports flat files, relational databases, Hadoop, NoSQL, and cloud sources. Use it to integrate, transform, visualize, and analyze data. Use the APIs to extend reports, queries, and transformations.

Install the 30-day trial of Pentaho Data Integration and Analytics

The Pentaho Installation Wizard is a streamlined way to install, learn about, and evaluate the Pentaho Suite. With this 30-day trial, you can install and test Pentaho Business Analytics (BA) and Data Integration (DI).

To get started fast after installation, follow the Pentaho Data Integration (PDI) tutorial.

If you have issues with the 30-day trial, contact the Pentaho Trial Expertsenvelope.

circle-exclamation

With the Pentaho Installation Wizard you can choose one of two install types:

  • Default: Select Keep it simple. Give me everything.

  • Custom: Select Let me decide for myself.

Process overview

Installation instructions are the same for Windows, Linux, and macOS.

Complete these tasks in order:

Explore Considerations

You Supply

A computer that meets Pentaho's operating system and hardware requirements.

We Supply

  • Installation Package

  • Oracle JRE

  • Repository Platform (PostgreSQL)

  • Repository Platform JDBC Driver (PostgreSQL)

  • Web Application Server (Tomcat)

Technologies Used

  • Tomcat web application server

  • PostgreSQL database to house the Pentaho Repository

Expertise

  • Basic computer knowledge.

Approximate Installation Time

30 minutes

Download the trial software

To download the software, complete these steps.

  1. Make sure you are logged in to the computer where you want to install the software.

    You should use an account that can install software.

    • On Windows, this is typically an account with administrator privileges.

    • On Linux, this is also an account with administrator privileges.

  2. Navigate to the Pentaho trial downloadarrow-up-right website.

  3. Click Start Your 30-Day Trial.

    The Submission Agreement form appears.

  4. Fill in the requested information and click Submit.

    The Getting Started with Pentaho Data Integration & Analytics page opens.

  5. In the On-Prem section, click Download for your operating system.

  6. When prompted, choose a directory for the installer and wait for the download to finish.

  7. If you are using Linux, make sure you can execute the file.

    Open a terminal, go to the directory where you downloaded the file, then run:

Start the Pentaho Installation Wizard

To start the installation wizard, complete the following steps.

circle-info

Note: Launch the installation wizard from a locally mounted hard drive only. Network-mounted drives (for example NFS) are not supported.

Do one of the following.

  • Windows: Open File Explorer, go to the installer, then double-click pentaho-business-analytics-10.3.0-x64.exe.

  • macOS: Unpack pentaho-business-analytics-10.3.0-x64.app.tar.gz, then double-click pentaho-business-analytics-10.3.0-x64.app.

  • Linux (graphics): Open a terminal, go to the installer, then run:

  • Linux (no graphics): Open a terminal, go to the installer, then run one of these:

    • GTK text mode:

    • OpenMotif X mode:

Default installation

If you want to install every component and some sample data, complete the following steps. If you want a custom install, see Custom installation.

circle-exclamation
  1. After you start the wizard, the splash screen appears, then the Pentaho Business Analytics installation is ready! window. Click Next.

    Install is ready. Click Next
  2. Read the license agreement. Select Accept, then click Next.

  3. In Installation folder, accept the default directory or enter a different path, then click Next.

    Choose Pentaho installation directory
  4. You are prompted to add a PostgreSQL password. Enter and confirm the password for the postgres user.

    circle-exclamation
  5. Click Next.

  6. In the What do you want to install? window, select Keep it simple. Give me everything, then click Next.

  7. If the Sample Database window appears, enter a port number, then click Next.

    This prompt appears only if the default ports are not available.

  8. In the Pentaho License Information window, leave the box empty and click Next. A 30-day license is installed for the trial.

  9. In We're Set. Let's install! window, click Next. Installation begins and takes about 30 minutes to complete.

  10. When installation is complete, select Pentaho Data Integration to launch Spoon.

    Select Pentaho Data Integration
  11. Click Finish.

    After PDI starts, you will see the Welcome! window.

    Welcome to Pentaho Data Integration

    To get started quickly after installation, follow the Pentaho Data Integration (PDI) tutorial.

If you have trouble with the installation wizard, see the Administer Pentaho Data Integration and Analytics document.

Custom installation

If you want to choose which components to install and whether to install sample data, complete the following steps. If you want the default install, see Default installation.

circle-exclamation
  1. After you start the wizard, the splash screen appears, then the Pentaho Business Analytics installation is ready! window. Click Next.

    Install is ready. Click Next
  2. Read the license agreement. Select Accept, then click Next.

  3. In Installation folder, accept the default directory or enter a different path, then click Next.

    Choose Pentaho installation directory
  4. You are prompted to add a PostgreSQL password. Enter and confirm the password for the postgres user.

    circle-exclamation
  5. Click Next.

  6. If the Sample Database window appears, enter a port number, then click Next.

    This prompt appears only if the default ports are not available.

  7. In the What do you want to install? window, select Let me decide for myself, then click Next.

  8. Select the components you want to install, then click Next.

  9. Select whether to include sample content, then click Next.

  10. If the Sample Database window appears, enter a port number, then click Next.

  11. In the Pentaho License Information window, leave the box empty and click Next. A 30-day license is installed for the trial.

  12. In We're Set. Let's install! window, click Next. Installation begins and takes about 30 minutes to complete.

  13. When installation is complete, select the components you want to launch and click Finish.

If you have trouble with the installer, see the Administer Pentaho Data Integration and Analytics document.

Verify installation

To verify the installation, review the directory structure and the installation summary file.

  1. Open a file explorer or command line tool.

  2. Navigate to the install directory and compare it to the structure below.

  3. Open installation-summary.txt. Make sure it lists the design tools and plugins you installed.

  4. Review the file locations below.

    File
    Description

    ctlscript.sh

    Starts, stops, restarts, and shows the status of Pentaho services. Available on Linux and macOS.

    installation-summary.txt

    Contains the information from the summary screen at the end of the installation process.

    uninstall

    A script that removes Pentaho Business Analytics.

    Tool/Plugin
    Location

    Pentaho Server

    pentaho/server/pentaho-server/

    Report Designer

    pentaho/design-tools/report-designer/

    Schema Workbench

    pentaho/design-tools/schema-workbench/

    Data Integration (Spoon)

    pentaho/design-tools/data-integration/

    Metadata Editor

    pentaho/design-tools/metadata-editor/

    Aggregation Designer

    pentaho/design-tools/aggregation-designer/

    Dashboard Designer

    pentaho/server/pentaho-server/pentaho-solutions/system/dashboards/

    Analyzer

    pentaho/server/pentaho-server/pentaho-solutions/system/analyzer/

    Interactive Reports

    pentaho/server/pentaho-server/pentaho-solutions/system/pentaho-interactive-reporting/

    License Installer

    pentaho/license-installer/

    Log
    Location

    Pentaho Server Logs for BA configuration

    pentaho/server/pentaho-server/logs/

    Tomcat Logs for Pentaho Server for BA configuration

    pentaho/server/pentaho-server/tomcat/logs/

    For macOS, copy the JDBC .jar into the location listed for Report Designer.

    JDBC Driver
    Location

    Pentaho Server for BA configuration

    pentaho/server/pentaho-server/tomcat/lib/

    Report Designer

    pentaho/design-tools/report-designer/lib/jdbc/

    Schema Workbench

    pentaho/design-tools/schema-workbench/drivers/

    Aggregation Designer

    pentaho/design-tools/aggregation-designer/drivers/

    Metadata Editor

    pentaho/design-tools/metadata-editor/libext/JDBC/

    PDI client (Spoon)

    pentaho/design-tools/data-integration/lib/

    Port Number
    Description

    5432

    PostgreSQL Server

    8080

    Pentaho Server Tomcat Web Server Startup Port

    8012

    Pentaho Server Shutdown Port

    9001

    HSQL Server Port

    9092

    Embedded H2 Database

    circle-info

    Note: Your port numbers might differ. If you installed using the installation wizard, check installation-summary.txt for the actual ports.

Getting started tutorial

To get started quickly after installation, follow the Pentaho Data Integration (PDI) tutorial.

Start and stop the PDI client on Windows

If you installed PDI on Windows using the installation wizard, start Spoon from: Start > All Programs > Pentaho Enterprise Edition > Design Tools > Data Integration.

Start and stop the Pentaho Server for configuration on Windows

When you ran the installation wizard on Windows, the Pentaho Server deployed in an included Apache Tomcat application server. Manage Pentaho and Tomcat from: Start > All Programs > Pentaho Enterprise Edition > Server Management.

Use one of these menu items:

  • Start Pentaho Server

  • Stop Pentaho Server

The installer also registered services for:

  • Pentaho Server

  • Data Integration

  • Pentaho Repository

You can start and stop these services from Control Panel > Administrative Tools > Services.

Start and stop the Pentaho Server for configuration on Linux

When you ran the installation wizard on Linux, the Pentaho Server deployed in an included Apache Tomcat application server. Use /pentaho/ctlscript.sh to start and stop services.

Common arguments:

  • start

  • stop

  • restart

  • status

  • help

Common services:

  • pentahoserver

  • postgresql

Examples:

Adjust Java VM memory limits for an installation on Windows

Windows users might see out-of-memory errors. Increase the Java VM memory allocation to fix them.

circle-info

Note: These steps apply when you installed with the installation wizard.

  1. Double-click pentahoserverw.exe in server\pentaho-server\tomcat\bin to open Pentaho Server Properties.

    You might need to select Run as Administrator.

  2. Select the Java tab.

    Windows Properties dialog box
  3. Set:

    • Initial memory pool: 6144 MB

    • Maximum memory pool: 8192 MB

If the JVM refuses to start, you might need to add RAM, stop memory-intensive services, or lower the maximum memory limit.

Adjust Java VM memory limits for a graphical installation on Linux

These steps apply when you installed using the installation wizard, your computer can display graphics, and you are running Linux.

  1. Go to pentaho-server/tomcat/scripts.

  2. Edit ctl.sh.

  3. Locate the line under start tomcat, which looks like this:

  4. Set the memory to a minimum of 4096 MB and a maximum of 6144 MB, based on available system memory.

If the JVM refuses to start, add RAM, stop memory-intensive services, or lower the maximum memory limit.

Increase Pentaho Server memory limit for installations on Windows

If you installed the Pentaho Server on Windows using the installation wizard, you can increase the server memory limits by editing the Tomcat Java settings.

circle-info

Notes:

  1. Stop the Pentaho Server if it is running. See Start and stop the Pentaho Server for configuration on Windows.

  2. Type services.msc into the Windows Search box.

  3. Find the Pentaho Server entry and note the service name.

    It should be pentahoserver.

  4. In C:\pentaho\server\pentaho-server\tomcat\bin\, rename tomcat8w.exe to match the service name.

    Example: pentahoserverw.exe.

  5. Double-click the renamed file to open the Properties window.

    You might need to select Run as Administrator.

  6. Select the Java tab.

  7. Set:

    • Minimum: 4096 MB

    • Maximum: 6144 MB

If the JVM refuses to start with increased limits, add RAM, stop memory-intensive services, or reduce the maximum memory limit.

See also: Increase the PDI client memory limit.

Increase Pentaho Server memory limit for installations on Linux

If you installed PDI on Linux using the installation wizard, you can increase memory limits by editing a variable in a Pentaho-supplied script.

circle-info

Note:

  1. Go to pentaho-server/tomcat/scripts.

  2. Edit ctl.sh.

  3. Locate the line under start tomcat, which looks like this:

  4. Set the memory to a minimum of 4096 MB and a maximum of 6144 MB, based on available system memory.

See also: Increase the PDI client memory limit.

Increase the PDI client memory limit

As a best practice, increase PDI's memory limit so Spoon can perform memory-intensive tasks. You must increase the memory limit for both the Pentaho Server and the PDI client.

circle-info

Note: Instead of modifying the PDI client startup script, you can set the environment variable PENTAHO_DI_JAVA_OPTIONS to -Xmx2g -XX:MaxPermSize=256m on your client.

Uninstalling the Pentaho Suite after evaluation

To uninstall the Pentaho Suite after evaluation:

  1. Go to the pentaho directory and run the uninstall file.

  2. Follow the Uninstall Wizard.

  3. When prompted, choose whether to delete the data files.

  4. Restart your computer.

Before you install a production version of Pentaho, uninstall the evaluation version first.

Tutorials

Review these tutorials to start using PDI, reporting tools, and dashboards.

If you are new to PDI, start with Getting Started with PDI.

Within this tutorial, you can also view:

The Getting Started with Analyzer, Interactive Reports, and Dashboard Designer tutorial covers product features, best practices, and troubleshooting.

Within that tutorial, you can also view:

The Getting started with Report Designer tutorial includes step-by-step report creation instructions.

Within that tutorial, you can also view:

Remove sample data from the Pentaho Server

By default, you have access to a sample data source and example content. When you are ready to move to development or production, remove the sample content.

To remove sample data and content:

  1. Stop the Pentaho Server.

  2. Delete samples.zip from /pentaho-server/pentaho-solutions/system/default-content.

    If you performed a manual WAR build and deployment, the path is /pentaho-server/pentaho-solutions/system.

  3. Edit /pentaho/WEB-INF/web.xml inside the deployed pentaho.war.

    For archive installs, the path is usually /pentaho-server/tomcat/webapps/pentaho/WEB-INF/web.xml.

  4. Remove the hsqldb-databases section:

  5. Remove the hsqldb-starter section:

  6. Remove the SystemStatusFilter.

    circle-info

    Note: This filter shows status messages useful only for development and testing.

  7. Remove the filter mapping:

  8. Save and close web.xml.

  9. Delete the /pentaho-server/data/ directory.

  10. Restart the Pentaho Server, then sign in to the Pentaho User Console as an administrator.

    On Browse Files, delete the sample folders under Public.

Data Integration and Analytics components and tools

Pentaho Data Integration and Analytics includes web-based components and design tools. What you use depends on your workflow and environment.

Data Integration and Analytics web-based components

Use the Pentaho web-based components to share business intelligence solutions by analyzing data, creating reports, and building dashboards.

These components include:

  • Pentaho User Console (PUC)

    A design environment for accessing Analyzer, Interactive Reports, and Dashboard Designer. PUC also offers administration features for configuring your Pentaho Server.

  • Analyzer

    Visualize data to make informed decisions. Create charts and visualizations, filter data, and configure drill-down links.

  • Interactive Reports

    Create simple and on-demand operational reports without relying on IT.

  • Dashboard Designer

    Create dashboards from templates, themes, and content. Combine Interactive Reports, Analyzer, and more.

  • CTools

    A community-driven framework for creating dashboards with web technologies.

Data Integration and Analytics design tools

Use Pentaho design tools to model, transform, and store data.

These tools include:

  • Pentaho Data Integration (PDI)

    An ETL engine for capturing data, cleansing it, and storing it in a usable format.

  • Report Designer

    Create pixel-perfect reports from virtually any data source.

  • Aggregation Designer

    Create aggregate tables for OLAP cubes to improve performance.

  • Metadata Editor

    Build metadata domains and models. Map physical database structures into a business model.

  • Schema Workbench

    Create and edit Mondrian models.

Evaluate Pentaho Data Integration and Analytics

Before you set up and use Pentaho Data Integration and Analytics in production, evaluate it:

Data Integration and Analytics supported technologies

Reference material for supported components and JDBC drivers:

Last updated

Was this helpful?