Pentaho Data Integration 11.0

Pentaho Data Integration (PDI) provides the Extract, Transform, and Load (ETL) capabilities that facilitate the process of capturing, cleansing, and storing data using a uniform and consistent format that is accessible and relevant to end users and IoT technologies.

If you or your administrator has not already installed PDI on your system, see the Install Pentaho Data Integration and Analytics document for details.

Get started with Pentaho Data Integration (PDI) by learning core ETL concepts, data types, client setup, and project organization.

  • Basic concepts of ETL in PDI

    PDI uses a workflow metaphor as building blocks for transforming your data and other tasks. Workflows are built using steps or entries as you create transformations and jobs. Each step or entry is joined by a hop which passes the flow of data from one item to the next.

  • Understanding PDI data types and field metadata

    As a best practice for producing consistent, predictable outcomes when working with your data in PDI, you must consider how the Pentaho engine processes different data types and field metadata in transformations and jobs.

  • Starting the PDI client

    After you have installed Pentaho Data Integration (PDI), you can use the PDI client (also known as Spoon) desktop application to start building transformations of your data.

  • Use the PDI client perspectives

    Pentaho Data Integration (PDI) empowers you with tools that include ETL and scheduling in one unified environment — the PDI client interface. This integrated environment enables you to work in close cooperation with business users to build business intelligence solutions more quickly and efficiently.

Last updated

Was this helpful?