Transforming data with PDI

Transform data and manage jobs in Pentaho Data Integration (PDI) with tools for development, scheduling, monitoring, and optimization.

  • Work with transformations

    In the PDI client, you can develop transformations, which are data workflows representing your ETL activities.

  • Work with jobs

    Develop jobs to orchestrate your ETL activities. The entries used in your jobs define the individual ETL elements, such as transformations, applied to your data.

  • PDI run modifiers

    Use arguments, parameters, or variables to modify how you run PDI transformations and jobs.

  • Partitioning data

    You can use partitioning to distribute all the data from a set into distinct subsets according to the rule applied on a table or row, where these subsets form a partition of the original set with no item replicated into multiple groups.

  • Logging and performance monitoring

    Monitor and analyze the performance of Pentaho Data Integration (PDI) transformations and jobs using logging, performance monitoring, and impact analysis.

  • Add notes to transformations and jobs

    Notes can help you and others understand the structure, design decisions, business rules, dependencies, and other aspects of your transformations and jobs.

  • Manage PDI transformations and job schedules

    Schedule, edit, or manage Pentaho Data Integration (PDI) transformations and jobs to run at specific times or intervals.

Last updated

Was this helpful?