Partitioning data

You can use partitioning to distribute all the data from a set into distinct subsets according to the rule applied on a table or row, where these subsets form a partition of the original set with no item replicated into multiple groups.

You can scale up and scale out your transformations and jobs when you partition your data in Pentaho Data Integration. Scaling up makes the most of a single server with multiple CPU cores, while scaling out maximizes the resources of multiple servers operating in parallel.

Last updated

Was this helpful?