# Partitioning during data processing

To take advantage of the processing resources in your server, you can scale up the transformation using the multi-threading option **Change Number of Copies to Start** to produce copies of the steps (right-click the step to access the menu). As shown below, the **x2** notation indicates that two copies will be started at runtime. By default, this data movement from the CSV file input step into the `count by state` step will be performed in round-robin order. This means that if there are 'N' copies, the first copy gets the first row, the second copy gets the second row, and the Nth copy receives the Nth row. Row N+1 goes to the first copy again, and so on until there are no more rows to distribute. Reading the data from the CSV file is done in parallel. Attempting to aggregate in parallel, however, produces incorrect results because the rows are split arbitrarily (without a specific rule) over the two copies of the `count by state` aggregation step, as shown in the preview data.

![Aggregate in parallel, data error example](/files/awTOgaaXra3JLOTRWgHj)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.pentaho.com/pdia-data-integration/10.2-data-integration/advanced-topics-pentaho-data-integration-overview/partitioning-data/get-started/partitioning-during-data-processing.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
