> For the complete documentation index, see [llms.txt](https://docs.pentaho.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.pentaho.com/pdia-data-integration/10.2-data-integration/pdi-transformation-steps-reference-overview/s3-csv-input-cp/options-s3-csv-input.md).

# Options

![S3 CSV Input](/files/ghKMWn5174BATYC2ycj8)

**CAUTION:**

For technical reasons, parallel reading of S3 files is only supported on files that do not have fields with line breaks or carriage returns in them.

The following options are available for the S3 CSV Input transformation step.

| Option                        | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| ----------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| **Step name**                 | Specify the unique name of the S3 CSV Input step on the canvas. You can customize the name or leave it as the default.                                                                                                                                                                                                                                                                                                                                               |
| **S3 Bucket**                 | Specify the S3 container where the CSV object file is stored. Or click **Select bucket** to browse to and select the S3 container where the CSV object file is stored.                                                                                                                                                                                                                                                                                               |
| **Filename**                  | <p>Specify one of the following names (or click <strong>Browse</strong>) for the input file:- The name (<strong>Filename</strong>) of the S3 source file.</p><ul><li>The field which will become the name of the S3 source file or files at runtime, if the S3 CSV Input step receives data from another step.</li></ul><p>The file name of a file in the S3 Cloud uses the following schema:</p><p><code>s3n://s3\_bucket\_name/absolute\_path\_to\_file</code></p> |
| **Delimiter**                 | <p>Specify the file delimiter character used in the source file. The default delimiter for the CSV File Input step is a semicolon (;).</p><p>Click <strong>Insert Tab</strong> to use a tab as the delimiter.</p><p>Special characters can be set with the format: <code>$\[value]</code></p><p>For example: <code>$\[01]</code> or <code>$\[6F,FF,00,1F]</code></p>                                                                                                 |
| **Enclosure**                 | <p>Specify the enclosure character used in the source file. The default value is double quotes (<code>“</code>).</p><p>Special characters can be set with the format: <code>$\[value]</code></p><p>For example: <code>$\[01]</code> or <code>$\[6F,FF,00,1F]</code></p>                                                                                                                                                                                              |
| **Max line size**             | Specify the maximum characters to be read per line by the input operation. The default is 5000.                                                                                                                                                                                                                                                                                                                                                                      |
| **Lazy conversion?**          | Select this option to delay the conversion of a row of data until it is absolutely necessary.                                                                                                                                                                                                                                                                                                                                                                        |
| **Header row present?**       | Select to indicate whether the source file contains a header row with column names.                                                                                                                                                                                                                                                                                                                                                                                  |
| **The row number field name** | Specify the name of the field that will contain the row number in the output of this step.                                                                                                                                                                                                                                                                                                                                                                           |
| **Running in parallel**       | <p>Select to indicate whether you will have multiple instances of this step running (step copies) and if you want each instance to read a separate part of the S3 file(s).</p><p>When reading multiple files, the total size of all files is taken into consideration to split the workload. In that specific case, make sure that ALL step copies receive all files that need to be read; otherwise, the parallel algorithm will not work correctly.</p>            |


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.pentaho.com/pdia-data-integration/10.2-data-integration/pdi-transformation-steps-reference-overview/s3-csv-input-cp/options-s3-csv-input.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
