> For the complete documentation index, see [llms.txt](https://docs.pentaho.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.pentaho.com/pdia-data-integration/10.2-data-integration/pdi-transformation-steps-reference-overview/hadoop-file-output-cp-main-page/options-hadoop-file-output-reuse/file-tab-hadoop-file-output-kettle.md).

# File tab

![File tab](/files/fq4d2BviIHh9El0A0ufk)

The **File** tab contains the following options that define the basic properties for the file being created:

| Option                                 | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| -------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
| **Hadoop Cluster**                     | <p>Specify which Hadoop cluster configuration to use.</p><p>You can specify information like host names and ports for HDFS, Job Tracker, and other big data cluster components through the Hadoop Cluster configuration dialog box. Click <strong>Edit</strong> to edit an existing cluster configuration in the dialog box, or click <strong>New</strong> to create a new configuration with the dialog box. Once created, Hadoop cluster configurations settings can be reused by other transformation steps and job entries. See <a href="/pages/pAzzVzwCU5XRZ9ANZ1SU">Connecting to a Hadoop cluster with the PDI client</a> for more details on the configuration settings.</p> |
| **Folder/File**                        | Specify the location and/or name of the output text file written to the Hadoop Cluster. Click **Browse** to navigate to the source file or folder in the [VFS browser](/pdia-data-integration/10.2-data-integration/data-integration-perspective-in-the-pdi-client/virtual-file-system-browser/vfs-browser-connecting-to-virtual-file-systems.md).                                                                                                                                                                                                                                                                                                                                   |
| **Create Parent Folder**               | Indicate if a parent folder should be created for the output text file.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
| **Do not create file at start**        | Avoid empty files when no rows are processed.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
| **Accept file name from field?**       | <p>Indicate if you want to specify the file name(s) in a field in the input stream.</p><p>This setting can be fine-tuned with the <code>kettle.properties</code> file. See <a href="/pages/4y589gJGQnFtoK3IBy7f">Improving performance when writing multiple files</a>.</p>                                                                                                                                                                                                                                                                                                                                                                                                          |
| **File name field**                    | Specify the field that contains the filename(s) in the input stream during runtime.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| **Extension**                          | Add an extension to the end of the file name. The default is .txt.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| **Include stepnr in filename**         | Include the copy number in the file name (`_0` for example) when you run the step in multiple copies (launching several copies of a step).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| **Include partition nr in file name?** | Include the data partition number in the file name.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| **Include date in file name**          | Include the system date in the filename (`_20181231` for example).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| **Include time in file name**          | Include the system time in the filename (`_235959` for example).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| **Specify Date time format**           | Indicate you want to specify the date time format from the list in the **Date time format** drop-down list.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| **Date time format**                   | Specify date time formats.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| **Show file name(s)**                  | Display a list of the files generated. The list is a simulation and depends on the number of rows that go into each file.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
| **Add filenames to result**            | Add the filename to the internal file result set.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.pentaho.com/pdia-data-integration/10.2-data-integration/pdi-transformation-steps-reference-overview/hadoop-file-output-cp-main-page/options-hadoop-file-output-reuse/file-tab-hadoop-file-output-kettle.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
