# PDI big data transformation steps

You can use the following Pentaho Data Integration transformation steps to help enable PDI to work with big data technologies:

* Avro Input
* Avro Output
* Cassandra Input
* Cassandra Output
* CouchDB
* Hadoop File Input
* Hadoop File Output
* HBase Input
* HBase Output
* HBase Row Decoder
* Kafka Consumer
* Kafka Producer
* MapReduce Input
* MapReduce Output
* MongoDB Input
* MongoDB Output
* ORC Input
* ORC Output
* Parquet Input
* Parquet Output
* Splunk Input
* Splunk Output
* SSTable Output

See the **Transformation step reference** in the **Pentaho Data Integration** document for details and additional job entries.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.pentaho.com/install/9.3-install/use-hadoop-with-pentaho/advanced-topics/pdi-big-data-transformation-steps.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
