PDI job entries

Entries extend and expand the functionality of Pentaho Data Integration (PDI) entries. You can use the following steps in PDI.

Entries: A - F

Name
Category
Description

Utility

Abort the job.

File management

Add filenames to result.

Big Data

Execute MapReduce jobs in Amazon EMR.

Big Data

Execute Hive jobs in Amazon EMR.

Bulk loading

Load from a MySQL table into a file.

Bulk loading

Bulk load data into Databricks tables

Bulk loading

Bulk loads files located in S3 buckets into an Amazon Redshift database.

Bulk loading

Load data into an Azure SQL database from Azure Data Lake Storage.

Bulk loading

Load data from a file into a MSSQL table.

Bulk loading

Load data from a file into a MySQL table.

Bulk loading

Load data from a file into a Snowflake data warehouse

Conditions

Check if we can connect to one or several databases.

Conditions

Check if one or several files are locked by another process.

Conditions

Check if a folder is empty.

Repository

Return true if we are connected to a repository.

XML

Check if one or several XML files is/are well formed.

Conditions

Check if a webservice is available.

Conditions

Checks if files exist.

Conditions

Check if one or several columns exist in a table on a specified connection.

File management

Compare two folders (or two files).

File management

Convert file content between Windows and Unix. Converting to Unix will replace CRLF (carriage return and line feed) by LF (line feed).

File management

Copy files.

File management

Copy or move result filenames (since version 5.0, this job entry has been renamed to Process result filenames and it handles Delete as well).

File management

Create a folder.

File management

Create an empty file.

Service management

Create a Snowflake virtual warehouse.

File encryption

Decrypt files encrypted with PGP (Pretty Good Privacy). This job entry needs GnuPG to work properly.

File management

Delete a file.

File management

Delete filenames from result.

File management

Delete files.

File management

Delete specified folders. If a the folder contains files, PDI will delete them all.

Service management

Drops a Snowflake warehouse.

Utility

Display a simple message information box.

XML

Verify if an XML file corresponds to a certain structure or format.

General

Use the Dummy job entry to do nothing in a job.

File encryption

Encrypt files with PGP (Pretty Good Privacy). This job entry needs GnuPG to work properly.

Conditions

Evaluate files size or files count.

Conditions

Evaluate the content of a table. You can also specify an SQL query.

Repository

Export repository to XML file.

File management

Compare two files.

Conditions

Check if a file exists.

File transfer

Delete files in a remote host.

Entries: G - L

Name
Category
Description

File transfer

Get files using FTP (File Transfer Protocol).

File transfer

Get a file with FTP secure.

File transfer

Get files using SFTP (Secure File Transfer Protocol).

Mail

Get mails (POP3/IMAP) server and save into a local folder.

Big Data

Load data into Google BigQuery from a Google Cloud Storage account.

Big Data

Copies files in a Hadoop cluster from one location to another.

Big Data

Execute a map/reduce job contained in a jar file.

Utility

Acknowledge HL7 messages.

Utility

Read data from HL7 data streams within a transformation.

File management

Get or upload a file using HTTP (Hypertext Transfer Protocol).

Scripting

Evaluate the result of the execution of a previous job entry.

General

Execute a job.

Utility

Reset the Kafka offset value.

Entries: M - R

Name
Category
Description

Mail

Send an email.

Mail

Check the validity of an email address (SNMP trap to a target host.)

Service management

Modify a Snowflake virtual warehouse.

File management

Move files.

Big Data

Execute Oozie workflows.

Big Data

Execute transformation-based MapReduce jobs in Hadoop.

Pig Script Executor

Deprecated

No longer a part of the PDI distribution. Contact Pentaho Support for details.

Utility

Ping a host.

File transfer

Put a file with FTP.

File management

Copy, move, or delete result filenames.

File transfer

Put files using SFTP (Secure File Transfer Protocol).

Entries: S - Z

Name
Category
Description

Utility

Send information to another server using the Syslog protocol.

Utility

Send Nagios passive checks.

Utility

Send SNMP trap to a target host.

General

Set one or several variables.

Scripting

Execute a shell script.

Conditions

Evaluate one field or variable.

Big Data

Submit Spark jobs to Hadoop clusters.

Scripting

Executes SQL on a certain database connection.

Big Data

Export data from the Hadoop Distributed File System (HDFS) into a relational database (RDBMS) using Apache Sqoop.

Big Data

Import data from a relational database (RDBMS) into the Hadoop Distributed File System (HDFS) using Apache Sqoop.

SSH2 Get (deprecated)

Deprecated

Get files using SSH2 (Deprecated in 5.0 in favor of the SFTP job entry).

SSH2 Put (deprecated)

Deprecated

Put files in a remote host using SSH2 (Deprecated in 5.0 in favor of the SFTP job entry).

General

Defines the starting point for job execution. Every job must have one (and only one) Start.

Big Data

Start a PDI Cluster on YARN.

Big Data

Stop a PDI Cluster on YARN.

Service management

Resumes a Snowflake warehouse.

Service management

Suspends a Snowflake warehouse.

General

Clear any error state encountered in a job and forces it to a success state.

Conditions

Check if a table exists on a database connection.

General

Run a transformation.

Utility

Truncate one or several tables.

File management

Unzip file in a target folder.

File transfer

Upload files to a FTP secure.

File encryption

Verify file signature with PGP (Pretty Good Privacy). This job entry needs GnuPG to work properly.

Conditions

Wait for a delay.

File management

Wait for a file.

Utility

Scan a database and success when a specified condition on returned rows is true.

File management

Write text content to file.

Utility

Write message to log.

XML

Perform an XSD validation against data in a file or in an input field.

XML

Make an XSL transformation.

File management

Zip files from a directory and process files.

Last updated

Was this helpful?