Configure and execute PDI-CLI

Configure and run AWS Batch using the PDI-CLI image.

See the AWS instructions for the following steps at Getting Started with AWS Batch.

  1. Navigate to the AWS Batch home page.

  2. Create a compute environment by selecting Compute environments and follow the instructions.

  3. Create a job queue by selecting Job queuesand follow the instructions.

  4. Create a job definition by selecting Job definitions and follow the instructions.

    Provide the image name in the section for configuring the container.

  5. Create a job by selecting Jobs and follow the instructions.

    In the Environment Variables section, configure the following variables:

Variable

Description

PROJECT_S3_LOCATION

Configures the S3 path from where the data is downloaded. It is then uploaded to the container.

Example: Set PROJECT_S3_LOCATION to s3://pentaho-samples/

METASTORE_LOCATION

Configures the metastore path from where the metastore content and configuration will be downloaded. It is then uploaded to the path of the container: /home/pentaho/.pentaho.

Example: Set METASTORE_LOCATION to metastore

PROJECT_STARTUP_JOB

Path used to execute KJB files.

Example: Set PROJECT_STARTUP_JOB to jobs/run_job_write_to_s3/read_csv_from_s3_job.kjb

LICENSE_TOKEN

License token or server URL that is used to provide access to Pentaho.Example: Set LICENSE_TOKEN to http://localhost:7070/license-server/request(Sample).

PARAMETERS

Parameters passed to the running job or transformation. Example: Set PARAMETERS to -param:my_param_name=MYVALUE.

You can now run Pentaho transformations and jobs using PDI-CLI.

Last updated

Was this helpful?