Open and rename the job
To copy files in these instructions, use either the Hadoop Copy Files job entry or Hadoop command line tools.
Perform the following steps to modify the sample Spark job and understand how a Spark Submit entry works in PDI:
Copy a text file that contains words that you would like to count to the HDFS on your cluster.
Start the PDI client.
Open the
Spark Submit.kjb
job, which can be found in thedesign-tools/data-integration/samples/jobs/Spark Submit
folder.Select File > Save As, and then rename and save the file as
Spark Submit Sample.kjb
.
The Spark Submit Sample.kjb
file is saved to the jobs
folder.

Last updated
Was this helpful?