To use the HBase Input and HBase Output steps with EMR 5.21, you must add the following parameter:
spark.hadoop.validateOutputSpecs=false
You can use any of these methods to set the parameter:
Specify the parameter in the properties file
Specify the parameter in Transformation properties
Specify the parameter as an environment variable in PDI
For more information about the properties file and processing Spark parameters, see the Administer Pentaho Data Integration and Analytics document.
Last updated 8 months ago
Was this helpful?