Unable to read or write files to HDFS on the Amazon EMR cluster
When running a transformation on an EMR cluster, the transformation appears to run successfully, but an empty file is written to the cluster. When PDI is not installed on the Amazon EC2 instance where you are running your transformation, you are unable to read or write files to the HDFS cluster. Any files written to the cluster are empty.
To resolve this issue, perform the following steps to edit the hdfs-site.xml
file on the PDI client
:
Navigate to the
<username>/.pentaho/metastore/pentaho/NamedCluster/Configs/<user-defined connection name>
directory.Open the
hdfs-site.xml
file with any text editor.Add the following code:
<property> <name>dfs.client.use.datanode.hostname</name> <value>true</value> </property>
Save and close the file.
PreviousLegacy mode activated when named cluster configuration cannot be locatedNextUse YARN with S3
Last updated
Was this helpful?