Use a Spark client already installed on a cluster
To use a Spark client that already resides on a cluster, specify the cluster path in the sparkHome= parameter in the application.properties
file. For example:
sparkHome=/cluster_path/spark-2.4.5-bin-hadoop2.7/
where cluster_path
is your specific path.
The Spark client is started as part of the AEL execution and does not require any manual startup. The following examples show common cluster configurations.
Cluster Configuration
Example Entry
CDH 6.1
sparkHome=/opt/cloudera/parcels/CDH-6.1.1-1.cdh6.1.1.p0.875250/lib/spark/
CDH 6.2
sparkHome=/opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/lib/spark/
EMR 5.24
sparkHome=/usr/lib/spark/
GDP 1.4.2.1
sparkHome=/usr/lib/spark/
HDP 3.1
sparkHome=/usr/hdp/current/spark2-client
Last updated
Was this helpful?