Use a Spark client already installed on a cluster

To use a Spark client that already resides on a cluster, specify the cluster path in the sparkHome= parameter in the application.properties file. For example:

sparkHome=/cluster_path/spark-2.4.5-bin-hadoop2.7/

where cluster_path is your specific path.

The Spark client is started as part of the AEL execution and does not require any manual startup. The following examples show common cluster configurations.

Cluster Configuration
Example Entry

CDH 6.1

sparkHome=/opt/cloudera/parcels/CDH-6.1.1-1.cdh6.1.1.p0.875250/lib/spark/

CDH 6.2

sparkHome=/opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/lib/spark/

EMR 5.24

sparkHome=/usr/lib/spark/

GDP 1.4.2.1

sparkHome=/usr/lib/spark/

HDP 3.1

sparkHome=/usr/hdp/current/spark2-client

Last updated

Was this helpful?