LZO support
LZO is a compression format supported by Amazon EMR. It is required for running AEL on EMR. To configure for LZO compression, you will need to add several properties.
Follow the instructions available here to install the Linux LZO compression library from the command line: https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.1/bk_command-line-installation/content/install_compression_libraries.html
Navigate to the
data-integration/adaptive-execution/config/
directory and open theapplication.properties
file.Add the following properties:
spark.executor.extraClassPath= /usr/lib/hadoop-lzo/lib/hadoop-lzo.jar
spark.driver.extraClassPath = /usr/lib/hadoop-lzo/lib/hadoop-lzo.jar
Append the following properties to include
-Djava.library.path=/usr/lib/hadoop-lzo/lib/native
at the end of each line:sparkExecutorExtraJavaOptions
sparkDriverExtraJavaOptions
Save and close the file.
Last updated
Was this helpful?