LZO support
LZO is a compression format supported by Amazon EMR. It is required for running AEL on EMR. To configure for LZO compression, you will need to add several properties.
Follow the instructions available here to install the Linux LZO compression library from the command line: https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.1/bk_command-line-installation/content/install_compression_libraries.html
Navigate to the
data-integration/adaptive-execution/config/directory and open theapplication.propertiesfile.Add the following properties:
spark.executor.extraClassPath= /usr/lib/hadoop-lzo/lib/hadoop-lzo.jarspark.driver.extraClassPath = /usr/lib/hadoop-lzo/lib/hadoop-lzo.jar
Append the following properties to include
-Djava.library.path=/usr/lib/hadoop-lzo/lib/nativeat the end of each line:sparkExecutorExtraJavaOptions
sparkDriverExtraJavaOptions
Save and close the file.
Last updated
Was this helpful?

