Add the Pentaho Data Optimizer service to the cluster
To add Data Optimizer to your cluster, perform the following steps:
Log in to the Cloudera Manager dashboard.
Navigate to the cluster and open the action menu dropdown for the cluster then select Add Service.
Select Pentaho Data Optimizer from the list of available services and click Continue.
Assign hosts for the Volume role.
On the Assign Roles page, locate the Volume role for the Data Optimizer service and click the Volume dialog box.
Select the hosts to assign to the Volume role and click OK.
Only hosts that have the HDFS Datanode role are valid candidates to add the Data Optimizer Volume role.
Assign hosts for the Volume Monitor role:
Navigate back to the Assign Roles page and locate the named Volume Monitor for the Data Optimizer service.
Click the Volume Monitor dialog box then assign hosts in your cluster to the Volume Monitor role.
If prompted, select Custom.
Select each of the hosts to which you added the Volume role and click OK.
Note: Each host with a Volume instance must have a Volume Monitor instance as well. Do not select hosts without a Volume instance.
Click Continue.
Proceed to the Review Changes page. then enter the Data Optimizer volume configuration parameters for your environment.
See the Data Optimizer configuration parameters section for information about how to configure Data Optimizer volumes.
Note: Remember the value of the
MOUNT_POINT
parameter. You will need this value when configuring HDFS to use the Data Optimizer volume.After you have entered and confirmed all your Data Optimizer configuration values, click Continue.
The Command Details page opens. From here, you can monitor the First Run Command.
At this point in the process, Cloudera Manager attempts to start the service and launch the Volume instances for the initial time.
Monitor the start commands as they run in the background. Verify that all Volume and Volume Monitor instances start without error.
(Optional) If you encounter errors, you may need to troubleshoot.
Look at the
stdout
,stderr
, and role logs in the Cloudera Manager UI.If necessary, see Troubleshoot Data Optimizer.
After all Data Optimizer volumes have started, you can click through the remaining pages in the Add Service wizard to return to the Cloudera Manager dashboard.
Last updated
Was this helpful?