# Add the Pentaho Data Optimizer service to the cluster

To add Data Optimizer to your cluster, perform the following steps:

1. Log in to the Cloudera Manager dashboard.
2. Navigate to the cluster and open the action menu dropdown for the cluster then select **Add Service**.
3. Select Pentaho Data Optimizer from the list of available services and click **Continue**.
4. Assign hosts for the **Volume** role.
   1. On the Assign Roles page, locate the **Volume** role for the Data Optimizer service and click the Volume dialog box.
   2. Select the hosts to assign to the **Volume** role and click **OK**.

      Only hosts that have the HDFS Datanode role are valid candidates to add the Data Optimizer **Volume** role.
5. Assign hosts for the **Volume Monitor** role:
   1. Navigate back to the Assign Roles page and locate the named **Volume Monitor** for the Data Optimizer service.
   2. Click the **Volume Monitor** dialog box then assign hosts in your cluster to the **Volume Monitor** role.
   3. If prompted, select **Custom**.
   4. Select each of the hosts to which you added the **Volume** role and click **OK**.

      **Note:** Each host with a **Volume** instance must have a **Volume Monitor** instance as well. Do not select hosts without a **Volume** instance.
6. Click **Continue**.
7. Proceed to the Review Changes page. then enter the Data Optimizer volume configuration parameters for your environment.

   See the [Data Optimizer configuration parameters](/pdc-10.2-data-optimizer/pdso-install-landing-page/pdso-install-in-hadoop-cluster/pdso-configure-data-storage-optimizer/pdso-configuration-parameters.md) section for information about how to configure Data Optimizer volumes.

   **Note:** Remember the value of the `MOUNT_POINT` parameter. You will need this value when configuring HDFS to use the Data Optimizer volume.
8. After you have entered and confirmed all your Data Optimizer configuration values, click **Continue**.

   The Command Details page opens. From here, you can monitor the **First Run Command**.

   At this point in the process, Cloudera Manager attempts to start the service and launch the **Volume** instances for the initial time.
9. Monitor the start commands as they run in the background. Verify that all **Volume** and **Volume Monitor** instances start without error.
10. (Optional) If you encounter errors, you may need to troubleshoot.
    1. Look at the `stdout`, `stderr`, and role logs in the Cloudera Manager UI.
    2. If necessary, see [Troubleshoot Data Optimizer](/pdc-10.2-data-optimizer/pdso-install-landing-page/pdso-install-in-hadoop-cluster/pdso-maintain-landing-page/troubleshoot-data-storage-optimizer-fs.md).
11. After all Data Optimizer volumes have started, you can click through the remaining pages in the Add Service wizard to return to the Cloudera Manager dashboard.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.pentaho.com/pdc-10.2-data-optimizer/pdso-install-landing-page/pdso-install-in-hadoop-cluster/pdso-install-landing-page-hadoop-installation/installing-data-storage-optimizer-on-cloudera-manager-cluster-cp/add-data-storage-optimizer-service-to-the-cluster.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
