Troubleshooting Pentaho Data Catalog

The Pentaho Data Catalog log files contain information that can help you determine the root cause of error messages you might see. Refer to the following topics for information on how to resolve the issues causing the error messages.

Low disk space message

If you see a Low disk space message from Pentaho Data Catalog while loading images into the Docker repository, you can resolve this issue by linking the Docker root directory to another directory.

Important: The other directory should have at least 100 GB of free space.

Use the following steps to resolve this issue:

  1. Enter the following commands to link the /var/lib/docker directory to a directory with at least 100 GB of free space.

    Note: In this example, the directory with at least 100 GB of free space is <dir with min 100 GB free>. You should replace <dir with min 100 GB free> in the command with the full path to your directory with a minimum of 100 GB of free space.

    sudo systemctl stop docker
    sudo mv /var/lib/docker <dir with min 100 GB free>
    sudo ln -s <dir with min 100 GB free> /var/lib/docker
    sudo systemctl start docker
  2. Repeat the action that produced the Low disk space message.

The action should succeed without producing a Low disk space message.

The service "um-alpine-init" didn't complete successfully message

If you see the message service "um-alpine-init" didn't complete successfully when using Keycloak with Pentaho Data Catalog, there are two possible causes.

When you use the Keycloak identity and access management (IAM) tool for user authentication with Pentaho Data Catalog, the service um-alpine-init checks whether Keycloak is up and running. You can determine the cause of the message service "um-alpine-init" didn't complete successfully by checking the log files for other messages that appear with the um-alpine-init message. The problem could be that the GLOBAL_SERVER_HOST_NAME variable or the security certificates need to be updated.

Use the following steps on the PDC server to determine the cause of the message and resolve the issue.

  1. Enter the following command to search the logs for the message about the um-alpine-init service:

    ./pdc.sh logs um-alpine-init

    When you find the service "um-alpine-init" didn't complete successfully message, you need to follow different steps depending on what you see. Use one of the optional steps below.

  2. (Optional) If you see the message Max retries reached. Exiting…, this means that Data Catalog cannot connect to Keycloak. Use the following steps on the PDC server to establish a connection to Keycloak:

    1. Edit the conf/.env file to update the GLOBAL_SERVER_HOST_NAME variable with your hostname or IP address, as in the following example:

      GLOBAL_SERVER_HOST_NAME="myhost.pdc.eng.example.com"

    2. Enter the following command to restart PDC services:

      ./pdc.sh restart

    PDC should start up and run without an error message.

  3. (Optional) If you see the message Update TokenResp error 401, use the following steps:

    1. Use the following command to look for "um-css-admin-api" in the logs:

      ./pdc.sh logs um-css-admin-api

      Sample output:

      um-css-admin-api-1  | [2024-03-28T16:56:27.558Z]
              [css-admin-api : css-admin-api] - [error]:   
              Failed to fetch public key self-signed certificate 
       um-css-admin-api-1  |  Error:
              self-signed certificate
       um-css-admin-api-1 
                |     at TLSSocket.onConnectSecure
              (node:_tls_wrap:1659:34)
       um-css-admin-api-1 
                |     at TLSSocket.emit
              (node:events:517:28)
       um-css-admin-api-1  |     at TLSSocket._finishInit
              (node:_tls_wrap:1070:8)
       um-css-admin-api-1 
                |     at ssl.onhandshakedone
              (node:_tls_wrap:856:12)
       um-css-admin-api-1 
                |     at TLSWrap.callbackTrampoline
              node:internal/async_hooks:128:17) {
       um-css-admin-api-1 
                |   code:
              'DEPTH_ZERO_SELF_SIGNED_CERT'
       um-css-admin-api-1 
              | }  

      This output includes the message Failed to fetch public key self-signed certificate, which points to an error with the PDC self-signed certificates.

    2. Stop PDC services by entering the following command:

      ./pdc.sh stop

    3. Edit the conf/.env file to change the GLOBAL_SERVER_HOST_NAME variable to the fully qualified domain name (FQDN) for the host server, as in the following example:

      GLOBAL_SERVER_HOST_NAME="myhost.pdc.eng.example.com"

    4. Remove all certificates by entering the following command:

      rm -rf conf/{https,extra-certs,mongodb}

    5. Restart PDC services by entering the following command:

      ./pdc.sh restart

      The server generates new self-signed certificates based on the FQDN that you provided.

    PDC should start up and run without an error message.

opensearch-cluster-init service fails to start on existing deployment

The opensearch-cluster-init service may fail to start when you run ./pdc.sh up on an existing Pentaho Data Catalog deployment. This happens if OpenSearch crashed previously and did not remove the lock file, preventing new processes from accessing the data folder.

Perform the following steps to fix the issue:

  1. Stop the services:

    ./pdc.sh stop
  2. Remove the OpenSearch lock file:

    sudo rm -rf /var/lib/docker/volumes/pdc_opensearch_data/_data/nodes/0/node.lock
  3. Restart the services:

    ./pdc.sh start

The OpenSearch service starts successfully.

Last updated

Was this helpful?