Upgrade Data Catalog

Upgrading Pentaho Data Catalog ensures that you can take advantage of the latest features, improvements, and security updates while maintaining your existing configurations and data. This section provides step-by-step instructions to upgrade your Data Catalog deployment from earlier versions to newer releases.

The topics in this section guide you through the specific upgrade paths supported for Pentaho Data Catalog 10.2.x:

Before starting, ensure you review the prerequisites, back up your configuration and data, and prepare the deployment environment as described in the respective upgrade topics.

Upgrade PDC 10.2.1 to 10.2.5

Upgrading Pentaho Data Catalog (PDC) to the latest version ensures access to new features, improved performance, and security updates.

Perform the following steps to upgrade from PDC 10.2.1 to PDC 10.2.5:

Make sure that:

  • PDC 10.2.1 is installed. For the new installation of Data Catalog, see Installing Data Catalog.

  • You have a backup of your configurations and data.

  • You have administrative access to the server where Data Catalog is installed.

  1. Go to the Data Catalog installation folder, where PDC 10.2.1 is installed:

    cd /opt/pentaho/pdc-docker-deployment/
  2. Stop the currently running PDC services.

    ./pdc.sh stop
  3. Back up configuration files:

    1. Create a backup folder in /opt/ to store the container data and configuration backups:

      mkdir /opt/pdc_backup_10_2_1
    2. Copy the config folder to the backup folder you have created:

      cp -a /opt/pentaho/pdc-docker-deployment/conf/ /opt/ pdc_backup_10_2_1
    3. Copy the vendor folder to the backup folder you have created:

      cp -a /opt/pentaho/pdc-docker-deployment/vendor/ /opt/ pdc_backup_10_2_1
  4. Back up Docker volumes:

    1. Identify PDC Docker volumes:

      docker volume ls | grep pdc*
    2. Go to the Docker volumes path:

      cd /var/lib/docker/volumes/
    3. Copy PDC Docker volumes to the backup folder:

      cp -a pdc* /opt/pdc_backup_10_2_1/
  5. Verify that all configuration files and docker volumes are backed up correctly in the backup folder.

  6. Remove the vendor folder of PDC 10.2.1 build:

    rm –rf /opt/pentaho/pdc-docker-deployment/vendor
  7. Load the PDC 10.2.5 images into the Docker repository:

    docker load –i pdc-10.2.5-images.tgz
  8. Extract the contents of the PDC 10.2.5 package and overwrite the existing files in the deployment folder (/opt/pentaho/pdc-docker-deployment/):

    Important: Before overriding the files, review any custom configurations made in the Docker compose files and ensure those changes are noted and applied to the new files.

    tar -xzvf pdc-<DEPLOYMENT_PACKAGE_TYPE>-10.2.5-compose.tgz -C /opt/

    The <DEPLOYMENT_PACKAGE_TYPE> placeholder corresponds to the type of PDC service you want to deploy. For example:

    • For PDC full services, use: pdc-full-10.2.5-compose.tgz

    • For PDC with Pentaho Data Optimizer services, use: pdc-pdo-10.2.5-compose.tgz

    • For PDC with Pentaho Data Mastering services, use: pdc-pdm-10.2.5-compose.tgz If you are unsure which deployment package to use, contact Pentaho Support for guidance.

  9. Start PDC services with the new version:

    ./pdc.sh up

You have successfully upgraded Pentaho Data Catalog version 10.2.1 to 10.2.5.

Upgrade PDC 10.2.0 to 10.2.1

Upgrading Pentaho Data Catalog (PDC) to the latest version ensures access to new features, improved performance, and security updates.

Before you begin, make sure you have a backup of your configurations and data, and you have administrative access to the server where Data Catalog is installed.

Perform the following steps to upgrade from PDC 10.2.0 to PDC 10.2.1:

  1. Go to the Data Catalog installation directory, where PDC 10.2.0 is installed (/opt/pentaho/pdc-docker-deployment/).

  2. Stop the currently running PDC services:

    ./pdc.sh stop

  3. Back up configuration files:

    1. Create a backup folder in /opt/ to store the container data and configuration backups:

      mkdir pdc_backup_10_2_0

    2. Copy the config directory to the backup folder you have created:

      cp -a /opt/pentaho/pdc-docker-deployment/conf/ /opt/pdc_backup_10_2_0/

  4. Back up Docker volumes:

    1. Identify PDC Docker volumes:

      docker volume ls | grep pdc*

    2. Go to the Docker volumes directory:

      cd /var/lib/docker/volumes/

    3. Copy PDC Docker volumes to the backup folder:

      cp -a pdc* /opt/pdc_backup_10_2_0/

  5. Verify that all configuration files and docker volumes are backed up correctly in the backup folder.

  6. Extract the contents of the PDC 10.2.1 package and overwrite the existing files in the deployment directory (/opt/pentaho/pdc-docker-deployment/):

    Important: Before overriding the files, review any custom configurations made in the Docker-compose files and ensure those changes are noted and applied to the new files.

    tar -xzvf pdc-10.2.0.tar.gz -C /opt/

  7. Open the .env file located in the /opt/pentaho/pdc-docker-deployment/conf/ path and add or update the licensing URL in a line such as the following and save the .env file:

    LICENSING_SERVER_URL=<your_license_server_url>

  8. After updating the configurations, load the Docker images for PDC 10.2.1:

    ./pdc.sh load-images

  9. Start PDC services with the new version:

    ./pdc.sh up

You have successfully upgraded PDC from 10.2.0 to 10.2.1.

Last updated

Was this helpful?