Install and configure a Remote Worker
The Remote Worker in Data Catalog gives a secure and scalable solution for metadata management, facilitating metadata extraction and task execution across distributed environments while complying with required network security requirements. For more information, see the Remote Worker section in the Use Pentaho Data Catalog.
Perform the following steps to install and configure a Remote Worker:
Prerequisites
Ensure that Docker and Docker Compose are installed on the machine where the Remote Worker will be deployed.
Ensure you have the latest Data Catalog Docker deployment bundle
pdc-<version>-images.tgzand Remote Worker bundlepdc-remote-<version>-compose.tgz. For example, for PDC 10.2.5, Data Catalog Docker deployment bundle ispdc-10.2.5-images.tgzand Remote Worker bundle ispdc-remote-10.2.5-compose.tgz. If you don't have, contact Pentaho support.
Procedure
Load the latest Data Catalog Docker deployment bundle
pdc-<version>-images.tgzinto Docker.Extract the Data Catalog Remote Worker bundle and navigate into the
pdc-deploymentdirectory:sudo tar -xvf pdc-remote-<version>-compose.tgz -C /opt/ cd /opt/pentaho/pdc-docker-deployment/Create the
confdirectory:sudo mkdir confCreate a .env file, specify the necessary environment variables, and save the .env file:
Copy the variables
PDC_WS_REMOTE_JOB_SERVER_IDandPDC_DATA_ENCRYPTION_KEYalong with their values from the base server installation.Add the
PDC_WS_REMOTE_OPS_URL,PDC_WS_REMOTE_GLOSSARY_BASE_URL, andPDC_MONGODB_OPS_DATABASE_URLwith the ops and mongodb details.
GLOBAL_SERVER_HOST_NAME=<Base Server FQDN or IP Address> PDC_WS_REMOTE_JOB_SERVER_ID="eb710d72-9613-a978-42c5-a101343bf6ca" PDC_DATA_ENCRYPTION_KEY="2eindcVFPic6uA1o0wRWnXsBKNKiMMhbc2P9qTtvUTE=" COMPOSE_PROFILES=ws-remote PDC_WS_REMOTE_OPS_URL="https://<Base Server FQDN or IP Address>/internal/ops/" PDC_WS_REMOTE_GLOSSARY_BASE_URL="https://<Base Server FQDN or IP Address>/glossary-service/api/v1/" PDC_MONGODB_OPS_DATABASE_URL="mongodb://root:broot@<Base Server FQDN or IP Address>:27017/ops?directConnection=true&authSource=admin&replicaSet=rs0" PDC_WS_REMOTE_DQ_CLIENT_ID= PDC_WS_REMOTE_DQ_API_URL= PDC_WS_REMOTE_DQ_CLIENT_SECRET= LOG_FLUENTBIT_ELASTICSEARCH_HOST=${GLOBAL_SERVER_HOST_NAME}Copy the
extra-certsdirectory and its contents, like certificates, into the Remote Worker machineconfdirectory:cp -r extra-certs/ conf/Start the Remote Worker:
sudo ./pdc.sh upVerify the Remote Worker deployment in Data Catalog:
Log in to Data Catalog and click Management in the left navigation menu.
In the Resources card, click Data Centers.
You can see the Remote Worker listed with Affinity as Remote.

Remote Worker in Data Catalog
You have successfully installed the Remote Worker and registered within Data Catalog.
Last updated
Was this helpful?

