Install Pentaho Data Catalog
Includes setup steps for PDC, Remote Workers, and optional components.
Pentaho Data Catalog is a powerful solution for data governance, discovery, and cataloging. Installing Data Catalog in your environment enables you to manage structured and unstructured data using intelligent automation and machine learning, while laying the groundwork for advanced features like Pentaho Data Optimizer and Pentaho Data Mastering (if licensed).
This guide helps you install PDC and its optional components across various deployment scenarios, ranging from a single-node server to a distributed, containerized environment. You can also deploy Remote Workers to support scalable and secure metadata processing across different network zones.
What’s included with installation
When you install PDC, the following are also installed (based on your license):
Pentaho Data Optimizer (PDO): Automates intelligent data tiering to object storage.
Pentaho Data Mastering (PDM): Enables advanced data mastering and curation workflows.
To install Data Catalog, see the following topics:
To install and configure a remote worker, see Install and configure a Remote Worker.
For Data Catalog upgrade instructions, see Upgrade Data Catalog and to upgrade to a patch version, see Upgrade Data Catalog to a patch version.
For cloud-based deployments, see Hyperscalers.
For more help or access to downloads, visit the Pentaho Support Portal.
Additional Resources
For advanced setup and custom configurations, see the Advanced configuration section in the Administer Pentaho Data Catalog guide.
For help with common issues, refer to the Troubleshooting Pentaho Data Catalog section in the Administer Pentaho Data Catalog guide.
Last updated
Was this helpful?