Use a Pentaho Repository in PDI
The PDI client (also known as Spoon) offers several different types of file storage. A Pentaho Repository stores transformations, jobs, and schedules in a central environment through the Pentaho Server. It is recommended for enterprise deployments and fully supported features.
If your team needs a collaborative ETL (Extract, Transform, and Load) environment, we recommend using one or more than one Pentaho Repository. In addition to storing and managing your jobs and transformations, A Pentaho Repository provides full revision history for you to track changes, compare revisions, and revert to previous versions when necessary. These features, along with enterprise security and content locking, make using a Pentaho Repository an ideal platform for collaboration.
You can use the following information in the Administer Pentaho Data Integration and Analytics document to help to extend your knowledge of a Pentaho Repository beyond basic setup and use:
Advanced topics
To extend your knowledge of a Pentaho Repository beyond basic setup and use, see the following topics in the Administer Pentaho Data Integration and Analytics guide:
Repository content can also be imported and exported through either the PDI client or a command line interface.
Purge transformations, jobs, and shared objects from the Pentaho Repository
If the Pentaho Repository becomes too large for effective system performance, consider purging some of the data.
Backup and restore a Pentaho Repository
Perform routine backups to minimize potential data loss through machine failure, theft, disaster, or accidental change.
Last updated
Was this helpful?