User roles and permissions in Data Catalog
Pentaho Data Catalog uses role-based access control (RBAC) to define what users can view, create, or manage in the catalog. By default, Data Catalog provides a set of predefined user roles that align with common responsibilities in data management and cataloging.
In addition to default roles, you can create communities to group users with similar responsibilities and tailor their access to specific data assets. Communities help you refine permissions for business areas or projects. For example, you can create a community for the Finance team that grants access only to financial glossaries and data sources. For more information, see Communities in Data Catalog.
Each user must be assigned at least one role or community when the account is created. You can assign multiple roles or communities to a user if the permissions do not overlap or originate from the same default role. When a user has multiple roles, either directly or through community membership, the highest-level role determines the user’s effective permissions.
Data Catalog supports two main user tiers: Business Users, who primarily view or explore data and business terms, and Expert Users, who create, ingest, or curate entities and metadata within the catalog. See Default user roles and permissions for details on the permissions for users in each tier. Contact your sales representative if you have questions about this feature.
Default user roles and permissions
Data Catalog provides default user roles with role-based permissions that enable administrators to control access as necessary across Data Catalog. These permissions are distributed across two tiers of licensed users: Business Users and Expert Users, as needed.
Administrators can also fine-tune access by creating communities of users to which they assign permissions, such as access to specific data source types or business glossaries. Administrators can further refine access by creating communities, which act as custom roles with additional or restricted permissions. Communities are useful when you need to limit access to specific asset types, such as particular data sources, glossaries, or policies.
Using a community, an administrator can grant or deny access to specific assets, such as business glossaries or data connections. For more information, see the Manage users and permissions #Add a community topic under the Manage users and permissions section in the Administer Pentaho Data Catalog guide.
All users, regardless of role, can comment on, rate, and receive notifications for assets they have permission to access. The following sections outline the permissions that are available in the tiered default roles.
Business Users
The first tier of licensed users is Business Users. This tier includes two roles with different levels of access: Business User and Data User. The following table lists the default permissions for each role. For example, a user with the Business User role can view business glossaries and policies but cannot access data sources. The Data User role includes all Business User permissions and provides additional access to data assets that are relevant to the user's specific line of business.
Business User
Business Glossary
View
Policies
View
Data User
Applications
View
Business Glossary
View
Business Intelligence
View
Data Sources
View, AddContent, DeleteContent, ViewDashboard
Policies
View
Expert Users
The second tier of licensed users is Expert Users, which includes four roles with differing permissions.
Note: Your license limits the number of Expert Users to whom you can assign a Data Catalog role. When the number of your allowed Expert Users reaches 75% of the limit allowed by your license agreement, you see a warning message. You also receive a message if you have exceeded the quota.
Business Steward
Business Stewards focus on maintaining business glossaries and governance policies. They can create, update, and manage definitions and policies, but cannot modify data sources.
Business Steward
Applications
View
Business Glossary
View, Create, Update, Delete, Import, Export
Business Intelligence
View
Data Sources
View
Policies
View, Create, Update, Delete, Import, Export
Data Steward
Data Stewards are responsible for data quality, profiling, and metadata curation. They can create and manage data sources, define business and metadata rules, and run profiling or validation jobs.
Data Steward
Applications
View, Create, Update, Delete, Import, Export
Business Glossary
View
Business Intelligence
View
Business Rules
View, Create, Update, Delete, Import, Export
Data Identification Methods
View, Create, Update, Delete, Import, Export
Data Sources
View, Create, Update, Delete, Import, Export, AddContent, ApplyRules, DeleteContent, RelationshipAssignment, RunJobs, ViewDashboard, ViewSamples
Domain Asset
View, ApplyRules, ApproveRecords
Match & Merge (Metadata) Rules
View, Modify
Policies
View
Reference Data
View, Create, Update, Delete, Import, Export, UpdateValues, ViewValues
Admin
Admins are responsible for managing user accounts, roles, permissions, and system configuration. They can view most assets but cannot create or edit business rules.
Admin
Administration
View, Modify
Applications
View
Business Glossary
View
Business Intelligence
View
Data Sources
View
Domain Asset
View, Modify, Export, ApplyRules, ApproveRecords, RunEngine
MDM Data Sources
View, Modify
Match & Merge (Metadata) Rules
View, Modify
Policies
View
Data Storage Administrator
Data Storage Administrators manage storage utilization and optimization across data sources, folders, and schemas. They monitor capacity, data temperature, business terms, and duplicate file analysis.
Data Storage Administrator
Applications
View
Business Glossary
View
Business Intelligence
View
Business Rules
View, Create, Update, Delete, Import, Export
Data Identification Methods
View, Create, Update, Delete, Import, Export
Data Sources
View, Create, ViewSamples, Update, RunJobs, ApplyRules, RelationshipsAssignment, Delete, Import, Export, AddContent, DeleteContent, ViewDashboard, ViewStorageReports
Policies
View
Data Developer
Data Developers design and maintain business rules, metadata rules, and data domain logic. They typically work with structured metadata, reference data, and rule automation.
Data Developer
Applications
View
Business Glossary
View
Business Intelligence
View
Business Rules
View, Create, Update, Delete, Import, Export
Data Sources
View
Domain Asset
View, Modify, Export, ApplyRules, ApproveRecords, RunEngine
MDM Data Sources
View, Modify
Match & Merge (Metadata) Rules
View, Modify
Policies
View
Communities in Data Catalog
Communities in Pentaho Data Catalog extend role-based access control (RBAC) by allowing administrators to manage access at a more granular level. A community groups users, who share similar responsibilities, projects, or business areas and assigns them customized permissions for specific catalog assets. Communities are useful when you want to limit or extend access within a department, project, or data domain without creating new global roles. For example, you can create a Finance community that grants analysts access only to finance-related data sources and glossaries, while members of other departments retain access to their respective assets.
Each community is based on an existing default role, such as Business User or Data Steward. Permissions from the base role are inherited, and you can fine-tune them by enabling or disabling actions for specific features. When you assign a user to a community:
- The user automatically inherits all permissions defined in the community. 
- The user’s permissions combine with those from any other assigned roles or communities. 
- If permissions conflict, the highest-level role or the least restrictive permission applies. 
When a user belongs to multiple communities, Data Catalog merges permissions from all assigned communities. The resulting access level always reflects the most permissive combination of granted rights.
To know more about creating and managing communities, see Manage users and permissions in the Administer Pentaho Data Catalog guide.
Data Catalog permission types and actions
The following table lists the permission types and actions available in Pentaho Data Catalog. These permissions can be fine-tuned through communities to grant or restrict specific capabilities for selected users. A user’s base role determines the general level of access for each feature. When you create or edit a community, you can modify these permissions to extend or limit access beyond the default role settings.
For example, the ViewSamples action for data sources is available by default to users with the Data Steward role that allows them to view sample data for profiled columns. If you create a community based on this role, you can assign other users to that community to give them the same permission to view sample data for profiled columns.

The permissions available for each Data Catalog feature are shown in the following table.
Administration
Modify, View
Applications
Create, Delete, Export, Import, Update, View
Business Glossary
Create, Delete, Export, Import, Update, View
Business Intelligence
Create, Delete, Export, Import, Update, View
Business Rules
Create, Delete, Export, Import, Update, View
Data Identification Methods
Create, Delete, Export, Import, Update, View
Data Sources
AddContent, ApplyRules, Create, Delete, DeleteContent, Export, Import, RelationshipAssignment, RunJobs, Update, View, ViewDashboard, ViewSamples
Domain Asset
ApplyRules, ApproveRecords, Export, Modify, RunEngine, View
Match & Merge (Metadata) Rules
Modify, View
MDM Data Sources
Modify, View
Policies
Create, Delete, Export, Import, Update, View
Reference Data
Create, Delete, Export, Import, Update, UpdateValues, View, ViewValues
Last updated
Was this helpful?

