> For the complete documentation index, see [llms.txt](https://docs.pentaho.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.pentaho.com/pdc-admin/ldc-manage-data-sources-cp.md).

# Manage data sources

With Pentaho Data Catalog, you can process data from file systems and relational databases.

To process data from these systems, Data Catalog establishes a data source definition. This data source stores the connection information to your sources of data, including their access URLs and user credentials. The number of data sources you can add is determined by your license agreement.

**Note:** Refer to the product release notes for the latest supported versions.

The following data sources are supported:

<table><thead><tr><th width="247">Type</th><th>Data source</th></tr></thead><tbody><tr><td>File System</td><td><ul><li>​<a href="/pages/4dorAgQlNItAPel334On#hdfs-data-source">HDFS data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#nfs-data-source">NFS data source</a>​</li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#smb-cifs-data-source">SMB/CIFS data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#local-file-system-data-source">Local File System data source</a>​</li></ul></td></tr><tr><td>Relational Databases</td><td><ul><li>​<a href="/pages/4dorAgQlNItAPel334On#amazon-redshift-data-source">Amazon Redshift data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#ibm-db2-data-source">IBM Db2 data source</a>​</li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#microsoft-sql-server-data-source">Microsoft SQL Server data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#mysql-data-source">MySQL data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#microsoft-access-data-source">Microsoft Access data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#oracle-data-source">Oracle data source</a>​</li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#postgresql-data-source">PostgreSQL data source</a>​</li><li><a href="/pages/4dorAgQlNItAPel334On#sap-hana-data-source">​SAP HANA data source​</a></li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#sybase-data-source">Sybase data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#vertica-data-source">Vertica data source</a>​</li></ul></td></tr><tr><td>NoSQL Databases</td><td><ul><li>​<a href="/pages/4dorAgQlNItAPel334On#dynamodb-data-source">DynamoDB data source</a>​</li></ul></td></tr><tr><td>Data Platforms</td><td><ul><li><a href="/pages/4dorAgQlNItAPel334On#apache-iceberg-data-source">​Apache Iceberg data source</a></li><li><a href="/pages/4dorAgQlNItAPel334On#databricks-data-source">Databricks data source</a></li></ul></td></tr><tr><td>Object Stores</td><td><ul><li>​<a href="/pages/4dorAgQlNItAPel334On#aws-s3-data-source">AWS S3 data source</a><sup>1</sup></li><li>​<a href="/pages/4dorAgQlNItAPel334On#azure-blob-storage-data-source">Azure Blob Storage data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#google-cloud-storage-data-source">Google Cloud Storage data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#hcp-data-source">HCP data source</a><sup>1</sup>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#onedrive-or-sharepoint-data-source">OneDrive and SharePoint data source</a>​</li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#snowflake-data-source">Snowflake data source</a></li></ul></td></tr><tr><td>Others</td><td><ul><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#add-okta-as-a-data-source">Okta as a data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#active-directory-as-a-data-source">Active Directory as a data source</a></li></ul></td></tr></tbody></table>

{% hint style="info" %}
If you encounter an error while connecting to a data source, refer to the data source provider's documentation for more information about the error.
{% endhint %}

***

<sup>1</sup> AWS S3, HCP, Minio, and VSP One data sources support **Skip SSL validation** when the data source uses a self-signed SSL certificate. Use this option only in non-production environments. To enable SSL validation, install user-provided SSL certificates into Data Catalog instead. For more information, see [Advanced configuration](/pdc-admin/ldc-advanced-configuration-ut_cp.md#install-user-provided-ssl-certificates).


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.pentaho.com/pdc-admin/ldc-manage-data-sources-cp.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
