# Manage data sources

With Pentaho Data Catalog, you can process data from file systems and relational databases.

To process data from these systems, Data Catalog establishes a data source definition. This data source stores the connection information to your sources of data, including their access URLs and user credentials. The number of data sources you can add is determined by your license agreement.

**Note:** Refer to the product release notes for the latest supported versions.

The following data sources are supported:

<table><thead><tr><th width="247">Type</th><th>Data source</th></tr></thead><tbody><tr><td>File System</td><td><ul><li>​<a href="/pages/4dorAgQlNItAPel334On#hdfs-data-source">HDFS data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#nfs-data-source">NFS data source</a>​</li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#smb-cifs-data-source">SMB/CIFS data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#local-file-system-data-source">Local File System data source</a>​</li></ul></td></tr><tr><td>Relational Databases</td><td><ul><li>​<a href="/pages/4dorAgQlNItAPel334On#amazon-redshift-data-source">Amazon Redshift data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#ibm-db2-data-source">IBM Db2 data source</a>​</li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#microsoft-sql-server-data-source">Microsoft SQL Server data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#mysql-data-source">MySQL data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#microsoft-access-data-source">Microsoft Access data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#oracle-data-source">Oracle data source</a>​</li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#postgresql-data-source">PostgreSQL data source</a>​</li><li><a href="/pages/4dorAgQlNItAPel334On#sap-hana-data-source">​SAP HANA data source​</a></li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#sybase-data-source">Sybase data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#vertica-data-source">Vertica data source</a>​</li></ul></td></tr><tr><td>NoSQL Databases</td><td><ul><li>​<a href="/pages/4dorAgQlNItAPel334On#dynamodb-data-source">DynamoDB data source</a>​</li></ul></td></tr><tr><td>Data Platforms</td><td><ul><li><a href="/pages/4dorAgQlNItAPel334On#apache-iceberg-data-source">​Apache Iceberg data source</a></li><li><a href="/pages/4dorAgQlNItAPel334On#databricks-data-source">Databricks data source</a></li></ul></td></tr><tr><td>Object Stores</td><td><ul><li>​<a href="/pages/4dorAgQlNItAPel334On#aws-s3-data-source">AWS S3 data source</a><sup>1</sup></li><li>​<a href="/pages/4dorAgQlNItAPel334On#azure-blob-storage-data-source">Azure Blob Storage data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#google-cloud-storage-data-source">Google Cloud Storage data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#hcp-data-source">HCP data source</a><sup>1</sup>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#onedrive-or-sharepoint-data-source">OneDrive and SharePoint data source</a>​</li><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#snowflake-data-source">Snowflake data source</a></li></ul></td></tr><tr><td>Others</td><td><ul><li>​<a href="https://app.gitbook.com/o/PtpmPYUKgAsUWgv8SVUt/s/cUaDtyTop3vo8cjqgjGk/~/edit/~/changes/139/ldc-manage-data-sources-cp/adding-a-data-source-ldc-manage-data-sources-ag#add-okta-as-a-data-source">Okta as a data source</a>​</li><li>​<a href="/pages/4dorAgQlNItAPel334On#active-directory-as-a-data-source">Active Directory as a data source</a></li></ul></td></tr></tbody></table>

{% hint style="info" %}
If you encounter an error while connecting to a data source, refer to the data source provider's documentation for more information about the error.
{% endhint %}

***

<sup>1</sup> AWS S3, HCP, Minio, and VSP One data sources support **Skip SSL validation** when the data source uses a self-signed SSL certificate. Use this option only in non-production environments. To enable SSL validation, install user-provided SSL certificates into Data Catalog instead. For more information, see [Advanced configuration](/pdc-admin/ldc-advanced-configuration-ut_cp.md#install-user-provided-ssl-certificates).


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.pentaho.com/pdc-admin/ldc-manage-data-sources-cp.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
