Big Data Sources: General

Pentaho software supports the following Big Data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.

Data Source
Supported Version

Amazon EMR (via Hive)

7.0.0 (Certified)

Apache Vanilla Hadoop

3.3.0 (Certified)

Cassandra (Datastax)

6.8 (Certified)

Cloudera Data Platform (CDP) on-prem (Private cloud)

7.1.9 (Certified)

Cloudera Data Platform (Public cloud)

7.2.17

Google BigQuery

1.2.25

Google Dataproc

2.1

Greenplum

4.3

Microsoft Azure HDInsight

4.0

MongoDB

7 (Certified)

Vertica

11

Last updated

Was this helpful?