Big Data Sources: General
Pentaho software supports the following Big Data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.
Data Source
Supported Version
Amazon EMR (via Hive)
5.21, 5.24, 5.32
Cloudera (via Hive or Impala)
6.1, 6.2, 6.3
Cloudera Data Platform (via Hive or Impala)
7.1.x
Datastax
4.6, 4.8
Google BigQuery
1.2.2.1004
Google Dataproc
1.4, 2.2
Greenplum
4.2, 4.3
Hortonworks (via Hive or Spark SQL)
3.0, 3.1
Microsoft Azure HDInsight
4.0
MongoDB
4.0.2
Netezza
7.1, 7.2
SAP HANA
SPS
Teradata
14.10, 15.0
Vertica
9.3.0.0
Last updated
Was this helpful?