LogoLogo
Ctrlk
Try Pentaho Data Integration and Analytics
  • Pentaho Documentation
  • Install Pentaho Data Integration and Analytics
  • Getting started with Pentaho Data Integration and Analytics installation
  • Pentaho installation
  • Components Reference
  • JDBC drivers reference
  • Pentaho configuration
  • Pentaho upgrade
  • Multidimensional Data Modeling in Pentaho
  • Relational Data Modeling in Pentaho
  • Use Hadoop with Pentaho
    • Pentaho, big data, and Hadoop
    • Get started with Hadoop and PDI
    • Advanced topics
    • Troubleshooting possible Big Data issues
      • General configuration problems
      • Cannot access cluster with Kerberos enabled
      • Cannot access the Hive service on a cluster
      • HBase Get Master Failed error
      • Sqoop export fails
      • Sqoop import into Hive fails
      • Pig job not executing after Kerberos authentication fails
      • Group By step is not supported in a single threaded transformation engine
      • Kettle cluster on YARN will not start
      • Hadoop on Windows
      • Spark issues
      • Legacy mode activated when named cluster configuration cannot be located
      • Unable to read or write files to HDFS on the Amazon EMR cluster
      • Use YARN with S3
      • Data Catalog searches returning incomplete or missing data
  • Using Spark Submit
Powered by GitBook
On this page

Was this helpful?

  1. Use Hadoop with Pentaho

Troubleshooting possible Big Data issues

Follow the suggestions in these topics to help resolve common issues when working with Big Data:

  • General configuration problems

  • Cannot access cluster with Kerberos enabled

  • Cannot access the Hive service on a cluster

  • HBase Get Master Failed error

  • Sqoop import into Hive fails

  • Pig job not executing after Kerberos authentication fails

  • Kettle cluster on YARN will not start

  • Group By step is not supported in a single threaded transformation engine

  • Hadoop on Windows

  • Spark issues

  • Legacy mode activated when named cluster configuration cannot be located

  • Unable to read or write files to HDFS on the Amazon EMR cluster

  • Use YARN with S3

  • Data Catalog searches returning incomplete or missing data

See the Administer Pentaho Data Integration and Analytics document for additional troubleshooting information.

PreviousBig data resourcesNextGeneral configuration problems

Last updated 4 months ago

Was this helpful?

LogoLogo

About

  • Pentaho.com

Support

  • Pentaho Support

Resources

  • Privacy

© 2025 Hitachi Vantara LLC