Metastore configuration

The elements listed in the following table are required for all remote environments. When you create a VFS connection using the PDI client, you will not need to manually edit anything in the <configuration> section.

Common elements

These elements are required for all VFS connections:

Element

Value

Description

scheme

<string>

The type of connection. The values are:

s3 - Amazon, MinIO, and HCP

gs - Google cloud storage

abfss - Azure Data Lake Storage Gen2

rootPath

<bucket-name>[/<path>]

The bucket name and optional folder path where you want to create the VFS metastore. The rootPath element must point to the location where you will store the metastore file on the cloud location.

This is analogous to the .pentaho folder in a local metastore.

Examples:

  • miniobucket/dir1

  • gcpbucket/dir1

children

A container for type-specific configurations. For example:

<children>
    <child>
<id>description</id>
         <value></value>
    <type>String</type>
</child>

</children>

## S3 elements

The elements listed in the table below apply to S3 environments. Some elements are conditional, based on your choices for other settings:

Element

Value

Description

accessKey

<s3-access-key>

The S3 user’s access key.

secretKey

<s3-secret-key>

The S3 user’s secret key.

endPoint

<s3-endpoint>

The URL to access the S3 location. Examples:

http://<host ip>:port

https://my-hcp-namespace.my-hcp-tenant.hcpdemo.hitachivantara.com

region

<s3-region>

The user-designated region. For example, us-east-1.

connectionType

0 or 1

The connection type value. The values are:

0 - to connect to AWS

1 - to connect to MinIO or HCP

credentialFile

An encrypted string that is not user editable

profileName

<string>

The AWS user profile connection when the Type is 0 (AWS) and the authType is 1 (credentials file)

defaultS3Config

true or false

The setting that controls whether the default S3 configuration is used. Set to true to use the default S3 configuration

credentialsFilePath

<path to AWS credentials file>

The path to the AWS credentials file when the connectionType is 0 (AWS) and the authType is 1 (credentials file),

pathStyleAccess

true or false

The setting that controls which access style is used. Specify true to use a path-style access; false to use S3 bucket-style access

signatureVersion

AWSS3V4SignerType

The version of signature used for communicating with the AWS S3 location of your metastore.

name

vfsMetastore

The name for the connection.

description

<string>

A description of the connection.

sessionToken

<session token string>

Optional. A temporary credential that is used if the AWS S3 bucket is configured to require a session token for access

authType

0 or 1

The authentication type to use when the connection type is 0 (AWS):

0 – Access key/Secret key

1 – Credentials file

## GCP elements

The elements listed in the table below apply to GCP environments:

Element
Value
Description

serviceAccountKey

<string>

A key that is generated based on the contents of the service account JSON.

keyPath

<path>

The path to the file containing the GCP service account JSON.

name

<string>

The name of the connection.

description

<string>

A description of the connection.

Azure Data Lake Storage Gen2 elements

The elements listed in the table below apply to Azure Data Lake Storage Gen2 environments. See Azure Blob Storage documentation for more information.

Element

Value

Description

sharedKey

<encrypted string>

The shared key for accessing the service.

accountName

<encrypted string>

The name of the account.

accessTier

<string>

The access tier value. The default is Hot.

blockSize

<Integer>

The default is 50.

maxSingleUploadSize

<Integer>

The default is 100.

bufferCount

<Integer>

The default is 5.[MB1]

name

<string>

The name of the connection.

authType

0, 1, or 2

The authorization type. The values are:

0 - Account Shared Key

1 - Azure Active Directory

2 - Shared Access Signature

Last updated

Was this helpful?