Metastore configuration
The elements listed in the following table are required for all remote environments. When you create a VFS connection using the PDI client, you will not need to manually edit anything in the <configuration>
section.
Common elements
These elements are required for all VFS connections:
Element
Value
Description
scheme
<string>
The type of connection. The values are:
s3 - Amazon, MinIO, and HCP
gs - Google cloud storage
abfss - Azure Data Lake Storage Gen2
rootPath
<bucket-name>[/<path>]
The bucket name and optional folder path where you want to create the VFS metastore. The rootPath
element must point to the location where you will store the metastore file on the cloud location.
This is analogous to the .pentaho
folder in a local metastore.
Examples:
miniobucket/dir1
gcpbucket/dir1
children
A container for type-specific configurations. For example:
<children>
<child>
<id>description</id>
<value></value>
<type>String</type>
</child>
…
</children>
## S3 elements
The elements listed in the table below apply to S3 environments. Some elements are conditional, based on your choices for other settings:
Element
Value
Description
accessKey
<s3-access-key>
The S3 user’s access key.
secretKey
<s3-secret-key>
The S3 user’s secret key.
endPoint
<s3-endpoint>
The URL to access the S3 location. Examples:
http://<host ip>:port
https://my-hcp-namespace.my-hcp-tenant.hcpdemo.hitachivantara.com
region
<s3-region>
The user-designated region. For example, us-east-1
.
connectionType
0 or 1
The connection type value. The values are:
0 - to connect to AWS
1 - to connect to MinIO or HCP
credentialFile
An encrypted string that is not user editable
profileName
<string>
The AWS user profile connection when the Type is 0 (AWS) and the authType
is 1 (credentials file)
defaultS3Config
true or false
The setting that controls whether the default S3 configuration is used. Set to true
to use the default S3 configuration
credentialsFilePath
<path to AWS credentials file>
The path to the AWS credentials file when the connectionType is 0 (AWS) and the authType is 1 (credentials file),
pathStyleAccess
true or false
The setting that controls which access style is used. Specify true
to use a path-style access; false
to use S3 bucket-style access
signatureVersion
AWSS3V4SignerType
The version of signature used for communicating with the AWS S3 location of your metastore.
name
vfsMetastore
The name for the connection.
description
<string>
A description of the connection.
sessionToken
<session token string>
Optional. A temporary credential that is used if the AWS S3 bucket is configured to require a session token for access
authType
0 or 1
The authentication type to use when the connection type is 0 (AWS):
0 – Access key/Secret key
1 – Credentials file
## GCP elements
The elements listed in the table below apply to GCP environments:
serviceAccountKey
<string>
A key that is generated based on the contents of the service account JSON.
keyPath
<path>
The path to the file containing the GCP service account JSON.
name
<string>
The name of the connection.
description
<string>
A description of the connection.
Azure Data Lake Storage Gen2 elements
The elements listed in the table below apply to Azure Data Lake Storage Gen2 environments. See Azure Blob Storage documentation for more information.
Element
Value
Description
sharedKey
<encrypted string>
The shared key for accessing the service.
accountName
<encrypted string>
The name of the account.
accessTier
<string>
The access tier value. The default is Hot.
blockSize
<Integer>
The default is 50.
maxSingleUploadSize
<Integer>
The default is 100.
bufferCount
<Integer>
The default is 5.[MB1]
name
<string>
The name of the connection.
authType
0
, 1
, or 2
The authorization type. The values are:
0 - Account Shared Key
1 - Azure Active Directory
2 - Shared Access Signature
Last updated
Was this helpful?