Configure connection tab

This tab contains HBase connection information. You can configure a connection in one of two ways:

  • Using the Hadoop cluster properties, or

  • By using an hbase-site.xml and (an optional) hbase-default.xml configuration file.

Below the connection details are fields to specify which target HBase table to write to, along with a mapping by which to encode incoming field values.

Configure connection tab

This tab includes the following fields:

Option

Definition

Hadoop cluster

Click the Hadoop Cluster drop-down menu to select an existing Hadoop cluster configuration.

URL to hbase-site.xml

Address of the hbase-site.xml file.

URL to hbase-default.xml

Address of the hbase-default.xml file.

HBase table name

The target HBase table you want to write data into.

Get table names (button)

Click to populate the drop-down list of available table names in the Hbase table name field. Only table names that have been mapped are retrieved. If you enter the namespace and colon in the HBase table name field, and then click Get table names, only the table names in that namespace display. The table names display the namespace, followed by a colon, then the table name. See Namespaces.

Mapping name

A mapping to decode and interpret column values. Click Get mappings for the specified table to populate the drop-down list of available mappings.

Store mapping info in step meta

Specifies whether to store mapping information in the step's metadata instead of loading it from HBase when it runs.

Delete rows by mapping key

Select to instruct HBase to delete rows using the row key on the mapped input field.

Disable write to WAL

Disables writing to the Write Ahead Log (WAL).

The WAL is used as a failsafe to restore the status quo if the server goes down while data is being inserted. Disabling WAL will increase performance.

Not available when Delete rows by mapping is selected.

Size of write buffer (bytes)

The size of the write buffer used to transfer data to HBase.

A larger buffer consumes more memory (on both the client and server), but results in fewer remote procedure calls.

If you leave this field empty, the default value (specified in thehbase-default.xml file) is 2MB (2097152 bytes).

Last updated

Was this helpful?