Workflows
Use the workflow examples in this section as a guide to rules-governed tiering and purging data operations.
Also see Rehydration.
Tiering workflow
Use the workflow example below to perform rules-governed file tiering to move files older than a specified date into long term storage.
Before you begin, the data sources and storage destinations should already be present in Data Catalog. Also, your role in Data Catalog and Data Optimizer must have permissions for the operations you want to perform. See Default user roles and permissions in Data Optimizer.
From Data Storage Optimizer, click Check Data Temperature to open the Glossary in Data Catalog.
Create a domain. For example,
Data Temperature. See Assign data temperatures.Create terms in the domain to apply to your files or folders. For example,
Boiling,Hot,Warm,Cold, andFrozen.Click Management to open the Manage Your Environment page. On the Metadata Rules card, click Add New > Add Rule Definition and create a rule definition with the following attributes:
Name the rule definition. For example,
Cold Files.In Criteria select Object equals File.
Select Create a condition and Attribute equals Last Access Date.
In Operator select Less Than Date, and then enter a time and date. For example,
1 October 2022, 00:00:00.In Add Action select Apply Business Terms.
In Add Term select
Cold(for example).In Add Action select Remove Business Terms.
In Add Term select
Boiling,Hot,Warm, andFrozen(for example).Click Save.
Open the Manage Your Environment page, and on the Metadata Rules card, click Add New > Add Rule then create a metadata rule with these attributes:
Name the metadata rule. For example,
Find Cold FilesSelect the Source Data Asset on which to run the metadata rule.
Select the rule definition created earlier. For example,
Cold Files.Select the checkbox to run the rule now, enter a time to run it later, or both.
Click Apply.
Click the App Switcher then select Data Optimizer.
Click Management to open the Manage Your Environment page. On the Rules card, click Add New > Add Rule Definition and create a rule definition to tier files:
Name the rule definition. For example,
Tier Cold Data.In Criteria select Object equals File.
Click Create a condition and select Attribute equals Business Term.
In Operator select Equals Ignore Case.
In Value select
Cold(for example).Click Start Process.
In Action select Move and then select the file destination.
Click Save.
On the Rules card, click Add New > Add Rule then create a metadata rule with these attributes:
Name the metadata rule. For example,
Move Cold Files.Select the Source Data Asset on which you want to run the metadata rule.
Select the rule definition created earlier. For example,
Tier Cold Data.Select the checkbox to run the rule now, enter a time to schedule it to run, or both.
Click Apply.
Result
Files marked Cold are moved to the target destination.
To view the progress of the tiering operation, go to the Manage Your Environment page and on the Data Operations card, click Submitted. On the Data Operations page, the status of the operation, file name, path, source, type, and destination are provided. What remains in the folder of the data source is an .html stub file, except in HDFS, that can be used for rehydration if needed.
Purging workflow
Use the workflow example below to perform rules-governed file purging to delete files older than a specified date for the data source.
Purged files are permanently deleted.
Before you begin, the data sources and storage destinations should already be present in Data Catalog. Also, your role in Data Catalog and Data Optimizer must have permissions for the operations you want to perform. See Default user roles and permissions in Data Optimizer.
From Data Optimizer, click Check Data Temperature to open the Glossary in Data Catalog.
Create a domain. For example,
Data Temperature. See Assign data temperatures.Create terms for the domain to apply to your files or folders. For example,
Boiling,Hot,Warm,Cold, andFrozen.Click Management to open the Manage Your Environment page, and on the Metadata Rules card, click Add New. Then click Add Rule Definition and create a rule definition with the following attributes:
Name the rule definition. For example,
Frozen Files.In Criteria select Object equals File
Click Create a condition and select Attribute equals Last Access Date.
In Operator select Less Than Date, and then enter a time and date. For example,
1 October 2020, 00:00:00In Add Action select Apply Business Terms
In Add Term enter
Frozen(for example)In Add Action select Remove Business Terms
In Add Term enter
Boiling,Hot,Warm, andCold(for example).Click Save.
Open the Manage Your Environment page, and on the Metadata Rules card, click Add New > Add Rule then create a metadata rule with these attributes:
Name the metadata rule. For example,
Find Frozen Files.Select the Source Data Asset on which to run the rule.
Select the rule definition created earlier. For example,
Frozen Files.Select the checkbox to run the rule now, enter a time to run it later, or both.
Click Apply.
Click the App Switcher then select Data Optimizer.
Click Management to open the Manage Your Environment page. On the Rules card, click Add New > Add Rule Definition and create a rule definition to delete files:
Name the rule definition. For example,
Purge Frozen Data.In Criteria select Object equals File.
Click Create a condition and select Attribute equals Business Term.
In Operator select Equals Ignore Case.
In Value select
Frozen(for example).Click Start Process.
In Action select Delete.
Click Save.
On the Rules card, click Add New > Add Rule then create a metadata rule with these attributes:
Name the metadata rule. For example,
Delete Frozen Data.Select the Source Data Asset on which to run the metadata rule.
Select the rule definition created earlier. For example,
Purge Frozen Data.Select the checkbox to run the rule now, enter a time to run it later, or both.
Click Apply.
Result
Files marked Frozen are deleted.
To view the history of the execution, on the Rules card, click Rules. Locate the name of the metadata rule and select View. Click the History tab to view the status and click Statistics to view detailed results.
Last updated
Was this helpful?

