Requirements
Data Optimizer requires specific external components and applications to operate optimally. This section provides a list of those components and applications along with details of their use and the versions Data Optimizer supports.
Environment considerations
To ensure proper software development and deployment practices, it is a best practice to have two separate environments:
Development or Staging
Production
System requirements
The software, hardware, and access requirements you should have before you install Data Optimizer are outlined below.
Checklist for infrastructure requests
Perform the following tasks as needed to prepare your environment for Data Storage Optimizer:
Request a Virtual Machine (VM) on Azure, AWS, or on-premises.
Request IDs with remote access permissions to the VM on your cloud or on-premises.
Request necessary access to systems, applications, and data sources.
Request VDI or VPN access for Data Storage Optimizer data engineers to enable remote access to the VM.
Request a database user account (service account) or logins for connecting to the data sources.
Make sure the database user account has read-only permissions for the database objects.
Make sure that your system owner or Database Administrator (DBA) has copied or extracted any required data or files.
Obtain an SSL certificate from a certificate authority. If required by your organization's security policy, raise an infrastructure support request for an SSL certificate. The certificate authority will give you a key file and a certificate file.
Hardware requirements
Your server and network must meet the following requirements:
CPU
16 cores (minimum)
32 cores (recommended)
RAM
64 GB (minimum)
128 GB (recommended)
Disk storage
1 TB (minimum)
Network
1 Gbps
If the server is running on AWS, review the following requirements.
AWS EC2 details
An AWS EC2 virtual machine has the following requirements:
Size
m5.4xlarge
m2.8xlarge
vCPU
16 cores
32 cores
Memory
64 GB
128 GB
Azure VM details
An Azure VM has the following requirements:
Size
B_16s_v2
B_32s_v2
vCPU
16 cores
32 cores
Memory
64GB
128 GB
Server storage requirements
The server file systems and storage must meet the following requirements:
At least 10 GB of storage should be allocated for the root file system.
Ample storage should be mounted in the designated Docker storage area (typically the default on Linux servers).
Operating system requirements
You must deploy Data Optimizer to a dedicated server, which can be either a physical server or a virtual machine. The hosting environment can be on-premises or on the cloud using platforms such as Azure or AWS.
For optimal compatibility and performance, the server must run a modern Linux operating system based on 64-bit (x86_64/amd64) architecture.
Linux kernel version
Version 4.0 or higher of the Linux kernel is required. For RHEL, use version 3.10.0-514 of the kernel or a higher version.
Note: The overlay and overlay2 drivers are supported on XFS backing file systems with the d_type=true option enabled.
To ensure that the ftype option is set to 1, use the command xfs_info and verify the output. To format an XFS file system correctly, use the flag -n ftype=1.
If the dedicated server is restarted, make sure to enable auto start-up for Docker by executing the following commands:
sudo systemctl enable docker.servicesudo systemctl enable containerd.service
Network security and firewall requirements
The network security and firewall must meet the following requirements:
Ports
80and443should be open to inbound traffic.The application server must have network connectivity to the database server and port.
User account
The server user account used for the installation must either be the root user or have appropriate permissions to run Docker. To set up Docker permissions for non-root users, see the official Docker documentation at https://docs.docker.com/engine/install/linux-postinstall/.
Software requirements
Before you install Data Optimizer, make sure that Docker is installed on your server and configured to automatically start on boot. See the official Docker documentation for instructions on installing Docker.
Docker
Version 20.10+
Docker Compose
Use the latest version of Docker Compose supported by your OS platform.
Additional software
For seamless SSH connectivity and secure file transfer between your machine and the server, it is a best practice that you install the following software on your machine:
An SSH client such as PuTTY (recommended).
WinSCP for a graphical user interface to securely transfer files between the client and the server using SSH.
Data source connectivity
The following table contains the supported data sources and respective requirements to connect with Data Optimizer.
AWS S3
AWS region where the S3 bucket was created
Access key and secret access key
Read-only permissions to the S3 bucket
Azure Blob Storage
Account Fully Qualified Domain Name (FQDN)
Client ID and client key
authTokenEndpoint
HCP
AWS region where the S3 bucket was created
Access key and secret access key
Read-only permissions to the S3 bucket
HDFS
Hadoop version 2.7.2 and later
URI should provide a hostname and share folder details
Path of the directory that needs to be scanned
Read-only access to the directory
SharePoint and OneDrive
Application (client) ID, Directory (tenant ID), and clientSecret from a registered app on the Azure portal
Delegated permissions and Application permissions in the registered app
Read-only permissions to the SharePoint and OneDrive sites
SMB/CIFS
URI should provide host name and share folder details
Username and password to access the SMB/CIFS share directory
Path of directory that needs to be scanned
Read-only access is required
(Optional) Client Virtual Device Interface
The following table contains the client’s Virtual Device Interface (VDI) requirements.
Server configuration
Windows operating system
16 GB RAM
Disk or storage
100 GB minimum
Others
Internet connectivity
Google Chrome browser
Permission to download files from the FTP server (secure FTP access)
Last updated
Was this helpful?

