Administer a LogRhythm High Availability Deployment

LogRhythm High Availability is designed to help protect against downtime caused by planned or unplanned outages. It uses host-based replication technologies and constant monitoring of critical components and services.

The LogRhythm HA solution leverages SIOS products to deliver failover and data replication capabilities that are collectively called the SIOS Protection Suite (SPS). Combined with SIOS products, the LogRhythm HA solution can deliver superior uptime and flexibility to ensure that you are able to meet the constant demands of your enterprise.

This guide covers common tasks required to manage the LogRhythm High Availability (HA) Solution for both new and existing LogRhythm installations including:

High Availability Overview
Physical Configuration
Logical Configuration
Platform Updates
LogRhythm Software Updates

Definition of High Availability Terms and Concepts

Term	Definition
Cluster	A grouping of two or more networked servers or nodes
Communications Path	Physical connections between clustered servers
Data Replication	Software-based mirroring between servers via TCP/IP
Failover	The process of bringing a resource In Service when the active server or resource fails. This occurs when a node or resource failure is detected. It is also referred to as an unplanned outage
Heartbeat	The periodic message sent between servers to indicate the server health status
Host IP Address	The IP Address that is persistent to a physical system
Host Name	The logical name that is persistent to a physical system or node
In Service	The state of a resource hierarchy when it is active on a cluster node. When a resource is In Service, it is in an Active state
Local Recovery	Occurs when a resource failure is detected. The process involves restarting one or more resources locally and typically takes less time to recover than failover between nodes
Node	linked computer system that cooperates collectively with other parts of a cluster
Out of Service	The state of a resource hierarchy when it is not active on a node of the cluster. When a resource is Out of Service, it is in a Standby or Failed State
Resource	A system asset to be protected
Resource Hierarchy	A set of resources that are ordered in a way that models an application’s requirements
Shared IP Address	The logical IP Address used by protected services that can move between nodes in a cluster
Shared Name	The logical name used by protected services that can move between nodes in a cluster
Switchover	A process that takes a resource or resource hierarchy Out of Service in an orderly manner and brings them In Service on a backup system. Also referred to as a planned outage
Volume	Logical Volumes are partitions defined in Windows Disk Management. Depending on the appliance, they may be logical or physical volumes