Disaster Recovery Administration Checklist

This checklist can be used to record your progress throughout the process of administering a LogRhythm Disaster Recovery deployment.

Regular Monitoring Tasks

[ ] Check replication status using LogRhythm DR Control:
- [ ] Run DR Control (Start > All Programs > LogRhythm > Disaster Recovery > DR Control) as administrator
- [ ] Verify databases show "Synchronized" or "Synchronizing" status
- [ ] Review metrics (SendQueue, SendRate, RedoQueue, RedoRate, EstimatedRecoveryTime, SyncPerformance)
- [ ] Exit panel with 'Q'
[ ] Alternatively, use AlwaysOn Availability Group Dashboard:
- [ ] Start SQL Server Management Studio and log in as administrator
- [ ] Expand AlwaysOn High Availability folder and Availability Groups folder
- [ ] Right-click Availability Group and select "Show Dashboard"

[ ] Review current replication mode (Asynchronous or Synchronous)
[ ] Determine if mode changes are needed based on:
- [ ] Current network performance
- [ ] Recovery Point Objective (RPO) requirements
- [ ] Performance requirements
- [ ] Distance between Primary and Secondary sites

[ ] Access Primary (active) Platform Manager
[ ] Run DR Control (Start > All Programs > LogRhythm > Disaster Recovery > DR Control) as administrator
[ ] Press 'D' to display DR Control Options
[ ] Type 'F' to initiate failover process
[ ] Confirm with 'Y' when prompted
[ ] Wait for automatic tasks to complete:
- [ ] Platform Manager services stopping on Primary site
- [ ] Database synchronization verification
- [ ] Secondary Platform Manager designation as Active site

[ ] Verify DNS record updates (automatic or manual) to point to Secondary Platform Manager
[ ] Wait for TTL limit to be reached
[ ] Confirm Platform Manager services have started on Secondary site:
- [ ] Alarming and Response Manager (ARM) service
- [ ] Job Manager service
[ ] Start services for Data Processors, Data Indexers, and AI Engines if necessary
[ ] Verify remote systems reconnection to Secondary Platform Manager
[ ] Test system functionality on Secondary site
[ ] Document failover completion

[ ] Go to Secondary (standby) Platform Manager
[ ] Run DR Control (Start > All Programs > LogRhythm > Disaster Recovery > DR Control) as administrator
[ ] Acknowledge potential data loss warning by typing 'Y'
[ ] Wait for automatic tasks to complete:
- [ ] Secondary Platform Manager switching to Active state
- [ ] Platform Manager services starting on Secondary site
- [ ] Replicated databases loading
[ ] Press Enter to exit when failover is complete

[ ] From DR Install folder on primary server, run DR Re-IP Uninstall.exe as Administrator
[ ] Click the "Uninstall" tab
[ ] Review description of uninstall process
[ ] Click "Uninstall" and follow confirmation prompts
[ ] Enter sysadmin-level SQL credentials when prompted
[ ] Review script output and address any errors
[ ] Repeat steps on secondary server
[ ] For secondary server, correctly identify deployment type when prompted

[ ] Verify no databases are in Synchronizing, Not Synchronizing, Restoring, or Suspect state
[ ] Confirm LogRhythm folder in Windows Task Scheduler has been removed
[ ] Verify CONSUL_CLIENT environment variable does not exist on XM/PM
[ ] Confirm all LogRhythm PM services are running
[ ] Verify SQL job "LogRhythm DR Job Management" is gone and all remaining SQL Server agent jobs are enabled
[ ] Update components to use management IP instead of shared DNS or failover IPs
[ ] Re-run LRII and remove host record for Secondary server
[ ] In Deployment Properties, change "Does your deployment include Disaster Recovery (DR)?" to No
[ ] Run "Get-Cluster" from elevated PowerShell to verify cluster service is not running