Veritas™Resiliency Platform 2.2 Solutions for Virtual Business Services

Last Published:
Product(s): Resiliency Platform & CloudMobility (2.2)
  1. Overview of Resiliency Platform
    1.  
      About Veritas Resiliency Platform
    2.  
      About Resiliency Platform features and components
    3.  
      About permissions for operations in the console
  2. Using Resiliency Platform for disaster recovery
    1.  
      About disaster recovery using Resiliency Platform
    2.  
      Understanding the role of resiliency groups in disaster recovery operations
  3. About virtual business services
    1. About virtual business services
      1.  
        Understanding virtual business service tiers
    2.  
      Creating a virtual business service
    3.  
      Starting and stopping a virtual business service
    4.  
      Displaying virtual business service details
    5.  
      Editing a virtual business service
    6.  
      Deleting a virtual business service
    7.  
      Performing rehearsal on a virtual business service
    8.  
      Performing cleanup rehearsal on a virtual business service
    9.  
      Migrating a virtual business service
    10.  
      Taking over a virtual business service
    11.  
      Performing resync for a virtual business service
    12.  
      Performing restore for a virtual business service
  4. Monitoring risks
    1.  
      About risk insight
    2.  
      Displaying risk information
    3.  
      Predefined risks in Resiliency Platform
    4.  
      Viewing the current risk report
    5.  
      Viewing the historical risk report
  5. Managing activities and resiliency plans
    1. Managing activities
      1.  
        Viewing activities
      2.  
        Aborting a running activity
    2. Managing resiliency plans
      1.  
        About resiliency plans
      2. Creating a new resiliency plan template
        1. About manual task
          1.  
            Using manual tasks in resiliency plans
        2. About custom script
          1.  
            Using custom scripts in resiliency plans
      3.  
        Editing a resiliency plan template
      4.  
        Deleting a resiliency plan template
      5.  
        Viewing a resiliency plan template
      6.  
        Creating a new resiliency plan
      7.  
        Editing a resiliency plan
      8.  
        Deleting a resiliency plan
      9.  
        Executing a resiliency plan
      10.  
        Viewing a resiliency plan
      11.  
        Creating a schedule for a resiliency plan
      12.  
        Editing a schedule for a resiliency plan
      13.  
        Deleting a schedule for a resiliency plan
      14.  
        Viewing a schedule for a resiliency plan
  6. Managing evacuation plans
    1.  
      About evacuation plan
    2.  
      Generating an evacuation plan
    3.  
      Regenerating an evacuation plan
    4.  
      Performing evacuation
    5.  
      Performing rehearse evacuation
    6.  
      Performing cleanup evacuation rehearsal
  7. Appendix A. Troubleshooting
    1.  
      Viewing events and logs in the console
  8.  
    Glossary

Predefined risks in Resiliency Platform

Table: Predefined risks lists the predefined risks available in Resiliency Platform. These risks are reflected in the current risk report and the historical risk report.

Table: Predefined risks

Risks

Description

Risk detection time

Risk type

Affected operation

Fix if violated

Veritas Infoscale Operations Manager disconnected

Checks for Veritas Infoscale Operations Manager to Resiliency Manager connection state

1 minute

Error

All operations

Check Veritas Infoscale Operations Manager reachability

Try to reconnect Veritas Infoscale Operations Manager

vCenter Password Incorrect

Checks if vCenter password is incorrect

5 minutes

Error

  • On primary site: start or stop operations

  • On secondary site: migrate or takeover operations

In case of a password change, resolve the password issue and refresh the vCenter configuration

VM tools not installed

Checks if VM Tools are not Installed. It may affect IP Customization and VM Shutdown.

Real time, when resiliency group is created

Error

  • Migrate

  • Stop

  • In case of VMWare, install VMWare Tools

  • In case of Hyper-V, install Hyper-V Integration Tools

Snapshot removed from Virtual Machine

Checks if snapshot has been removed from virtual machine.

5 minutes

Error

Resiliency Platform Data Mover replication

Edit the resiliency group to refresh configuration

Snapshot reverted on Virtual Machine

Checks if snapshot has been reverted on virtual machine.

5 minutes

Error

Resiliency Platform Data Mover replication

Remove and re-add the virtual machine to the Resiliency group by editing Resiliency group

Data Mover Daemon Crash

Checks if VM Data Mover filter is not able to connect to its counterpart in ESX.

5 minutes

Error

Resiliency Platform Data Mover replication

In order to continue the replication, you can move (VMotion) the VM to a different ESX node in the cluster and either troubleshoot the issue with this ESX node or raise a support case with Veritas

Snapshot created on Virtual Machine

Checks if a snapshot has been created on Virtual machine.

5 minutes

Error

Resiliency Platform Data Mover replication

Edit the resiliency group to refresh configuration

DataMover virtual machine in noop mode

Checks if VM Data Mover filter is not able to connect to its counterpart in ESX.

5 minutes

Error

Resiliency Platform Data Mover replication

In order to continue the replication, you can move (VMotion) the VM to a different ESX node in the cluster and either troubleshoot the issue with this ESX node or raise a support case with Veritas

Resiliency group configuration drift

Checks if disk configuration of any of the assets in the resiliency group has changed.

30 minutes

Error

  • Migrate

  • Resync

Edit the resiliency group to first remove the impacted virtual machine from the resiliency group and then add it back to the resiliency group.

Global user deleted

Checks if there are no global users. In this case, the user will not be able to customize the IP for Windows machines in VMware environment.

Real time

Warning

  • Migrate

  • Takeover

Edit the resiliency group or add a Global user

Missing heartbeat from Resiliency Manager

Checks for heartbeat failure from a Resiliency Manager.

5 minutes

Error

All

Fix the Resiliency Manager connectivity issue

Infrastructure Management Server disconnected

Check for Infrastructure Management Server(IMS) to Resiliency Manager(RM) connection state.

1 minute

Error

All

Check IMS reachability

Try to reconnect IMS

Storage Discovery Host down

Checks if the discovery daemon is down on the storage discovery host

15 minutes

Error

Migrate

Resolve the discovery daemon issue

DNS removed

Checks if DNS is removed from the resiliency group where DNS customization is enabled

real time

Warning

  • Migrate

  • Takeover

Edit the Resiliency Group and disable DNS customization

IOTap driver not configured

Checks if the IOTap driver is not configured

2 hours

Error

None

Configure the IOTap driver

This risk is removed when the workload is configured for disaster recovery

VMware Discovery Host Down

Checks if the discovery daemon is down on the VMware Discovery Host

15 minutes

Error

Migrate

Resolve the discovery daemon issue

VM restart is pending

Checks if the VM has not been restarted after add host operation

2 hours

Error

Configure DR

Restart the VM after add host operation

New VM added to replication storage

Checks if a virtual machine that is added to a Veritas Replication Set on a primary site, is not a part of the resiliency group.

5 minutes

Error

  • Migrate

  • Takeover

  • Rehearsal

Add the virtual machine to the resiliency group.

Replication lag exceeding RPO

Checks if the replication lag exceeds the thresholds defined for the resiliency group. This risk affects the SLA for the services running on your production data center.

5 minutes

Warning

  • Migrate

  • Takeover

Check if the replication lag exceeds the RPO that is defined in the Service Objective

Replication state broken/critical

Checks if the replication is not working or is in a critical condition for each resiliency group.

5 minutes

Error

  • Migrate

  • Takeover

Contact the enclosure vendor.

Remote mount point already mounted

Checks if the mount point is not available for mounting on target site for any of the following reasons:

  • Mount point is already mounted.

  • Mount point is being used by other assets.

  • Native (ext3, ext4,NTFS ): 30 minutes

  • Virtualization (VMFS, NFS): 6 hours

Warning

  • Migrate

  • Takeover

Unmount the mount point that is already mounted or is being used by other assets.

Disk utilization critical

Checks if at least 80% of the disk capacity is being utilized. The risk is generated for all the resiliency groups associated with that particular file system.

  • Native (ext3, ext4,NTFS ): 30 minutes

  • Virtualization (VMFS, NFS): 6 hours

Warning

  • Migrate

  • Takeover

  • Rehearsal

Delete or move some files or uninstall some non-critical applications to free up some disk space.

ESX not reachable

Checks if the ESX server is in a disconnected state.

5 minutes

Error

  • On primary site: start or stop operations

  • On secondary site: migrate or takeover operations

Resolve the ESX server connection issue.

vCenter Server not reachable

Checks if the virtualization server is unreachable or if the password for the virtualization server has changed.

5 minutes

Error

  • On primary site: start or stop operations

  • On secondary site: migrate or takeover operations

Resolve the virtualization server connection issue.

In case of a password change, resolve the password issue.

Insufficient compute resources on failover target

Checks if there are insufficient CPU resources on failover target in a virtual environment.

6 hours

Warning

  • Migrate

  • Takeover

Reduce the number of CPUs assigned to the virtual machines on the primary site to match the available CPU resources on failover target.

Host not added on recovery data center

Checks if the host is not added to the IMS on the recovery data center.

30 minutes

Error

Migrate

Check the following and fix:

  • Host is up on recovery data center.

  • Host is accessible from recovery datacenter IMS.

  • Time is synchronized between host and recovery datacenter IMS.

NetBackup Notification channel disconnected

Checks for NetBackup Notification channel connection state

5 minutes

Error

Restore

Check if the NetBackup Notification channel is added to the NetBackup master server.

Backup image violates the defined RPO

Checks if the backup image violates the defined RPO

30 minutes

Warning

No operation

  • Check the connection state of NetBackup Notification channel

  • Check for issues due to which backup images are not available

NetBackup master server disconnected

Checks if NetBackup master server is disconnected or not reachable

5 minutes

Error

Restore

Check if IMS is added as an additional server to the NetBackup master server

Assets do not have copy policy

Checks if the assets do not have a copy policy

3 hours

Warning

No operation

Set up copy policy and then refresh the NetBackup master server

Target replication is not configured

Checks if the target replication is not configured

3 hours

Warning

No operation

Configure target replication and then refresh the NetBackup master server

Disabled NetBackup Policy

NetBackup policy associated with the virtual machine is disabled

3 hours

Warning

No operation

Fix the disabled policy