Veritas NetBackup™ for HBase Administrator's Guide

Last Published:
Product(s): NetBackup (9.0.0.1, 9.0)
  1. Introduction
    1.  
      Protecting HBase data using NetBackup
    2.  
      Backing up HBase data
    3.  
      Restoring HBase data
    4.  
      NetBackup for HBase terminologies
    5.  
      Limitations
  2. Deploying HBase plug-in for NetBackup
    1.  
      About the HBase plug-in deployment
    2.  
      Pre-requisites for installing the HBase plug-in
    3.  
      Operating system and platform compatibility
    4.  
      License for HBase plug-in for NetBackup
    5.  
      Preparing the HBase cluster
    6.  
      Best practices for deploying the HBase plug-in
    7.  
      Post installation procedures
    8.  
      Verifying the deployment of the HBase plug-in
  3. Configuring NetBackup for HBase
    1.  
      About configuring NetBackup for HBase
    2. Managing backup hosts
      1.  
        Whitelisting a NetBackup client on NetBackup master server
      2.  
        Configure a NetBackup Appliance as a backup host
    3.  
      Adding HBase credentials in NetBackup
    4. Configuring the HBase plug-in using the HBase configuration file
      1.  
        Configuring NetBackup for a highly-available HBase cluster
      2.  
        Configuring communication between NetBackup and HBase clusters that have SSL enabled (HTTPS)
    5.  
      Configuration for a HBase cluster that uses Kerberos
    6. Configuring NetBackup policies for HBase plug-in
      1. Creating a BigData backup policy
        1. Creating BigData policy using the NetBackup Administration Console
          1.  
            Using the Policy Configuration Wizard to create a BigData policy for HBase clusters
          2.  
            Using the NetBackup Policies utility to create a BigData policy for HBase clusters
        2.  
          Using NetBackup Command Line Interface (CLI) to create a BigData policy for HBase clusters
    7.  
      Disaster recovery of a HBase cluster
  4. Performing backups and restores of HBase
    1. About backing up a HBase cluster
      1.  
        Pre-requisite for running backup and restore operations for a HBase cluster with Kerberos authentication
      2.  
        Backing up a HBase cluster
      3.  
        Best practices for backing up a HBase cluster
    2. About restoring a HBase cluster
      1. Restoring HBase data on the same HBase cluster
        1.  
          Using the Restore Wizard to restore HBase data on the same cluster
        2.  
          Using the bprestore command to restore HBase data on the same HBase cluster
    3.  
      Restoring HBase data on an alternate HBase cluster
    4.  
      Restoring truncated tables
    5.  
      Best practices for restoring a HBase cluster
  5. Troubleshooting
    1.  
      About NetBackup for HBase debug logging
    2.  
      Backup fails with error 6609
    3.  
      Backup fails with error 6601
    4.  
      Backup fails with error 6623
    5.  
      Restore fails with error 2850
    6.  
      Backup fails with error 20
    7.  
      Backup fails with error 112
    8.  
      Backup operation fails with error 6654
    9.  
      NetBackup configuration and certificate files do not persist after the container-based NetBackup appliance restarts
    10.  
      Configuration file is not recovered after a disaster recovery
  6.  
    Index

Backing up HBase data

HBase data is backed up in parallel streams wherein HBase Region servers stream data blocks simultaneously to multiple backup hosts.

The following diagram provides an overview of the backup flow:

Figure: Backup flow

Backup flow

As illustrated in the following diagram:

  1. A scheduled backup job is triggered from the master server.

  2. Backup job for HBase data is a compound job. When the backup job is triggered, first a discovery job is run.

  3. During discovery, the first backup host connects with the Hmaster and performs a discovery to get details of data that needs to be backed up.

  4. A workload discovery file is created on the backup host. The workload discovery file contains the details of the data that needs to be backed up from the different Region servers.

  5. The backup host uses the workload discovery file and decides how the workload is distributed amongst the backup hosts. Workload distribution files are created for each backup host.

  6. Individual child jobs are executed for each backup host. As specified in the workload distribution files, data is backed up.

  7. Data blocks are streamed simultaneously from different Region servers to multiple backup hosts.

The compound backup job is not completed until all the child jobs are completed. After the child jobs are completed, NetBackup cleans all the snapshots from the HMaster. Only after the cleanup activity is completed, the compound backup job is completed.

See About backing up a HBase cluster.