Veritas NetBackup for HBase Administrator's Guide

Last Published:
Product(s): NetBackup (8.1.1)
Platform: Linux,UNIX,Windows
  1. Introduction
    1.  
      Protecting HBase data using NetBackup
    2.  
      Backing up HBase data
    3.  
      Restoring HBase data
    4.  
      Deploying the HBase plug-in
    5.  
      NetBackup for HBase terminologies
    6.  
      Limitations
  2. Installing and deploying HBase plug-in for NetBackup
    1.  
      About installing and deploying the HBase plug-in
    2.  
      Pre-requisites for installing the HBase plug-in
    3.  
      Operating system and platform compatibility
    4.  
      License for HBase plug-in for NetBackup
    5.  
      Preparing the HBase cluster
    6.  
      Downloading the plug-in
    7.  
      Installing the HBase plug-in
    8.  
      Best practices for deploying the HBase plug-in
    9.  
      Post installation procedures
    10.  
      Verifying the installation of the HBase plug-in
  3. Configuring NetBackup for HBase
    1.  
      About configuring NetBackup for HBase
    2. Managing backup hosts
      1.  
        Whitelisting a NetBackup client on NetBackup master server
      2.  
        Configure a NetBackup Appliance as a backup host
    3.  
      Adding HBase credentials in NetBackup
    4. Configuring the HBase plug-in using the HBase configuration file
      1.  
        Configuring NetBackup for a highly-available HBase cluster
    5.  
      Configuration for a HBase cluster that uses Kerberos
    6. Configuring NetBackup policies for HBase plug-in
      1. Creating a BigData backup policy
        1. Creating BigData policy using the NetBackup Administration Console
          1.  
            Using the Policy Configuration Wizard to create a BigData policy for HBase clusters
          2.  
            Using the NetBackup Policies utility to create a BigData policy for HBase clusters
        2.  
          Using NetBackup Command Line Interface (CLI) to create a BigData policy for HBase clusters
    7.  
      Disaster recovery of a HBase cluster
  4. Performing backups and restores of HBase
    1. About backing up a HBase cluster
      1.  
        Pre-requisite for running backup and restore operations for a HBase cluster with Kerberos authentication
      2.  
        Backing up a HBase cluster
      3.  
        Best practices for backing up a HBase cluster
    2. About restoring a HBase cluster
      1. Restoring HBase data on the same HBase cluster
        1.  
          Using the Restore Wizard to restore HBase data on the same cluster
        2.  
          Using the bprestore command to restore HBase data on the same HBase cluster
    3.  
      Restoring HBase data on an alternate HBase cluster
    4.  
      Restoring truncated tables
    5.  
      Best practices for restoring a HBase cluster
  5. Troubleshooting
    1.  
      About NetBackup for HBase debug logging
    2.  
      Backup fails with error 6609
    3.  
      Backup fails with error 6601
    4.  
      Backup fails with error 6623
    5.  
      Restore fails with error 2850
    6.  
      Backup fails with error 20

NetBackup for HBase terminologies

The following table defines the terms you will come across when using NetBackup for protecting HBase cluster.

Table: NetBackup terminologies

Terminology

Definition

Compound job

A backup job for HBase data is a compound job.

  • The backup job runs a discovery job for getting information of the data to be backed up.

  • Child jobs are created for each backup host that performs the actual data transfer.

  • After the backup is complete, the job cleans up the snapshots on the HMaster and is then marked complete.

Discovery job

When a backup job is executed, first a discovery job is created. The discovery job communicates with the HMaster and gathers information of the block that needs to be backed up and the associated Region servers. At the end of the discovery, the job populates a workload discovery file that NetBackup then uses to distribute the workload amongst the backup hosts.

Child job

For backup, a separate child job is created for each backup host to transfer data to the storage media. A child job can transfer data blocks from multiple Region servers.

Workload discovery file

During discovery, when the backup host communicates with the HMaster, a workload discovery file is created. The file contains information about the data blocks to be backed up and the associated Region servers.

Workload distribution file

After the discovery is complete, NetBackup creates a workload distribution file for each backup host. These files contain information of the data that is transferred by the respective backup host.

Parallel streams

The NetBackup parallel streaming framework allows data blocks from multiple Region servers to be backed up using multiple backup hosts simultaneously.

Backup host

The backup host acts as a proxy client. All the backup and restore operations are executed through the backup host.

You can configure media servers, clients, or a master server as a backup host.

The backup host is also used as destination client during restores.

BigData policy

The BigData policy is introduced to:

  • Specify the application type.

  • Allow backing up distributed multi-node environments.

  • Associate backup hosts.

  • Perform workload distribution.

Application server

HMaster is referred to as a application server in NetBackup.

Primary HMaster

In a high-availability scenario, you need to specify one HMaster with the BigData policy and with the tpconfig command. This HMaster is referred as the primary HMaster.

Fail-over HMaster

In a high-availability scenario, the HMaster other than the primary HMaster that are updated in the hbase.conf file are referred as fail-over HMaster.