Veritas NetBackup for Hadoop Administrator's Guide

Last Published:
Product(s): NetBackup (8.1)
  1. Introduction
    1.  
      Protecting Hadoop data using NetBackup
    2.  
      Backing up Hadoop data
    3.  
      Restoring Hadoop data
    4.  
      Deploying the Hadoop plug-in
    5.  
      NetBackup for Hadoop terminologies
    6.  
      Limitations
  2. Installing and deploying Hadoop plug-in for NetBackup
    1.  
      About installing and deploying the Hadoop plug-in
    2. Pre-requisites for installing the Hadoop plug-in
      1.  
        Operating system and platform compatibility
      2.  
        License for Hadoop plug-in for NetBackup
    3.  
      Best practices for deploying the Hadoop plug-in
    4.  
      Preparing the Hadoop cluster
    5.  
      Downloading the Hadoop plug-in
    6.  
      Installing the Hadoop plug-in
    7.  
      Verifying the installation of the Hadoop plug-in
  3. Configuring NetBackup for Hadoop
    1.  
      About configuring NetBackup for Hadoop
    2. Managing backup hosts
      1.  
        Whitelisting a NetBackup client on NetBackup master server
      2.  
        Configure a NetBackup Appliance as a backup host
    3.  
      Adding Hadoop credentials in NetBackup
    4. Configuring the Hadoop plug-in using the Hadoop configuration file
      1.  
        Configuring NetBackup for a highly-available Hadoop cluster
      2.  
        Configuring a custom port for the Hadoop cluster
      3.  
        Configuring number of threads for backup hosts
    5.  
      Configuration for a Hadoop cluster that uses Kerberos
    6. Configuring NetBackup policies for Hadoop plug-in
      1. Creating a BigData backup policy
        1. Creating BigData policy using the NetBackup Administration Console
          1.  
            Using the Policy Configuration Wizard to create a BigData policy for Hadoop clusters
          2.  
            Using the NetBackup Policies utility to create a BigData policy for Hadoop clusters
        2.  
          Using NetBackup Command Line Interface (CLI) to create a BigData policy for Hadoop clusters
    7.  
      Disaster recovery of a Hadoop cluster
  4. Performing backups and restores of Hadoop
    1. About backing up a Hadoop cluster
      1.  
        Pre-requisite for running backup and restore operations for a Hadoop cluster with Kerberos authentication
      2.  
        Backing up a Hadoop cluster
      3.  
        Best practices for backing up a Hadoop cluster
    2. About restoring a Hadoop cluster
      1. Restoring Hadoop data on the same Hadoop cluster
        1.  
          Using the Restore Wizard to restore Hadoop data on the same Hadoop cluster
        2.  
          Using the bprestore command to restore Hadoop data on the same Hadoop cluster
      2.  
        Restoring Hadoop data on an alternate Hadoop cluster
      3.  
        Best practices for restoring a Hadoop cluster
  5. Troubleshooting
    1.  
      About troubleshooting NetBackup for Hadoop issues
    2.  
      About NetBackup for Hadoop debug logging
    3. Troubleshooting backup issues for Hadoop data
      1.  
        Backup operation for Hadoop fails with error code 6599
      2.  
        Backup operation fails with error 6609
      3.  
        Backup operation failed with error 6618
      4.  
        Backup operation fails with error 6647
      5.  
        Extended attributes (xattrs) and Access Control Lists (ACLs) are not backed up or restored for Hadoop
      6.  
        Backup operation fails with error 6654
      7.  
        Backup operation fails with bpbrm error 8857
      8.  
        Backup operation fails with error 6617
      9.  
        Backup operation fails with error 6616
    4. Troubleshooting restore issues for Hadoop data
      1.  
        Restore fails with error code 2850
      2.  
        NetBackup restore job for Hadoop completes partially
      3.  
        Extended attributes (xattrs) and Access Control Lists (ACLs) are not backed up or restored for Hadoop
      4.  
        Restore operation fails when Hadoop plug-in files are missing on the backup host
      5.  
        Restore fails with bpbrm error 54932
      6.  
        Restore operation fails with bpbrm error 21296

Restoring Hadoop data on an alternate Hadoop cluster

NetBackup lets you restore Hadoop data to another NameNode or Hadoop cluster. This type of restore method is also referred to as redirected restores.

Note:

NetBackup supports redirected restores only using the Command Line Interface (CLI).

Note:

Make sure that you have added the credentials for the alternate NameNode or Hadoop cluster in NetBackup master server and also completed the Whitelisting tasks on NetBackup master server. For more information about how to add Hadoop credentials in NetBackup and whitlelisting procedures, See Adding Hadoop credentials in NetBackup. See Whitelisting a NetBackup client on NetBackup master server.

To perform redirected restore for Hadoop

  1. Modify the values for rename_file and listfile as follows:

    Parameter

    Value

    rename_file

    Change /<source_folder_path> to /<destination_folder_path> ALT_APPLICATION_SERVER=<alternate name node>

    listfile

    List of all the Hadoop files to be restored

  2. Run the bprestore -S master_server -D backup_host -C client -R rename_file -t 44 -L progress log -f listfile command on the NetBackup master server using the modified values for the mentioned parameters in step 1.

    Where,

    -S master_server

    Specifies the name of the NetBackup master server.

    -D backup host

    Specifies the name of the backup host.

    -C client

    Specifies a NameNode as a source to use for finding backups or archives from which to restore files. This name must be as it appears in the NetBackup catalog.

    -f listfile

    Specifies a file (listfile) that contains a list of files to be restored and can be used instead of the file names option. In listfile, list each file path must be on a separate line.

    -L progress_log

    Specifies the name of whitelisted file path in which to write progress information.

    -t 44

    Specifies BigData as the policy type.

    -R rename_file

    Specifies the name of a file with name changes for alternate-path restores.

    Use the following form for entries in the rename file:

    change backup_filepath to restore_filepath ALT_APPLICATION_SERVER=<Application Server Name>

    The file paths must start with / (slash).

    Note:

    Ensure that you have whitelisted all the file paths such as <rename_file_path>, <progress_log_path> that are already not included as a part of NetBackup install path.