Translation Notice
Please note that this content includes text that has been machine-translated from English. Veritas does not guarantee the accuracy regarding the completeness of the translation. You may also refer to the English Version of this knowledge base article for up-to-date information.
NetBackup 8.1 / 3.1 Hotfix - CloudCatalyst EEB Bundle (Etrack 3958410)
Abstract
Description
A hotfix is available for NetBackup 8.1 which resolves the following issues seen after an upgrade:
- The cache eviction takes a long time. The delete request prevents the upload request.
- Performance improvement for uploads to cloud
- Certain error conditions could cause failed uploads to be reported as successful resulting in potential data loss.
- A hang could occur after several hours or days as a result of a previous non-graceful service shutdown.
- A fix to the Azure plugin is included - vxesp crashes leading to hung CC jobs.
- The rocksdb library has been upgraded to the latest available version to prevent vxesfsd crashes.
- The cache eviction logic fails on some CloudCatalyst servers installed on xfs or other filesystems.This results in a cache full scenario and jobs failing with write errors.
- vxesfsd is crashing in rocksdb. Set FSDB block cache size or disable it.
- vxesfsd crashes in del_file when deleting a file which had nlink set to 0 in FSDB.
- vxesfsd is crashing in esfs_opendir() due to a null-pointer exception.
- vxesfsd is crashing in boost libraries. Remove boost from vxesfsd.
- Added more detailed logging for cache eviction.
- Do not run cache eviction every 10 seconds if very little space is freed.
- Do not use status 1 in image stat file because it is treated as success by MSDP.
- Avoid crash from incorrect path in upload file list.
- Do not use world-writable permissions on created directories.
- Make sure that correct status is returned on failure for rmdir.
- Do not write __EMPTYFILE__ into .bin/.bhd files
- Download size is not the same value as fsdb. Disable download size check in case the
- size in fsdb is incorrect.
- Cannot remove certain directories. Change rmdir logic of checking directory is empty.
- This change makes sure that directory with garbage data is able to be deleted.
- vxesfsd crashing. Set max open file to limit RocksDB memory allocation. The allocated
- RSS memory will never be larger than 1GB.
- Implement write pause instead of fail when the cache hits the high watermark (the cache is full). Container write will be paused during cache eviction time.
- Round robin between upload and delete requests to avoid starving delete requests in very busy environments.
- Improve performance of opendir/readdir (remove support for optional d_type on readdir since MSDP does not use it)
- Cache eviction improvements
- Skip bhd files and recently modified files during cache eviction
- If unable to reclaim enough space, consider them for eviction the next time
- Check disk usage no more than once every 10 seconds
- Cache sys_ino for /data and /databases for performance
- Remove unnecessary lock for esfs_opendir for MSDP performance
- Improve parse_upload_filelist performance
- Cache sys_ino values while parsing the filelist to reduce stress on fsdb
- Assume root directory is ESFS_ROOT_INO instead of querying fsdb on every esfs_lookup call
- Fix error status for readonly mode for upload msdp system files
- Avoid possible duplicate ino in case of fsdb error
- Propagate error properly on failed cloud delete request
- Prevent inode reuse and change list result for Azure.
- Call putattr on error path.
- Fix crash on failed start during rapid start/stop/start sequence.
- Allow esfs running for non fatal error in fsdb. Avoid crash once vxesfs cannot continue at startup.
- For delete requests change the ino to ext_rscn if it's not null (case of duplicate ino) for DR from cloud.
- Storage manager uses ext_rscn as real inode for download because it might be reused. The utility of DR from cloud stores inode in cloud into ext_rscn.
- Comprehensive fsdb check at start. The allocated inode checking time is the same as metadata checking time.
- Updated pre and post install scripts for the EEB.
- Fix vxesfsd crashes by removing boost library.
Added to Version 16:
Remove libnbsqlite.so dependency from fsdb_check
Versions Affected
- NetBackup 8.1
- NetBackup Appliance 3.1
Read me
The formal resolution for this issue (Etrack 3958410) is included in the following release:
- NetBackup 8.2
This EEB should be installed on: Cloud Catalyst Media Servers
Installation Steps:
- Stop NBU services.
- Uninstall any previous version of this EEB (3958410 versions 1 to 15) before installing.
- If not an NBU appliance, please run the EEB installer with the create option.
- Run the cc_touch utility (runs the touch command on the container files in the /cache/userdata/data directory) located in the bin/goodies directory as follows: cc_touch /cache/userdata/data
- Change esfs.json (/cache/etc/esfs.json) to have LowWatermark 50 and HighWatermark 90 values. Note: DACDays setting in esfs.json has been deprecated.
- Start NBU services.
EEB Installer instructions:
Using the NetBackup Emergency Engineering Binary (EEB) installer
https://www.veritas.com/docs/100019405
Installing EEBs on a NetBackup 52x0 / 5330 Appliance
https://www.veritas.com/docs/100023444
Download Files
- NB_8.1_ET3958410_16.zip
- NBAPP_EEB_ET3958410-3.1.0.0-16.x86_64.rpm
- VRTSflex-nb_EEB_ET3958410-8.1-16.x86_64.rpm
EEB Installer Contents
File Checksum Byte count
linuxR_x86/cc_touch 1385895309 183520
linuxR_x86/dbdump 3335979864 2070008
linuxR_x86/esfs_check 4285991846 2334888
linuxR_x86/fsdb_check 3593585795 4098000
linuxR_x86/install-3958410 2156734147 2406
linuxR_x86/librocksdb.so.6.0.2 1725966266 5219784
linuxR_x86/libstspiazure.so 572332941 2321290
linuxR_x86/libstspiazureMT.so 1028344246 2323921
linuxR_x86/libtcmalloc.so.4.5.3 1715574939 2342432
linuxR_x86/mkesfs 3374189183 2367400
linuxR_x86/post_uninstall-3958410 2572799933 373
linuxR_x86/pre_proc_uninstall_3958410 4204104862 4223
linuxR_x86/preprocess_install_3958410 3415945899 1182
linuxR_x86/vxesfsd 240630947 2061784
linuxR_x86/vxesp 1404305135 247632
Recommended service state:
Stop all NetBackup services before applying this hotfix.
Update files
|
File name | Description | Version | Platform | Size |
---|
Applies to the following product releases
Knowledge base
Duplication jobs to CloudCatalyst may complete with status 0, but if cache fills up, dataloss could result
2020-02-11Severity Possible Data Loss Description Duplication jobs to the cloud using NetBackup CloudCatalyst will complete with status 0, but if the CloudCatalyst cache volume becomes full, data loss can result. Versions Affected NetBackup 8.1 EEB 3958410...