Translation Notice
Please note that this content includes text that has been machine-translated from English. Veritas does not guarantee the accuracy regarding the completeness of the translation. You may also refer to the English Version of this knowledge base article for up-to-date information.
NetBackup 8.1.1 / 3.1.1 Hotfix - CloudCatalyst EEB Bundle (Etrack 3956754)
Abstract
Description
Added A hotfix is available for NetBackup 8.1.1 which resolves the following issues seen after an upgrade:
- The cache eviction takes a long time. The delete request prevents the upload request.
- Performance improvement for uploads to cloud
- Certain error conditions could cause failed uploads to be reported as successful resulting in potential data loss.
- A hang could occur after several hours or days as a result of a previous non-graceful service shutdown.
- A fix to the Azure plugin is included - vxesp crashes leading to hung CC jobs.
- The rocksdb library has been upgraded to the latest available version to prevent vxesfsd crashes.
- The cache eviction logic fails on some CloudCatalyst servers installed on xfs or other filesystems.This results in a cache full scenario and jobs failing with write errors.
- vxesfsd is crashing in rocksdb. Set FSDB block cache size or disable it.
- vxesfsd crashes in del_file when deleting a file which had nlink set to 0 in FSDB.
- vxesfsd is crashing in esfs_opendir() due to a null-pointer exception.
- vxesfsd is crashing in boost libraries. Remove boost from vxesfsd.
- Added more detailed logging for cache eviction.
- Do not run cache eviction every 10 seconds if very little space is freed.
- Do not use status 1 in image stat file because it is treated as success by MSDP.
- Avoid crash from incorrect path in upload file list.
- Do not use world-writable permissions on created directories.
- Make sure that correct status is returned on failure for rmdir.
- Do not write __EMPTYFILE__ into .bin/.bhd files
- Download size is not the same value as fsdb. Disable download size check in case the
- size in fsdb is incorrect.
- Cannot remove certain directories. Change rmdir logic of checking directory is empty.
- This change makes sure that directory with garbage data is able to be deleted.
- vxesfsd crashing. Set max open file to limit RocksDB memory allocation. The allocated
- RSS memory will never be larger than 1GB.
- Implement write pause instead of fail when the cache hits the high watermark (the cache is full). Container write will be paused during cache eviction time.
- Improve performance of opendir/readdir (remove support for optional d_type on readdir since MSDP does not use it)
- Cache eviction improvements
- Skip bhd files and recently modified files during cache eviction
- If unable to reclaim enough space, consider them for eviction the next time
- Check disk usage no more than once every 10 seconds
- Cache sys_ino for /data and /databases for performance
- Remove unnecessary lock for esfs_opendir for MSDP performance
- Improve parse_upload_filelist performance
- Cache sys_ino values while parsing the filelist to reduce stress on fsdb
- Assume root directory is ESFS_ROOT_INO instead of querying fsdb on every esfs_lookup call
- Fix error status for readonly mode for upload msdp system files
- Avoid possible duplicate ino in case of fsdb error
- Propagate error properly on failed cloud delete request
- Prevent inode reuse and change list result for Azure.
- Call putattr on error path.
- Fix crash on failed start during rapid start/stop/start sequence.
- Allow esfs running for non fatal error in fsdb. Avoid crash once vxesfs cannot continue at startup.
- Add log for sts_get_physical_machine_name and sts_get_cred.
- For delete requests change the ino to ext_rscn if it's not null (case of duplicate ino) for DR from cloud.
- Storage manager uses ext_rscn as real inode for download because it might be reused. The utility of DR from cloud stores inode in cloud into ext_rscn.
- Comprehensive fsdb check at start. The allocated inode checking time is the same as metadata checking time.
- Updated pre and post install scripts for the EEB.
- Fix vxesfsd crashes by removing boost library.
- Implement fsdb check and integrate it into vxesfsd. vxesfsd will stop when fsdb has problem. fsdb check can remove garbage entries.
- Add option to check specific file consistency.
- Flush FSDB WAL at some important points. Add more info into fill_emptyfile for better analysis in future.
Added to Version 17:
- Remove libnbsqlite.so dependency from fsdb_check
Versions Affected
- NetBackup 8.1.1
- NetBackup Appliance 3.1.1
The formal resolution for this issue (Etrack 3956754) is included in the following release:
NetBackup 8.2
This EEB should be installed on: Cloud Catalyst Media Servers
EEB Installer instructions:
Using the NetBackup Emergency Engineering Binary (EEB) installer
https://www.veritas.com/docs/100019405
Installing EEBs on a NetBackup 52x0 / 5330 Appliance
https://www.veritas.com/docs/100023444
Installation Steps:
- Stop NBU services.
- Uninstall any previous version of this EEB (3956754 versions 1 to 16) before installing.
- If not an NBU appliance, please run the EEB installer with the create option.
- Run the cc_touch utility (runs the touch command on the container files in the
/cache/userdata/data
directory) located in the bin/goodies directory as follows:cc_touch /cache/userdata/data
- Change esfs.json (
/cache/etc/esfs.json
) to have LowWatermark 50 and HighWatermark 90 values. - Start NBU services.
Downloads:
- NB_8.1.1_ET3956754_17.zip
- NBAPP_EEB_ET3956754-3.1.1.0-17.x86_64.rpm
EEB Installer Contents:
File Checksum Byte count
linuxR_x86/cc_touch 1385895309 183520
linuxR_x86/dbdump 3868977608 7650024
linuxR_x86/esfs_check 947729854 8742200
linuxR_x86/fsdb_check 1921525586 13812688
linuxR_x86/install-3956754 3778100034 2408
linuxR_x86/librocksdb.so.6.0.2 1725966266 5219784
linuxR_x86/libstspiazure.so 3131267038 2729377
linuxR_x86/libstspiazureMT.so 3641006614 2731459
linuxR_x86/libtcmalloc.so.4.5.3 1715574939 2342432
linuxR_x86/mkesfs 704888488 8764128
linuxR_x86/post_uninstall-3956754 788363685 375
linuxR_x86/pre_proc_uninstall_3956754 2869712438 4225
linuxR_x86/preprocess_install_3956754 1795276267 1181
linuxR_x86/vxesfsd 2274859221 7608136
linuxR_x86/vxesp 318816760 733616
Recommended service state:
Stop all NetBackup services before applying this hotfix.
Applies to the following product releases
Update files
|
File name | Description | Version | Platform | Size |
---|
Knowledge base
Duplication jobs to CloudCatalyst may complete with status 0, but if cache fills up, dataloss could result
2020-02-11Severity Possible Data Loss Description Duplication jobs to the cloud using NetBackup CloudCatalyst will complete with status 0, but if the CloudCatalyst cache volume becomes full, data loss can result. Versions Affected NetBackup 8.1 EEB 3958410...