Status code: 13
131 - Connection reset by peer
13:00:05.827  <2> bpbrm read_backup_start: from client ClientA: read client start message
13:00:05.884  <2> bpbrm handle_backup: ADDED FILES TO DB FOR ClientA_1325703601 1 /opt/
13:05:07.496  <16> bpbrm readline: socket read failed: errno = 131 - Connection reset by peer
13:05:07.540  <2> bpbrm brm_sigcld: bpbrm child 16042 exit_status = 13, signal_status = 0
The socket read failure is 5 minutes after the last update to the media server
Firewall has a 300 second idle session timeout configured
One of the client backup ports may have been idle longer than 300 seconds.
The client opens a DATA port for file data to storage device (to bptm / bpdm daemon) and a NAME port for file metadata (to bpbrm daemon).
BPBRM Log message about the 2 sockets opened for the backup
13:00:05.594  <2> bpbrm start_bpcd_stat: DATA_SOCK from bpcr = 10
13:00:05.594  <2> bpbrm start_bpcd_stat: NAME_SOCK from bpcr = 11
The NAME port is usually less active (usually during large file backups) and backup will stop once it tries reading or writing to the closed port.
Extend the 300 second firewall idle session timeout to 7500 seconds or greater
Another option is to change the TCP keepalive setting on the master and media server operating systems to a frequency that is less than the current firewall idle port timeout setting.
Yet another option if the version of NetBackup is 7.5 or greater: enable Resilient Network for the affected client. To enable from the NetBackup administration console: Host Properties of the master server > Resilient Network, click Add, add the client hostname as it appears in the backup policy client list.
....or via command-line:
• Windows: install_path\NetBackup\bin\admincmd
• UNIX: /usr/openv/netbackup/bin/admincmd
# resilient_clients status | on | off <nbuclientname>
ex. resilient_clients on mynbuclientname
NOTE: can specify client name (same as in policy client list), or the client IP address, or a network address. Try to refrain from enabling this for entire network addresses where too many clients (100's of clients) would now be enabled.
About WAN Resiliency
NetBackup, any version.
- network hardware firewall
- or network load balancer hardware