NetBackup™ Deployment Guide for Kubernetes Clusters
- Introduction
- Section I. Configurations- Prerequisites
- Recommendations and Limitations
- Configurations
- Configuration of key parameters in Cloud Scale deployments- Tuning touch files
- Setting maximum jobs per client
- Setting maximum jobs per media server
- Enabling intelligent catalog archiving
- Enabling security settings
- Configuring email server
- Reducing catalog storage management
- Configuring zone redundancy
- Enabling client-side deduplication capabilities
- Parameters for logging (fluentbit)
 
 
- Section II. Deployment
- Section III. Monitoring and Management- Monitoring NetBackup
- Monitoring Snapshot Manager
- Monitoring fluentbit
- Monitoring MSDP Scaleout
- Managing NetBackup
- Managing the Load Balancer service
- Managing PostrgreSQL DBaaS
- Managing fluentbit
- Performing catalog backup and recovery
 
- Section IV. Maintenance- PostgreSQL DBaaS Maintenance
- Patching mechanism for primary, media servers, fluentbit pods, and postgres pods
- Upgrading
- Cloud Scale Disaster Recovery
- Uninstalling
- Troubleshooting- Troubleshooting AKS and EKS issues- View the list of operator resources
- View the list of product resources
- View operator logs
- View primary logs
- Socket connection failure
- Resolving an issue where external IP address is not assigned to a NetBackup server's load balancer services
- Resolving the issue where the NetBackup server pod is not scheduled for long time
- Resolving an issue where the Storage class does not exist
- Resolving an issue where the primary server or media server deployment does not proceed
- Resolving an issue of failed probes
- Resolving token issues
- Resolving an issue related to insufficient storage
- Resolving an issue related to invalid nodepool
- Resolving a token expiry issue
- Resolve an issue related to KMS database
- Resolve an issue related to pulling an image from the container registry
- Resolving an issue related to recovery of data
- Check primary server status
- Pod status field shows as pending
- Ensure that the container is running the patched image
- Getting EEB information from an image, a running container, or persistent data
- Resolving the certificate error issue in NetBackup operator pod logs
- Pod restart failure due to liveness probe time-out
- NetBackup messaging queue broker take more time to start
- Host mapping conflict in NetBackup
- Issue with capacity licensing reporting which takes longer time
- Local connection is getting treated as insecure connection
- Primary pod is in pending state for a long duration
- Backing up data from Primary server's /mnt/nbdata/ directory fails with primary server as a client
- Storage server not supporting Instant Access capability on Web UI after upgrading NetBackup
- Taint, Toleration, and Node affinity related issues in cpServer
- Operations performed on cpServer in environment.yaml file are not reflected
- Elastic media server related issues
- Failed to register Snapshot Manager with NetBackup
- Post Kubernetes cluster restart, flexsnap-listener pod went into CrashLoopBackoff state or pods were unable to connect to flexsnap-rabbitmq
- Post Kubernetes cluster restart, issues observed in case of containerized Postgres deployment
- Request router logs
- Issues with NBPEM/NBJM
- Issues with logging feature for Cloud Scale
- The flexsnap-listener pod is unable to communicate with RabbitMQ
 
- Troubleshooting AKS-specific issues
- Troubleshooting EKS-specific issues
- Troubleshooting issue for bootstrapper pod
 
- Troubleshooting AKS and EKS issues
 
- Appendix A. CR template
- Appendix B. MSDP Scaleout- About MSDP Scaleout
- Prerequisites for MSDP Scaleout (AKS\EKS)
- Limitations in MSDP Scaleout
- MSDP Scaleout configuration
- Installing the docker images and binaries for MSDP Scaleout (without environment operators or Helm charts)
- Deploying MSDP Scaleout
- Managing MSDP Scaleout
- MSDP Scaleout maintenance
 
About NetBackup operator logs
Note the following about the NetBackup operator logs.
- NetBackup operator logs can be checked using the operator pod logs using the kubectl logs <Netbackup-operator-pod-name> -c netbackup-operator -n <netbackup-opertaor-namespace> command. 
- NetBackup operator provides different log levels that can be changed before deployment of NetBackup operator. - The following log levels are provided: - -1 - Debug 
- 0 - Info 
- 1 - Warn 
- 2 - Error 
 - By default, the log level is 0. - It is recommended to use 0, 1, or 2 log level depending on your requirement. - To change the log level modify the - operators-values.yamlfile and upgrade the operators using the following command:- helm upgrade --install operators operators-10.5.tgz -f operators-values.yaml -n netbackup-operator-system 
- Config-Checker jobs that run before deployment of primary server and media server creates the pod. The logs for config checker executions can be checked using the kubectl logs <configchecker-pod-name> -n <netbackup-operator-namespace> command. 
- Installation logs of NetBackup primary server and media server can be retrieved using any of the following methods: - Run the kubectl logs <PrimaryServer/MediaServer-Pod-Name> -n <PrimaryServer/MediaServer namespace> command. 
- Execute the following command in the primary server/media server pod and check the - /mnt/nblogs/setup-server.logfile:- kubectl exec -it <PrimaryServer/MediaServer-Pod-Name> -n <PrimaryServer/MediaServer-namespace> -- bash 
 
- (AKS-specific) Data migration jobs create the pods that run before deployment of primary server. The logs for data migration execution can be checked using the following command: - kubectl logs <migration-pod-name> -n <netbackup-environment-namespace> 
- Execute the following respective commands to check the event logs that shows deployment logs for PrimaryServer, MediaServer and Request Router: - For primary server: kubectl describe PrimaryServer <PrimaryServer name> -n <PrimaryServer-namespace> 
- For media server: kubectl describe MediaServer<MediaServername> -n<MediaServer-namespace> 
- For request router: kubectl describe pod <request-router-pod-name> -n <PrimaryServer-namespace>