InfoScale™ 9.0 Storage Foundation Cluster File System High Availability Configuration and Upgrade Guide - Linux
- Section I. Introduction to SFCFSHA
- Introducing Storage Foundation Cluster File System High Availability
- Section II. Configuration of SFCFSHA
- Preparing to configure
- Preparing to configure SFCFSHA clusters for data integrity
- About planning to configure I/O fencing
- Setting up the CP server
- Planning your CP server setup
- Installing the CP server using the installer
- Configuring the CP server cluster in secure mode
- Setting up shared storage for the CP server database
- Configuring the CP server using the installer program
- Configuring the CP server manually
- Verifying the CP server configuration
- Configuring SFCFSHA
- Overview of tasks to configure SFCFSHA using the product installer
- Starting the software configuration
- Specifying systems for configuration
- Configuring the cluster name
- Configuring private heartbeat links
- Configuring the virtual IP of the cluster
- Configuring SFCFSHA in secure mode
- Configuring a secure cluster node by node
- Adding VCS users
- Configuring SMTP email notification
- Configuring SNMP trap notification
- Configuring global clusters
- Completing the SFCFSHA configuration
- About the License Audit Tool
- Verifying and updating licenses on the system
- Configuring SFDB
- Configuring SFCFSHA clusters for data integrity
- Setting up disk-based I/O fencing using installer
- Setting up server-based I/O fencing using installer
- Setting up non-SCSI-3 I/O fencing in virtual environments using installer
- Setting up majority-based I/O fencing using installer
- Enabling or disabling the preferred fencing policy
- Performing an automated SFCFSHA configuration using response files
- Performing an automated I/O fencing configuration using response files
- Configuring I/O fencing using response files
- Response file variables to configure disk-based I/O fencing
- Sample response file for configuring disk-based I/O fencing
- Configuring CP server using response files
- Response file variables to configure server-based I/O fencing
- Sample response file for configuring server-based I/O fencing
- Response file variables to configure non-SCSI-3 I/O fencing
- Sample response file for configuring non-SCSI-3 I/O fencing
- Response file variables to configure majority-based I/O fencing
- Sample response file for configuring majority-based I/O fencing
- Manually configuring SFCFSHA clusters for data integrity
- Setting up disk-based I/O fencing manually
- Setting up server-based I/O fencing manually
- Preparing the CP servers manually for use by the SFCFSHA cluster
- Generating the client key and certificates manually on the client nodes
- Configuring server-based fencing on the SFCFSHA cluster manually
- Configuring CoordPoint agent to monitor coordination points
- Verifying server-based I/O fencing configuration
- Setting up non-SCSI-3 fencing in virtual environments manually
- Setting up majority-based I/O fencing manually
- Section III. Upgrade of SFCFSHA
- Planning to upgrade SFCFSHA
- About the upgrade
- Supported upgrade paths
- Transitioning between the InfoScale products
- Considerations for upgrading SFCFSHA to 9.0 on systems configured with an Oracle resource
- Preparing to upgrade SFCFSHA
- Considerations for upgrading REST server
- Using Install Bundles to simultaneously install or upgrade full releases (base, maintenance, rolling patch), and individual patches
- Performing a full upgrade of SFCFSHA using the installer
- Performing a rolling upgrade of SFCFSHA
- Performing a phased upgrade of SFCFSHA
- About phased upgrade
- Performing a phased upgrade using the product installer
- Moving the service groups to the second subcluster
- Upgrading the operating system on the first subcluster
- Upgrading the SFCFSHA stack on the first subcluster
- Preparing the second subcluster
- Activating the first subcluster
- Upgrading the operating system on the second subcluster
- Upgrading the second subcluster
- Completing the phased upgrade
- Performing an automated SFCFSHA upgrade using response files
- Upgrading SFCFSHA using YUM
- Upgrading Volume Replicator
- Upgrading VirtualStore
- Performing post-upgrade tasks
- Resetting DAS disk names to include host name in FSS environments
- Re-joining the backup boot disk group into the current disk group
- Reverting to the backup boot disk group after an unsuccessful upgrade
- CVM master node needs to assume the logowner role for VCS managed VVR resources
- Consideration when KMS is used for volume encryption
- Planning to upgrade SFCFSHA
- Section IV. Post-configuration tasks
- Section V. Configuration of disaster recovery environments
- Section VI. Adding and removing nodes
- Adding a node to SFCFSHA clusters
- About adding a node to a cluster
- Before adding a node to a cluster
- Adding a node to a cluster using the Veritas InfoScale installer
- Adding the node to a cluster manually
- Starting Veritas Volume Manager (VxVM) on the new node
- Configuring cluster processes on the new node
- Setting up the node to run in secure mode
- Starting fencing on the new node
- After adding the new node
- Configuring Cluster Volume Manager (CVM) and Cluster File System (CFS) on the new node
- Configuring the ClusterService group for the new node
- Adding a node using response files
- Configuring server-based fencing on the new node
- Adding nodes to a cluster that is using authentication for SFDB tools
- Updating the Storage Foundation for Databases (SFDB) repository after adding a node
- Sample configuration file for adding a node to the cluster
- Removing a node from SFCFSHA clusters
- About removing a node from a cluster
- Removing a node from a cluster
- Modifying the VCS configuration files on existing nodes
- Modifying the Cluster Volume Manager (CVM) configuration on the existing nodes to remove references to the deleted node
- Removing the node configuration from the CP server
- Removing security credentials from the leaving node
- Updating the Storage Foundation for Databases (SFDB) repository after removing a node
- Sample configuration file for removing a node from the cluster
- Adding a node to SFCFSHA clusters
- Section VII. Configuration and Upgrade reference
- Appendix A. Installation scripts
- Appendix B. Configuration files
- Appendix C. Configuring the secure shell or the remote shell for communications
- About configuring secure shell or remote shell communication modes before installing products
- Manually configuring passwordless ssh
- Setting up ssh and rsh connection using the installer -comsetup command
- Setting up ssh and rsh connection using the pwdutil.pl utility
- Restarting the ssh session
- Enabling rsh for Linux
- Appendix D. High availability agent information
- Appendix E. Sample SFCFSHA cluster setup diagrams for CP server-based I/O fencing
- Appendix F. Configuring LLT over UDP
- Using the UDP layer for LLT
- Manually configuring LLT over UDP using IPv4
- Broadcast address in the /etc/llttab file
- The link command in the /etc/llttab file
- The set-addr command in the /etc/llttab file
- Selecting UDP ports
- Configuring the netmask for LLT
- Configuring the broadcast address for LLT
- Sample configuration: direct-attached links
- Sample configuration: links crossing IP routers
- Using the UDP layer of IPv6 for LLT
- Manually configuring LLT over UDP using IPv6
- About configuring LLT over UDP multiport
- Appendix G. Using LLT over RDMA
- Using LLT over RDMA
- About RDMA over RoCE or InfiniBand networks in a clustering environment
- How LLT supports RDMA capability for faster interconnects between applications
- Using LLT over RDMA: supported use cases
- Configuring LLT over RDMA
- Choosing supported hardware for LLT over RDMA
- Installing RDMA, InfiniBand or Ethernet drivers and utilities
- Configuring RDMA over an Ethernet network
- Configuring RDMA over an InfiniBand network
- Tuning system performance
- Manually configuring LLT over RDMA
- LLT over RDMA sample /etc/llttab
- Verifying LLT configuration
- Troubleshooting LLT over RDMA
- IP addresses associated to the RDMA NICs do not automatically plumb on node restart
- Ping test fails for the IP addresses configured over InfiniBand interfaces
- After a node restart, by default the Mellanox card with Virtual Protocol Interconnect (VPI) gets configured in InfiniBand mode
- The LLT module fails to start
Performing the upgrade
If you plan to upgrade the operating system, perform the following steps:
Note:
On a Cross replication VVR or CVR environment, full upgrade is not supported. Perform a rolling upgrade.
To upgrade the operating system
- Rename the
/etc/llttabfile to prevent LLT from starting automatically when the node starts:# mv /etc/llttab /etc/llttab.save
- Create
install-dbfile to prevent VxVM daemons or processes from starting automatically when the node starts.# touch /etc/vx/reconfig.d/state.d/install-db
- Upgrade the operating system on all nodes in the cluster.
For instructions, see the operating system documentation.
- If you upgraded the operating system, restart the nodes if required:
# shutdown -r now
- After the system restarts, rename the
/etc/llttabfile to its original name:# mv /etc/llttab.save /etc/llttab
- Enable VxVM to start after system restarts.
# rm /etc/vx/reconfig.d/state.d/install-db
To perform the upgrade
Note:
Root Disk Encapsulation (RDE) is not supported on Linux from 7.3.1 onwards.
- Log in as superuser.
- Insert the appropriate media disc per your distribution and architecture into your system's DVD-ROM drive.
- If volume management software is running on your system, the software disc automatically mounts as /mnt/cdrom.
If volume management software is not available to mount the disc, you must mount it manually, enter:
# mount -o ro /dev/cdrom /mnt/cdrom
- Change to the top-level directory on the disc:
# cd /mnt/cdrom
- Verify there are no VxFS file systems mounted on the nodes being upgraded:
# mount -t vxfs
If any VxFS file systems are mounted, offline the group on each node of the cluster:
# hagrp -offline group -sys sys1 # hagrp -offline group -sys sys2 # hagrp -offline group -sys sys3 # hagrp -offline group -sys sys4
where group is the VCS service group that has the CVMVolDg and CFSMount resource.
If VxFS are not managed by VCS then unmount them manually:
# umount mount_point
Repeat this step for each SFCFSHA service group.
- If a cache area is online, you must take the cache area offline before you upgrade the VxVM RPM. Use the following command to take the cache area offline:
# sfcache offline cachename
- Start the upgrade from any node in the cluster. Enter the following command, and then press y to upgrade the cluster configuration.
# ./installer -upgrade
- You are prompted to enter the system names (in the following example, "sys1" and "sys2") on which the software is to be upgraded. Enter the system name or names and then press Return.
Enter the system names separated by spaces: [q, ?] sys1 sys2
- During the initial system check, the installer verifies that communication between systems has been set up.
If the installer hangs or asks for a login password, you have the option to let the installer configure SSH or RSH communications between the systems. If you choose to allow this configuration, select the communication type and provide the root passwords for each system.
- At the prompt, specify whether you accept the terms of the End User License Agreement (EULA).
Do you agree with the terms of the End User License Agreement as specified in the EULA/en/EULA_ENTERPRISE_Ux_9.0.pdf file present on media? [y,n,q,?] y
- The installer discovers if any of the systems that you are upgrading have mirrored and encapsulated boot disks. For each system that has a mirrored boot disk, you have the option to create a backup of the system's boot disk group before the upgrade proceeds. If you want to split the boot disk group to create a backup, answer y.
- The installer then prompts you to name the backup boot disk group. Enter the name for it or press Enter to accept the default.
- If you are prompted to start the split operation. Press y to continue.
Note:
The split operation can take some time to complete.
- Output shows information that SFCFSHA must be stopped on a running system. Enter y to continue.
- The installer displays the following question before the installer stops the product processes if the current cluster is secured and version is prior to 6.2.:
Do you want to grant read access to everyone? [y,n,q,?]
To grant read access to all authenticated users, type y.
To grant usergroup specific permissions, type n.
Do you want to provide any usergroups that you would like to grant read access?[y,n,q,?]
To specify usergroups and grant them read access, type y
To grant read access only to root users, type n. The installer grants read access read access to the root users.
Note:
Separate the usergroup names with spaces. To grant read access to a usergroup on a specific node, specify usergroup as <usergroup>@<node_name>. You can also specify usergroups here and create them later.
- Enter y for summary information and reboots if the boot disk is encapsulated before the upgrade.
Do not remove the log files until the Veritas InfoScale product is working properly on your system. Technical Support will need these log files for debugging purposes.
- Only perform this step if you have split the mirrored root disk to back it up. After a successful reboot, verify the upgrade and re-join the backup disk group. If the upgrade fails, revert to the backup disk group.
See Re-joining the backup boot disk group into the current disk group.
See Reverting to the backup boot disk group after an unsuccessful upgrade.