Veritas InfoScale™ 7.4.1 Release Notes - Linux
- Introduction
- Requirements
- Changes introduced in 7.4.1- Changes related to installation and upgrades
- Changes related to licensing
- Changes related to security features
- Changes related to supported configurations
- Changes related to InfoScale in cloud environments
- Changes related to Cluster Server agents
- Changes related to Veritas Volume Manager
- Changes related to Veritas File System
- Changes related to replication
 
- Fixed issues
- Limitations- Virtualization software limitations
- Storage Foundation software limitations- Dynamic Multi-Pathing software limitations
- Veritas Volume Manager software limitations- Snapshot configuration with volumes in shared disk groups and private disk groups is not supported (2801037)
- SmartSync is not supported for Oracle databases running on raw VxVM volumes
- Veritas InfoScale does not support thin reclamation of space on a linked mirror volume (2729563)
- Cloned disks operations not supported for FSS disk groups
- Thin reclamation requests are not redirected even when the ioship policy is enabled (2755982)
- Veritas Operations Manager does not support disk, disk group, and volume state information related to CVM I/O shipping feature (2781126)
 
- Veritas File System software limitations- Limitations while managing Docker containers
- Linux I/O Scheduler for Database Workloads
- Recommended limit of number of files in a directory
- The vxlist command cannot correctly display numbers greater than or equal to 1 EB
- Limitations with delayed allocation for extending writes feature
- FlashBackup feature of NetBackup 7.5 (or earlier) does not support disk layout Version 8, 9, or 10
- Compressed files that are backed up using NetBackup 7.1 or prior become uncompressed when you restore the files
- On SUSE, creation of a SmartIO cache of VxFS type hangs on Fusion-io device (3200586)
- A NetBackup restore operation on VxFS file systems does not work with SmartIO writeback caching
- VxFS file system writeback operation is not supported with volume level replication or array level replication
 
- SmartIO software limitations
 
- Replication software limitations
- Cluster Server software limitations- Limitations related to bundled agents- GoogleIP service group comes online even though OverlayIP resource is already online outside the cluster
- Programs using networked services may stop responding if the host is disconnected
- Volume agent clean may forcibly stop volume resources
- False concurrency violation when using PidFiles to monitor application resources
- Mount agent limitations
- Share agent limitations
- Volumes in a disk group start automatically irrespective of the value of the StartVolumes attribute in VCS [2162929]
- Application agent limitations
- Campus cluster fire drill does not work when DSM sites are used to mark site boundaries [3073907]
- Mount agent reports resource state as OFFLINE if the configured mount point does not exist [3435266]
- Limitation of VMwareDisks agent to communicate with the vCenter Server [3528649]
- NFSRestart agent: In NFSv3, lock recovery is not supported with multiple NFS share service groups
 
- Limitations related to VCS engine- Loads fail to consolidate and optimize when multiple groups fault [3074299]
- Preferred fencing ignores the forecasted available capacity [3077242]
- Failover occurs within the SystemZone or site when BiggestAvailable policy is set [3083757]
- Load for Priority groups is ignored in groups with BiggestAvailable and Priority in the same group[3074314]
 
- Veritas cluster configuration wizard limitations
- Limitations related to IMF
- Limitations related to the VCS database agents
- Security-Enhanced Linux is not supported on SLES distributions
- Systems in a cluster must have same system locale setting
- VxVM site for the disk group remains detached after node reboot in campus clusters with fire drill [1919317]
- Limitations with DiskGroupSnap agent [1919329]
- System reboot after panic
- Host on RHEV-M and actual host must match [2827219]
- Cluster Manager (Java console) limitations
- Limitations related to LLT
- Limitations related to I/O fencing- Preferred fencing limitation when VxFEN activates RACER node re-election
- Stopping systems in clusters with I/O fencing configured
- Uninstalling VRTSvxvm causes issues when VxFEN is configured in SCSI3 mode with dmp disk policy (2522069)
- Node may panic if HAD process is stopped by force and then node is shut down or restarted [3640007]
 
- Limitations related to global clusters
- Clusters must run on VCS 6.0.5 and later to be able to communicate after upgrading to 2048 bit key and SHA256 signature certificates [3812313]
 
- Limitations related to bundled agents
- Storage Foundation Cluster File System High Availability software limitations
- Storage Foundation for Oracle RAC software limitations- Supportability constraints for normal or high redundancy ASM disk groups with CVM I/O shipping and FSS (3600155)
- Limitations of CSSD agent
- Oracle Clusterware/Grid Infrastructure installation fails if the cluster name exceeds 14 characters
- SELinux supported in disabled and permissive modes only
- Policy-managed databases not supported by CRSResource agent
- Health checks may fail on clusters that have more than 10 nodes
- Cached ODM not supported in Veritas InfoScale environments
 
- Storage Foundation for Databases (SFDB) tools software limitations
- Storage Foundation for Sybase ASE CE software limitations
 
- Known issues- Issues related to installation and upgrade- InfoScale 7.3.1 behavior on RHEL 7.4.(3929407)
- Installer fails on SLES 11 and SLES 12 for older NTP versions (3912493)
- Switch fencing in enable or disable mode may not take effect if VCS is not reconfigured [3798127]
- During an upgrade process, the AMF_START or AMF_STOP variable values may be inconsistent [3763790]
- Stopping the installer during an upgrade and then resuming the upgrade might freeze the service groups (2574731)
- The uninstaller does not remove all scripts (2696033)
- NetBackup 6.5 or older version is installed on a VxFS file system (2056282)
- Error messages in syslog (1630188)
- Ignore certain errors after an operating system upgrade - after a product upgrade with encapsulated boot disks (2030970)
- After a locale change restart the vxconfig daemon (2417547, 2116264)
- Dependency may get overruled when uninstalling multiple RPMs in a single command [3563254]
- Resource faults during Rolling upgrade due to perl changes (3930605)
- When using the response file, the installer must not proceed with the installation or upgrade, if you have not provided edge server details (3964335)
- Unable to update edge server details by running the installer (3964611)
 
- Storage Foundation known issues- Dynamic Multi-Pathing known issues
- Veritas Volume Manager known issues- Multiple Issues with Root Disk Encapsulation on RHEL
- Core dump issue after restoration of disk group backup (3909046)
- VxVM tunables not updated on SLES 12 SP2 systems with 4.4 kernel (3916902)
- Failed verifydata operation leaves residual cache objects that cannot be removed (3370667)
- LUNs claimed but not in use by VxVM may report "Device Busy" when it is accessed outside VxVM (3667574)
- If the disk with CDS EFI label is used as remote disk on the cluster node, restarting the vxconfigd daemon on that particular node causes vxconfigd to go into disabled state (3873123)
- Unable to set master on the secondary site in VVR environment if any pending I/O's are on the secondary site (3874873)
- After installing DMP 6.0.1 on a host with the root disk under LVM on a cciss controller, the system is unable to boot using the vxdmp_kernel command [3599030]
- VRAS verifydata command fails without cleaning up the snapshots created [3558199]
- SmartIO VxVM cache invalidated after relayout operation (3492350)
- VxVM fails to create volume by the vxassist(1M) command with maxsize parameter on Oracle Enterprise Linux 6 Update 5 (OEL6U5) [3736647]
- Performance impact when a large number of disks are reconnected (2802698)
- Machine fails to boot after root disk encapsulation on servers with UEFI firmware (1842096)
- device.map must be up to date before doing root disk encapsulation (2202047)
- Veritas Volume Manager (VxVM) might report false serial split brain under certain scenarios (1834513)
- VxVM starts before OS device scan is done (1635274)
- DMP disables subpaths and initiates failover when an iSCSI link is failed and recovered within 5 seconds. (2100039)
- During system boot, some VxVM volumes fail to mount (2622979)
- Removing an array node from an IBM Storwize V7000 storage system also removes the controller (2816589)
- Continuous trespass loop when a CLARiiON LUN is mapped to a different host than its snapshot (2761567)
- Disk group import of BCV LUNs using -o updateid and -ouseclonedev options is not supported if the disk group has mirrored volumes with DCO or has snapshots (2831658)
- After devices that are managed by EMC PowerPath lose access to storage, Veritas Volume Manager commands are delayed (2757198)
- vxresize does not work with layered volumes that have multiple plexes at the top level (3301991)
- Running the vxdisk disk set clone=off command on imported clone disk group luns results in a mix of clone and non-clone disks (3338075)
- vxunroot cannot encapsulate a root disk when the root partition has XFS mounted on it (3614362)
- Restarting the vxconfigd daemon on the slave node after a disk is removed from all nodes may cause the disk groups to be disabled on the slave node (3591019)
- DMP panics if a DDL device discovery is initiated immediately after loss of connectivity to the storage (2040929)
- Failback to primary paths does not occur if the node that initiated the failover leaves the cluster (1856723)
- Issues if the storage connectivity to data disks is lost on a CVM slave node while vxconfigd was not running on the node (2562889)
- The vxcdsconvert utility is supported only on the master node (2616422)
- Re-enabling connectivity if the disks are in local failed (lfailed) state (2425977)
- Issues with the disk state on the CVM slave node when vxconfigd is restarted on all nodes (2615680)
- Plex synchronization is not completed after resuming synchronization on a new master when the original master lost connectivity (2788077)
- A master node is not capable of doing recovery if it cannot access the disks belonging to any of the plexes of a volume (2764153)
- CVM fails to start if the first node joining the cluster has no connectivity to the storage (2787713)
- CVMVolDg agent may fail to deport CVM disk group when CVMDeportOnOffline is set to 1
- cvm_clus resource goes into faulted state after the resource is manually panicked and rebooted in a 32 node cluster (2278894)
- DMP uses OS device physical path to maintain persistence of path attributes from 6.0 [3761441]
- The vxsnap print command shows incorrect value for percentage dirty [2360780]
- Systems may panic after GPT disk resize operation (3930664)
- If LVM volume group has mirror volume, the conversion operation to VxVM fails (3930536)
- If recovery of columns on EC volumes fails, recovery of other columns on the other volumes also fails (3930435)
- Restarting vxconfigd during relayout operation causes the volume to go in an intermediate state.(3959429)
 
- Virtualization known issues- Configuring application for high availability with storage using VCS wizard may fail on a VMware virtual machine which is configured with more than two storage controllers [3640956]
- Host fails to reboot when the resource gets stuck in ONLINE|STATE UNKNOWN state [2738864]
- VM state is in PAUSED state when storage domain is inactive [2747163]
- Switching KVMGuest resource fails due to inadequate swap space on the other host [2753936]
- Policies introduced in SLES 11SP2 may block graceful shutdown if a VM in SUSE KVM environment [2792889]
- Load on libvirtd may terminate it in SUSE KVM environment [2824952]
- Offline or switch of KVMGuest resource fails if the VM it is monitoring is undefined [2796817]
- Increased memory usage observed even with no VM running [2734970]
- Resource faults when it fails to ONLINE VM beacuse of insufficient swap percentage [2827214]
- Migration of guest VM on native LVM volume may cause libvirtd process to terminate abruptly (2582716)
- Virtual machine may return the not-responding state when the storage domain is inactive and the data center is down (2848003)
- Guest virtual machine may fail on RHEL 6.1 if KVM guest image resides on CVM-CFS [2659944]
- System panics after starting KVM virtualized guest or initiating KVMGuest resource online [2337626]
- CD ROM with empty file vmPayload found inside the guest when resource comes online [3060910]
- VCS fails to start virtual machine on another node if the first node panics [3042806]
- VM fails to start on the target node if the source node panics or restarts during migration [3042786]
- High Availability tab does not report LVMVolumeGroup resources as online [2909417]
- Cluster communication breaks when you revert a snapshot in VMware environment [3409586]
- VCS may detect the migration event during the regular monitor cycle due to the timing issue [2827227]
 
- Veritas File System known issues- Cfsmount test fails with error logs that inaccessible block device path for the file system (3873325)
- The VxFS file system with local scope enabled may hang if two or more nodes are restarted simultaneously (3944891)
- Docker does not recognize VxFS backend file system
- On RHEL7 onwards, Pluggable Authentication Modules(PAM) related error messages for Samba daemon might occur in system logs [3765921]
- Delayed allocation may be turned off automatically when one of the volumes in a multi-volume file system nears 100%(2438368)
- The file system deduplication operation fails with the error message "DEDUP_ERROR Error renaming X checkpoint to Y checkpoint on filesystem Z error 16" (3348534)
- After upgrading a file system using the vxupgrade(1M) command, the sfcache(1M) command with the stat option shows garbage value on the secondary node. [3759788]
- XFS file system is not supported for RDE
- The command tab auto-complete fails for the /dev/vx/ file tree; specifically for RHEL 7 (3602082)
- Task blocked messages display in the console for RHEL5 and RHEL6 (2560357)
- Deduplication can fail with error 110 (3741016)
- System unable to select ext4 from the file system (2691654)
- The system panics with the panic string "kernel BUG at fs/dcache.c:670!" (3323152)
- A restored volume snapshot may be inconsistent with the data in the SmartIO VxFS cache (3760219)
- When in-place and relocate compression rules are in the same policy file, file relocation is unpredictable (3760242)
- During a deduplication operation, the spoold script fails to start (3196423)
- The file system may hang when it has compression enabled (3331276)
- "rpc.statd" in the "nfs-utils" RPM in the various Linux distributions does not properly cleanse the untrusted format strings (3335691)
- Mount agent type resource goes into faulted state if SELinux is enabled on RHEL 6.x (3945714)
 
 
- Replication known issues- The secondary vradmind may appear hung and the vradmin commands may fail (3940842,3944301)
- Data corruption may occur if you perform a rolling upgrade of InfoScale Storage or InfoScale Enterprise from 7.3.1 or earlier to 7.4 or later during replication (3951527)
- vradmind may appear hung or may fail for the role migrate operation (3968642, 3968641)
- After the product upgrade on secondary site, replication may fail to resume with "Secondary SRL missing" error [3931763]
- vradmin repstatus command reports secondary host as "unreachable"(3896588)
- RVGPrimary agent operation to start replication between the original Primary and the bunker fails during failback (2036605)
- A snapshot volume created on the Secondary, containing a VxFS file system may not mount in read-write mode and performing a read-write mount of the VxFS file systems on the new Primary after a global clustering site failover may fail [3761497]
- In an IPv6-only environment RVG, data volumes or SRL names cannot contain a colon (1672410, 1672417, 1825031)
- vxassist relayout removes the DCM (145413)
- vradmin functionality may not work after a master switch operation [2158679]
- Cannot relayout data volumes in an RVG from concat to striped-mirror (2129601)
- vradmin verifydata operation fails when replicating between versions 5.1 and 6.0 or later (2360713)
- vradmin verifydata may report differences in a cross-endian environment (2834424)
- vradmin verifydata operation fails if the RVG contains a volume set (2808902)
- Plex reattach operation fails with unexpected kernel error in configuration update (2791241)
- Bunker replay does not occur with volume sets (3329970)
- SmartIO does not support write-back caching mode for volumes configured for replication by Volume Replicator (3313920)
- During moderate to heavy I/O, the vradmin verifydata command may falsely report differences in data (3270067)
- The vradmin repstatus command does not show that the SmartSync feature is running [3343141]
- While vradmin commands are running, vradmind may temporarily lose heartbeats (3347656, 3724338)
- Write I/Os on the primary logowner may take a long time to complete (2622536)
- DCM logs on a disassociated layered data volume results in configuration changes or CVM node reconfiguration issues (3582509)
- After performing a CVM master switch on the secondary node, both rlinks detach (3642855)
- vradmin -g dg repstatus rvg displays the following configuration error: vradmind not reachable on cluster peer (3648854)
- The RVGPrimary agent may fail to bring the application service group online on the new Primary site because of a previous primary-elect operation not being run or not completing successfully (3761555, 2043831)
- A snapshot volume created on the Secondary, containing a VxFS file system may not mount in read-write mode and performing a read-write mount of the VxFS file systems on the new Primary after a global clustering site failover may fail (1558257)
- DCM plex becomes inaccessible and goes into DISABLED(SPARSE) state in case of node failure. (3931775)
- Initial autosync operation takes a long time to complete for data volumes larger than 3TB (3966713)
 
- Cluster Server known issues- Operational issues for VCS- LVM SG transition fails in all paths disabled status [2081430]
- SG goes into Partial state if Native LVMVG is imported and activated outside VCS control
- Switching service group with DiskGroup resource causes reservation conflict with UseFence set to SCSI3 and powerpath environment set [2749136]
- Stale NFS file handle on the client across failover of a VCS service group containing LVMLogicalVolume resource (2016627)
- NFS cluster I/O fails when storage is disabled [2555662]
- VVR configuration may go in a primary-primary configuration when the primary node crashes and restarts [3314749]
- CP server does not allow adding and removing HTTPS virtual IP or ports when it is running [3322154]
- CP server does not support IPv6 communication with HTTPS protocol [3209475]
- VCS fails to stop volume due to a transaction ID mismatch error [3292840]
- Some VCS components do not work on the systems where a firewall is configured to block TCP traffic [3545338]
 
- Issues related to the VCS engine- Invalid argument message in the message log due to Red Hat Linux bug (3872083)
- Extremely high CPU utilization may cause HAD to fail to heartbeat to GAB [1744854]
- The hacf -cmdtocf command generates a broken main.cf file [1919951]
- Trigger does not get executed when there is more than one leading or trailing slash in the triggerpath [2368061]
- Service group is not auto started on the node having incorrect value of EngineRestarted [2653688]
- Group is not brought online if top level resource is disabled [2486476]
- NFS resource goes offline unexpectedly and reports errors when restarted [2490331]
- Parent group does not come online on a node where child group is online [2489053]
- Cannot modify temp attribute when VCS is in LEAVING state [2407850]
- Service group may fail to come online after a flush and a force flush operation [2616779]
- Elevated TargetCount prevents the online of a service group with hagrp -online -sys command [2871892]
- Auto failover does not happen in case of two successive primary and secondary cluster failures [2858187]
- GCO clusters remain in INIT state [2848006]
- The ha commands may fail for non-root user if cluster is secure [2847998]
- Running -delete -keys for any scalar attribute causes core dump [3065357]
- Veritas InfoScale enters into admin_wait state when Cluster Statistics is enabled with load and capacity defined [3199210]
- Agent reports incorrect state if VCS is not set to start automatically and utmp file is empty before VCS is started [3326504]
- VCS crashes if feature tracking file is corrupt [3603291]
- RemoteGroup agent and non-root users may fail to authenticate after a secure upgrade [3649457]
- Global Cluster Option (GCO) require NIC names in specific format [3641586]
- If you disable security before upgrading VCS to version 7.0.1 or later on secured clusters, the security certificates will not be upgraded to 2048 bit SHA2 [3812313]
- Java console and CLI do not allow adding VCS user names starting with '_' character (3870470)
 
- Issues related to the bundled agents- Mount resource incorrectly goes into ONLINE|STOPPING state when bind mounts are configured on RHEL 7
- Mounting an NFSv4 volume on the NFS client side fails
- KVMGuest resource fails to work on VCS agent for RHEV3.5 (3873800)
- LVM Logical Volume will be auto activated during I/O path failure [2140342]
- KVMGuest monitor entry point reports resource ONLINE even for corrupted guest or with no OS installed inside guest [2394235]
- Concurrency violation observed during migration of monitored virtual machine [2755936]
- LVM logical volume may get stuck with reiserfs file system on SLES11 [2120133]
- KVMGuest resource comes online on failover target node when started manually [2394048]
- IMF registration fails for Mount resource if the configured MountPoint path contains spaces [2442598]
- DiskGroup agent is unable to offline the resource if volume is unmounted outside VCS
- RemoteGroup agent does not failover in case of network cable pull [2588807]
- VVR setup with FireDrill in CVM environment may fail with CFSMount Errors [2564411]
- CoordPoint agent remains in faulted state [2852872]
- RVGsnapshot agent does not work with volume sets created using vxvset [2553505]
- No log messages in engine_A.log if VCS does not find the Monitor program [2563080]
- KVMGuest agent fails to recognize paused state of the VM causing KVMGuest resource to fault [2796538]
- Concurrency violation observed when host is moved to maintenance mode [2735283]
- Logical volume resources fail to detect connectivity loss with storage when all paths are disabled in KVM guest [2871891]
- Resource does not appear ONLINE immediately after VM appears online after a restart [2735917]
- Unexpected behavior in VCS observed while taking the disk online [3123872]
- LVMLogicalVolume agent clean entry point fails to stop logical volume if storage connectivity is lost [3118820]
- VM goes into paused state if the source node loses storage connectivity during migration [3085214]
- Virtual machine goes to paused state during migration if the public network cable is pulled on the destination node [3080930]
- NFS client reports I/O error because of network split brain [3257399]
- Manual configuration of RHEVMInfo attribute of KVMGuest agent requires all its keys to be configured [3277994]
- SambaServer agent may generate core on Linux if LockDir attribute is changed to empty value while agent is running [3339231]
- Independent Persistent disk setting is not preserved during failover of virtual disks in VMware environment [3338702]
- LVMLogicalVolume resource goes in UNABLE TO OFFLINE state if native LVM volume group is exported outside VCS control [3606516]
- DiskGroup resource online may take time if it is configured along with VMwareDisks resource [3638242]
- SFCache Agent fails to enable caching if cache area is offline [3644424]
- RemoteGroup agent may stop working on upgrading the remote cluster in secure mode [3648886]
- VMwareDisks agent may fail to start or storage discovery may fail if SELinux is running in enforcing mode [3106376]
 
- Issues related to the VCS database agents- Unsupported startup options with systemD enabled [3901204]
- VCS ASMDG resource status does not match the Oracle ASMDG resource status (3962416)
- ASMDG agent does not go offline if the management DB is running on the same (3856460)
- ASMDG on a particular does not go offline if its instances is being used by other database instances (3856450)
- Sometimes ASMDG reports as offline instead of faulted (3856454)
- The ASMInstAgent does not support having pfile/spfile for the ASM Instance on the ASM diskgroups
- VCS agent for ASM: Health check monitoring is not supported for ASMInst agent
- NOFAILOVER action specified for certain Oracle errors
- Oracle agent fails to offline pluggable database (PDB) resource with PDB in backup mode [3592142]
- Clean succeeds for PDB even as PDB staus is UNABLE to OFFLINE [3609351]
- Second level monitoring fails if user and table names are identical [3594962]
- Monitor entry point times out for Oracle PDB resources when CDB is moved to suspended state in Oracle 12.1.0.2 [3643582]
- Oracle agent fails to come online and monitor Oracle instance if threaded_execution parameter is set to true (3644425)
 
- Issues related to the agent framework- Agent framework cannot handle leading and trailing spaces for the dependent attribute (2027896)
- The agent framework does not detect if service threads hang inside an entry point [1442255]
- IMF related error messages while bringing a resource online and offline [2553917]
- Delayed response to VCS commands observed on nodes with several resources and system has high CPU usage or high swap usage [3208239]
- CFSMount agent may fail to heartbeat with VCS engine and logs an error message in the engine log on systems with high memory load [3060779]
- Logs from the script executed other than the agent entry point goes into the engine logs [3547329]
- VCS fails to process the hares -add command resource if the resource is deleted and subsequently added just after the VCS process or the agent's process starts (3813979)
 
- Cluster Server agents for Volume Replicator known issues
- Issues related to Intelligent Monitoring Framework (IMF)- Registration error while creating a Firedrill setup [2564350]
- IMF does not provide notification for a registered disk group if it is imported using a different name (2730774)
- Direct execution of linkamf displays syntax error [2858163]
- Error messages displayed during reboot cycles [2847950]
- Error message displayed when ProPCV prevents a process from coming ONLINE to prevent concurrency violation does not have I18N support [2848011]
- AMF displays StartProgram name multiple times on the console without a VCS error code or logs [2872064]
- Core dump observed when amfconfig is run with set and reset commands simultaneously [2871890]
- VCS engine shows error for cancellation of reaper when Apache agent is disabled [3043533]
- Terminating the imfd daemon orphans the vxnotify process [2728787]
- Agent cannot become IMF-aware with agent directory and agent file configured [2858160]
- ProPCV fails to prevent a script from running if it is run with relative path [3617014]
 
- Issues related to global clusters
- Issues related to the Cluster Manager (Java Console)
- VCS Cluster Configuration wizard issues- VCS Cluster Configuration wizard does not automatically close in Mozilla Firefox [3281450]
- Configuration inputs page of VCS Cluster Configuration wizard shows multiple cluster systems for the same virtual machine [3237023]
- VCS Cluster Configuration wizard fails to display mount points on native LVM if volume groups are exported [3341937]
- IPv6 verification fails while configuring generic application using VCS Cluster Configuration wizard [3614680]
- InfoScale Enterprise: Unable to configure clusters through the VCS Cluster Configuration wizard (3911694)
 
- LLT known issues- LLT connections are not formed when a vlan is configured on a NIC (2484856)
- If you manually re-plumb (change) the IP address on a network interface card (NIC) which is used by LLT, then LLT may experience heartbeat loss and the node may panic (3188950)
- A network restart of the network interfaces may cause heartbeat loss for the NIC interfaces used by LLT
- Performance degradation occurs when RDMA connection between nodes is down [3877863]
- After configuring LLT over UDP using IPV6, one of the configured link may show DOWN status for lltstat command [3916374]
- When using FSS over RDMA links during heavy IO, LLT may face link fluctuations [3907179]
- The LLT window may drop to a very low value in CVM/FSS or CFS environment [3914954]
- When using response files for LLT configuration over UDP, the nodes become unresponsive (3946836)
- LLT causes node to panic during TCP connection failure when incomplete packets are received (3944294)
 
- I/O fencing known issues- Fencing port b is visible for few seconds even if cluster nodes have not registered with CP server (2415619)
- The cpsadm command fails if LLT is not configured on the application cluster (2583685)
- The vxfenswap utility does not detect failure of coordination points validation due to an RSH limitation (2531561)
- Hostname and username are case sensitive in CP server (2846392)
- Server-based fencing comes up incorrectly if default port is not mentioned (2403453)
- The vxfenswap utility deletes comment lines from the /etc/vxfemode file, if you run the utility with hacli option (3318449)
- The vxfentsthdw utility may not run on systems installed with partial SFHA stack [3333914]
- When a client node goes down, for reasons such as node panic, I/O fencing does not come up on that client node after node restart (3341322)
- VCS fails to take virtual machines offline while restarting a physical host in RHEV and KVM environments (3320988)
- Fencing may panic the node while shut down or restart when LLT network interfaces are under Network Manager control [3627749]
- The vxfenconfig -l command output does not list Coordinator disks that are removed using the vxdmpadm exclude dmpnodename=<dmp_disk/node> command [3644431]
- The CoordPoint agent faults after you detach or reattach one or more coordination disks from a storage array (3317123)
 
 
- Operational issues for VCS
- Storage Foundation and High Availability known issues- Cache area is lost after a disk failure (3158482)
- Installer exits upgrade to 5.1 RP1 with Rolling Upgrade error message (1951825, 1997914)
- In an IPv6 environment, db2icrt and db2idrop commands return a segmentation fault error during instance creation and instance removal (1602444)
- Process start-up may hang during configuration using the installer (1678116)
- Not all the objects are visible in the VOM GUI (1821803)
- An error message is received when you perform off-host clone for RAC and the off-host node is not part of the CVM cluster (1834860)
- A volume's placement class tags are not visible in the Veritas Enterprise Administrator GUI when creating a dynamic storage tiering placement policy (1880081)
 
- Storage Foundation Cluster File System High Availability known issues- Transaction hangs when multiple plex-attach or add-mirror operations are triggered on the same volume (3969500)
- In an FSS environment, creation of mirrored volumes may fail for SSD media [3932494]
- Mount command may fail to mount the file system (3913246)
- After the local node restarts or panics, the FSS service group cannot be online successfully on the local node and the remote node when the local node is up again (3865289)
- In the FSS environment, if DG goes to the dgdisable state and deep volume monitoring is disabled, successive node joins fail with error 'Slave failed to create remote disk: retry to add a node failed' (3874730)
- DG creation fails with error "V-5-1-585 Disk group punedatadg: cannot create: SCSI-3 PR operation failed" on the VSCSI disks (3875044)
- Write back cache is not supported on the cluster in FSS scenario [3723701]
- CVMVOLDg agent is not going into the FAULTED state. [3771283]
- On CFS, SmartIO is caching writes although the cache appears as nocache on one node (3760253)
- Unmounting the checkpoint using cfsumount(1M) may fail if SElinux is in enforcing mode (3766074)
- tail -f run on a cluster file system file only works correctly on the local node [3741020]
- In SFCFS on Linux, stack may overflow when the system creates ODM file [3758102]
- CFS commands might hang when run by non-root (3038283)
- The fsappadm subfilemove command moves all extents of a file (3258678)
- Certain I/O errors during clone deletion may lead to system panic. (3331273)
- Panic due to null pointer de-reference in vx_bmap_lookup() (3038285)
- In a CFS cluster, that has multi-volume file system of a small size, the fsadm operation may hang (3348520)
 
- Storage Foundation for Oracle RAC known issues- Oracle RAC known issues
- Storage Foundation Oracle RAC issues- CSSD configuration fails if OCR and voting disk volumes are located on Oracle ASM (3914497)
- When you upgrade to SF Oracle RAC 7.1, VxFS may fail to stop (3872605)
- ASM disk groups configured with normal or high redundancy are dismounted if the CVM master panics due to network failure in FSS environment or if CVM I/O shipping is enabled (3600155)
- PrivNIC and MultiPrivNIC agents not supported with Oracle RAC 11.2.0.2 and later versions
- CSSD agent forcibly stops Oracle Clusterware if Oracle Clusterware fails to respond (3352269)
- Intelligent Monitoring Framework (IMF) entry point may fail when IMF detects resource state transition from online to offline for CSSD resource type (3287719)
- Node fails to join the SF Oracle RAC cluster if the file system containing Oracle Clusterware is not mounted (2611055)
- The vxconfigd daemon fails to start after machine reboot (3566713)
- Health check monitoring fails with policy-managed databases (3609349)
- CVMVolDg agent may fail to deport CVM disk group
- Rolling upgrade not supported for upgrades from SF Oracle RAC 5.1 SP1 with fencing configured in dmpmode.
- "Configuration must be ReadWrite : Use haconf -makerw" error message appears in VCS engine log when hastop -local is invoked (2609137)
- Veritas Volume Manager can not identify Oracle Automatic Storage Management (ASM) disks (2771637)
- vxdisk resize from slave nodes fails with "Command is not supported for command shipping" error (3140314)
- CVR configurations are not supported for Flexible Storage Sharing (3155726)
- CVM requires the T10 vendor provided ID to be unique (3191807)
- SG_IO ioctl hang causes disk group creation, CVM node joins, and storage connects/disconnects, and vxconfigd to hang in the kernel (3193119)
- vxdg adddisk operation fails when adding nodes containing disks with the same name (3301085)
- FSS Disk group creation with 510 exported disks from master fails with Transaction locks timed out error (3311250)
- vxconfigrestore is unable to restore FSS cache objects in the pre-commit stage (3461928)
- Change in naming scheme is not reflected on nodes in an FSS environment (3589272)
- Intel SSD cannot be initialized and exported (3584762)
- VxVM may report false serial split brain under certain FSS scenarios (3565845)
 
 
- Storage Foundation for Databases (SFDB) tools known issues- Clone operations fail for instant mode snapshot (3916053)
- Sometimes SFDB may report the following error message: SFDB remote or privileged command error (2869262)
- SFDB commands do not work in IPV6 environment (2619958)
- When you attempt to move all the extents of a table, the dbdst_obj_move(1M) command fails with an error (3260289)
- Attempt to use SmartTier commands fails (2332973)
- Attempt to use certain names for tiers results in error (2581390)
- Clone operation failure might leave clone database in unexpected state (2512664)
- Clone command fails if PFILE entries have their values spread across multiple lines (2844247)
- Clone command errors in a Data Guard environment using the MEMORY_TARGET feature for Oracle 11g (1824713)
- Clone fails with error "ORA-01513: invalid current time returned by operating system" with Oracle 11.2.0.3 (2804452)
- Data population fails after datafile corruption, rollback, and restore of offline checkpoint (2869259)
- Flashsnap clone fails under some unusual archivelog configuration on RAC (2846399)
- In the cloned database, the seed PDB remains in the mounted state (3599920)
- Cloning of a container database may fail after a reverse resync commit operation is performed (3509778)
- If one of the PDBs is in the read-write restricted state, then cloning of a CDB fails (3516634)
- Cloning of a CDB fails for point-in-time copies when one of the PDBs is in the read-only mode (3513432)
- If a CDB has a tablespace in the read-only mode, then the cloning fails (3512370)
- SFDB commands fail when an SFDB installation with authentication configured is upgraded to InfoScale 7.4.1 (3644030)
- Benign message displayed upon execution of vxsfadm -a oracle -s filesnap -o destroyclone (3901533)
 
- Storage Foundation for Sybase ASE CE known issues- Sybase Agent Monitor times out (1592996)
- Installer warning (1515503)
- Unexpected node reboot while probing a Sybase resource in transition (1593605)
- Unexpected node reboot when invalid attribute is given (2567507)
- "Configuration must be ReadWrite : Use haconf -makerw" error message appears in VCS engine log when hastop -local is invoked (2609137)
 
- Application isolation feature known Issues- Addition of an Oracle instance using Oracle GUI (dbca) does not work with Application Isolation feature enabled
- Auto reattach of detached plexes may not happen for FSS disk groups when auto-mapping feature is used (3902004)
- CPI is not supported for configuring the application isolation feature (3902023)
- Thin reclamation does not happen for remote disks if the storage node or the disk owner does not have the file system mounted on it (3902009)
 
- Cloud deployment known issues- Systems in GCP may get stuck in the LEAVING state when multiple nodes are restarted a cascaded manner
- An error occurs during VVR or CVR configuration when alias IPs are assigned to GCP VM instances (3965275)
- In an Azure environment, the systems under InfoScale control may panic due to CPU soft lockup [3929534]
- In an Azure environment, an InfoScale cluster node may panic if any of the node is rebooted using Azure portal [3930926]
- If you disable a public IP from the Azure portal, the corresponding AzureIP resource goes into UNKNOWN state [3928222]
 
- Issues related to Veritas InfoScale Storage in Amazon Web Services cloud environments- Incorrect media type displayed for AWS EC2 volumes
- Inconsistencies in instance store volumes
- Stale remote disks on some nodes after failure of vxdisk unexport operation
- UDID of AWS volumes not updated after migration
- Partial detachment of volumes from AWS console
- Crash dump logs not available when EC2 instances crash
- vxcloudd daemon fails with a core dump when the bucket name on the target exceeds 32 characters (3916980)
- Migration of data to cloud volumes using S3 Connector fails with core dump (3915555)
 
 
- Issues related to installation and upgrade
Delayed response to VCS commands observed on nodes with several resources and system has high CPU usage or high swap usage [3208239]
You may experience a delay of several minutes in the VCS response to commands if you configure large number of resources for monitoring on a VCS node and if the CPU usage is close to 100 percent or swap usage is very high.
Some of the commands are mentioned below:
- # hares -online 
- # hares -offline 
- # hagrp -online 
- # hagrp -offline 
- # hares -switch 
The delay occurs as the related VCS agent does not get enough CPU bandwidth to process your command. The agent may also be busy processing large number of pending internal commands (such as periodic monitoring of each resource).
Workaround: Change the values of some VCS agent type attributes which are facing the issue and restore the original attribute values after the system returns to the normal CPU load.
- Back up the original values of attributes such as MonitorInterval, OfflineMonitorInterval, and MonitorFreq of IMF attribute.
- If the agent does not support Intelligent Monitoring Framework (IMF), increase the value of MonitorInterval and OfflineMonitorInterval attributes.# haconf -makerw # hatype -modify <TypeName> MonitorInterval <value> # hatype -modify <TypeName> OfflineMonitorInterval <value> # haconf -dump -makero Where <TypeName> is the name of the agent with which you are facing delays and <value> is any numerical value appropriate for your environment. 
- If the agent supports IMF, increase the value of MonitorFreq attribute of IMF.# haconf -makerw # hatype -modify <TypeName> IMF -update MonitorFreq <value> # haconf -dump -makero Where <value> is any numerical value appropriate for your environment. 
- Wait for several minutes to ensure that VCS has executed all pending commands, and then execute any new VCS command.
- If the delay persists, repeat step 2 or 3 as appropriate.
- If the CPU usage returns to normal limits, revert the attribute changes to the backed up values to avoid the delay in detecting the resource fault.