Clustered File Server unresponsive in a hung state following large number of FSA Placeholder recalls.

Clustered File Server unresponsive in a hung state following large number of FSA Placeholder recalls.

Article: 100027644
Last Published: 2014-12-11
Ratings: 0 0
Product(s): Enterprise Vault

Problem

An issue has been reported where upon retrieval of a large number of archived items while recalling placeholders, the cluster node on which the file server is online becomes unresponsive due to a deadlock.

Cause

Investigation around the cause of the deadlock has led to a functional change within the methods used by the FSA mini filter driver. 

If CSC (Client Side Caching) is running on the client, then the server sees the file I/O requests. On the file server, a PAGED-read operation is initiated because of the Cache Manager’s read ahead mechanism. In due course, the Cache Manager acquires the lock on PagingIoResource in shared mode. A PreRead callback is being hit in the FSA Driver because of the PAGED-read operation.
 

In PreRead or PreWrite callbacks, the file is recalled and converted back to a normal file. In the process of converting the file back to normal, the reparse point is removed [done by calling FltUntagFile()] and the actual file is replaced.

The call to FltUntagFile is synchronous, which attempts to acquire the same lock, and results in a deadlock.

 

Solution

The FSA driver is being modified so that a deadlock no longer occurs in this situation.
 


This issue has been addressed as part of the following release:


Enterprise Vault 10.0.3 - Release Details
http://www.veritas.com/docs/000017186

Enterprise Vault 9.0.5 - Release Details
http://www.veritas.com/docs/000088251

 

Applies To

Enterprise Vault (EV) File System Archiving (FSA) installed on Microsoft Windows 2008 R2 Failover Cluster

References

Etrack : 2926286 Etrack : 2926285 Etrack : 2804765

Was this content helpful?