Arctera Insight Information Governance Administrator's Guide
- Section I. Getting started
- Introduction to Arctera Insight Information Governance administration
- Configuring Information Governance global settings
- About scanning and event monitoring
- About filtering certain accounts, IP addresses, and paths
- About archiving data
- About Information Governance integration with Data Loss Prevention (DLP)
- Configuring advanced analytics
- About open shares
- About user risk score
- About bulk assignment of custodians
- Configuring Metadata Framework
- Section II. Configuring Information Governance
- Configuring Information Governance product users
- Configuring Information Governance product servers
- About node templates
- About automated alerts for patches and upgrades
- Configuring saved credentials
- Configuring directory service domains
- Adding a directory service domain to Information Governance
- Configuring containers
- Server Pools
- Section III. Configuring native file systems in Information Governance
- Configuring clustered NetApp file server monitoring
- About configuring secure communication between Information Governance and cluster-mode NetApp devices
- Configuring EMC Celerra or VNX monitoring
- Configuring EMC Isilon monitoring
- Configuring EMC Unity VSA file servers
- Configuring Hitachi NAS file server monitoring
- Configuring Windows File Server monitoring
- Configuring Arctera File System (VxFS) file server monitoring
- Configuring monitoring of a generic device
- Managing file servers
- Adding filers
- Adding shares
- Renaming storage devices
- Configuring clustered NetApp file server monitoring
- Section IV. Configuring SharePoint data sources
- Configuring monitoring of SharePoint web applications
- About the Information Governance web service for SharePoint
- Adding web applications
- Adding site collections
- Configuring monitoring of SharePoint Online accounts
- About SharePoint Online account monitoring
- Adding site collections to SharePoint Online accounts
- Configuring monitoring of SharePoint web applications
- Section V. Configuring cloud data sources
- Configuring monitoring of Box accounts
- Configuring OneDrive account monitoring
- Configuring Azure Netapp Files Device
- Managing cloud sources
- Section VI. Configuring Object Storage Sources
- Section VII. Health and monitoring
- Section VIII. Alerts and policies
- Configuring policies
- Managing policies
- Configuring policies
- Section IX. Remediation
- Configuring remediation settings
- Section X. Reference
- Appendix A. Information Governance best practices
- Appendix B. Migrating Information Governance components
- Appendix C. Backing up and restoring data
- Appendix D. Arctera Information Governance health checks
- About Information Governance health checks
- About Information Governance health checks
- Appendix E. Command File Reference
- Appendix F. Arctera Information Governance jobs
- Appendix G. Troubleshooting
- Troubleshooting FPolicy issues on NetApp devices
About scanning and event monitoring
Information Governance scans the file system hierarchy to collect information related to permissions and file system metadata from the monitored storage devices.
Event monitoring is an operation that keeps track of the access events happening on a file system. During event monitoring if Information Governance detects an event such as create, write or file system ACL level permission changes, it uses this information to perform incremental scans for the paths on which events are reported.
Information Governance uses asynchronous APIs, such as FPolicy for NetApp filers, the CEE framework for EMC filers, and filter driver for Windows File Servers to collect access events.
By default, Information Governance initiates event monitoring every 2 hours. You can disable event monitoring for the individual storage devices. To turn off event monitoring, navigate to Settings > Filers. In the edit page for filer, uncheck the option Enable file system event monitoring.
Note:
Information Governance scans only share-level permission changes when event monitoring is turned off.
To fetch file system metadata, Information Governance performs the following types of scans:
During a full scan Information Governance scans the entire file system hierarchy. A full scan is typically run after a storage device is first added to the Information Governance configuration. Full scans can run for several hours, depending on the size of the shares. After the first full scan, you can perform full scans less frequently based on your preference. Ordinarily, you need to run a full scan only to scan those paths which might have been modified while file system auditing was not running for any reason.
In case of large shares, a full scan can take long time to complete. If the collector node specification is 16 core CPU and 32 GB RAM or higher, Information Governance automatically shifts to parallel scanning, which results in faster scan of shares.
By default, each Collector node initiates a full scan at 7:00 P.M. on the last Friday of each month. For SharePoint, the default scan schedule is 11:00 P.M. each night.
During an incremental scan, Information Governance re-scans only those paths of a share that have been modified since the last full scan. It does so by monitoring incoming access events to see which paths had a create, write, or a security event on it since the last scan. Incremental scans are much faster than full scans.
By default, an incremental scan is scheduled once every night at 7:00 P.M. You can initiate an on-demand incremental scan manually by using the command line utility scancli.exe. It is recommended to run the IScannerJob before you execute the utility.
See Scheduled Information Governance jobs.
After Information Governance completes indexing the full scan data, it computes the paths that no longer seem to be present on the file system. A re-confirmation scan confirms if a path which is present in the indexes, but appears to be no longer present on the file system, is indeed deleted. A re-confirmation scan is automatically triggered, when Information Governance detects potentially missing paths on the file system during a full scan.
You can turn off re-confirmation scan for any Indexer, using the Advanced Setting for that Indexer. When the re-confirmation scan is turned off, Information Governance readily removes the missing paths from the indexes without carrying out a re-confirmation.
See Configuring advanced settings.
At a global level, full scans are scheduled for individual Collectors or Windows File Server agents. The Table: Entities having configurable scan schedules gives you the details of all the entities for which you can schedule a full scan.
Table: Entities having configurable scan schedules
Entity | Scan schedule settings location | Scope | Details |
---|---|---|---|
Collector or Windows File Server agents | Settings > Servers > Advanced Setting > File System Scanner settings. | Applies to all the storage devices associated with the Collector, for which a schedule is defined. | |
Filers, web applications, Object Storage Sources and cloud sources | In case of a filer, Settings > Filers > Add New Filer. In case of a SharePoint web application, Settings > SharePoint Sources > Add SharePoint Source > SharePoint Web Application. In case of a SharePoint Online account, Settings > SharePoint Sources > Add SharePoint Source > SharePoint Online Account. In case of a object storage sources account, Settings > Cloud Sources > Add new Object Storage Source> Amazon S3. In case of a cloud storage account, Settings > Cloud Sources > Add New Cloud Source. Note: You can also configure scanning at the time of editing filers, web applications, and cloud sources. | Applies to filers, SharePoint web applications, SharePoint Online accounts, ECM sources, Object Storage Sources or cloud sources for which schedule is defined. This setting overrides the scan schedule defined for the Collector associated with the filer, web applications, and cloud sources. | See Adding filers. See Configuring monitoring of cloud sources in Information Governance. |
Shares, site collections, buckets and repositories | Settings > Filers > Monitored Shares > Add New Share. Settings > SharePoint Sources > Web Applications > Monitored Site Collections > Add Site Collection. Settings > SharePoint Sources > Online Accounts > Monitored Site Collections > Add Site Collection. Settings > Object Storage Sources > Amazon S3 > Monitored Buckets > Add New Bucket. Note: You can also configure scanning at the time of editing shares and site collections. | Applies to the entire share or site collection for which schedule is defined. Overrides the scan schedules defined for the filer or the web application associated with the share or the site collection. | See Adding shares. |
You can override all the full scan schedules and initiate an on-demand full scan for configured shares or site collections. See Managing shares.
Sometimes for maintenance and diagnostic purposes, you may need to disable all the scans. You can disable all scans:
At the time of adding or editing a storage device.
See Adding filers.
Or from the Settings > Scanning and Event Monitoring page of the Management Console.
If you disable scanning for any device, you will not be able to view any permissions data for that device. However, you may still see some stale metadata like size, permissions etc., which was collected before the scanning was disabled. If you run a report on the paths for which scanning is disabled, you may get a report with stale data.
You can specify pause schedules for both full and incremental scans to indicate when scanning should not be allowed to run. You can configure a pause schedule from the Settings > Servers > Advanced Settings page. See Configuring advanced settings. to know more about configuring a pause schedule.
You can view the details of the current and historical scan status for your entire environment from the scanning dashboard. To access the scanning dashboard, from the Information Governance Management Console, navigate to Settings > Scan Status > Overview. See Viewing the scanning overview. to know more about the scanning dashboard.