Arctera Insight Information Governance Classification Guide

Last Published:
Product(s): Data Insight (7.2)
Platform: Windows

About classification

With the continuous growth of unstructured data in the business environment, taking decisions to archive and delete content of business or legal value is a challenge. You can simplify data remediation decisions by categorizing and organizing data based on tags and policies.

Information Governance integrates with Arctera Information Classifier to analyze the files that Information Governance monitors. Arctera Information Classifier uses built-in and user-defined policies to assign classification tags to files in your environment. After the files are classified, users of applications such as Information Governance can use the classification tags to filter the files for searches, reviews, and remediation.

Arctera Insight Information Governance integrates with Arctera Information Classifier to help you do the following:

  • Analyze: Improves the content analytics by focusing on relevant and classified data set to perform risk analysis and remediation.

    Classification enables you to identify the type of data being stored in repositories (for example, Personally Identifiable Information), the purpose of the data, and the risk that is associated with it (whether sensitive or otherwise).

  • Decide: Lets you make informed decisions to retain, secure, move, delete, or monitor data and control permissions based on the classification tags.

  • Regulate: Ensures that the data complies with the legal requirements.

  • Organize: Provides the ability to categorize and tag content to make it more accessible, searchable, and usable, specifically for archiving, ediscovery, and audits.

  • Visualize: Lets you view the classified data in Arctera Insight Information Governance and run custom Arctera Insight Information Governance Query Language (DQL) reports that group files based on the tags that are assigned to them.

Supported file types for classification

To know more about the file types that Information Governance supports for Classification, check the Apache Tika 1.27 documentation.

Supported file types for Optical Character Recognition (OCR)

To know more about the file types that Information Governance supports for OCR, check the Tesseract v5.0.1.20220118 documentation.

Information Governance supports classification of files stored on file servers, SharePoint web applications, SharePoint online sources, OneDrive accounts, Object Storage Sources like Amazon S3, and Box accounts.

Note:

For Amazon S3, Information Governance supports classification for files under following storage classes: STANDARD, INTELLIGENT_TIERING, STANDARD_IA, and ONEZONE_IA