Veritas Data Insight Classification Guide

Last Published:
Product(s): Data Insight (7.0)
Platform: Windows

About classification

With the continuous growth of unstructured data in the business environment, taking decisions to archive and delete content of business or legal value is a challenge. You can simplify data remediation decisions by categorizing and organizing data based on tags and policies.

Data Insight integrates with Veritas Information Classifier to analyze the files that Data Insight monitors. Veritas Information Classifier uses built-in and user-defined policies to assign classification tags to files in your environment. After the files are classified, users of applications such as Data Insight can use the classification tags to filter the files for searches, reviews, and remediation.

Data Insight integrates with Veritas Information Classifier to help you do the following:

  • Analyze: Improves the content analytics by focusing on relevant and classified data set to perform risk analysis and remediation.

    Classification enables you to identify the type of data being stored in repositories (for example, Personally Identifiable Information), the purpose of the data, and the risk that is associated with it (whether sensitive or otherwise).

  • Decide: Lets you make informed decisions to retain, secure, move, delete, or monitor data and control permissions based on the classification tags.

  • Regulate: Ensures that the data complies with the legal requirements.

  • Organize: Provides the ability to categorize and tag content to make it more accessible, searchable, and usable, specifically for archiving, ediscovery, and audits.

  • Visualize: Lets you view the classified data in Data Insight and run custom Data Insight Query Language (DQL) reports that group files based on the tags that are assigned to them.

Supported file types for classification

To know more about the file types that Data Insight supports for Classification, check the Apache Tika 1.27 documentation.

Supported file types for Optical Character Recognition (OCR)

To know more about the file types that Data Insight supports for OCR, check the Tesseract v5.0.1.20220118 documentation.

Data Insight supports classification of files stored on file servers, SharePoint web applications, SharePoint online sources, OneDrive accounts, Object Storage Sources like Amazon S3, and Box accounts.

Note:

For Amazon S3, Data Insight supports classification for files under following storage classes: STANDARD, INTELLIGENT_TIERING, STANDARD_IA, and ONEZONE_IA