Veritas Data Insight Classification Guide

Last Published:
Product(s): Data Insight (6.4)
Platform: Windows

Configuring classification

You can enable and configure classification from the Settings > Configuration under Classification.

Note:

If you enable classification, Data Insight automatically adds a rule to exclude the audit events generated by the saved credentials used for content scanning. The exclude rule is added to prevent the accesses by the named credentials from being registered and to ensure that the access time of a file is not modified.

Note:

Make sure trust is established between file server domain and Management Server domain.

Note:

AmazonS3 service is installed and configured on the Classification server by default.

Ensure that all prerequisites are met before you configure classification.

To configure classification

  1. In the Management Console, click Settings > Configuration under Classification.
  2. On the Classification Configuration page, edit all or any of the following settings:

    Enable classification

    When the check box selected, you can submit files for classification from the Workspace tab, Reports tab, and Settings tab > Classification > Requests.

    See Initiating classification.

    Enable Smart Classification

    Check the box to enable Smart Classification.

    When enabled, Data Insight intelligently analyzes the files to identify sensitive files and submits them for classification.

    See About Smart Classification.

    Enable Optical Character Recognition for classification of images

    When this check box is selected, you can classify images.

    To allow classification of images, a software called Tesseract is installed on the Management Server, Collector, and Classification server nodes during the installation of Data Insight. The default location for the Tesseract installation is C:\Program Files (x86)\Tesseract-OCR. In case the Tesseract installation fails, refer to the Troubleshooting section in the Veritas Data Insight Administrator's Guide to manually install the Tesseract software.

    Note:

    Optical character recognition (OCR) is a performance-expensive feature. Veritas recommends that you select only those file groups that contain the specific file extensions that you need to classify for OCR. By default, the file group, Images for OCR is selected.

    Skip classification of files with size greater than

    You can set the limit on the size of files which Data Insight submits for classification. Data Insight does not submit those files that exceed the specified size.

  3. Enter Tenant Id, Client Id, Client Secret Key, and Microsoft Administrator Account details.

    Note:

    Use Microsoft Administrator Account which is either Global Administrator account or Minimum Privilege Account with Information Protector reader and Sensitivity Label reader privileges. For more information, refer to Creating Minimum Privilege Account Role in Compliance Center section in Data Insight Administrator's Guide.

  4. Click Save.

If you are deploying a Classification Server pool and map the Collector node to the server pool instead of to single Classification Server, see Associating a Classification Server pool to a Collector in

See Configuring safeguard settings for Classification Server.

See Initiating classification.

See Viewing classification status.