Veritas Data Insight Classification Guide
- About this guide
- Getting Started
- Managing content classification from Data Insight
- Configuring classification
- Initiating classification
- Appendix A. Classification best practices
- Appendix B. Classification jobs
- Appendix C. Troubleshooting classification
Configuring classification
You can enable and configure classification from the Classification.
> underNote:
If you enable classification, Data Insight automatically adds a rule to exclude the audit events generated by the saved credentials used for content scanning. The exclude rule is added to prevent the accesses by the named credentials from being registered and to ensure that the access time of a file is not modified.
Note:
Make sure trust is established between file server domain and Management Server domain.
Note:
AmazonS3 service is installed and configured on the Classification server by default.
Ensure that all prerequisites are met before you configure classification.
To configure classification
- In the Management Console, click Settings > Configuration under Classification.
- On the Classification Configuration page, edit all or any of the following settings:
Enable classification
When the check box selected, you can submit files for classification from the Workspace tab, Reports tab, and Settings tab > Classification > Requests.
Enable Smart Classification
Check the box to enable Smart Classification.
When enabled, Data Insight intelligently analyzes the files to identify sensitive files and submits them for classification.
Enable Optical Character Recognition for classification of images
When this check box is selected, you can classify images.
To allow classification of images, a software called Tesseract is installed on the Management Server, Collector, and Classification server nodes during the installation of Data Insight. The default location for the Tesseract installation is
C:\Program Files (x86)\Tesseract-OCR
. In case the Tesseract installation fails, refer to the Troubleshooting section in the Veritas Data Insight Administrator's Guide to manually install the Tesseract software.Note:
Optical character recognition (OCR) is a performance-expensive feature. Veritas recommends that you select only those file groups that contain the specific file extensions that you need to classify for OCR. By default, the file group, Images for OCR is selected.
Skip classification of files with size greater than
You can set the limit on the size of files which Data Insight submits for classification. Data Insight does not submit those files that exceed the specified size.
- Enter Tenant Id, Client Id, Client Secret Key, and Microsoft Administrator Account details.
Note:
Use Microsoft Administrator Account which is either Global Administrator account or Minimum Privilege Account with Information Protector reader and Sensitivity Label reader privileges. For more information, refer to Creating Minimum Privilege Account Role in Compliance Center section in Data Insight Administrator's Guide.
- Click Save.
If you are deploying a Classification Server pool and map the Collector node to the server pool instead of to single Classification Server, see Associating a Classification Server pool to a Collector in
See Configuring safeguard settings for Classification Server.