Data Insight Classification job times are inflated when the Pause feature is enabled

Article: 100040584
Last Published: 2017-11-01
Ratings: 0 0
Product(s): Data Insight

Problem

Data Insight Classification job times are inflated when the Pause feature is enabled

Error Message

The jobs request would appear to take a long time when submitted within the pause window and be much kmore efecient when submitted outside or the window.

Cause

The calculation for the classification job request is the start time in epoch format subtracted form the end time and converted into HH:MM:SS which are then rolunded to whole minutes or hours in the time taken column.

Image

Solution

The classification of the data contents of file objects can be accomplished by submitting requests via the workspace through clicking to submit jobs, or adding a request via the Classification Request page.

However a request will not complete within the pause period, which is set by default, without the  pause being disabled or the request run after the pause window expires. To verify the window is enabled or disabled in the GUI go to the Image Tab and navigate to page:

Image

Data Insight Servers >>
Classification server, Advanced settings, Classification settings (all jobs and SharePoint)
WinNAS server , Advanced settings, Classification settings (all WinNAS agents will classify on the node itself)

 

For the classification server covering any device except Windows NAS filer or SharePoint requests which have there own pause schedule select the button to remove it within the GUI.


Image

For the SharePoint classification it is lower in the save setting:

Image

Remember to save the changes.

 

For the classification that happens on the winnas servers navigate to the servers window and click on the filer then under the advanced settings tab select the classification section and edit as above.

Image

Using the command line the settings will show only for disabled. If they are not found they are enabled and active.

On the Management Server in a DOS window run the command

C:\Program Files\DataInsight\bin>configdb -pT objattr |findstr pause
node:1:scanner.pause.enabled=false
node:1:scanner.pause.enabled.incremental=false

Note: these settings are for scanning Full or incremental settings

node:7:classify.fetch.pause.enabled=false
node:7:classify.fetch.sp.pause.enabled=false
node:5:classify.fetch.pause.enabled=false

 

Note: sp = SharePoint server settings

to match the node names to the servers run the command:

C:\Program Files\DataInsight\bin>configdb -pT node
#|(L)ocal/(R)emote|ID|Name|IP|Queryd_port|Commd_port|isConsole|isIndexer|isColle
ctor|Ctr|
|L|1|DIServer1.SAMGWIN.local|DIServer1.SAMGWIN.local|8282|8383|1|1|1|0|
|R|2|DIPortal1.SAMGWIN.local|DIPortal1.SAMGWIN.local|8282|8383|0|0|0|0|
|R|3|Indexer1.SAMGWIN.local|Indexer1.SAMGWIN.local|8282|8383|0|1|1|0|
|R|4|Collector1.SAMGWIN.local|Collector1.SAMGWIN.local|8282|8383|0|0|1|0|
|R|5|FileServer1.SAMGWIN.local|FileServer1.SAMGWIN.local|8282|8383|0|0|1|0|
|R|6|FileServer2.SAMGWIN.local|FileServer2.SAMGWIN.local|8282|8383|0|0|1|0|
|R|7|Classification.SAMGWIN.local|Classification.SAMGWIN.local|8282|8383|0|0|0|0

 

Caveat: there are considerations to additional workload on servers due to classification that could impact the ability of the machine to handle excessive loads. The setting is designed with this purpose in mind. Disable within your own discretion

 

 

 

Was this content helpful?