Smart Classification Examples
The following examples refer to custom metadata that would have to be created before creating the Smart Classification rule; a Smart Classification rule cannot be saved unless you specify which metadata field to set.
Identifying files less than 5 MB containing US social security numbers
Rule name | Tag files <5 MB with US social security numbers |
Automatic execution | Enable |
Filters | File size is less than [5 MB] |
Classifier | Default |
Classifier patterns | Match pattern by name [U. S. Social Security Number (SSN)] |
Classifier condition | Number of matches is greater than [0] |
Match action | Set metadata [SSN.found] to yes |
Non-match action | Set metadata [SSN.found] to no |
Identifying files with extensions .txt and .pdf containing US social security numbers
Rule name | Tag txt and pdf files with US social security numbers |
Automatic execution | Enable |
Filters | File extension is equal to [txt] OR File extension is equal to [pdf] Note: Click AND to change it to OR. |
Classifier | Default |
Classifier patterns | Match pattern by name [U. S. Social Security Number (SSN)] |
Classifier condition | Number of matches is greater than [0] |
Match action | Set metadata [SSN.found] to yes |
Non-match action | Set metadata [SSN.found] to no |
Identifying all files containing US social security numbers
Rule name | Tag all files with US social security numbers |
Automatic execution | Enable |
Filters | Anything |
Classifier | Default |
Classifier patterns | Match pattern by name [U. S. Social Security Number (SSN)] |
Classifier condition | Number of matches is greater than [0] |
Match action | Set metadata [SSN.found] to yes |
Non-match action | Set metadata [SSN.found] to no |
Identifying files in the Team Folder HumanResources containing US social security numbers
Rule name | Tag all Human Resources files containing US social security numbers |
Automatic execution | Enable |
Filters | File path starts with [TeamFolderAdmin/HumanResources] Note: See Identifying a FileCloud Specific Path for help writing FileCloud folder paths. |
Classifier | Default |
Classifier patterns | Match pattern by name [U. S. Social Security Number (SSN)] |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set metadata [SSN.found] to yes |
Non-match action | Set metadata [SSN.found] to no |
Identifying files containing any pattern in the custom pattern group France ID numbers
Rule name | France ID numbers |
Automatic execution | Enable |
Filters | Anything |
Classifier | Default |
Classifier patterns | Match pattern by group [France ID numbers] |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [metadata ID.found] to yes |
Non-match action | Set [metadata ID.found] to no |
Identifying files with Singapore National Registry Identity Card (NRIC)
Rule name | Tag files with Singapore NRIC |
Automatic execution | Enable |
Filters | Anything |
Classifier | Singapore NRIC |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [metadata ID.found] to yes |
Non-match action | Set [metadata ID.found] to no |
Identifying files with patterns matching American Express credit cards
Rule name | Tag files with American Express card numbers |
Automatic execution | Enable |
Filters | Anything |
Classifier | Default |
Classifier patterns | Match RegEx [3[47]{1}[0-9]{13}] OR Match RegEx [3[47]{1}[0-9]{2}-[0-9]{6}-[0-9]{5}] OR Match RegEx [3[47]{1}[0-9]{2} [0-9]{4} [0-9]{4} [0-9]{3}] Note: Smart Classification automatically inserts OR when you add multiple Classifier patterns. |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [metadata ID.found] to yes |
Non-match action | Set [metadata ID.found] to no |
Identifying files with the exact phrase "Confidential - for internal use only"
Rule name | Tag files marked as confidential |
Automatic execution | Enable |
Filters | Anything |
Classifier | Default |
Classifier patterns | Match RegEx [Confidential - for internal use only] |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [IncludesText.Confidential] to yes |
Non-match action | Set [IncludesText.Confidential] to no |
Mark files with different tags depending on the number of matches
In this rule, if a file has 0-2 five-digit numbers, it is marked as having a low possibility of personal ID information. If it has >2 five-digit numbers, it is marked as having a high possibility of personal ID information. This enables you to perform different operations on files with low and high likelihood of having a match. For example, you might choose to manually review files with low possibility, but automatically block files with high possibility.
Rule name | Tag files based on number of 5-digit numbers |
Automatic execution | Enable |
Filters | Anything |
Classifier | Default |
Classifier patterns | Match RegEx [[0-9]{5}] |
Classifier condition | Number of matches [is greater than 2] |
Match action | Set [ID.found] to high |
Non-match action | Set [ID.found] to low |
Identifying files with a phrase that is the same or similar to "Confidential - for internal use only"
Rule name | Tag files with confidentiality phrases |
Automatic execution | Enable |
Filters | Anything |
Classifier | Solr Standard Query |
Classifier patterns | Match Standard Query ["Confidential - for internal use only"~4] (include "" around phrase) Note: ~4 indicates that all words in the phrase must appear, but may be within 4 words of each other. For example "Confidential - use for internal only" would be a match. |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [IncludesText.Confidential] to yes |
Non-match action | Set [IncludesText.Confidential] to no |
Identifying files with a word that matches or is one letter different from "Confidential"
Rule name | Tag files with words spelled similarly to confidential |
Automatic execution | Enable |
Filters | Anything |
Classifier | Solr Standard Query |
Classifier patterns | Match Standard Query [Confidential~1] (do not include "" around word) Note: ~1 indicates that there may be 1 letter different in the spelling, for example "Confidental" and "Confidentials" would match, but "Confidentail" would not. |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [Spelling.similar] to yes |
Non-match action | Set [Spelling.similar] to no |
Identifying files with the word "classified" and not the word "declassified"
Rule name | Tag classified files |
Automatic execution | Enable |
Filters | Anything |
Classifier | Solr Standard Query |
Classifier patterns | Match Standard Query ["CLASSIFIED" NOT "DECLASSIFIED"] |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [Classified.found] to yes |
Non-match action | Set [Classified.found] to no |
Identifying files marked for blocking by ICAP-DLP
In the case of the ICAP-DLP classifier, the pattern is checked by ICAP-DLP, which tags the file if it is sensitive and does not tag it if it is not sensitive. Therefore, if the file is tagged by ICAP-DLP as sensitive, it is a match, and the following rule sets File.allowed to false, indicating that the file is not allowed to be downloaded, uploaded, or shared.
Rule name | Identifying files flagged by ICAP-DLP |
Automatic execution | Enable |
Filters | Anything |
Classifier | ICAP-DLP |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [File.allowed] to false |
Non-match action | Set [File.allowed] to true |
Identifying files with the names or addresses (AI Classifier example)
Rule name | Tag files with names or addresses |
Automatic execution | Enable |
Filters | Anything |
Classifier | AI Classifier |
Classifier patterns | Match instances of [people names] OR Match instances of [addresses] |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [Personal Info.found] to yes |
Non-match action | Set [Personal Info.found] to no |
Identifying files with company names (AI Classifier example)
Rule name | Identify files with company names |
Automatic execution | Enable |
Filters | Anything |
Classifier | AI Classifier |
Classifier patterns | Match instances of [company names] |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [CompanyName.detected] to yes |
Non-match action | Set [CompanyName.detected] to no |
Identifying files with contact information (AI Classifier example)
Rule name | Identify files with contact information |
Automatic execution | Enable |
Filters | Anything |
Classifier | AI Classifier |
Classifier patterns | Match instances of [phone numbers] OR Match instances of [email addresses] |
Classifier condition | Number of matches [is greater than 0] |
Match action | Set [ContactInfo.detected] to yes |
Non-match action | Set [ContactInfo.detected] to no |