Use Pattern Search for GDPR

You can manage the GDPR provision that requires you to ensure that files with protected information have not been shared inappropriately by searching for sensitive information and reviewing who has accessed it.


  • Quickly search using built-in pattern identifiers including e-mail addresses, phone numbers, and credit cards
  • Use templates to search for complex patterns such as license plate numbers, driver’s licenses and national identification numbers, which are different across countries

In FileCloud Server, on the Admin Portal, this is called PII Search (for Personally Identifiable Information search)

(warning)  Searching for protected information is only available to administrators on the Admin Portal.  FileCloud users do not have access to PII Search on the User Portal or from any other client.


You can only use this feature if you have installed an Enterprise License.

View your license details


First content search must be enabled in FileCloud. If content search is not enabled, please Contact FileCloud Support.

Index Managed Storage

In order to search documents for information, you must first index them.

  • Create a new index
  • Check the status of an existing index to see if the number of files managed is the same as the number of files indexed
  • Update an existing index to add only the files that have not been indexed yet.

Indexing Managed Storage


After you enable PII Search, a new section is displayed called Manage PII Patterns. In the Manage PII Patterns section, you can:

  • Add new patterns

  • Edit existing patterns

  • Search for patterns
  • Remove patterns you won't need

(lightbulb) The list of patterns you configure here is shown on the Federated Search screen in the Advanced options dialog box as options to choose from.


A pattern contains the following information:

Regex is short for a regular expression. This is a special alphanumeric string used to describe a search pattern.

If you need to create a new pattern or edit an existing one, use the following table to understand the Regex format used by FileCloud.


NameInput TypeLength of InputRegEx Format
DescriptionIdentifies the type of protected information
  • enclosed in square brackets [ ]
  • a number range
  • uppercase letter range
  • lowercase letter range
  • enclosed in curly brackets '{ '}
  • a number
[Type]{Length}

Numerical

Example

U.S. Social Security Number0-9 (can contain any number 0-9)

3 (for the first number)

2 (for the second number)

4 (for the third number)

[0-9]{3}-[0-9]{2}-[0-9]{4}

012-34-5678

Letter example

Poland Passport

a-zA-Z

0-9

2 letters (followed by)

7 numbers

[a-zA-Z]{2}[0-9]{7}

ME0123456


To enable and configure PII Search:

  1. Open a browser and log in to the Admin Portal.

  2. From the left navigation panel, click Settings.

  3. On the Manage Settings screen, click the Content Search tab.

  4. On the Content Search tab, after Solr Configuration, check the Enable PII Search checkbox.

  5. To add a new pattern to the list, in the Manage PII Patterns section, in the top right corner, click Add.
  6. To edit an existing pattern, in the Manage PII Patterns section, select the pattern and then click the edit icon ().
  7. To remove an existing pattern, in the Manage PII Patterns section, select the pattern and then click the delete icon ().
  8. Click Save.



To search the site's content for PII patterns, see Regex searches on the page Search in the Admin Portal.