Use Pattern Search for GDPR

You can manage the GDPR provision that requires you to ensure that files with protected information have not been shared inappropriately by searching for sensitive information and reviewing who has accessed it.


  • Quickly search using built-in pattern identifiers including e-mail addresses, phone numbers, and credit cards
  • Use templates to search for complex patterns such as license plate numbers, driver’s licenses and national identification numbers, which are different across countries

In FileCloud Server, on the Admin Portal, this is called PII Search (for Personally Identifiable Information search)

(warning)  Searching for protected information is only available to administrators on the Admin Portal.  FileCloud users do not have access to PII Search on the User Portal or from any other client.


You can only use this feature if you have installed an Enterprise License.

View your license details



First you must enable content search to search within the contents of files.

Setting up content search for documents

Index Managed Storage

In order to search documents for information, you must first index them.

  • Create a new index
  • Check the status of an existing index to see if the number of files managed is the same as the number of files indexed
  • Update an existing index to add only the files that have not been indexed yet.

Indexing Managed Storage


Index Network Folders

Unlike Managed Storage, network folder files exist outside of FileCloud and therefore changes occurring in the network folders might not be propagated into FileCloud.

Monitoring such changes are important in the following scenarios:

  • Faster searching
  • Content Search
  • Automatic Realtime Syncing of Network Folders
  • Pattern Searches for GDPR

For these scenarios, you must to index network folders and keep them indexed as files and folders change.

  • To index network folders, the FileCloud Helper service is required

Indexing Network Folders

After you enable PII Search, a new section is displayed called Manage PII Patterns. In the Manage PII Patterns section, you can:

  • Add new patterns

  • Edit existing patterns

  • Search for patterns
  • Remove patterns you won't need

(lightbulb) The list of patterns you configure here is shown on the Federated Search screen in the Advanced options dialog box as options to choose from.


A pattern contains the following information:

Regex is short for a regular expression. This is a special alphanumeric string used to describe a search pattern.

If you need to create a new pattern or edit an existing one, use the following table to understand the Regex format used by FileCloud.


NameInput TypeLength of InputRegEx Format
DescriptionIdentifies the type of protected information
  • enclosed in square brackets [ ]
  • a number range
  • uppercase letter range
  • lowercase letter range
  • enclosed in curly brackets '{ '}
  • a number
[Type]{Length}

Numerical

Example

U.S. Social Security Number0-9 (can contain any number 0-9)

3 (for the first number)

2 (for the second number)

4 (for the third number)

[0-9]{3}-[0-9]{2}-[0-9]{4}

012-34-5678

Letter example

Poland Passport

a-zA-Z

0-9

2 letters (followed by)

7 numbers

[a-zA-Z]{2}[0-9]{7}

ME0123456


To enable and configure PII Search:

  1. Open a browser and log in to the Admin Portal.

  2. From the left navigation panel, click Settings.

  3. On the Manage Settings screen, click the Content Search tab.

  4. On the Content Search tab, after Solr Configuration, check the Enable PII Search checkbox.

  5. To add a new pattern to the list, in the Manage PII Patterns section, in the top right corner, click Add.
  6. To edit an existing pattern, in the Manage PII Patterns section, select the pattern and then click the edit icon ().
  7. To remove an existing pattern, in the Manage PII Patterns section, select the pattern and then click the delete icon ().
  8. Click Save.



To search the site's content for PII patterns, see Regex searches on the page Search in the Admin Portal.