Inbuilt Types and Detectors
PII Types
โ Tokern can detect multiple types of PII. In-built types are: โ
- Phone
- Credit Card
- Address
- Person/Name
- Birth Date
- Gender
- Nationality
- SSN
- User Name
- Password
- Zip Code
- PO Box โ โ
ColumnNameRegexDetector
Tokern uses common regular expressions on column names to detect sensitive data. โ
- PERSON
^.*(firstname|fname|lastname|lname|
fullname|fname|maidenname|_name|
nickname|name_suffix|name).*$
โ
^.*(email|e-mail|mail).*$
โ
- BIRTH_DATE
^.*(date_of_birth|dateofbirth|dob|
birthday|date_of_death|dateofdeath|birthdate).*$
โ
- GENDER
^.*(gender).*$
- NATIONALITY
^.*(nationality).*$
โ
- ADDRESS
^.*(address|city|state|county|country|zone|borough).*$
โ
- ZIP_CODE
^.*(zipcode|zip_code|postal|postal_code|zip).*$
โ
- USER_NAME
^.*user(id|name|).*$
โ
- PASSWORD:
^.*pass.*$
โ
- US Social Security Number
^.*(ssn|social_number|social_security|
social_security_number|social_security_no).*$
โ
- PO_BOX
^.*(po_box|pobox).*$
โ โ
DatumRegexDetector
Tokern uses the CommonRegex Improved package to scan a sample set of rows for sensitive data. โ The following data categories are recognized using this method: โ
- Phone
- Credit Card
- Address
- US Social Security Number
- Zip Code
- PO Box