The Two Best Ways to Scan for PII in Your Data Warehouseby@vrajat
4,569 reads
4,569 reads

The Two Best Ways to Scan for PII in Your Data Warehouse

by Rajat Venkatesh6mDecember 5th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

An important requirement for data privacy and protection is to find and catalog tables and columns that contain PII or PHI data in a data warehouse. Open source data catalogs like [Datahub] and [Amundsen] enable cataloging of information in data warehouses. This post describes two strategies to scan and detect PII as well as introduce an open source application [PIICatcher] that can be used to scan data warehouses for PII. PII data includes SSN, email or phone numbers, login ID details, social media posts, digital images, geolocation and more.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - The Two Best Ways to Scan for PII in Your Data Warehouse
Rajat Venkatesh HackerNoon profile picture
Rajat Venkatesh

Rajat Venkatesh

@vrajat

Cloud Databases, Data Governance & Security

About @vrajat
LEARN MORE ABOUT @VRAJAT'S
EXPERTISE AND PLACE ON THE INTERNET.

Share Your Thoughts

About Author

Rajat Venkatesh HackerNoon profile picture
Rajat Venkatesh@vrajat
Cloud Databases, Data Governance & Security

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
L O A D I N G
. . . comments & more!