Remove personally identifiable information (PII) from CSV files with OpenAI
$20/month : Unlimited workflows
2500 executions/month
THE #1 IN WEB SCRAPING
Scrape any website without limits
HOSTINGER 🎉 Early Black Friday Deal
DISCOUNT 20% Try free
DISCOUNT 20%
Self-hosted n8n
Unlimited workflows - from $4.99/mo
#1 hub for scraping, AI & automation
6000+ actors - $5 credits/mo
What this workflow does
- Monitors Google Drive: The workflow triggers whenever a new CSV file is uploaded.
- Uses AI to Identify PII Columns: The OpenAI node analyzes the data and identifies PII-containing columns (e.g., name, email, phone).
- Removes PII: The workflow filters out these columns from the dataset.
- Uploads Cleaned File: The sanitized file is renamed and re-uploaded to Google Drive, ensuring the original data remains intact.
How to customize this workflow to your needs
- Adjust PII Identification: Modify the prompt in the OpenAI node to align with your specific data compliance requirements.
- Include/Exclude File Types: Adjust the Google Drive Trigger settings to monitor specific file types (e.g., CSV only).
- Output Destination: Change the folder in Google Drive where the sanitized file is uploaded.
Setup
Prerequisites:
- A Google Drive account.
- An OpenAI API key.
Workflow Configuration:
- Configure the Google Drive Trigger to monitor a folder for new files.
- Configure the OpenAI Node to connect with your API
- Set the Google Drive Upload folder to a different location than the Trigger folder to prevent workflow loops.