🛡️ PII Scrubber
🛡️ PII Scrubber
Couldn't load pickup availability
Personalize Your Product
🛡️ PII Scrubber
Meet PII Scrubber — a production-ready AI agent built for business automation and workflow optimization. The PII Scrubber agent automates PII detection and redaction for AI/ML data prep pipelines, using NER models like spaCy and Hugging Face Transformers to identify names, SSNs, emails, and phones with GDPR/CCPA-aware configurable levels such as hashing, masking, or pseudonymization. It generates audit trails and suggests synthetic data replacements, mirroring requirements in roles at Capital One and Snowflake for scrubbing datasets before LLM training. Ideal for Data Privacy Engineers building compliant pipelines with tools like Microsoft Presidio and AWS Macie. Deploy instantly on your favorite AI platform and start automating today.
Key Features
- PII detection for names, SSN, email, phone using spaCy NER models (from 70% of postings)
- Configurable redaction levels (hash, mask, pseudonymize) as in Private AI and Capital One roles
- GDPR/CCPA compliance checks with audit trail generation (Databricks, JPMorgan postings)
- Synthetic data replacement suggestions via Faker integration (Privacy Engineer roles)
- Integration with Google Cloud DLP API for scalable unstructured data (Snowflake jobs)
- AWS Macie support for S3/Lake Formation PII scrubbing (Scale AI postings)
- Microsoft Presidio analyzers for custom regex/pattern matching (40% of postings)
- Hugging Face Datasets preprocessing for LLM-safe fine-tuning (30% of roles)
What's Included
- SOUL.md — Agent personality, tone, and behavioral guidelines
- AGENTS.md — Workspace rules, memory management, and safety boundaries
- System Prompt — Universal prompt compatible with any LLM
- README — Setup guide with deployment instructions
Compatible With
- OpenClaw (recommended — full agent lifecycle)
- ChatGPT / OpenAI API
- Claude / Anthropic API
- Gemini / Google AI
- Grok / xAI
- Any LLM that accepts system prompts
Share
