highData Exfiltration·Detected in 8% of scanned conversations

PII Exposure in AI Agent Conversations

By Declan Paul·Last updated: March 2026·3 min read

PII exposure occurs when personally identifiable information — Social Security numbers, credit card numbers, phone numbers, home addresses — appears in AI agent conversations. This creates compliance risks (GDPR, CCPA, HIPAA), liability exposure, and potential for identity theft if the data is logged, cached, or exfiltrated.

Start Free — 10K Events/MonthNo credit card required

How It Works

User-submitted PII

Customers voluntarily share sensitive data ("my SSN is 123-45-6789") not realizing it's being logged

Tool output PII

Agent queries a database or API that returns records containing PII in the response

Cross-user leaking

Agent trained or cached on one user's data includes it in responses to another user

Aggregation risk

Individual pieces of non-sensitive data combine to create identifiable profiles

Real-World Scenario

A healthcare scheduling agent asks for a patient's date of birth for verification. The patient also provides their SSN and insurance ID. All of this is logged in the conversation history, violating HIPAA regulations and creating a data breach liability.

Example Payload

My social security number is 123-45-6789 and my credit card is 4532 1234 5678 9012

This is an example for educational purposes. Rune detects and blocks payloads like this in real-time. Scan your own agent for this threat.

How Rune Detects This

L1 PII Scanning

Regex patterns detect SSNs (XXX-XX-XXXX), credit card numbers (Luhn validation), phone numbers, and email addresses in both inputs and outputs.

Policy Engine

Policies can flag or block conversations containing PII, force redaction before logging, or restrict which tools can access PII-containing data stores.

Alerting

Real-time alerts notify security teams when PII is detected, with severity levels based on data type and context.

Mitigations

Scan all agent inputs and outputs for PII patterns and redact before logging
Implement data classification policies that restrict PII access to authorized agents only
Don't store raw conversation logs containing PII — tokenize or redact sensitive fields
Train users not to share sensitive data with AI agents, and add pre-conversation warnings

PII Exposure is one of seven attack categories covered in the broader developer's guide to AI agent security. For the framework-agnostic view — ADR vs AI-SPM, the three-layer detection model, and compliance frameworks — start there.

Secret Exposure

How API keys, passwords, and tokens leak through AI agent inputs and outputs. Detection and prevention strategies for production deployments.

Data Exfiltration

How attackers use AI agents to steal sensitive data through tool calls, network requests, and output manipulation. Prevention strategies for production agents.

Frequently Asked Questions

What compliance regulations apply when PII appears in AI agent conversations?

GDPR requires explicit consent and data minimization for EU residents' PII. CCPA gives California residents the right to know what data is collected and request deletion. HIPAA applies when health information is involved. If PII is logged in an AI conversation without proper controls, each of these frameworks can impose significant fines.

How does PII detection in AI agents differ from traditional DLP scanning?

Traditional DLP scans structured data fields and network traffic at well-defined boundaries. AI agent PII detection must handle unstructured natural language where a Social Security number might appear mid-sentence, partially redacted, or described indirectly ('the last four digits of my social are 6789'). This requires both pattern matching and contextual understanding.

Can PII be reliably redacted from AI agent logs after the fact?

Post-hoc redaction is unreliable because PII may already have been sent to the LLM provider, cached in intermediate systems, or used in model fine-tuning. The only reliable approach is pre-processing — scanning and redacting PII before it enters the LLM pipeline. Rune's input scanner catches PII at the edge, before it reaches any downstream system.

Protect your agents from pii exposure

Add Rune to your agent in under 5 minutes. Scans every input and output for pii exposure and 6 other threat categories.

Start Free — 10K Events/Month Read the Docs