Guardrails Validator: Tonic Textual PII

PII detection and redaction validator for Guardrails AI, powered by Tonic Textual.

Uses transformer-based NER supporting 46+ entity types across 50+ languages.

Installation

pip install guardrails-tonic-textual

Or via Guardrails Hub:

guardrails hub install hub://tonic/textual_pii

Setup

Set the TONIC_TEXTUAL_API_KEY environment variable with your API key. See Creating and revoking Textual API keys for setup instructions.

export TONIC_TEXTUAL_API_KEY="your-key"

Quick Start

from guardrails import Guard
from guardrails.hub import TextualPII
# or: from validator import TextualPII

guard = Guard().use(TextualPII(on_fail="fix"))

result = guard.validate("My SSN is 123-45-6789, please help me file my taxes")
print(result.validated_output)
# "My SSN is [US_SSN_...], please help me file my taxes"

This works in both directions — scrub user input before it reaches the LLM, or scrub LLM output before it reaches the user.

Raise on PII instead of redacting:

guard = Guard().use(TextualPII(on_fail="exception"))
guard.validate("My SSN is 123-45-6789")  # raises ValidationError

Filter to specific entity types:

guard = Guard().use(
    TextualPII(entities=["US_SSN", "CREDIT_CARD", "EMAIL_ADDRESS"], on_fail="fix")
)

Wrap an LLM call:

result = guard(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Tell me about John Smith"}],
)
print(result.validated_output)  # PII auto-redacted

Configuration

PII Handling Modes

Control how detected PII is replaced in the fix_value via generator_default and per-entity generator_config. See the Tonic Textual entity type handling docs for details.

guard = Guard().use(
    TextualPII(
        on_fail="fix",
        generator_default="Synthesis",   # Replace with realistic fakes
        generator_config={
            "NAME_GIVEN": "Synthesis",    # Fake names
            "EMAIL_ADDRESS": "Redaction", # [EMAIL_ADDRESS] labels
            "PHONE_NUMBER": "Off",        # Leave unchanged
        },
    ),
)

Mode	Behavior
`Off`	PII is detected but left unchanged in the fix value
`Redaction`	PII is replaced with entity type labels (e.g., `[NAME_GIVEN]`)
`Synthesis`	PII is replaced with realistic synthetic values
`GroupingSynthesis`	Groups related entities and generates new names via LLM
`ReplacementSynthesis`	Redacts first, then uses LLM to generate contextual replacements

Allow Lists and Block Lists

Use regex patterns to force-tag or exclude specific values per entity type:

guard = Guard().use(
    TextualPII(
        on_fail="fix",
        # Force-tag text matching these regexes as the given entity type
        label_allow_lists={
            "ORGANIZATION": ["Acme Corp", "Initech"],
            "PHONE_NUMBER": [r"\+1\s?\(\d{3}\)\s?\d{3}-\d{4}"],
        },
        # Exclude values matching these regexes from detection
        label_block_lists={
            "NAME_FAMILY": [r"^Smith$"],
        },
    ),
)

Custom Entities

Include custom entity types defined in the Tonic Textual UI:

guard = Guard().use(
    TextualPII(
        on_fail="fix",
        custom_entities=["CUSTOM_INTERNAL_ID", "CUSTOM_ACCOUNT_NUMBER"],
    ),
)

Reproducible Output

Use random_seed for deterministic synthesis/tokenization across calls:

guard = Guard().use(
    TextualPII(
        on_fail="fix",
        generator_default="Synthesis",
        random_seed=42,
    ),
)

Self-Hosted Deployments

For self-hosted Textual instances, provide your deployment URL:

guard = Guard().use(
    TextualPII(
        base_url="https://textual.your-company.com",
        on_fail="fix",
    ),
)

How It Works

Unlike tool-based integrations (where an LLM actively calls a redaction tool), this validator operates as a passive PII firewall. It can be applied in two directions:

Input filtering (scrub user messages before they reach the LLM):

The user submits a message that may contain PII
guard.validate(user_input) scrubs PII from the message
The scrubbed text is sent to the LLM -- PII never leaves your perimeter

Output filtering (scrub LLM responses before they reach the user):

The LLM generates a response normally
The Guard intercepts the response before it reaches the user
Tonic Textual scans for PII and returns entity positions
The Guard applies the configured on_fail action

The on_fail strategies control what happens when PII is detected:

"fix": Replaces the text with the redacted version (using fix_value)
"exception": Raises a ValidationError blocking the text entirely
"noop": Logs the PII detection but passes through unchanged
"reask": Re-prompts the LLM asking it to remove the PII

API Reference

`TextualPII`

Parameter	Type	Default	Description
`entities`	`list[str] \| None`	`None`	PII types to detect (all if None)
`api_key`	`str \| None`	`None`	API key (falls back to env var)
`base_url`	`str \| None`	`None`	Self-hosted deployment URL
`generator_default`	`str \| None`	`None`	Default handling mode (`Off`, `Redaction`, `Synthesis`, `GroupingSynthesis`, `ReplacementSynthesis`)
`generator_config`	`dict[str, str] \| None`	`None`	Per-entity mode overrides
`label_allow_lists`	`dict[str, list[str]] \| None`	`None`	Per-entity regex patterns to force-tag as that entity type
`label_block_lists`	`dict[str, list[str]] \| None`	`None`	Per-entity regex patterns to exclude from detection
`custom_entities`	`list[str] \| None`	`None`	Custom entity types to include (defined in Textual UI)
`random_seed`	`int \| None`	`None`	Seed for reproducible synthesis/tokenization
`on_fail`	`str \| callable \| None`	`None`	Failure action

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
blogs		blogs
tests		tests
validator		validator
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Guardrails Validator: Tonic Textual PII

Installation

Setup

Quick Start

Configuration

PII Handling Modes

Allow Lists and Block Lists

Custom Entities

Reproducible Output

Self-Hosted Deployments

How It Works

API Reference

`TextualPII`

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Guardrails Validator: Tonic Textual PII

Installation

Setup

Quick Start

Configuration

PII Handling Modes

Allow Lists and Block Lists

Custom Entities

Reproducible Output

Self-Hosted Deployments

How It Works

API Reference

TextualPII

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`TextualPII`

Packages