AI Guard

AI Guard by OneTrust is a comprehensive classification tool for protecting critical AI systems. It acts as a sophisticated inspection layer between your users and AI tools, ensuring every interaction is safe, secure, and compliant.

What Is AI Guard?

AI Guard provides runtime classification capabilities for generative AI applications. It automatically detects and masks PII, secrets, proprietary code, and moreβ€”with support for 300+ classification patterns out of the box and the ability to define custom profiles.

AI Guard helps businesses develop AI applications with confidence, especially during the build phase. It identifies PII risk in AI payloads early in development by providing sophisticated PII identification and data classification. By using AI Guard, businesses can test AI agents and applications for PII generation and quantify PII risk before taking them to production.

πŸ“˜

Current Release Scope

As of this release, AI Guard is optimized for development and testing workloads. It is not recommended for large classification volumes generally seen in externally facing AI applications or agents.

Key Capabilities

CapabilityDescription
Real-Time ClassificationClassify user prompts and AI responses during inference with 300+ built-in classifiers
RedactionAutomatically mask sensitive data (PII, credentials, phone numbers, addresses) based on configurable policies
BlockingBlock entire messages when critical sensitive data is detected
Streaming SupportProcess LLM streaming responses with concurrent chunked classification and inline redaction
ObservabilityStream classification metrics and events to OneTrust AI Governance for compliance monitoring
AI Application InventoryRegister AI applications to OneTrust AI Governance for centralized governance

How It Works

AI Guard is an SDK-based solution, making it suitable for almost all environments with a Python runtime. Developers import the OneTrust AI Guard SDK into their applications and leverage its classification, redaction, and streaming functionalities.

Architecture

The system consists of four layers:

  1. SDK Layer β€” Python AI Guard Client that integrates with your GenAI agent runtime
  2. Service Layer β€” AI Guard classification service (deployed on the OneTrust Light Worker Node) that processes classification requests
  3. Backend Layer β€” OneTrust Data Discovery Classification System providing the classification engine and 300+ system classifiers
  4. Observability Layer β€” Metrics streamed from AI Guard to OneTrust AI Governance for monitoring and compliance

Call Flow

Your AI App (SDK) β†’ AI Guard Service β†’ Classification Engine β†’ Results β†’ SDK (Redact/Block/Pass)
  1. Your GenAI agent invokes the AI Guard SDK during inference
  2. The SDK sends a classification request to the AI Guard service
  3. AI Guard classifies the text using cached profiles and classifiers
  4. Results are returned to the SDK for real-time decision-making (redact, block, or pass through)
  5. Classification events are published to AI Governance via the metrics pipeline
πŸ“˜

Important

The OneTrust AI Governance Cloud acts as a control plane to define classification behavior. It does not collect prompts or responses. Summary classification metrics are sent from the Light Worker Node to AI Governance at regular intervals.

Primary Use Cases

AI Application Inventory

Import OneTrust AI Guard as an SDK within your applications and declare AI application metadata, thereby registering the application to OneTrust AI Governance. This allows businesses to centralize all AI applications under development and ensure governance actions can be extended to them.

Sensitive Data Protection

Automatically detect and classify PII, secrets, proprietary code, and more in both user prompts and AI-generated responses. Apply configurable policies to redact or block sensitive content before it reaches end users.

Supported Platforms

The SDK supports the following AI platforms for metrics attribution:

PlatformIdentifier
Amazon BedrockAMAZON_BEDROCK
Amazon SageMakerAMAZON_SAGEMAKER
Azure AI FoundryAZURE_FOUNDRY
DatabricksDATABRICKS
Google Cloud Vertex AIGCP_VERTEX

What's Next?

Get started with AI Guard by following the setup guides:


What’s Next

Get started with AI Guard by following the setup guides: