Guardrails

Guardrails are safety policies applied at the gateway level to protect against data leakage, injection attacks, and inappropriate content.

Available guardrails

Detects and blocks sensitive data in prompts before they reach the AI.

Detects:

Action: blocks the request and shows which patterns matched.

Identifies personally identifiable information in prompts.

Detects:

Action: configurable — block, warn, or redact.

Detects attempts to override system prompts or inject malicious instructions.

Detects:

Action: blocks the request.

Filters requests and responses for inappropriate or off-topic content.

Detects:

Action: configurable — block or flag for review.

Configure guardrails from the dashboard:

All guardrail triggers are logged: