Skip to main content

Documentation Index

Fetch the complete documentation index at: https://documentation.uponai.com/llms.txt

Use this file to discover all available pages before exploring further.

Guardrails are a built-in content moderation layer that checks agent responses and user messages for prohibited topics. When a guardrail triggers, the prohibited content is automatically replaced with a safe placeholder message, keeping the call going without interruption.

How Guardrails Work

Guardrails apply in two ways:
  • Output guardrails — check what the agent says. If the response contains a prohibited topic, it is replaced with a placeholder before being spoken.
  • Input guardrails — check what the user says. If the user’s message contains a prohibited topic, the agent responds with a placeholder instead of processing the request.
In both cases, the call continues normally after the placeholder is delivered. Guardrails do not end the call, transfer the call, or trigger any other action — they only replace the problematic message.

Configuring Guardrails

Configure guardrails when creating or updating an agent via the dashboard or API. In the dashboard, guardrail settings are under Security & Fallback Settings.
Guardrails add approximately 50ms of latency to calls.

Output Topics

These categories detect prohibited content in agent responses:
TopicDescription
harassmentHarassing or abusive language
self_harmContent related to self-harm
sexual_exploitationSexually exploitative content
violenceViolent content
defense_and_national_securityDefense and national security topics
illicit_and_harmful_activityIllicit or harmful activities
gamblingGambling-related content
regulated_professional_adviceRegulated professional advice (legal, medical, financial)
child_safety_and_exploitationChild safety and exploitation content

Input Topics

One input topic is available — it detects attempts to manipulate or jailbreak the agent:
TopicDescription
platform_integrity_jailbreakingAttempts to jailbreak or manipulate the agent