Guardrails are rules that control what your agent can and cannot discuss. They're your safety net. blocking harmful topics, redirecting off-topic conversations, and ensuring sensitive issues get human attention.
Find guardrails in your agent's Rules tab. Click + Add Restriction to create a new rule.
Restriction Fields
Choose how the system detects when a restriction should trigger.
Best for: Simple word blocklists, common phrases
Best for: Complex patterns, email/phone detection, multi-word variations
Best for: Broad topics, intent detection, sophisticated evasion attempts
What happens when a restriction matches. Choose based on severity.
Use for: Competitor names, explicit content, legal red lines
Use for: Off-topic questions, topics handled by other teams
Use for: Analytics, A/B testing restrictions, soft monitoring
Use for: Complaints, legal issues, high-value sales inquiries
When a message matches multiple restrictions, only the highest priority (lowest number) applies. Priority 1 is checked first, then 2, 3, etc.
Keywords:
competitor1, competitor2, alternative toResponse: "I can only discuss our products. Visit our comparison page for details."
Concept: "User is requesting a refund or wants their money back"
Response: "I'll connect you with our billing team who can help with refund requests. Please hold."
Keywords:
hiring, job, career, resume, applyResponse: "For career opportunities, please visit our careers page at company.com/careers"
Pattern:
\b(ssn|social security|credit card|password)\bResponse: "I cannot request or store personal information like passwords or financial details."
Use the Chat Test tab to verify your guardrails work correctly.
Testing Checklist