Skip to main content

Guardrails

Guardrails configure input and output guard settings to ensure a safe and controlled AI chat experience. There are two types of guards:

  • Input Guard – Controls and validates user inputs.
  • Output Guard – Regulates the agent's responses.
ButtonsDescription
ResetRestore the original condition
Save ChangesSave the guardrails changes

Table: Buttons in Guardrail Settings

guardrail_interface

Figure: Guardrail Interface

Input Guard

Input GuardDescription
Jailbreak DetectionDetects and blocks attempts to bypass or manipulate the agent's safety mechanisms
PII DetectionIdentifies any Personally Identifiable Information (PII) in the user input
Secret DetectionDetects sensitive data like credentials, API keys, or custom-defined secrets in the input
Table: Input Guard Settings

all_input_guards_are_enabled_in_guardrail_settings

Figure: All Input Guards Enabled in Guardrail Settings

Output Guard

Output GuardDescription
PII DetectionDetects and removes any Personally Identifiable Information from the agent's response
Secrets DetectionIdentifies and blocks responses containing confidential or custom-defined sensitive information
Competitor MentionPrevents the agent from mentioning competitor names or related information
ProfanityDetects and filters out offensive or vulgar language
ToxicityFlags and blocks toxic, harmful, or abusive content in the agent's response
Table: Output Guard Settings

all_output_guards_are_enabled_in_guardrail_settings

Figure: All Output Guards Enabled in Guardrail Settings