Configure LLM Options

LLM configuration options let you fine-tune how your agent processes and responds to conversations. Different settings can significantly impact behavior, reliability, and cost.

Not all options are available for every model. Check your dashboard for model-specific capabilities.

Temperature

Temperature controls the randomness and creativity of your agent’s responses — a value between 0 and 1 that affects how the model selects its next words.

Range	Behavior	Best for
0.0 – 0.3	Highly consistent, deterministic	Function calling, data collection, technical support
0.4 – 0.7	Balanced consistency and variation	General customer service, sales calls
0.8 – 1.0	Creative, varied responses	Creative brainstorming, casual conversation

Recommendations by use case:

Use case	Recommended temperature
Appointment booking	0.1 – 0.3
Customer support	0.3 – 0.5
Sales outreach	0.5 – 0.7
Virtual companion	0.7 – 0.9

Structured Output

Structured Output ensures LLM responses strictly follow predefined schemas — particularly important for reliable function calling. When enabled, the model is constrained to output only valid function calls with all required parameters. Benefits:

Eliminates missing or malformed function arguments
Prevents invalid function calls from being attempted
Ensures all outputs match expected schemas

Trade-offs:

Schema caching may delay agent configuration saves
Model cannot deviate from defined structures
First load after changes may be slower

Enable when

Production agents with critical function calls
Agents handling financial or medical data
Integration with strict API requirements

Consider disabling when

Development and testing phases
Agents with simple or flexible function needs
Rapid iteration is more important than reliability

Fast Tier

Fast Tier routes LLM calls through dedicated, high-priority infrastructure for superior performance and consistency. It eliminates latency variability you might experience with standard routing. Key benefits:

Consistent, predictable response times on every call
Priority access to compute resources
Minimal fluctuation in processing speeds
Smoother, more natural conversations

Fast Tier is priced at 1.5× the standard rate for your selected model. Calculate the ROI for your use case before enabling.

Ideal for

High-value customer interactions
Time-sensitive operations (emergency services, urgent support)
Premium service tiers
Demos and sales calls

May not be necessary for

Internal testing
Low-volume or non-critical calls
Cost-sensitive applications

Prompt Guide and Examples for Specific Situations Choose a Custom Voice

​Temperature

​Structured Output

Enable when

Consider disabling when

​Fast Tier

Ideal for

May not be necessary for

Temperature

Structured Output

Fast Tier