LLM configuration options let you fine-tune how your agent processes and responds to conversations. Different settings can significantly impact behavior, reliability, and cost.Documentation Index
Fetch the complete documentation index at: https://documentation.uponai.com/llms.txt
Use this file to discover all available pages before exploring further.
Not all options are available for every model. Check your dashboard for model-specific capabilities.
Temperature
Temperature controls the randomness and creativity of your agent’s responses — a value between 0 and 1 that affects how the model selects its next words.| Range | Behavior | Best for |
|---|---|---|
| 0.0 – 0.3 | Highly consistent, deterministic | Function calling, data collection, technical support |
| 0.4 – 0.7 | Balanced consistency and variation | General customer service, sales calls |
| 0.8 – 1.0 | Creative, varied responses | Creative brainstorming, casual conversation |
| Use case | Recommended temperature |
|---|---|
| Appointment booking | 0.1 – 0.3 |
| Customer support | 0.3 – 0.5 |
| Sales outreach | 0.5 – 0.7 |
| Virtual companion | 0.7 – 0.9 |
Structured Output
Structured Output ensures LLM responses strictly follow predefined schemas — particularly important for reliable function calling. When enabled, the model is constrained to output only valid function calls with all required parameters. Benefits:- Eliminates missing or malformed function arguments
- Prevents invalid function calls from being attempted
- Ensures all outputs match expected schemas
- Schema caching may delay agent configuration saves
- Model cannot deviate from defined structures
- First load after changes may be slower
Enable when
- Production agents with critical function calls
- Agents handling financial or medical data
- Integration with strict API requirements
Consider disabling when
- Development and testing phases
- Agents with simple or flexible function needs
- Rapid iteration is more important than reliability
Fast Tier
Fast Tier routes LLM calls through dedicated, high-priority infrastructure for superior performance and consistency. It eliminates latency variability you might experience with standard routing. Key benefits:- Consistent, predictable response times on every call
- Priority access to compute resources
- Minimal fluctuation in processing speeds
- Smoother, more natural conversations
Ideal for
- High-value customer interactions
- Time-sensitive operations (emergency services, urgent support)
- Premium service tiers
- Demos and sales calls
May not be necessary for
- Internal testing
- Low-volume or non-critical calls
- Cost-sensitive applications