Skip to main content

Documentation Index

Fetch the complete documentation index at: https://documentation.uponai.com/llms.txt

Use this file to discover all available pages before exploring further.

LLM configuration options let you fine-tune how your agent processes and responds to conversations. Different settings can significantly impact behavior, reliability, and cost.
Not all options are available for every model. Check your dashboard for model-specific capabilities.

Temperature

Temperature controls the randomness and creativity of your agent’s responses — a value between 0 and 1 that affects how the model selects its next words.
RangeBehaviorBest for
0.0 – 0.3Highly consistent, deterministicFunction calling, data collection, technical support
0.4 – 0.7Balanced consistency and variationGeneral customer service, sales calls
0.8 – 1.0Creative, varied responsesCreative brainstorming, casual conversation
Recommendations by use case:
Use caseRecommended temperature
Appointment booking0.1 – 0.3
Customer support0.3 – 0.5
Sales outreach0.5 – 0.7
Virtual companion0.7 – 0.9

Structured Output

Structured Output ensures LLM responses strictly follow predefined schemas — particularly important for reliable function calling. When enabled, the model is constrained to output only valid function calls with all required parameters. Benefits:
  • Eliminates missing or malformed function arguments
  • Prevents invalid function calls from being attempted
  • Ensures all outputs match expected schemas
Trade-offs:
  • Schema caching may delay agent configuration saves
  • Model cannot deviate from defined structures
  • First load after changes may be slower

Enable when

  • Production agents with critical function calls
  • Agents handling financial or medical data
  • Integration with strict API requirements

Consider disabling when

  • Development and testing phases
  • Agents with simple or flexible function needs
  • Rapid iteration is more important than reliability

Fast Tier

Fast Tier routes LLM calls through dedicated, high-priority infrastructure for superior performance and consistency. It eliminates latency variability you might experience with standard routing. Key benefits:
  • Consistent, predictable response times on every call
  • Priority access to compute resources
  • Minimal fluctuation in processing speeds
  • Smoother, more natural conversations
Fast Tier is priced at 1.5× the standard rate for your selected model. Calculate the ROI for your use case before enabling.

Ideal for

  • High-value customer interactions
  • Time-sensitive operations (emergency services, urgent support)
  • Premium service tiers
  • Demos and sales calls

May not be necessary for

  • Internal testing
  • Low-volume or non-critical calls
  • Cost-sensitive applications