Testing Overview

Available Testing Methods

1. LLM Playground

Purpose: Interactive text-based testing for rapid iteration and debugging.Key features: Real-time conversation testing, function call visualization, variable inspection, prompt debugging.Best for: Initial development, prompt refinement, debugging specific conversation paths.

2. LLM Simulation Testing

Purpose: Automated testing with predefined scenarios for consistent quality assurance.Key features: Batch testing, success metrics and scoring, scenario templates, regression testing.Best for: Quality assurance, regression testing, validating changes before deployment.

3. Web Call / Phone Call Testing

Purpose: Real-world testing with actual voice interactions to validate audio performance.Key features: Voice quality and latency testing, interruption handling, background noise, DTMF and telephony features.Best for: Final validation, voice quality testing, production readiness checks.

Feature	LLM Playground	LLM Simulation	Web/Phone Call
Setup effort	Medium	Low	High
Test speed	Fast	Very fast	Real-time
Response accuracy	✅	✅	✅
Function calls	✅	✅	✅
Background noise	❌	❌	✅
Interruptions	❌	❌	✅
Batch testing	❌	✅	❌
Cost	Per message	Per message	Call charges

Feature

LLM Playground

LLM Simulation

Web/Phone Call

Setup effort

Medium

Low

High

Test speed

Fast

Very fast

Real-time

Response accuracy

✅

Function calls

✅

Background noise

❌

✅

Interruptions

❌

✅

Batch testing

❌

✅

❌

Cost

Per message

Call charges

Recommended Testing Workflow

Phase 1: Development Testing

Tool: LLM Playground

Iterate on prompts and conversation flows
Debug function calling logic
Test edge cases interactively
Validate dynamic variables

Phase 2: Quality Assurance

Tool: LLM Simulation Testing

Create comprehensive test scenarios
Run regression tests after changes
Validate success metrics
Ensure consistent performance

Phase 3: Production Validation

Tool: Web / Phone Call Testing

Test actual voice interactions
Verify audio quality and latency
Check telephony features
Validate real-world performance

Phase 4: Continuous Testing

Tool: Batch Testing (Simulation)

Set up automated test suites
Monitor agent performance over time
Catch regressions early
Maintain quality standards

Create a test checklist covering all critical paths before each deployment. Include both happy paths and edge cases.

​Available Testing Methods

1. LLM Playground

2. LLM Simulation Testing

3. Web Call / Phone Call Testing

​Testing Method Comparison

​Recommended Testing Workflow

Available Testing Methods

Testing Method Comparison

Recommended Testing Workflow