Layercode
Back to blog
PodcastJanuary 6, 20263 min read

Why voice AI agents feel weird in production

Why do voice AI agents still feel 'off' to users, even when the tech works? Aidan and Jack break down the real production challenges that voice AI builders face.

Aidan Hornsby
Aidan Hornsby
@aidanhornsby
Jack Bridger
Jack Bridger
@jacksbridger

Why do voice AI agents still feel "off" to users, even when the tech works?

Aidan Hornsby and Jack Bridger break down the real production challenges that voice AI builders face: from subtle issues that break user trust to deep technical problems that don't surface until you're serving real calls.

This episode covers what we've learned talking to dozens of teams building voice AI systems in production.


What We Cover

Naturalness & Trust

  • Why mispronouncing one word in 100 makes your agent feel broken—especially names and numbers
  • The difference between voice and text agents: problems that don't exist in chat
  • Designing the right "vibe"—why Domino's intentionally made their voice AI "dead inside"
  • ChatGPT Voice quirks and why overly enthusiastic agents backfire

The Technical Reality

  • Stacked non-determinism: how STT → LLM → TTS errors compound across turns
  • Why voice calls are "one-shot"—users hang up, they don't reroll like with text
  • Background noise and "presence": why silence makes users think the call dropped
  • Browser audio gotchas (mobile Safari mic latency)
  • Testing voice systems: why traditional QA breaks down

Production & B2B Deployment

  • 60+ day onboarding cycles and why self-serve doesn't work yet
  • CRM integration nightmares in vertical markets
  • Why business owners are scared to deploy voice AI during business hours
  • Multi-agent handoffs, tool latency, and keeping conversations flowing

Latency & UX Patterns

  • Turn-taking tuning vs. perceived naturalness
  • Filler sounds, clicks, and other latency-masking techniques
  • Fast/slow agent architectures for background processing
  • How to give feedback in voice when there's no visual UI

Timestamps

  • — Intro: Voice AI Happy Hour
  • — Naturalness: mispronunciations, names & voice vs text
  • — Real-world calls: unexpected user behavior & production QA
  • — Testing voice systems: STT → LLM → TTS non-determinism
  • — One-shot UX: why users hang up and LLMs are "slot machines"
  • — Background noise, silence & presence cues
  • — Designing the right vibe: Domino's "dead inside" + ChatGPT voice
  • — B2B adoption: onboarding, integrations & CRM challenges
  • — Building complex agents: handoffs, workflows & tool latency
  • — Latency patterns: turn-taking + UX feedback techniques
  • — Browser audio gotchas: mobile Safari mic
  • — Demos vs reality (Sesame, Her) + wrap-up

Resources

Mentioned in This Episode


Have a topic you'd like us to cover? Reach out on X @uselayercode or email us at podcast@layercode.com

Related posts