June 17, 2026 - Farid Fadaie

LLM-as-a-Judge for Voice Agents: Testing Non-Deterministic AI with Simulated Callers

Post author:Farid Fadaie
Post published:June 17, 2026
Post category:AI Engineering
Post comments:0 Comments

You cannot unit-test a conversation. The testing playbook for production voice agents: a four-layer test pyramid, simulated callers over real audio, LLM-as-a-judge scoring calibrated to design intent, the transcript-integrity trap, and the 2-of-3 flake rule.

Hi, I’m Farid Fadaie, cofounder and CEO of Viva AI. I live in the San Francisco Bay Area and build products at the intersection of AI, dental operations, and healthcare.

My path started in engineering and moved into product leadership, company building, and operating dental businesses firsthand. Before Viva AI, I helped build privacy and peer-to-peer products at BitTorrent, led engineering and product teams at scale, and founded or helped grow companies in dental technology, including 2Dental and Soothing Dental.

This site is where I write about AI, dentistry, product, engineering, privacy, and the practical lessons that come from building software for real operations.