Voice AI Agents in Production: An Eval Harness Hackathon - #BOSTechWeek
Hosted by
ᵔ◡ᵔ
⚆◟⚆
Voice AI agents demo beautifully and fail in production. The gap between the two is closed by evaluation harnesses, and almost nobody teaches you how to build one. This hackathon brings AI practitioners and builders to Kendall Square for a day of building and breaking: teams will design eval harnesses for voice agents, stress-test them against the failure modes that actually matter in production (latency under load, barge-in and interruption handling, ASR errors, tool-call hallucinations, multi-turn state drift), and walk out with open-source code they can drop into their own stack. Expect a technical room of enterprise developers, and voice AI practitioners, with a short opening session from Rasa's co-founder and invited experts on what it actually takes to ship voice agents enterprises will trust. Prizes for the most rigorous harness, the most creative adversarial test suite, and the highest-scoring agent against a shared benchmark. Whether you ship voice agents daily or you're just trying to understand why they're so hard to productionize, you'll leave with working code, new collaborators, and a much clearer map of the stack. Open application. All skill levels welcome.
This event is a part of #BOSTechWeek—a week of events hosted by VCs and startups to bring together the tech ecosystem. Learn more at www.tech-week.com.
Guest List
0 on the list
⚆◟⚆
•ᴥ•
◉‿◉
Restricted Access
Must be on the list to view event activity & see list details