Back
on demand
on demand
Building Production Audio AI with Agents, Automated Transcription & Diarization
Thu, Feb 12, 05:00 PM - 05:45 PM UTC
Building Production Audio AI
Learn how teams building speech, voice, and conversational AI systems design scalable pipelines for audio annotation, transcription, and model training.
This webinar dives into the workflows that turn raw audio into high-quality training data and how to use audio agents to accelerate every step.
You will learn how to:
- Design end-to-end workflows for speech-to-text and audio understanding, including waveform-based labeling and multi-speaker diarization.
- Deploy audio agents that assist or automate annotation across large audio datasets, such as automated transcription & pre-labeling.
- Implement validation workflows, and accuracy checks to ensure high-fidelity training data for production speech and voice models.
- Use automated transcription alongside waveform visualization to quickly validate and refine annotations.
Register to participate in the session live or to get a recording.
Speakers

Diarmuid McGonagle
Lead Customer Support Engineer
Encord

Merric de Launey
Software Engineer
Encord





































