Announcing our Series C with $110M in total funding. Read more →.

Back

on demand

on demand

Building Production Audio AI with Agents, Automated Transcription & Diarization

Thu, Feb 12, 05:00 PM - 05:45 PM UTC

Building Production Audio AI

Learn how teams building speech, voice, and conversational AI systems design scalable pipelines for audio annotation, transcription, and model training.


This webinar dives into the workflows that turn raw audio into high-quality training data and how to use audio agents to accelerate every step.

You will learn how to:

  • Design end-to-end workflows for speech-to-text and audio understanding, including waveform-based labeling and multi-speaker diarization.
  • Deploy audio agents that assist or automate annotation across large audio datasets, such as automated transcription & pre-labeling.
  • Implement validation workflows, and accuracy checks to ensure high-fidelity training data for production speech and voice models.
  • Use automated transcription alongside waveform visualization to quickly validate and refine annotations.

Register to participate in the session live or to get a recording.

Speakers

Diarmuid McGonagle
Diarmuid McGonagle
Lead Customer Support Engineer
Encord
Merric de Launey
Merric de Launey
Software Engineer
Encord

Other webinars that you may like