Announcing our Series C with $110M in total funding. Read more →.

Back

on demand

on demand

Building Production Audio AI with Agents, Automated Transcription & Diarization

Thu, Feb 12, 05:00 PM - 05:45 PM UTC

Building Production Audio AI

Learn how teams building speech, voice, and conversational AI systems design scalable pipelines for audio annotation, transcription, and model training.

This webinar dives into the workflows that turn raw audio into high-quality training data and how to use audio agents to accelerate every step.

You will learn how to:

Design end-to-end workflows for speech-to-text and audio understanding, including waveform-based labeling and multi-speaker diarization.
Deploy audio agents that assist or automate annotation across large audio datasets, such as automated transcription & pre-labeling.
Implement validation workflows, and accuracy checks to ensure high-fidelity training data for production speech and voice models.
Use automated transcription alongside waveform visualization to quickly validate and refine annotations.

Register to participate in the session live or to get a recording.

Speakers

Diarmuid McGonagle

Diarmuid McGonagle

Lead Customer Support Engineer

Encord

Merric de Launey

Merric de Launey

Software Engineer

Encord

Other webinars that you may like

video-thumbnail

Will World Models Eat Physical AI?

30 m

video-thumbnail

Deploy AI Agents in Minutes (Live Session)

30 m

video-thumbnail

How to De-Risk Model Performance in High Stakes Deployment

30 m

video-thumbnail

Building Robust 3D Data Pipelines: From Manual Cuboids to Scalable Workflows

45 m

video-thumbnail

Accelerate VLA Segmentation for Robotics with SAM 3

45 m

video-thumbnail

Accelerating Robotics Data Annotation: How to Build an Editor Agent with GPT-4o

45 m

video-thumbnail

SAM 3: Prompt once, segment faster with new annotation workflows in Encord

45 m

video-thumbnail

The 2026 Annotation Analytics Masterclass: How to Measure Efficiency & Quality

45 m

video-thumbnail

Multimodal AI: The World’s Largest Dataset of Images, Videos, Text, Audio and Point Clouds

45 m

video-thumbnail

From Raw LiDAR to Reliable Driving: How to Build Scalable 3D Data Workflows

45 m

video-thumbnail

Don’t Let Edge Cases Break Your Model: How to Implement Smarter Evaluation for CV Data

45 m

video-thumbnail

The Future of Smart Cities: Inside vialytics' AI Data Stack

45 m

video-thumbnail

Outside the Bounding Box: LiDAR Annotation for 3D Precision

20 m

video-thumbnail

Build vs. Buy: AI Infra Options for Production-Ready AI

45 m

video-thumbnail

Precision at Scale: Reimagining Generative AI Evaluation for Real-World Impact

45 m

video-thumbnail

Build Smarter VLMs, Faster: How to Bootstrap With Existing ML Solutions

45 m

video-thumbnail

DeepSeek R1: How it works, what it means, and what comes next?

45 m

video-thumbnail

How To Speedrun Your AI Data Pipeline With R1, Grok 3, Claude 3.7 and o3

45 m

video-thumbnail

Building Physical AI: How to Enable Multimodal Reasoning at Scale

45 m

video-thumbnail

Streamline computer vision data curation and labeling for Physical AI

45 m

video-thumbnail

World Models for Physical AI: How Intelligent Systems Learn and Adapt in the Real World

45 m

video-thumbnail

Garbage In Garbage Out: Poorly Curated Data is Killing Your Models

60 m

video-thumbnail

SAM 2 for Video: How to Fine-tune On Your Data

45 m

video-thumbnail

How To Build An AI Agent On Your Own Data

45 m

video-thumbnail

Encord Masterclass: Label Editor and Data Annotation Best Practices

60 m

video-thumbnail

How To Use Agents To Build A Multimodal Data Pipeline

45 m

video-thumbnail

Drowning in Data? How To Curate Your AI Data At Scale

60 m

video-thumbnail

Learn How to Fine-tune SAM 2 with Your Own Data

60 m

video-thumbnail

Fine-Tuning Text-to-Image Models: How to Optimise Your Embedding Spaces

45 m

video-thumbnail

Deep Learning Leaders Ep.3 : Luc Vincent - Creator of Google Street View and VP AI Meta

45 m

video-thumbnail

Using GPT-4o to Accelerate Your Model Development

60 m

video-thumbnail

Utilising Gemini: How to Automate your Data Pipelines

60 m

video-thumbnail

Encord Learn: How to Automate Your Workflow

60 m

video-thumbnail

Encord Learn: Consensus Workflows

60 m

video-thumbnail

From Big Data to Smart Data: How to Manage, Clean and Curate Your Visual Datasets for AI Development

60 m

video-thumbnail

From Data to Diamonds: Unearth the True Value of Quality Data

60 m

video-thumbnail

How to Fine Tune Foundation Models to Auto-Label Training Data

60 m

video-thumbnail

Tech Talk: The Rise of Multi-Modal AI

60 m

video-thumbnail

How to build Semantic Visual Search with ChatGPT and CLIP

55 m

video-thumbnail

The Future of ML Teams: Embracing Active Learning

20 m

video-thumbnail

Synthetic Data & Generative AI: Fireside chat with Synthesia Co-Founder & CEO Victor Riparbelli

20 m

video-thumbnail

Lessons from the Field: Fireside chat with Luc Vincent, VP of AI at Meta

20 m

video-thumbnail

Fireside Chat: AI for Augmented Reality in 2023 and Beyond

60 m

video-thumbnail

Are Visual Foundation Models (VFMs) on par with SOTA?

48 m

video-thumbnail

Learning Pack: The European AI Act's Impact on AI Developers

video-thumbnail

Vision Language Models: Powering the next chapter in AI

60 m

video-thumbnail

Multi-modal Foundational Models: The End of Data Labeling?

60 m

video-thumbnail

How to Create Workflows in Encord

7 m