WhisperSpeech makes its way to AI.dev

WhisperSpeech makes its way to AI.dev

Mark Filion
December 07, 2023

Share this post:

Reading time:

Collabora is headed to San Jose, California, to take part in the inaugural edition of AI.dev: Open Source GenAI & ML Summit, a new event which aims to bring together the brightest developers from around the world to shape the trajectory of open source AI.

Join us on Tuesday, December 12, as Jakub Piotr Cłapa dives into findings from WhisperSpeech, a new Open Source text-to-speech model developed by Collabora. Based entirely on properly licensed speech datasets and unrestricted Open Source code, the model's focus is to deliver the best natural-sounding Open Source speech synthesis solution for improved communication.

In this talk, Jakub will look at how Collabora scaled its models and training pipelines from hundreds to 80K+ hours of speech recordings, and will share lessons learned along the way. He'll also discuss some of the challenges encountered, including:

Gone in 16 minutes: the importance of small scale experiments.
Full throttle: is 100% GPU utilization enough?
Do you need a fancy framework? From single- to multi-GPU training.
Are SSDs fast enough? WebDataset brings a 10x improvement.
Does bigger always mean better? How to effortlessly scale AI models.
Clouds, enthusiasts or clusters? How to hunt down GPUs.
Defending moats. How is a gaming 4090 different from an H100?

If you plan on attending, please make sure to come say hello! Note that so you can also watch Jakub's talk remotely via the Room LL20D live stream.

Update: The video recording is now available, click on the link below to start watching!

Collabora @ AI.dev: Open Source GenAI & ML Summit

Tricks Learned from Scaling WhisperSpeech Models to 80k+ Hours of Speech
Presented by Jakub Piotr Cłapa - Tuesday, December 12

WhisperSpeech: Exploring new horizons in text-to-speech tech

Benchmarking machine learning frameworks

MLfix to quickly fix datasets

WhisperSpeech: Exploring new horizons in text-to-speech tech

Benchmarking machine learning frameworks

MLfix to quickly fix datasets

Comments (0)

Add a Comment

Search the newsroom

Latest News & Events

Collabora at OSS Europe 2025: Five talks, hands-on demos, and workshops!

19/08/2025

Collabora is heading to Amsterdam with talks, demos, and workshops covering Embedded Linux, KernelCI, Bluetooth & Auracast, mainline video…

Improvements to Mesa video decoding for Panfrost

13/08/2025

The Mesa 25.2 release introduces support for AFBC compressed YUV textures in the Panfrost driver for ARM Mali GPUs, enabling more efficient…

Mesa 25.2 brings new hardware support for Nouveau users

04/08/2025

Starting with Mesa 25.2, NVK will now advertise support for Blackwell (RTX 50xx series) and Kepler (most GT and GTX 600 series, most GTX…

About Collabora

Whether writing a line of code or shaping a longer-term strategic software development plan, we'll help you navigate the ever-evolving world of Open Source.

한국의 국기 한국어 버전의 Collabora.com 보기

Bandeira de Português Acesse Collabora.com em Português

Learn more

+44 1223 362967

+1 514 667 2499

contact@collabora.com

Our website only uses a strictly necessary session cookie provided by our CMS system. To find out more please follow this link.

WhisperSpeech makes its way to AI.dev

Collabora @ AI.dev: Open Source GenAI & ML Summit

Related Posts

WhisperSpeech: Exploring new horizons in text-to-speech tech

Benchmarking machine learning frameworks

MLfix to quickly fix datasets

Related Posts

WhisperSpeech: Exploring new horizons in text-to-speech tech

Benchmarking machine learning frameworks

MLfix to quickly fix datasets

Comments (0)

Add a Comment

Search the newsroom

Latest News & Events

Collabora at OSS Europe 2025: Five talks, hands-on demos, and workshops!

Improvements to Mesa video decoding for Panfrost

Mesa 25.2 brings new hardware support for Nouveau users

About Collabora

Learn more

WhisperSpeech makes its way to AI.dev

Collabora @ AI​.dev: Open Source GenAI & ML Summit

Related Posts

WhisperSpeech: Exploring new horizons in text-to-speech tech

Benchmarking machine learning frameworks

MLfix to quickly fix datasets

Related Posts

WhisperSpeech: Exploring new horizons in text-to-speech tech

Benchmarking machine learning frameworks

MLfix to quickly fix datasets

Comments (0)

Add a Comment

Search the newsroom

Latest News & Events

Collabora at OSS Europe 2025: Five talks, hands-on demos, and workshops!

Improvements to Mesa video decoding for Panfrost

Mesa 25.2 brings new hardware support for Nouveau users

About Collabora

Learn more

Collabora @ AI.dev: Open Source GenAI & ML Summit