Mark Filion
December 07, 2023
Reading time:
Collabora is headed to San Jose, California, to take part in the inaugural edition of AI.dev: Open Source GenAI & ML Summit, a new event which aims to bring together the brightest developers from around the world to shape the trajectory of open source AI.
Join us on Tuesday, December 12, as Jakub Piotr Cłapa dives into findings from WhisperSpeech, a new Open Source text-to-speech model developed by Collabora. Based entirely on properly licensed speech datasets and unrestricted Open Source code, the model's focus is to deliver the best natural-sounding Open Source speech synthesis solution for improved communication.
In this talk, Jakub will look at how Collabora scaled its models and training pipelines from hundreds to 80K+ hours of speech recordings, and will share lessons learned along the way. He'll also discuss some of the challenges encountered, including:
If you plan on attending, please make sure to come say hello! Note that so you can also watch Jakub's talk remotely via the Room LL20D live stream.
Update: The video recording is now available, click on the link below to start watching!
Collabora @ AI.dev: Open Source GenAI & ML Summit
Tricks Learned from Scaling WhisperSpeech Models to 80k+ Hours of Speech
Presented by Jakub Piotr Cłapa - Tuesday, December 12
22/04/2025
As of today, NVK is a conformant Vulkan 1.4 implementation for NVIDIA Maxwell, Pascal, and Volta GPUs, and will be enabled by default starting…
17/04/2025
Our commitment to open source extends beyond contributing code. We are dedicated to upholding the highest standards of license compliance…
15/04/2025
This May, Embedded Recipes 2025, co-sponsored by Collabora, heads to Nice, France with talks, workshops, and a PipeWire hackfest, all bookended…
Comments (0)
Add a Comment