Meta releases multilingual speech translation model
Illustration by Alex Castro / The Verge
Meta released a new speech-to-text model that can translate nearly 100 languages called SeamlessM4T, as the company continues to try to make a universal translator.
SeamlessM4T, which stands for Massively Multilingual and Multimodal Machine Translation, that the company said can translate speech-to-text and text-to-text for nearly 100 languages. For speech-to-speech and text-to-speech actions, it recognizes 100 input languages and converts them into 35 output languages.
It is released under a Creative Commons CC BY-NC 4.0 license, allowing researchers to iterate upon it.
Along with SeamlessM4T, Meta also released the metadata for its open translation dataset SeamlessAlign.
“Building a universal language translator, like the fictional…