Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
by
Goyal, Naman
, Baevski, Alexei
, Tjandra, Andros
, Auli, Michael
, Patrick von Platen
, Babu, Arun
, Wang, Changhan
, Xu, Qiantong
, Pino, Juan
, Singh, Kritika
, Lakhotia, Kushal
, Conneau, Alexis
, Saraf, Yatharth
in
Audio data
/ English language
/ Languages
/ Representation learning
/ Scale models
/ Speech
/ Speech processing
/ Speech recognition
/ Translating
2021
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
by
Goyal, Naman
, Baevski, Alexei
, Tjandra, Andros
, Auli, Michael
, Patrick von Platen
, Babu, Arun
, Wang, Changhan
, Xu, Qiantong
, Pino, Juan
, Singh, Kritika
, Lakhotia, Kushal
, Conneau, Alexis
, Saraf, Yatharth
in
Audio data
/ English language
/ Languages
/ Representation learning
/ Scale models
/ Speech
/ Speech processing
/ Speech recognition
/ Translating
2021
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
by
Goyal, Naman
, Baevski, Alexei
, Tjandra, Andros
, Auli, Michael
, Patrick von Platen
, Babu, Arun
, Wang, Changhan
, Xu, Qiantong
, Pino, Juan
, Singh, Kritika
, Lakhotia, Kushal
, Conneau, Alexis
, Saraf, Yatharth
in
Audio data
/ English language
/ Languages
/ Representation learning
/ Scale models
/ Speech
/ Speech processing
/ Speech recognition
/ Translating
2021
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Paper
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
2021
Request Book From Autostore
and Choose the Collection Method
Overview
This paper presents XLS-R, a large-scale model for cross-lingual speech representation learning based on wav2vec 2.0. We train models with up to 2B parameters on nearly half a million hours of publicly available speech audio in 128 languages, an order of magnitude more public data than the largest known prior work. Our evaluation covers a wide range of tasks, domains, data regimes and languages, both high and low-resource. On the CoVoST-2 speech translation benchmark, we improve the previous state of the art by an average of 7.4 BLEU over 21 translation directions into English. For speech recognition, XLS-R improves over the best known prior work on BABEL, MLS, CommonVoice as well as VoxPopuli, lowering error rates by 14-34% relative on average. XLS-R also sets a new state of the art on VoxLingua107 language identification. Moreover, we show that with sufficient model size, cross-lingual pretraining can outperform English-only pretraining when translating English speech into other languages, a setting which favors monolingual pretraining. We hope XLS-R can help to improve speech processing tasks for many more languages of the world.
Publisher
Cornell University Library, arXiv.org
Subject
This website uses cookies to ensure you get the best experience on our website.