Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
An Overview of End-to-End Automatic Speech Recognition
by
Wang, Dong
, Wang, Xiaodong
, Lv, Shaohe
in
Acknowledgment
/ Acoustics
/ Advantages
/ Alignment
/ Alliances
/ Artificial neural networks
/ Automatic
/ Automatic speech recognition
/ Classification
/ Computer science
/ Continuous speech
/ Deep learning
/ Hypotheses
/ Laboratories
/ Machine learning
/ Markov analysis
/ Markov chains
/ Neural networks
/ Pattern recognition
/ Recurrent
/ Recurrent neural networks
/ Segmentation
/ Speech
/ Speech recognition
/ Symmetry
/ Training
/ Vocabulary
/ Voice recognition
2019
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
An Overview of End-to-End Automatic Speech Recognition
by
Wang, Dong
, Wang, Xiaodong
, Lv, Shaohe
in
Acknowledgment
/ Acoustics
/ Advantages
/ Alignment
/ Alliances
/ Artificial neural networks
/ Automatic
/ Automatic speech recognition
/ Classification
/ Computer science
/ Continuous speech
/ Deep learning
/ Hypotheses
/ Laboratories
/ Machine learning
/ Markov analysis
/ Markov chains
/ Neural networks
/ Pattern recognition
/ Recurrent
/ Recurrent neural networks
/ Segmentation
/ Speech
/ Speech recognition
/ Symmetry
/ Training
/ Vocabulary
/ Voice recognition
2019
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
An Overview of End-to-End Automatic Speech Recognition
by
Wang, Dong
, Wang, Xiaodong
, Lv, Shaohe
in
Acknowledgment
/ Acoustics
/ Advantages
/ Alignment
/ Alliances
/ Artificial neural networks
/ Automatic
/ Automatic speech recognition
/ Classification
/ Computer science
/ Continuous speech
/ Deep learning
/ Hypotheses
/ Laboratories
/ Machine learning
/ Markov analysis
/ Markov chains
/ Neural networks
/ Pattern recognition
/ Recurrent
/ Recurrent neural networks
/ Segmentation
/ Speech
/ Speech recognition
/ Symmetry
/ Training
/ Vocabulary
/ Voice recognition
2019
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Journal Article
An Overview of End-to-End Automatic Speech Recognition
2019
Request Book From Autostore
and Choose the Collection Method
Overview
Automatic speech recognition, especially large vocabulary continuous speech recognition, is an important issue in the field of machine learning. For a long time, the hidden Markov model (HMM)-Gaussian mixed model (GMM) has been the mainstream speech recognition framework. But recently, HMM-deep neural network (DNN) model and the end-to-end model using deep learning has achieved performance beyond HMM-GMM. Both using deep learning techniques, these two models have comparable performances. However, the HMM-DNN model itself is limited by various unfavorable factors such as data forced segmentation alignment, independent hypothesis, and multi-module individual training inherited from HMM, while the end-to-end model has a simplified model, joint training, direct output, no need to force data alignment and other advantages. Therefore, the end-to-end model is an important research direction of speech recognition. In this paper we review the development of end-to-end model. This paper first introduces the basic ideas, advantages and disadvantages of HMM-based model and end-to-end models, and points out that end-to-end model is the development direction of speech recognition. Then the article focuses on the principles, progress and research hotspots of three different end-to-end models, which are connectionist temporal classification (CTC)-based, recurrent neural network (RNN)-transducer and attention-based, and makes theoretically and experimentally detailed comparisons. Their respective advantages and disadvantages and the possible future development of the end-to-end model are finally pointed out. Automatic speech recognition is a pattern recognition task in the field of computer science, which is a subject area of Symmetry.
Publisher
MDPI AG
Subject
This website uses cookies to ensure you get the best experience on our website.