Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Whisper-to-speech conversion using restricted Boltzmann machine arrays
by
Li, Jing-jie
, McLoughlin, Ian V.
, Dai, Li-Rong
, Ling, Zhen-hua
in
Acoustical engineering
/ Arrays
/ artificial muffle
/ Boltzmann machines
/ Conversion
/ Gaussian mixture model
/ glottal‐induced pitch lead
/ human‐to‐human vocal communication mechanism
/ inherent noise‐like spectral distribution
/ Intelligibility
/ learning (artificial intelligence)
/ Linguistics
/ Machine learning
/ machine learning technique
/ pitch accuracy
/ pitch estimation
/ RBM arrays
/ restricted Boltzmann machine array
/ Spectra
/ Speech
/ speech intelligibility
/ speech processing
/ speech reconstruction
/ speech spectral envelope
/ State of the art
/ statistical analysis
/ statistical conversion model
/ unnatural prosody
/ vocal cord
/ Vocal cords
/ voiced region
/ whisper processing
/ whisper‐to‐speech conversion
2014
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Whisper-to-speech conversion using restricted Boltzmann machine arrays
by
Li, Jing-jie
, McLoughlin, Ian V.
, Dai, Li-Rong
, Ling, Zhen-hua
in
Acoustical engineering
/ Arrays
/ artificial muffle
/ Boltzmann machines
/ Conversion
/ Gaussian mixture model
/ glottal‐induced pitch lead
/ human‐to‐human vocal communication mechanism
/ inherent noise‐like spectral distribution
/ Intelligibility
/ learning (artificial intelligence)
/ Linguistics
/ Machine learning
/ machine learning technique
/ pitch accuracy
/ pitch estimation
/ RBM arrays
/ restricted Boltzmann machine array
/ Spectra
/ Speech
/ speech intelligibility
/ speech processing
/ speech reconstruction
/ speech spectral envelope
/ State of the art
/ statistical analysis
/ statistical conversion model
/ unnatural prosody
/ vocal cord
/ Vocal cords
/ voiced region
/ whisper processing
/ whisper‐to‐speech conversion
2014
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Whisper-to-speech conversion using restricted Boltzmann machine arrays
by
Li, Jing-jie
, McLoughlin, Ian V.
, Dai, Li-Rong
, Ling, Zhen-hua
in
Acoustical engineering
/ Arrays
/ artificial muffle
/ Boltzmann machines
/ Conversion
/ Gaussian mixture model
/ glottal‐induced pitch lead
/ human‐to‐human vocal communication mechanism
/ inherent noise‐like spectral distribution
/ Intelligibility
/ learning (artificial intelligence)
/ Linguistics
/ Machine learning
/ machine learning technique
/ pitch accuracy
/ pitch estimation
/ RBM arrays
/ restricted Boltzmann machine array
/ Spectra
/ Speech
/ speech intelligibility
/ speech processing
/ speech reconstruction
/ speech spectral envelope
/ State of the art
/ statistical analysis
/ statistical conversion model
/ unnatural prosody
/ vocal cord
/ Vocal cords
/ voiced region
/ whisper processing
/ whisper‐to‐speech conversion
2014
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Whisper-to-speech conversion using restricted Boltzmann machine arrays
Journal Article
Whisper-to-speech conversion using restricted Boltzmann machine arrays
2014
Request Book From Autostore
and Choose the Collection Method
Overview
Whispers are a natural vocal communication mechanism, in which vocal cords do not vibrate normally. Lack of glottal-induced pitch leads to low energy, and an inherent noise-like spectral distribution reduces intelligibility. Much research has been devoted to processing of whispers, including conversion of whispers to speech. Unfortunately, among several approaches, the best reconstructed speech to date still contains obviously artificial muffles and suffers from an unnatural prosody. To address these issues, the novel use of multiple restricted Boltzmann machines (RBMs) is reported as a statistical conversion model between whisper and speech spectral envelopes. Moreover, the accuracy of estimated pitch is improved using machine learning techniques for pitch estimation within only voiced (V) regions. Both objective and subjective evaluations show that this new method improves the quality of whisper-reconstructed speech compared with the state-of-the-art approaches.
Publisher
The Institution of Engineering and Technology,John Wiley & Sons, Inc
This website uses cookies to ensure you get the best experience on our website.