Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Improving speech depression detection using transfer learning with wav2vec 2.0 in low-resource environments
by
Li, Chenlong
, Yu, Chengyuan
, Zhang, Xu
, Zhang, Xiangcheng
, Chen, Weisi
in
639/705/1042
/ 692/700/139
/ Algorithms
/ Classification
/ Datasets
/ Deep Learning
/ Depression - diagnosis
/ Feature selection
/ Humanities and Social Sciences
/ Humans
/ Machine learning
/ Mental depression
/ Mental disorders
/ Methods
/ multidisciplinary
/ Neural Networks, Computer
/ Pandemics
/ Science
/ Science (multidisciplinary)
/ Speech
/ Transfer learning
2024
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Improving speech depression detection using transfer learning with wav2vec 2.0 in low-resource environments
by
Li, Chenlong
, Yu, Chengyuan
, Zhang, Xu
, Zhang, Xiangcheng
, Chen, Weisi
in
639/705/1042
/ 692/700/139
/ Algorithms
/ Classification
/ Datasets
/ Deep Learning
/ Depression - diagnosis
/ Feature selection
/ Humanities and Social Sciences
/ Humans
/ Machine learning
/ Mental depression
/ Mental disorders
/ Methods
/ multidisciplinary
/ Neural Networks, Computer
/ Pandemics
/ Science
/ Science (multidisciplinary)
/ Speech
/ Transfer learning
2024
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Improving speech depression detection using transfer learning with wav2vec 2.0 in low-resource environments
by
Li, Chenlong
, Yu, Chengyuan
, Zhang, Xu
, Zhang, Xiangcheng
, Chen, Weisi
in
639/705/1042
/ 692/700/139
/ Algorithms
/ Classification
/ Datasets
/ Deep Learning
/ Depression - diagnosis
/ Feature selection
/ Humanities and Social Sciences
/ Humans
/ Machine learning
/ Mental depression
/ Mental disorders
/ Methods
/ multidisciplinary
/ Neural Networks, Computer
/ Pandemics
/ Science
/ Science (multidisciplinary)
/ Speech
/ Transfer learning
2024
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Improving speech depression detection using transfer learning with wav2vec 2.0 in low-resource environments
Journal Article
Improving speech depression detection using transfer learning with wav2vec 2.0 in low-resource environments
2024
Request Book From Autostore
and Choose the Collection Method
Overview
Depression, a pervasive global mental disorder, profoundly impacts daily lives. Despite numerous deep learning studies focused on depression detection through speech analysis, the shortage of annotated bulk samples hampers the development of effective models. In response to this challenge, our research introduces a transfer learning approach for detecting depression in speech, aiming to overcome constraints imposed by limited resources. In the context of feature representation, we obtain depression-related features by fine-tuning wav2vec 2.0. By integrating 1D-CNN and attention pooling structures, we generate advanced features at the segment level, thereby enhancing the model's capability to capture temporal relationships within audio frames. In the realm of prediction results, we integrate LSTM and self-attention mechanisms. This incorporation assigns greater weights to segments associated with depression, thereby augmenting the model's discernment of depression-related information. The experimental results indicate that our model has achieved impressive F1 scores, reaching 79% on the DAIC-WOZ dataset and 90.53% on the CMDC dataset. It outperforms recent baseline models in the field of speech-based depression detection. This provides a promising solution for effective depression detection in low-resource environments.
This website uses cookies to ensure you get the best experience on our website.