Catalogue Search | MBRL

How neurotypical listeners recognize emotions expressed through vocal cues by speakers with high-functioning autism

by Gibson, Mindy T. , Schmidt-Kassow, Maren , Paulmann, Silke in Autism , Biology and Life Sciences , Care and treatment

2023

We conducted an investigation to explore how neurotypical (NT) listeners perceive the emotional tone of voice in sentences spoken by individuals with high-functioning autism spectrum disorders (ASD) and NT speakers. The investigation included both male and female speakers from both groups. In Study 1, NT listeners were asked to identify the emotional prosody (anger, fear, happiness, surprise or neutral) conveyed by the speakers. Results revealed that emotional expressions produced by male ASD speakers were generally less accurately recognized compared to male NT speakers. In contrast, emotions expressed by female ASD speakers were more accurately categorized compared to female NT speakers, except when expressing fear. This suggests that female ASD speakers may not express emotional prosody in the same way as their male counterparts. In Study 2, a subset of produced materials was rated for valence, voice modulation, and voice control to supplement Study 1 results: Female ASD speakers sounded less negative when expressing fear compared to female NT speakers. Male ASD speakers were perceived as less positive than NT speakers when expressing happiness. Voice modulation also differed between groups, showing a tendency for ASD speakers to follow different display rules for both positive emotions (happiness and surprise) tested. Finally, male ASD speakers were rated to use voice cues less appropriately compared to NT male speakers, an effect less pronounced for female ASD speakers. Together, the results imply that difficulties in social interactions among individuals with high-functioning ASD could be due to non-prototypical voice use of male ASD speakers and emphasize that female individuals do not show the same effects.

Journal Article

Share this book

Add to My Shelf

Different stages of emotional prosody processing in healthy ageing–evidence from behavioural responses, ERPs, tDCS, and tRNS

by Maltezou-Papastylianou, Constantina , Harmsworth, Chelsea , Wallace, Denise in Aging , Biology and Life Sciences , Cognitive ability

2022

Past research suggests that the ability to recognise the emotional intent of a speaker decreases as a function of age. Yet, few studies have looked at the underlying cause for this effect in a systematic way. This paper builds on the view that emotional prosody perception is a multi-stage process and explores which step of the recognition processing line is impaired in healthy ageing using time-sensitive event-related brain potentials (ERPs). Results suggest that early processes linked to salience detection as reflected in the P200 component and initial build-up of emotional representation as linked to a subsequent negative ERP component are largely unaffected in healthy ageing. The two groups show, however, emotional prosody recognition differences: older participants recognise emotional intentions of speakers less well than younger participants do. These findings were followed up by two neuro-stimulation studies specifically targeting the inferior frontal cortex to test if recognition improves during active stimulation relative to sham. Overall, results suggests that neither tDCS nor high-frequency tRNS stimulation at 2mA for 30 minutes facilitates emotional prosody recognition rates in healthy older adults.

Journal Article

Share this book

Add to My Shelf

Human voices communicating trustworthy intent: A demographically diverse speech audio dataset

by Maltezou-Papastylianou, Constantina , Scherer, Reinhold , Paulmann, Silke in 706/648/697 , 706/689/477 , Adolescent

2025

The multi-disciplinary field of voice perception and trustworthiness lacks accessible and diverse speech audio datasets representing diverse speaker demographics, including age, ethnicity, and sex. Existing datasets primarily feature white, younger adult speakers, limiting generalisability. This paper introduces a novel open-access speech audio dataset with 1,152 utterances from 96 untrained speakers, across white, black and south Asian backgrounds, divided into younger (N = 60, ages 18–45) and older (N = 36, ages 60+) adults. Each speaker recorded both, their natural speech patterns (i.e. “neutral” or no intent), and their attempt to convey their trustworthy intent as they perceive it during speech production. Our dataset is described and evaluated through classification methods between neutral and trustworthy speech. Specifically, extracted acoustic and voice quality features were analysed using linear and non-linear classification models, achieving accuracies of around 70%. This dataset aims to close a crucial gap in the existing literature and provide additional research opportunities that can contribute to the generalisability and applicability of future research results in this field.

Journal Article

Share this book

Add to My Shelf

Perceived Comfort and Blinding Efficacy in Randomised Sham-Controlled Transcranial Direct Current Stimulation (tDCS) Trials at 2 mA in Young and Older Healthy Adults

by Fitzgerald, Paul B. , Wallace, Denise , Paulmann, Silke in Adult , Adults , Aged

2016

tDCS studies typically find that: lowest levels of comfort occur at stimulation-onset; young adult participants experience less comfort than older participants; and participants' blinding seems effective at low current strengths. At 2 mA conflicting results have been reported, questioning the effectiveness of blinding in sham-controlled paradigms using higher current strengths. Investigator blinding is rarely reported. Using a protocol with 30 min of 2 mA stimulation we sought to: (a) investigate the level of perceived comfort in young and older adults, ranging in age from 19 to 29 years and 63 to 76 years, respectively; (b) test investigator and participant blinding; (c) assess comfort over a longer stimulation duration; (d) add to the literature on protocols using 2 mA current strength. A two-session experiment was conducted where sham and active stimulation were administered to the frontal cortex at the F8/FP1 sites in a within-subjects manner. Levels of perceived comfort were measured, using a visual analogue scale, at the start and end of stimulation in young and older adults. Post-stimulation, participants and investigators judged whether or not active stimulation was used. Comfort scores were lower at stimulation onset in both age groups. Older adults reported: (i) more comfort than young participants overall; (ii) comparable levels of comfort in sham and active stimulation; (iii) significantly more comfort than the young participants during active stimulation. Stimulation mode was correctly identified above chance in the second of the two sessions; 65% of all participants correctly identified the stimulation mode, resulting in a statistical trend. Similarly, the experimenter correctly identified stimulation mode significantly above chance, with 62% of all investigator judgements correct across 120 judgements. Using 2 mA current strength over 30 minutes, tDCS stimulation comfort is lower at stimulation onset in young and older adults and, overall, lower for young participants. Investigators and participants may be able to identify active stimulation at above chance levels, although accuracy never exceeded 65% for either participants or the experimenter. Further research into blinding efficacy is recommended.

Journal Article

Share this book

Add to My Shelf

How Psychological Stress Affects Emotional Prosody

by Cozzolino, Philip J. , Paulmann, Silke , Bøkenes, Anne Ming in Acoustics , Adult , Analysis

2016

We explored how experimentally induced psychological stress affects the production and recognition of vocal emotions. In Study 1a, we demonstrate that sentences spoken by stressed speakers are judged by naïve listeners as sounding more stressed than sentences uttered by non-stressed speakers. In Study 1b, negative emotions produced by stressed speakers are generally less well recognized than the same emotions produced by non-stressed speakers. Multiple mediation analyses suggest this poorer recognition of negative stimuli was due to a mismatch between the variation of volume voiced by speakers and the range of volume expected by listeners. Together, this suggests that the stress level of the speaker affects judgments made by the receiver. In Study 2, we demonstrate that participants who were induced with a feeling of stress before carrying out an emotional prosody recognition task performed worse than non-stressed participants. Overall, findings suggest detrimental effects of induced stress on interpersonal sensitivity.

Journal Article

Share this book

Add to My Shelf

Morphological encoding beyond slots and fillers: An ERP study of comparative formation in English

by Clahsen, Harald , Budd, Mary-Jane , Paulmann, Silke in Address forms , Adjectives , Adult

2018

One important organizational property of morphology is competition. Different means of expression are in conflict with each other for encoding the same grammatical function. In the current study, we examined the nature of this control mechanism by testing the formation of comparative adjectives in English during language production. Event-related brain potentials (ERPs) were recorded during cued silent production, the first study of this kind for comparative adjective formation. We specifically examined the ERP correlates of producing synthetic relative to analytic comparatives, e.g. angrier vs. more angry. A frontal, bilaterally distributed, enhanced negative-going waveform for analytic comparatives (vis-a-vis synthetic ones) emerged approximately 300ms after the (silent) production cue. We argue that this ERP effect reflects a control mechanism that constrains grammatically-based computational processes (viz. more comparative formation). We also address the possibility that this particular ERP effect may belong to a family of previously observed negativities reflecting cognitive control monitoring, rather than morphological encoding processes per se.

Journal Article

Share this book

Add to My Shelf

Dynamic Facial Expressions Prime the Processing of Emotional Prosody

by Garrido-Vásquez, Patricia , Paulmann, Silke , Pell, Marc D. in Auditory evoked potentials , Brain research , Cross-modal

2018

Evidence suggests that emotion is represented supramodally in the human brain. Emotional facial expressions, which often precede vocally expressed emotion in real life, can modulate event-related potentials (N100 and P200) during emotional prosody processing. To investigate these cross-modal emotional interactions, two lines of research have been put forward: cross-modal integration and cross-modal priming. In cross-modal integration studies, visual and auditory channels are temporally aligned, while in priming studies they are presented consecutively. Here we used cross-modal emotional priming to study the interaction of dynamic visual and auditory emotional information. Specifically, we presented dynamic facial expressions (angry, happy, neutral) as primes and emotionally-intoned pseudo-speech sentences (angry, happy) as targets. We were interested in how prime-target congruency would affect early auditory event-related potentials, i.e., N100 and P200, in order to shed more light on how dynamic facial information is used in cross-modal emotional prediction. Results showed enhanced N100 amplitudes for incongruently primed compared to congruently and neutrally primed emotional prosody, while the latter two conditions did not significantly differ. However, N100 peak latency was significantly delayed in the neutral condition compared to the other two conditions. Source reconstruction revealed that the right parahippocampal gyrus was activated in incongruent compared to congruent trials in the N100 time window. No significant ERP effects were observed in the P200 range. Our results indicate that dynamic facial expressions influence vocal emotion processing at an early point in time, and that an emotional mismatch between a facial expression and its ensuing vocal emotional signal induces additional processing costs in the brain, potentially because the cross-modal emotional prediction mechanism is violated in case of emotional prime-target incongruency.

Journal Article

Share this book

Add to My Shelf

Emotional Speech Perception Unfolding in Time: The Role of the Basal Ganglia

by Ott, Derek V. M. , Kotz, Sonja A. , Paulmann, Silke in Acoustics , Basal ganglia , Basal Ganglia - physiology

2011

The basal ganglia (BG) have repeatedly been linked to emotional speech processing in studies involving patients with neurodegenerative and structural changes of the BG. However, the majority of previous studies did not consider that (i) emotional speech processing entails multiple processing steps, and the possibility that (ii) the BG may engage in one rather than the other of these processing steps. In the present study we investigate three different stages of emotional speech processing (emotional salience detection, meaning-related processing, and identification) in the same patient group to verify whether lesions to the BG affect these stages in a qualitatively different manner. Specifically, we explore early implicit emotional speech processing (probe verification) in an ERP experiment followed by an explicit behavioral emotional recognition task. In both experiments, participants listened to emotional sentences expressing one of four emotions (anger, fear, disgust, happiness) or neutral sentences. In line with previous evidence patients and healthy controls show differentiation of emotional and neutral sentences in the P200 component (emotional salience detection) and a following negative-going brain wave (meaning-related processing). However, the behavioral recognition (identification stage) of emotional sentences was impaired in BG patients, but not in healthy controls. The current data provide further support that the BG are involved in late, explicit rather than early emotional speech processing stages.

Journal Article

Share this book

Add to My Shelf

ERP correlates of motivating voices: quality of motivation and time-course matters

by Zougkou, Konstantina , Weinstein, Netta , Paulmann, Silke in Adolescent , Adult , Brain Mapping

2017

Here, we conducted the first study to explore how motivations expressed through speech are processed in real-time. Participants listened to sentences spoken in two types of well-studied motivational tones (autonomy-supportive and controlling), or a neutral tone of voice. To examine this, listeners were presented with sentences that either signaled motivations through prosody (tone of voice) and words simultaneously (e.g. ‘You absolutely have to do it my way’ spoken in a controlling tone of voice), or lacked motivationally biasing words (e.g. ‘Why don’t we meet again tomorrow’ spoken in a motivational tone of voice). Event-related brain potentials (ERPs) in response to motivations conveyed through words and prosody showed that listeners rapidly distinguished between motivations and neutral forms of communication as shown in enhanced P2 amplitudes in response to motivational when compared with neutral speech. This early detection mechanism is argued to help determine the importance of incoming information. Once assessed, motivational language is continuously monitored and thoroughly evaluated. When compared with neutral speech, listening to controlling (but not autonomy-supportive) speech led to enhanced late potential ERP mean amplitudes, suggesting that listeners are particularly attuned to controlling messages. The importance of controlling motivation for listeners is mirrored in effects observed for motivations expressed through prosody only. Here, an early rapid appraisal, as reflected in enhanced P2 amplitudes, is only found for sentences spoken in controlling (but not autonomy-supportive) prosody. Once identified as sounding pressuring, the message seems to be preferentially processed, as shown by enhanced late potential amplitudes in response to controlling prosody. Taken together, results suggest that motivational and neutral language are differentially processed; further, the data suggest that listening to cues signaling pressure and control cannot be ignored and lead to preferential, and more in-depth processing mechanisms.

Journal Article

Share this book

Add to My Shelf

Is there an advantage for recognizing multi-modal emotional stimuli?

by Pell, Marc D. , Paulmann, Silke in Access , Accessibility , Accuracy

2011

Emotions can be recognized whether conveyed by facial expressions, linguistic cues (semantics), or prosody (voice tone). However, few studies have empirically documented the extent to which multi-modal emotion perception differs from uni-modal emotion perception. Here, we tested whether emotion recognition is more accurate for multi-modal stimuli by presenting stimuli with different combinations of facial, semantic, and prosodic cues. Participants judged the emotion conveyed by short utterances in six channel conditions. Results indicated that emotion recognition is significantly better in response to multi-modal versus uni-modal stimuli. When stimuli contained only one emotional channel, recognition tended to be higher in the visual modality (i.e., facial expressions, semantic information conveyed by text) than in the auditory modality (prosody), although this pattern was not uniform across emotion categories. The advantage for multi-modal recognition may reflect the automatic integration of congruent emotional information across channels which enhances the accessibility of emotion-related knowledge in memory.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter