Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
1,768
result(s) for
"Reversal Learning"
Sort by:
Dissociable effects of acute SSRI (escitalopram) on executive, learning and emotional functions in healthy humans
by
Sahakian, Barbara J
,
Rowe, James B
,
Passamonti, Luca
in
Antidepressants
,
Citalopram
,
Cognitive ability
2018
Serotonin is implicated in multiple executive functions including goal-directed learning, cognitive flexibility, response inhibition and emotional regulation. These functions are impaired in several psychiatric disorders, such as depression and obsessive–compulsive disorder. We tested the cognitive effects of the selective serotonin reuptake inhibitor escitalopram, using an acute and clinically relevant dose (20 mg), in 66 healthy male and female volunteers in a double-blind, placebo-controlled study. Participants performed a cognitive test battery including a probabilistic and reversal learning task, the CANTAB intra-dimensional/extra-dimensional shift test of cognitive flexibility, a response inhibition task with interleaved stop-signal and No-Go trials and tasks measuring emotional processing. We showed that acute escitalopram administration impaired learning and cognitive flexibility, but improved the ability to inhibit responses in stop-signal trials while leaving unaffected acute emotional processing. Our findings suggest a dissociation of effects of acute escitalopram on cognitive functions, possibly mediated by differential modulation of brain serotonin levels in distinct functional neural circuits.
Journal Article
Computational modelling reveals contrasting effects on reinforcement learning and cognitive flexibility in stimulant use disorder and obsessive-compulsive disorder: remediating effects of dopaminergic D2/3 receptor agents
by
Ersche, Karen D
,
Cardinal, Rudolf N
,
Fineberg, Naomi A
in
Bayesian analysis
,
Cognitive ability
,
Computer applications
2019
RationaleDisorders of compulsivity such as stimulant use disorder (SUD) and obsessive-compulsive disorder (OCD) are characterised by deficits in behavioural flexibility, some of which have been captured using probabilistic reversal learning (PRL) paradigms.ObjectivesThis study used computational modelling to characterise the reinforcement learning processes underlying patterns of PRL behaviour observed in SUD and OCD and to show how the dopamine D2/3 receptor agonist pramipexole and the D2/3 antagonist amisulpride affected these responses.MethodsWe applied a hierarchical Bayesian method to PRL data across three groups: individuals with SUD, OCD, and healthy controls. Participants completed three sessions where they received placebo, pramipexole, and amisulpride, in a double-blind placebo-controlled, randomised design. We compared seven models using a bridge sampling estimate of the marginal likelihood.ResultsStimulus-bound perseveration, a measure of the degree to which participants responded to the same stimulus as before irrespective of outcome, was significantly increased in SUD, but decreased in OCD, compared to controls (on placebo). Individuals with SUD also exhibited reduced reward-driven learning, whilst both the SUD and OCD groups showed increased learning from punishment (nonreward). Pramipexole and amisulpride had similar effects on the control and OCD groups; both increased punishment-driven learning. These D2/3-modulating drugs affected the SUD group differently, remediating reward-driven learning and reducing aspects of perseverative behaviour, amongst other effects.ConclusionsWe provide a parsimonious computational account of how perseverative tendencies and reward- and punishment-driven learning differentially contribute to PRL in SUD and OCD. D2/3 agents modulated these processes and remediated deficits in SUD in particular, which may inform therapeutic effects.
Journal Article
Effect of lysergic acid diethylamide (LSD) on reinforcement learning in humans
2023
The non-selective serotonin 2A (5-HT
) receptor agonist lysergic acid diethylamide (LSD) holds promise as a treatment for some psychiatric disorders. Psychedelic drugs such as LSD have been suggested to have therapeutic actions through their effects on learning. The behavioural effects of LSD in humans, however, remain incompletely understood. Here we examined how LSD affects probabilistic reversal learning (PRL) in healthy humans.
Healthy volunteers received intravenous LSD (75
g in 10 mL saline) or placebo (10 mL saline) in a within-subjects design and completed a PRL task. Participants had to learn through trial and error which of three stimuli was rewarded most of the time, and these contingencies switched in a reversal phase. Computational models of reinforcement learning (RL) were fitted to the behavioural data to assess how LSD affected the updating ('learning rates') and deployment of value representations ('reinforcement sensitivity') during choice, as well as 'stimulus stickiness' (choice repetition irrespective of reinforcement history).
Raw data measures assessing sensitivity to immediate feedback ('win-stay' and 'lose-shift' probabilities) were unaffected, whereas LSD increased the impact of the strength of initial learning on perseveration. Computational modelling revealed that the most pronounced effect of LSD was the enhancement of the reward learning rate. The punishment learning rate was also elevated. Stimulus stickiness was decreased by LSD, reflecting heightened exploration. Reinforcement sensitivity differed by phase.
Increased RL rates suggest LSD induced a state of heightened plasticity. These results indicate a potential mechanism through which revision of maladaptive associations could occur in the clinical application of LSD.
Journal Article
Activation of D1 receptors affects human reactivity and flexibility to valued cues
2020
Reward-predicting cues motivate goal-directed behavior, but in unstable environments humans must also be able to flexibly update cue-reward associations. While the capacity of reward cues to trigger motivation (‘reactivity’) as well as flexibility in cue-reward associations have been linked to the neurotransmitter dopamine in humans, the specific contribution of the dopamine D1 receptor family to these behaviors remained elusive. To fill this gap, we conducted a randomized, placebo-controlled, double-blind pharmacological study testing the impact of three different doses of a novel D1 agonist (relative to placebo) on reactivity to reward-predicting cues (Pavlovian-to-instrumental transfer) and flexibility of cue-outcome associations (reversal learning). We observed that the impact of the D1 agonist crucially depended on baseline working memory functioning, which has been identified as a proxy for baseline dopamine synthesis capacity. Specifically, increasing D1 receptor stimulation strengthened Pavlovian-to-instrumental transfer in individuals with high baseline working memory capacity. In contrast, higher doses of the D1 agonist improved reversal learning only in individuals with low baseline working memory functioning. Our findings suggest a crucial and baseline-dependent role of D1 receptor activation in controlling both cue reactivity and the flexibility of cue-reward associations.
Journal Article
Dorsal and ventral striatal dopamine D1 and D2 receptors differentially modulate distinct phases of serial visual reversal learning
by
Alsiö Johan
,
Boulos, Maria
,
McKenzie, Colin
in
Animal behavior
,
Caudate-putamen
,
Cognitive ability
2020
Impaired cognitive flexibility in visual reversal-learning tasks has been observed in a wide range of neurological and neuropsychiatric disorders. Although both human and animal studies have implicated striatal D2-like and D1-like receptors (D2R; D1R) in this form of flexibility, less is known about the contribution they make within distinct sub-regions of the striatum and the different phases of visual reversal learning. The present study investigated the involvement of D2R and D1R during the early (perseverative) phase of reversal learning as well as in the intermediate and late stages (new learning) after microinfusions of D2R and D1R antagonists into the nucleus accumbens core and shell (NAcC; NAcS), the anterior and posterior dorsomedial striatum (DMS) and the dorsolateral striatum (DLS) on a touchscreen visual serial reversal-learning task. Reversal learning was improved after dopamine receptor blockade in the nucleus accumbens; the D1R antagonist, SCH23390, in the NAcS and the D2R antagonist, raclopride, in the NAcC selectively reduced early, perseverative errors. In contrast, reversal learning was impaired by D2R antagonism, but not D1R antagonism, in the dorsal striatum: raclopride increased errors in the intermediate phase after DMS infusions, and increased errors across phases after DLS infusions. These findings indicate that D1R and D2R modulate different stages of reversal learning through effects localised to different sub-regions of the striatum. Thus, deficits in behavioral flexibility observed in disorders linked to dopamine perturbations may be attributable to specific D1R and D2R dysfunction in distinct striatal sub-regions.
Journal Article
Serotonin depletion impairs both Pavlovian and instrumental reversal learning in healthy humans
by
Price, Annabel
,
Sahakian, Barbara J
,
Yellowlees Robyn
in
Autonomic nervous system
,
Cognitive ability
,
Mental disorders
2021
Serotonin is involved in updating responses to changing environmental circumstances. Optimising behaviour to maximise reward and minimise punishment may require shifting strategies upon encountering new situations. Likewise, autonomic responses to threats are critical for survival yet must be modified as danger shifts from one source to another. Whilst numerous psychiatric disorders are characterised by behavioural and autonomic inflexibility, few studies have examined the contribution of serotonin in humans. We modelled both processes, respectively, in two independent experiments (N = 97). Experiment 1 assessed instrumental (stimulus-response-outcome) reversal learning whereby individuals learned through trial and error which action was most optimal for obtaining reward or avoiding punishment initially, and the contingencies subsequently reversed serially. Experiment 2 examined Pavlovian (stimulus-outcome) reversal learning assessed by the skin conductance response: one innately threatening stimulus predicted receipt of an uncomfortable electric shock and another did not; these contingencies swapped in a reversal phase. Upon depleting the serotonin precursor tryptophan—in a double-blind randomised placebo-controlled design—healthy volunteers showed impairments in updating both actions and autonomic responses to reflect changing contingencies. Reversal deficits in each domain, furthermore, were correlated with the extent of tryptophan depletion. Initial Pavlovian conditioning, moreover, which involved innately threatening stimuli, was potentiated by depletion. These results translate findings in experimental animals to humans and have implications for the neurochemical basis of cognitive inflexibility.
Journal Article
Value-guided remapping of sensory cortex by lateral orbitofrontal cortex
2020
Adaptive behaviour crucially depends on flexible decision-making, which in mammals relies on the frontal cortex, specifically the orbitofrontal cortex (OFC)
1
–
9
. How OFC encodes decision variables and instructs sensory areas to guide adaptive behaviour are key open questions. Here we developed a reversal learning task for head-fixed mice, monitored the activity of neurons of the lateral OFC using two-photon calcium imaging and investigated how OFC dynamically interacts with primary somatosensory cortex (S1). Mice learned to discriminate ‘go’ from ‘no-go’ tactile stimuli
10
,
11
and adapt their behaviour upon reversal of stimulus–reward contingency (‘rule switch’). Imaging individual neurons longitudinally across all behavioural phases revealed a distinct engagement of S1 and lateral OFC, with S1 neural activity reflecting initial task learning, whereas lateral OFC neurons responded saliently and transiently to the rule switch. We identified direct long-range projections from lateral OFC to S1 that can feed this activity back to S1 as value prediction error. This top-down signal updated sensory representations in S1 by functionally remapping responses in a subpopulation of neurons that was sensitive to reward history. Functional remapping crucially depended on top-down feedback as chemogenetic silencing of lateral OFC neurons disrupted reversal learning, as well as plasticity in S1. The dynamic interaction of lateral OFC with sensory cortex thus implements computations critical for value prediction that are history dependent and error based, providing plasticity essential for flexible decision-making.
Dynamic interaction of neurons in lateral orbitofrontal cortex with the sensory cortex implements value-prediction computations that are history dependent and error based, providing plasticity essential for flexible decision-making.
Journal Article
Abnormal modulation of reward versus punishment learning by a dopamine D2-receptor antagonist in pathological gamblers
by
Geurts, Dirk Everdina Maria
,
Cools, Roshan
,
Hashemi, Mahur Melina
in
Adult
,
Biomedical and Life Sciences
,
Biomedicine
2015
Rationale
Pathological gambling has been associated with dopamine transmission abnormalities, in particular dopamine D2-receptor deficiency, and reversal learning deficits. Moreover, pervasive theoretical accounts suggest a key role for dopamine in reversal learning. However, there is no empirical evidence for a direct link between dopamine, reversal learning and pathological gambling.
Objective
The aim of the present study is to triangulate dopamine, reversal learning, and pathological gambling.
Methods
Here, we assess the hypothesis that pathological gambling is accompanied by dopamine-related problems with learning from reward and punishment by investigating effects of the dopamine D2-receptor antagonist sulpiride (400 mg) on reward- and punishment-based reversal learning in 18 pathological gamblers and 22 healthy controls, using a placebo-controlled, double-blind, counter-balanced design.
Results
In line with previous studies, blockade of D2 receptors with sulpiride impaired reward versus punishment reversal learning in controls. By contrast, sulpiride did not have any outcome-specific effects in gamblers.
Conclusion
These data demonstrate that pathological gambling is associated with a dopamine-related anomaly in reversal learning from reward and punishment.
Journal Article
Striatal Dopamine and Reward Prediction Error Signaling in Unmedicated Schizophrenia Patients
by
Kaminski, Jakob
,
Schlagenhauf, Florian
,
Katthagen, Teresa
in
Adult
,
Corpus Striatum - diagnostic imaging
,
Corpus Striatum - metabolism
2020
Abstract
Increased striatal dopamine synthesis capacity has consistently been reported in patients with schizophrenia. However, the mechanism translating this into behavior and symptoms remains unclear. It has been proposed that heightened striatal dopamine may blunt dopaminergic reward prediction error signaling during reinforcement learning. In this study, we investigated striatal dopamine synthesis capacity, reward prediction errors, and their association in unmedicated schizophrenia patients (n = 19) and healthy controls (n = 23). They took part in FDOPA-PET and underwent functional magnetic resonance imaging (fMRI) scanning, where they performed a reversal-learning paradigm. The groups were compared regarding dopamine synthesis capacity (Kicer), fMRI neural prediction error signals, and the correlation of both. Patients did not differ from controls with respect to striatal Kicer. Taking into account, comorbid alcohol abuse revealed that patients without such abuse showed elevated Kicer in the associative striatum, while those with abuse did not differ from controls. Comparing all patients to controls, patients performed worse during reversal learning and displayed reduced prediction error signaling in the ventral striatum. In controls, Kicer in the limbic striatum correlated with higher reward prediction error signaling, while there was no significant association in patients. Kicer in the associative striatum correlated with higher positive symptoms and blunted reward prediction error signaling was associated with negative symptoms. Our results suggest a dissociation between striatal subregions and symptom domains, with elevated dopamine synthesis capacity in the associative striatum contributing to positive symptoms while blunted prediction error signaling in the ventral striatum related to negative symptoms.
Journal Article
Ghrelin decreases sensitivity to negative feedback and increases prediction-error related caudate activity in humans, a randomized controlled trial
by
Gauffin, Emelie
,
Heilig, Markus
,
Löfberg, Andreas
in
Addictive behaviors
,
Blindness
,
Caudate-putamen
2024
The stomach-derived hormone ghrelin plays not only a role in feeding, starvation, and survival, but it has been suggested to also be involved in the stress response, in neuropsychiatric conditions, and in alcohol and drug use disorders. Mechanisms related to reward processing might mediate ghrelin’s broader effects on complex behaviors, as indicated by animal studies and mostly correlative human studies. Here, using a within-subject double-blind placebo-controlled design with intravenous ghrelin infusion in healthy volunteers ( n = 30), we tested whether ghrelin alters sensitivity to reward and punishment in a reward learning task. Parameters were derived from a computational model of participants’ task behavior. The reversal learning task with monetary rewards was performed during functional brain imaging to investigate ghrelin effects on brain signals related to reward prediction errors. Compared to placebo, ghrelin decreased punishment sensitivity ( t = −2.448, p = 0.021), while reward sensitivity was unaltered ( t = 0.8, p = 0.43). We furthermore found increased prediction-error related activity in the dorsal striatum during ghrelin administration (region of interest analysis: t- values ≥ 4.21, p -values ≤ 0.044). Our results support a role for ghrelin in reward processing that extends beyond food-related rewards. Reduced sensitivity to negative outcomes and increased processing of prediction errors may be beneficial for food foraging when hungry but could also relate to increased risk taking and impulsivity in the broader context of addictive behaviors.
Journal Article