Catalogue Search | MBRL

Vector-based navigation using grid-like representations in artificial agents

by Lillicrap, Timothy , Sadik, Amir , Hadsell, Raia in 631/378/116/2396 , 639/705/117 , Agents (artificial intelligence)

2018

Deep neural networks have achieved impressive successes in fields ranging from object recognition to complex games such as Go 1 , 2 . Navigation, however, remains a substantial challenge for artificial agents, with deep neural networks trained by reinforcement learning 3 – 5 failing to rival the proficiency of mammalian spatial behaviour, which is underpinned by grid cells in the entorhinal cortex 6 . Grid cells are thought to provide a multi-scale periodic representation that functions as a metric for coding space 7 , 8 and is critical for integrating self-motion (path integration) 6 , 7 , 9 and planning direct trajectories to goals (vector-based navigation) 7 , 10 , 11 . Here we set out to leverage the computational functions of grid cells to develop a deep reinforcement learning agent with mammal-like navigational abilities. We first trained a recurrent network to perform path integration, leading to the emergence of representations resembling grid cells, as well as other entorhinal cell types 12 . We then showed that this representation provided an effective basis for an agent to locate goals in challenging, unfamiliar, and changeable environments—optimizing the primary objective of navigation through deep reinforcement learning. The performance of agents endowed with grid-like representations surpassed that of an expert human and comparison agents, with the metric quantities necessary for vector-based navigation derived from grid-like units within the network. Furthermore, grid-like representations enabled agents to conduct shortcut behaviours reminiscent of those performed by mammals. Our findings show that emergent grid-like representations furnish agents with a Euclidean spatial metric and associated vector operations, providing a foundation for proficient navigation. As such, our results support neuroscientific theories that see grid cells as critical for vector-based navigation 7 , 10 , 11 , demonstrating that the latter can be combined with path-based strategies to support navigation in challenging environments. Grid-like representations emerge spontaneously within a neural network trained to self-localize, enabling the agent to take shortcuts to destinations using vector-based navigation.

Journal Article

Share this book

Add to My Shelf

Quantifying the effects of environment and population diversity in multi-agent reinforcement learning

by Everett, Richard , McKee, Kevin R , Beattie, Charlie in Learning , Multiagent systems , Training

2022

Generalization is a major challenge for multi-agent reinforcement learning. How well does an agent perform when placed in novel environments and in interactions with new co-players? In this paper, we investigate and quantify the relationship between generalization and diversity in the multi-agent domain. Across the range of multi-agent environments considered here, procedurally generating training levels significantly improves agent performance on held-out levels. However, agent performance on the specific levels used in training sometimes declines as a result. To better understand the effects of co-player variation, our experiments introduce a new environment-agnostic measure of behavioral diversity. Results demonstrate that population size and intrinsic motivation are both effective methods of generating greater population diversity. In turn, training with a diverse set of co-players strengthens agent performance in some (but not all) cases.

Paper

Share this book

Add to My Shelf

A scalable and real-time neural decoder for topological quantum codes

by Zhang, Lei M , Boixo, Sergio , Blackwell, Sam in Accuracy , Codes , Color

2025

Fault-tolerant quantum computing will require error rates far below those achievable with physical qubits. Quantum error correction (QEC) bridges this gap, but depends on decoders being simultaneously fast, accurate, and scalable. This combination of requirements has not yet been met by a machine-learning decoder, nor by any decoder for promising resource-efficient codes such as the colour code. Here we introduce AlphaQubit 2, a neural-network decoder that achieves near-optimal logical error rates for both surface and colour codes at large scales under realistic noise. For the colour code, it is orders of magnitude faster than other high-accuracy decoders. For the surface code, we demonstrate real-time decoding faster than 1 microsecond per cycle up to distance 11 on current commercial accelerators with better accuracy than leading real-time decoders. These results support the practical application of a wider class of promising QEC codes, and establish a credible path towards high-accuracy, real-time neural decoding at the scales required for fault-tolerant quantum computation.

Paper

Share this book

Add to My Shelf

Improving cosmological reach of a gravitational wave observatory using Deep Loop Shaping

by Huber, Andrea , Betzwieser, Joseph , Rollins, Jameson in Binary stars , Black holes , Gravitational waves

2025

Improved low-frequency sensitivity of gravitational wave observatories would unlock study of intermediate-mass black hole mergers, binary black hole eccentricity, and provide early warnings for multi-messenger observations of binary neutron star mergers. Today's mirror stabilization control injects harmful noise, constituting a major obstacle to sensitivity improvements. We eliminated this noise through Deep Loop Shaping, a reinforcement learning method using frequency domain rewards. We proved our methodology on the LIGO Livingston Observatory (LLO). Our controller reduced control noise in the 10--30Hz band by over 30x, and up to 100x in sub-bands surpassing the design goal motivated by the quantum limit. These results highlight the potential of Deep Loop Shaping to improve current and future GW observatories, and more broadly instrumentation and control systems.

Paper

Share this book

Add to My Shelf

A multi-agent reinforcement learning model of reputation and cooperation in human groups

by Chadwick, Martin J , Zhu, Tina O , Botvinick, Matt in Collective action , Cooperation , Deep learning

2023

Collective action demands that individuals efficiently coordinate how much, where, and when to cooperate. Laboratory experiments have extensively explored the first part of this process, demonstrating that a variety of social-cognitive mechanisms influence how much individuals choose to invest in group efforts. However, experimental research has been unable to shed light on how social cognitive mechanisms contribute to the where and when of collective action. We build and test a computational model of human behavior in Clean Up, a social dilemma task popular in multi-agent reinforcement learning research. We show that human groups effectively cooperate in Clean Up when they can identify group members and track reputations over time, but fail to organize under conditions of anonymity. A multi-agent reinforcement learning model of reputation demonstrates the same difference in cooperation under conditions of identifiability and anonymity. In addition, the model accurately predicts spatial and temporal patterns of group behavior: in this public goods dilemma, the intrinsic motivation for reputation catalyzes the development of a non-territorial, turn-taking strategy to coordinate collective action.

Paper

Share this book

Add to My Shelf

Inferring a Continuous Distribution of Atom Coordinates from Cryo-EM Images using VAEs

by Huber, Andrea , Zielinski, Michal , Ronneberger, Olaf in Coders , Heterogeneity , Proteins

2021

Cryo-electron microscopy (cryo-EM) has revolutionized experimental protein structure determination. Despite advances in high resolution reconstruction, a majority of cryo-EM experiments provide either a single state of the studied macromolecule, or a relatively small number of its conformations. This reduces the effectiveness of the technique for proteins with flexible regions, which are known to play a key role in protein function. Recent methods for capturing conformational heterogeneity in cryo-EM data model it in volume space, making recovery of continuous atomic structures challenging. Here we present a fully deep-learning-based approach using variational auto-encoders (VAEs) to recover a continuous distribution of atomic protein structures and poses directly from picked particle images and demonstrate its efficacy on realistic simulated data. We hope that methods built on this work will allow incorporation of stronger prior information about protein structure and enable better understanding of non-rigid protein structures.

Paper

Share this book

Add to My Shelf

British Society of Gastroenterology guidance for management of inflammatory bowel disease during the COVID-19 pandemic

by Younge, Lisa , Lees, Charlie W , Sebastian, Shaji in Antiviral Agents - adverse effects , Antiviral Agents - therapeutic use , Betacoronavirus

2020

The COVID-19 pandemic is putting unprecedented pressures on healthcare systems globally. Early insights have been made possible by rapid sharing of data from China and Italy. In the UK, we have rapidly mobilised inflammatory bowel disease (IBD) centres in order that preparations can be made to protect our patients and the clinical services they rely on. This is a novel coronavirus; much is unknown as to how it will affect people with IBD. We also lack information about the impact of different immunosuppressive medications. To address this uncertainty, the British Society of Gastroenterology (BSG) COVID-19 IBD Working Group has used the best available data and expert opinion to generate a risk grid that groups patients into highest, moderate and lowest risk categories. This grid allows patients to be instructed to follow the UK government’s advice for shielding, stringent and standard advice regarding social distancing, respectively. Further considerations are given to service provision, medical and surgical therapy, endoscopy, imaging and clinical trials.

Journal Article

Share this book

Add to My Shelf

I'm; not angry with Sinead, it is her feelings I don't share ; After a troubled family life, pop star's; writer brother Joe O'Connor says he can forgive

by Report: JILLY BEATTIE Pictures: CHARLIE COLLINS and BARRY BLAND in Casey, Anne-Marie , O Connor, Joe , O Connor, Sinead

1999

RELAXED: Author [Joe O'Connor] says learning to forgive has made him feel more at ease; FORGIVING: happily married, Joe has moved on and is enjoying life; CONTROVERSIAL: singer [Sinead O'Connor] as the Virgin Mary for the film The Butcher Boy OUTRAGE: Sinead tore up a picture of the Pope on American television; PENSIVE: An author's lot can be lonely, but Joe says his wife [Anne-Marie Casey] is well able to cope

Newspaper Article

Share this book

Add to My Shelf

Adaptations to the British Society of Gastroenterology guidelines on the management of acute severe UC in the context of the COVID-19 pandemic: a RAND appropriateness panel

by Kent, Alexandra , Tozer, Phil , Younge, Lisa in Acute Disease , Adaptation , Azathioprine

2020

ObjectiveManagement of acute severe UC (ASUC) during the novel COVID-19 pandemic presents significant dilemmas. We aimed to provide COVID-19-specific guidance using current British Society of Gastroenterology (BSG) guidelines as a reference point.DesignWe convened a RAND appropriateness panel comprising 14 gastroenterologists and an IBD nurse consultant supplemented by surgical and COVID-19 experts. Panellists rated the appropriateness of interventions for ASUC in the context of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) infection. Median scores and disagreement index (DI) were calculated. Results were discussed at a moderated meeting prior to a second survey.ResultsPanellists recommended that patients with ASUC should be isolated throughout their hospital stay and should have a SARS-CoV-2 swab performed on admission. Patients with a positive swab should be discussed with COVID-19 specialists. As per BSG guidance, intravenous hydrocortisone was considered appropriate as initial management; only in patients with COVID-19 pneumonia was its use deemed uncertain. In patients requiring rescue therapy, infliximab with continuing steroids was recommended. Delaying colectomy because of COVID-19 was deemed inappropriate. Steroid tapering as per BSG guidance was deemed appropriate for all patients apart from those with COVID-19 pneumonia in whom a 4–6 week taper was preferred. Post-ASUC maintenance therapy was dependent on SARS-CoV-2 status but, in general, biologics were more likely to be deemed appropriate than azathioprine or tofacitinib. Panellists deemed prophylactic anticoagulation postdischarge to be appropriate in patients with a positive SARS-CoV-2 swab.ConclusionWe have suggested COVID-19-specific adaptations to the BSG ASUC guideline using a RAND panel.

Journal Article

Share this book

Add to My Shelf

Epidemiology, management and outcome of ultrashort bowel syndrome in infancy

by Beattie, Robert Mark , Johnson, Mark John , Keys, Simon Charlie in Children & youth , Clinical decision making , Congenital diseases

2017

Ultrashort bowel syndrome (USBS) is a group of heterogeneous disorders where the length of small bowel is less than 10 cm or 10% of expected for the age. It is caused by massive loss of the gut which in the neonatal period can be a result of vanishing gastroschisis or surgical resection following mid-gut volvulus, jejunoileal atresia and/or extensive necrotising enterocolitis. The exact prevalence of USBS is not known although there is a clear trend towards increasing numbers because of increased incidence and improved survival. Long-term parenteral nutrition (PN) is the mainstay of treatment and is best delivered by a multidisciplinary intestinal rehabilitation team. Promoting adaptation is vital to improving long-term survival and can be achieved by optimising feeds, reducing intestinal failure liver disease and catheter-related bloodstream infections. Surgical techniques that can promote enteral tolerance and hence improve outcome include establishing intestinal continuity and bowel lengthening procedures. The outcome for USBS is similar to patients with intestinal failure due to other causes and only a small proportion of children who develop irreversible complications of PN and will need intestinal transplantation. In this review, we will summarise the available evidence focusing particularly on the epidemiology, management strategies and outcome.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter