Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Open-Ended Learning Leads to Generally Capable Agents
by
Deck, Charlie
, Mahajan, Anuj
, Czarnecki, Wojciech Marian
, Bauer, Jakob
, Wong, Nathaniel
, Hughes-Fitt, Steph
, Porcel, Nicolas
, Raileanu, Roberta
, Open Ended Learning Team
, Dalibard, Valentin
, Sygnowski, Jakub
, Trebacz, Maja
, Stooke, Adam
, Bradley-Schmieg, Nathalie
, Barros, Catarina
, McAleese, Nat
, Mathieu, Michael
, Jaderberg, Max
in
Behavior
/ Experimentation
/ Learning
/ Multiagent systems
/ Training
/ Universe
2021
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Open-Ended Learning Leads to Generally Capable Agents
by
Deck, Charlie
, Mahajan, Anuj
, Czarnecki, Wojciech Marian
, Bauer, Jakob
, Wong, Nathaniel
, Hughes-Fitt, Steph
, Porcel, Nicolas
, Raileanu, Roberta
, Open Ended Learning Team
, Dalibard, Valentin
, Sygnowski, Jakub
, Trebacz, Maja
, Stooke, Adam
, Bradley-Schmieg, Nathalie
, Barros, Catarina
, McAleese, Nat
, Mathieu, Michael
, Jaderberg, Max
in
Behavior
/ Experimentation
/ Learning
/ Multiagent systems
/ Training
/ Universe
2021
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Open-Ended Learning Leads to Generally Capable Agents
by
Deck, Charlie
, Mahajan, Anuj
, Czarnecki, Wojciech Marian
, Bauer, Jakob
, Wong, Nathaniel
, Hughes-Fitt, Steph
, Porcel, Nicolas
, Raileanu, Roberta
, Open Ended Learning Team
, Dalibard, Valentin
, Sygnowski, Jakub
, Trebacz, Maja
, Stooke, Adam
, Bradley-Schmieg, Nathalie
, Barros, Catarina
, McAleese, Nat
, Mathieu, Michael
, Jaderberg, Max
in
Behavior
/ Experimentation
/ Learning
/ Multiagent systems
/ Training
/ Universe
2021
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Paper
Open-Ended Learning Leads to Generally Capable Agents
2021
Request Book From Autostore
and Choose the Collection Method
Overview
In this work we create agents that can perform well beyond a single, individual task, that exhibit much wider generalisation of behaviour to a massive, rich space of challenges. We define a universe of tasks within an environment domain and demonstrate the ability to train agents that are generally capable across this vast space and beyond. The environment is natively multi-agent, spanning the continuum of competitive, cooperative, and independent games, which are situated within procedurally generated physical 3D worlds. The resulting space is exceptionally diverse in terms of the challenges posed to agents, and as such, even measuring the learning progress of an agent is an open research problem. We propose an iterative notion of improvement between successive generations of agents, rather than seeking to maximise a singular objective, allowing us to quantify progress despite tasks being incomparable in terms of achievable rewards. We show that through constructing an open-ended learning process, which dynamically changes the training task distributions and training objectives such that the agent never stops learning, we achieve consistent learning of new behaviours. The resulting agent is able to score reward in every one of our humanly solvable evaluation levels, with behaviour generalising to many held-out points in the universe of tasks. Examples of this zero-shot generalisation include good performance on Hide and Seek, Capture the Flag, and Tag. Through analysis and hand-authored probe tasks we characterise the behaviour of our agent, and find interesting emergent heuristic behaviours such as trial-and-error experimentation, simple tool use, option switching, and cooperation. Finally, we demonstrate that the general capabilities of this agent could unlock larger scale transfer of behaviour through cheap finetuning.
Publisher
Cornell University Library, arXiv.org
Subject
This website uses cookies to ensure you get the best experience on our website.