Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Adaptive Evolutionary Reinforcement Learning with Policy Direction
by
Li, Dazi
, Dong, Caibo
in
Ablation
/ Adaptive learning
/ Artificial Intelligence
/ Complex Systems
/ Computational Intelligence
/ Computer Science
/ Convergence
/ Efficiency
/ Entropy
/ Evolution
/ Evolutionary algorithms
/ Feedback
/ Genetic algorithms
/ Learning
/ Modules
/ Normal distribution
/ Optimization
/ Performance evaluation
/ Regularization
/ Training
2024
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Adaptive Evolutionary Reinforcement Learning with Policy Direction
by
Li, Dazi
, Dong, Caibo
in
Ablation
/ Adaptive learning
/ Artificial Intelligence
/ Complex Systems
/ Computational Intelligence
/ Computer Science
/ Convergence
/ Efficiency
/ Entropy
/ Evolution
/ Evolutionary algorithms
/ Feedback
/ Genetic algorithms
/ Learning
/ Modules
/ Normal distribution
/ Optimization
/ Performance evaluation
/ Regularization
/ Training
2024
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Adaptive Evolutionary Reinforcement Learning with Policy Direction
by
Li, Dazi
, Dong, Caibo
in
Ablation
/ Adaptive learning
/ Artificial Intelligence
/ Complex Systems
/ Computational Intelligence
/ Computer Science
/ Convergence
/ Efficiency
/ Entropy
/ Evolution
/ Evolutionary algorithms
/ Feedback
/ Genetic algorithms
/ Learning
/ Modules
/ Normal distribution
/ Optimization
/ Performance evaluation
/ Regularization
/ Training
2024
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Adaptive Evolutionary Reinforcement Learning with Policy Direction
Journal Article
Adaptive Evolutionary Reinforcement Learning with Policy Direction
2024
Request Book From Autostore
and Choose the Collection Method
Overview
Evolutionary Reinforcement Learning (ERL) has garnered widespread attention in recent years due to its inherent robustness and parallelism. However, the integration of Evolutionary Algorithms (EAs) and Reinforcement Learning (RL) remains relatively rudimentary and lacks dynamism, which can impact the convergence performance of ERL algorithms. In this study, a dynamic adaptive module is introduced to balance the Evolution Strategies (ES) and RL training within ERL. By incorporating elite strategies, this module leverages advantageous individuals to elevate the overall population's performance. Additionally, RL strategy updates often lack guidance from the population. To address this, we incorporate the strategies of the best individuals from the population, providing valuable policy direction. This is achieved through the formulation of a loss function that employs either L1 or L2 regularization to facilitate RL training. The proposed framework is referred to as Adaptive Evolutionary Reinforcement Learning (AERL). The effectiveness of our framework is evaluated by adopting Soft Actor-Critic (SAC) as the RL algorithm and comparing it with other algorithms in the
MuJoCo
environment. The results underscore the outstanding convergence performance of our proposed Adaptive Evolutionary Soft Actor-Critic (AESAC) algorithm. Furthermore, ablation experiments are conducted to emphasize the necessity of these two improvements. It is worth noting that the enhancements in AESAC are realized at the population level, enabling broader exploration and effectively reducing the risk of falling into local optima.
This website uses cookies to ensure you get the best experience on our website.