Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Multi-robot hierarchical safe reinforcement learning autonomous decision-making strategy based on uniformly ultimate boundedness constraints
by
Sun, Huihui
, Wu, Changlin
, Qian, Sen
, Zhang, Long
, Jiang, Hui
in
639/166/987
/ 639/166/988
/ 639/705/258
/ Decision making
/ Humanities and Social Sciences
/ Lagrange multiplier
/ Learning
/ Multi-robot
/ multidisciplinary
/ Reinforcement
/ Reinforcement learning
/ Robots
/ Safety
/ Science
/ Science (multidisciplinary)
/ Security constraint
/ Uniformly ultimate boundedness
2025
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Multi-robot hierarchical safe reinforcement learning autonomous decision-making strategy based on uniformly ultimate boundedness constraints
by
Sun, Huihui
, Wu, Changlin
, Qian, Sen
, Zhang, Long
, Jiang, Hui
in
639/166/987
/ 639/166/988
/ 639/705/258
/ Decision making
/ Humanities and Social Sciences
/ Lagrange multiplier
/ Learning
/ Multi-robot
/ multidisciplinary
/ Reinforcement
/ Reinforcement learning
/ Robots
/ Safety
/ Science
/ Science (multidisciplinary)
/ Security constraint
/ Uniformly ultimate boundedness
2025
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Multi-robot hierarchical safe reinforcement learning autonomous decision-making strategy based on uniformly ultimate boundedness constraints
by
Sun, Huihui
, Wu, Changlin
, Qian, Sen
, Zhang, Long
, Jiang, Hui
in
639/166/987
/ 639/166/988
/ 639/705/258
/ Decision making
/ Humanities and Social Sciences
/ Lagrange multiplier
/ Learning
/ Multi-robot
/ multidisciplinary
/ Reinforcement
/ Reinforcement learning
/ Robots
/ Safety
/ Science
/ Science (multidisciplinary)
/ Security constraint
/ Uniformly ultimate boundedness
2025
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Multi-robot hierarchical safe reinforcement learning autonomous decision-making strategy based on uniformly ultimate boundedness constraints
Journal Article
Multi-robot hierarchical safe reinforcement learning autonomous decision-making strategy based on uniformly ultimate boundedness constraints
2025
Request Book From Autostore
and Choose the Collection Method
Overview
Deep reinforcement learning has exhibited exceptional capabilities in a variety of sequential decision-making problems, providing a standardized learning paradigm for the development of intelligent multi-robot systems. Nevertheless, when confronted with dynamic and unstructured environments, the security of decision-making strategies encounters serious challenges. The absence of security will leave multi-robot susceptible to unknown risks and potential physical damage. To tackle the safety challenges in autonomous decision-making of multi-robot systems, this manuscripts concentrates on a uniformly ultimately bounded constrained hierarchical safety reinforcement learning strategy (UBSRL). Initially, the approach innovatively proposes an event-triggered hierarchical safety reinforcement learning framework based on the constrained Markov decision process. The integrated framework achieves a harmonious advancement in both decision-making security and efficiency, facilitated by the seamless collaboration between the upper-tier evolutionary network and the lower-tier restoration network. Subsequently, by incorporating supplementary Lyapunov safety cost networks, a comprehensive strategy optimization mechanism that includes multiple safety cost constraints is devised, and the Lagrange multiplier principle is employed to address the challenge of identifying the optimal strategy. Finally, leveraging the principles of uniformly ultimate boundedness, the stability of the autonomous decision-making system is scrutinized. This analysis reveals that the action trajectories of multiple robots can be reverted to a safe space within a finite time frame from any perilous state, thereby theoretically substantiating the efficacy of the safety constraints embedded within the proposed strategy. Subsequent to exhaustive training and meticulous evaluation within a multitude of standardized scenarios, the outcomes indicate that the UBSRL strategy can effectively restricts the safety indicators to remain below the threshold, markedly enhancing the stability and task completion rate of the motion strategy.
Publisher
Nature Publishing Group UK,Nature Publishing Group,Nature Portfolio
Subject
This website uses cookies to ensure you get the best experience on our website.