Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Research on the Multiagent Joint Proximal Policy Optimization Algorithm Controlling Cooperative Fixed-Wing UAV Obstacle Avoidance
by
Miao, Xikui
, Chu, Hairong
, Liang, Dongxin
, Guo, Lihong
, Zhao, Weiwei
, Shen, Honghai
, Zhu, Chenhao
, Zhang, Feng
in
Algorithms
/ Collaboration
/ Learning
/ Markov analysis
/ Medical research
/ multiagent cooperative
/ multiple unmanned aerial vehicles (multi-UAV) formation
/ obstacle avoidance
/ Optimization
/ proximal policy optimization (PPO)
/ reinforcement learning
/ the joint state-value function
/ Unmanned aerial vehicles
2020
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Research on the Multiagent Joint Proximal Policy Optimization Algorithm Controlling Cooperative Fixed-Wing UAV Obstacle Avoidance
by
Miao, Xikui
, Chu, Hairong
, Liang, Dongxin
, Guo, Lihong
, Zhao, Weiwei
, Shen, Honghai
, Zhu, Chenhao
, Zhang, Feng
in
Algorithms
/ Collaboration
/ Learning
/ Markov analysis
/ Medical research
/ multiagent cooperative
/ multiple unmanned aerial vehicles (multi-UAV) formation
/ obstacle avoidance
/ Optimization
/ proximal policy optimization (PPO)
/ reinforcement learning
/ the joint state-value function
/ Unmanned aerial vehicles
2020
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Research on the Multiagent Joint Proximal Policy Optimization Algorithm Controlling Cooperative Fixed-Wing UAV Obstacle Avoidance
by
Miao, Xikui
, Chu, Hairong
, Liang, Dongxin
, Guo, Lihong
, Zhao, Weiwei
, Shen, Honghai
, Zhu, Chenhao
, Zhang, Feng
in
Algorithms
/ Collaboration
/ Learning
/ Markov analysis
/ Medical research
/ multiagent cooperative
/ multiple unmanned aerial vehicles (multi-UAV) formation
/ obstacle avoidance
/ Optimization
/ proximal policy optimization (PPO)
/ reinforcement learning
/ the joint state-value function
/ Unmanned aerial vehicles
2020
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Research on the Multiagent Joint Proximal Policy Optimization Algorithm Controlling Cooperative Fixed-Wing UAV Obstacle Avoidance
Journal Article
Research on the Multiagent Joint Proximal Policy Optimization Algorithm Controlling Cooperative Fixed-Wing UAV Obstacle Avoidance
2020
Request Book From Autostore
and Choose the Collection Method
Overview
Multiple unmanned aerial vehicle (UAV) collaboration has great potential. To increase the intelligence and environmental adaptability of multi-UAV control, we study the application of deep reinforcement learning algorithms in the field of multi-UAV cooperative control. Aiming at the problem of a non-stationary environment caused by the change of learning agent strategy in reinforcement learning in a multi-agent environment, the paper presents an improved multiagent reinforcement learning algorithm—the multiagent joint proximal policy optimization (MAJPPO) algorithm with the centralized learning and decentralized execution. This algorithm uses the moving window averaging method to make each agent obtain a centralized state value function, so that the agents can achieve better collaboration. The improved algorithm enhances the collaboration and increases the sum of reward values obtained by the multiagent system. To evaluate the performance of the algorithm, we use the MAJPPO algorithm to complete the task of multi-UAV formation and the crossing of multiple-obstacle environments. To simplify the control complexity of the UAV, we use the six-degree of freedom and 12-state equations of the dynamics model of the UAV with an attitude control loop. The experimental results show that the MAJPPO algorithm has better performance and better environmental adaptability.
Publisher
MDPI AG,MDPI
This website uses cookies to ensure you get the best experience on our website.