检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:唐蕾 刘广钟[1] TANG Lei;LIU Guangzhong(College of Information Engineering,Shanghai Maritime University,Shanghai 201306,China)
出 处:《计算机工程与应用》2021年第11期254-259,共6页Computer Engineering and Applications
基 金:国家自然科学基金(61202370);中国博士后科学基金(2014M561512);上海市教委科研创新项目(14YZ110)。
摘 要:为了提高无人机(Unmanned Aerial Vehicle,UAV)系统的智能避障性能,提出了一种基于双延迟深度确定性策略梯度(Twin Delayed Deep Deterministic Policy Gradient,TD3)的改进算法(Improved Twin Delayed Deep Deterministic Policy Gradient,I-TD3)。该算法通过设置两个经验缓存池分离成功飞行经验和失败飞行经验,并根据两个经验缓存池的不同使用目的分别结合优先经验回放(Prioritized Experience Replay)方法和经验回放(Experience Replay)方法,提高有效经验的采样效率,缓解因无效经验过高导致的训练效率低问题。改进奖励函数,解决因奖励设置不合理导致的训练效果差问题。在AirSim平台上实现仿真实验,结果表明在四旋翼无人机的避障问题上,I-TD3算法的避障效果优于TD3算法和深度确定性策略梯度(Deep Deterministic Policy Gradient,DDPG)算法。In order to improve the intelligent obstacle avoidance performance of Unmanned Aerial Vehicle(UAV),an improved algorithm called Improved Twin Delayed Deep Deterministic Policy Gradient(I-TD3)based on Twin Delayed Deep Deterministic Policy Gradient(TD3)is proposed.According to the different purposes of experience buffer pools,combined with the Prioritized Experience Replay and the Experience Replay,the success flight experience and failure flight experience are separated by setting two experience buffer pools to enhance the sample efficiency of effective experience,alleviate the problem of low training efficiency prompted by too much invalid experience.Meantime,the reward function is ameliorated to solve the problem of poor training effect caused by unreasonable reward setting.By applying the simulation experiment of quad-rotor UVA on AirSim platform,it is indicated that the obstacle avoidance effect of ITD3 algorithm is superior to the TD3 algorithm and the Deep Deterministic Policy Gradient(DDPG)algorithm.
关 键 词:双延迟深度确定性策略梯度(TD3) 优先经验回放 避障 四旋翼无人机
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28