Data-driven offline reinforcement learning approach for quadrotor's motion and path planning

作　　者：Haoran ZHAO Hang FU Fan YANG Che QU Yaoming ZHOU

机构地区：[1]School of Aeronautic Science and Engineering,Beihang University,Beijing 100191,China [2]Beijing Advanced Discipline Center for Unmanned Aircraft System,Beihang University,Beijing 100191,China [3]Tianmushan Laboratory,Hangzhou 311115,China

出　　处：《Chinese Journal of Aeronautics》2024年第11期386-397,共12页中国航空学报（英文版）

基　　金：supported by the National Natural Science Foundation of China(No.52272382);the Aeronautical Science Foundation of China(No.20200017051001);the Fundamental Research Funds for the Central Universities,China。

摘　　要：Non-learning based motion and path planning of an Unmanned Aerial Vehicle(UAV)is faced with low computation efficiency,mapping memory occupation and local optimization problems.This article investigates the challenge of quadrotor control using offline reinforcement learning.By establishing a data-driven learning paradigm that operates without real-environment interaction,the proposed workflow offers a safer approach than traditional reinforcement learning,making it particularly suited for UAV control in industrial scenarios.The introduced algorithm evaluates dataset uncertainty and employs a pessimistic estimation to foster offline deep reinforcement learning.Experiments highlight the algorithm's superiority over traditional online reinforcement learning methods,especially when learning from offline datasets.Furthermore,the article emphasizes the importance of a more general behavior policy.In evaluations,the trained policy demonstrated versatility by adeptly navigating diverse obstacles,underscoring its real-world applicability.

关键词：Motion planning Unmanned aerial vehicle Reinforcement learning Data-driven learning Markov decision process

分类号：V279[航空宇航科学与技术—飞行器设计]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

Data-driven offline reinforcement learning approach for quadrotor's motion and path planning

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

Data-driven offline reinforcement learning approach for quadrotor's motion and path planning

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索