Curricular Robust Reinforcement Learning via GAN-Based Perturbation Through Continuously Scheduled Task Sequence  被引量:1

在线阅读下载全文

作  者:Yike Li Yunzhe Tian Endong Tong Wenjia Niu Yingxiao Xiang Tong Chen Yalun Wu Jiqiang Liu 

机构地区:[1]Beijing Key Laboratory of Security and Privacy in Intelligent Transportation,Beijing Jiaotong University,Beijing 100044,China

出  处:《Tsinghua Science and Technology》2023年第1期27-38,共12页清华大学学报(自然科学版(英文版)

基  金:supported by the National Natural Science Foundation of China (Nos.61972025,61802389,61672092,U1811264,and 61966009);the National Key R&D Program of China (Nos.2020YFB1005604 and 2020YFB2103802).

摘  要:Reinforcement learning(RL),one of three branches of machine learning,aims for autonomous learning and is now greatly driving the artificial intelligence development,especially in autonomous distributed systems,such as cooperative Boston Dynamics robots.However,robust RL has been a challenging problem of reliable aspects due to the gap between laboratory simulation and real world.Existing efforts have been made to approach this problem,such as performing random environmental perturbations in the learning process.However,one cannot guarantee to train with a positive perturbation as bad ones might bring failures to RL.In this work,we treat robust RL as a multi-task RL problem,and propose a curricular robust RL approach.We first present a generative adversarial network(GAN)based task generation model to iteratively output new tasks at the appropriate level of difficulty for the current policy.Furthermore,with these progressive tasks,we can realize curricular learning and finally obtain a robust policy.Extensive experiments in multiple environments demonstrate that our method improves the training stability and is robust to differences in training/test conditions.

关 键 词:robust reinforcement learning generative adversarial network(GAN)based model curricular learning 

分 类 号:H31[语言文字—英语]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象