检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张庭瑜 曾颖 李楠[3] 黄洪钟[1,2] ZHANG Tingyu;ZENG Ying;LI Nan;HUANG Hongzhong(School of Mechanical and Electrical Engineering,University of Electronic Science and Technology of China,Chengdu 611731,China;Center for System Reliability and Safety,University of Electronic Science and Technology of China,Chengdu 611731,China;The 3rd Research Institute of China Electronics Technology Group Corporation,Beijing 100016,China)
机构地区:[1]电子科技大学机械与电气工程学院,四川成都611731 [2]电子科技大学系统可靠性与安全性研究中心,四川成都611731 [3]中国电子科技集团公司第三研究所,北京100016
出 处:《系统工程与电子技术》2024年第9期3060-3069,共10页Systems Engineering and Electronics
基 金:中央高校基本科研业务费项目(ZYGX2020ZB023)资助课题。
摘 要:为了实现航天器电源系统的灵活高效并网,最大化有限能量的利用,提出一种基于深度强化学习(deep reinforcement learning,DRL)的功率传输与信号传输复合网络拓扑优化模型,并使用知识蒸馏原理的多种可解释组件模型对优化过程进行剖析。首先,分析在轨运行阶段航天器母线电压调节控制域变换规律,并结合节点传播性参数,建立功率传输与信号通信的复合网络拓扑模型。然后,利用A3C(asynchronous advantage actor-critic)算法,对信号传输网络路由分布、拓扑结构等方面潜在的运行可靠性风险进行自适应性优化。最后,结合多种可解释组件对已训练的DRL模型进行知识蒸馏,形成一种可解释的量化分析方法。所提方法可以指导空间电源在随机阴影影响下选择最佳并网方案,并为更高任务要求和复杂环境下空间电源控制器设计提供理论支持。To maximize the utilization of limited energy and achieve flexible and efficient grid connection for spacecraft power supply systems,a composite grid topology optimization model for power transmission and signal communication is proposed based on deep reinforcement learning(DRL).Various interpretable component models are employed based on knowledge distillation principles to analyze the optimization mechanism.Firstly,the transformation law of the control domain of the spacecraft bus voltage regulation in the on-orbit operation stage is analyzed,and the composite network topology model of power transmission and signal communication is established by combining the node propagation parameters.Secondly,asynchronous advantage actor-critic(A3C)is utilized to adaptively optimize potential operational reliability risks in routing distribution and topology of the electrical signal transmission network.Finally,various interpretable components are used to perform knowledge distillation on the trained DRL model,forming an interpretable quantitative analysis method.The proposed method theoretically predicts optimal grid-connected processes of space power supply under random shadow effects,providing theoretical support and reference for designing space power supply controllers under higher task requirements and complex environments.
关 键 词:空间电源系统 复杂网络 深度强化学习 可靠性优化 可解释性分析
分 类 号:V423[航空宇航科学与技术—飞行器设计]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.135.184.166