检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:柏林[1] 何牧耕 陈兵奎[1] 刘小峰[1] BO Lin;HE Mugeng;CHEN Bingkui;LIU Xiaofeng(College of Mechanical and Transportation Engineering,Chongqing University,Chongqing 400044)
机构地区:[1]重庆大学机械与运载工程学院,重庆400044
出 处:《机械工程学报》2024年第22期165-178,共14页Journal of Mechanical Engineering
基 金:国家科技重大专项(J2019-IV-0001-0068);国家自然科学基金(52175077,51975067)资助项目。
摘 要:针对深度强化学习对交互环境的依赖性导致的其在跨工况设备故障诊断中可移植性差的问题,提出一种D3QN(Dueling double deep Q network,D3QN)域泛化的故障诊断方法。采用自适应权值的最大相关最小冗余特征筛选方法进行特征优化选择,实现数据环境去冗余精化处理;在竞争网络和双Q网络基础上引入了域识别网络,实现工况环境掩蔽下的故障状态信息分离提取;构建基于故障模式类间距的量化奖励矩阵,并结合域辨识奖励设置分治奖励策略,增强智能体对跨工况混叠故障模式的辨识决策能力。齿轮箱故障与轴承故障的跨工况诊断结果表明,能够较好地解决深度强化学习网络对交互环境的依赖性和其在跨工况故障诊断中与环境独立性之间的矛盾问题,实现深度强化模型在不同工况环境中的复用移植,提高深度强化学习在跨域故障诊断中的适用性。To address the problem of poor portability of deep reinforcement learning model in cross-condition fault diagnosis due to its dependence on the interaction environment,a domain generalization D3QN(Domain generalization dueling double deep Q network,DGD3QN)model is proposed for the machinery fault diagnosis across different working conditions.To realize the de-redundancy and refinement of data environment,the adaptive weighted max-relevance-min-redundancy method is utilized to optimize feature selection.The domain recognition network branch is introduced into D3QN network to separate and extract the fault state information from multi-conditions.To enhance the agent’s ability of identifying the overlapping failure modes in the multi-condition,the graded reward strategy is set by combining the domain recognition reward and the quantitative reward matrix constructed based on the inter-class distance of multi-condition failure modes.The experimental results of cross-condition diagnosis of gearbox fault and bearing fault showed that the proposed DGD3QN can better solve the contradiction between the environment dependence of DQN and the independence of cross-condition fault diagnosis on environmental conditions,realize the multiplexing and transplantation of D3QN models in different operating environments and enhance the applicability of DQN in the cross-domain fault diagnosis accuracy.
关 键 词:故障诊断 域泛化 特征筛选 分治奖励 深度强化学习
分 类 号:TH133[机械工程—机械制造及自动化]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.249