异步策略的强化因果发现方法

A reinforcement causal discovery approach for asynchronous strategy

作　　者：张英郭辉 ZHANG Ying;GUO Hui(College of Information Engineering,Ningxia University,Yinchuan,Ningxia 750021,China)

出　　处：《燕山大学学报》2024年第4期356-368,共13页Journal of Yanshan University

基　　金：宁夏自然科学基金资助项目(2021AAC03117)。

摘　　要：研究和发掘事物之间的因果关系是数据科学的核心问题之一。针对因果发现面临着搜索空间超指数量级增长、评价指标低、收敛速度慢且效果差等问题,本文提出一种基于异步策略的强化因果发现方法。首先采用自注意力机制的编码器和单层解码器模型探索数据之间的因果关系;其次,改进强化学习模型中的结构约束,并基于异步优势算法更新网络模型参数;最后,搜索、输出最大奖励的有向无环图。通过实验对比验证了该方法的良好性能。The research and discovery of causality between things is one of the core issues in data science.Causal discovery usually faces problems such as super exponential growth of the search space,low evaluation index,slow rate of convergence and poor effect.To solve them,a reinforcement causal discovery method is proposed for asynchronous strategy.Firstly,a self⁃attentional encoder and a single⁃layer decoder model are used to explore the causal relationship between the data.Secondly,the structural constraints in the reinforcement learning model are improved,and the parameters of the network model are updated based on the asynchronous dominance algorithm.Finally,the directed acyclic graph with the maximum reward is given by searching.The good performance of this method has been verified through experimental comparison.

关键词：因果关系有向无环图强化因果发现结构约束异步优势算法

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

异步策略的强化因果发现方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

异步策略的强化因果发现方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索