基于信息熵函数的启发式贝叶斯因果推理  被引量:8

Heuristic Bayesian Causal Inference based on Information Entropy Function

在线阅读下载全文

作  者:刘洋[1] 王利民[1,2] 孙铭会[1] LIU Yang;WANG Li-Min;SUN Ming-Hui(College of Computer Science and Technology,Jilin University,Changchun 130012;Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education,Jilin University,Changchun 130012)

机构地区:[1]吉林大学计算机科学与技术学院,长春130012 [2]吉林大学符号计算与知识工程教育部重点实验室,长春130012

出  处:《计算机学报》2021年第10期2135-2147,共13页Chinese Journal of Computers

基  金:国家重点研发计划(No.2019YFC1804804);吉林省科技发展计划项目(No.20200201281JC)资助.

摘  要:贝叶斯网络分类器(BNC)由于其优越的分类性能和可解释性在数据挖掘和人工智能等领域有着广泛的应用.信息论为其迅速发展奠定了坚实的数学理论基础,例如条件互信息被用来度量BNC拓扑结构中属性间的条件依赖关系.然而,贝叶斯网络又被称为因果网络,但目前人工智能等领域中有关贝叶斯网络因果关系的研究是一个很有争议性的课题.属性间因果性的定义远比相关性的定义复杂微妙很多.而条件互信息可能不适用于度量BNC整体拓扑结构对数据的拟合性,并且其表达式的对称性决定了其只能描述属性之间的无向相关性,而非有向因果性.本文从信息熵的角度对贝叶斯网络中的因果关系进行了探索性的研究,首先基于对似然函数定义了联合熵函数与贝叶斯网络拓扑结构中联合概率分布的映射关系,然后在此基础上提出了类条件熵和局部条件熵函数来识别拓扑结构中属性间的因果关系.最后提出了一种基于类标签驱动的启发式结构学习方法来构建可以兼顾有标签数据拟合和无标签数据泛化的BNC(记为HBN).对美国加州大学欧文分校(UCI)机器学习数据库中35个数据集的实验评估表明,本文所提出算法与其它算法相比在分类性能上具有显著优势,例如HBN在0-1损失函数上明显优于CFWNB(17优5劣)、SKDB(14优5劣)、AIWNB(17优7劣);在偏差上HBN与CFWNB相比26优6劣,与SKDB相比10优5劣,与WAODE相比15优7劣,与RF相比29优4劣,与AIWNB相比22优6劣.由于CFWNB、WAODE、AIWNB没有结构学习过程,其拓扑结构不受训练数据扰动的影响.这三种算法的方差显著低于其它算法.而HBN的局部拓扑结构能充分体现测试实例中隐含的因果关系,在一定程度上减轻训练数据过拟合带来的负面影响.因此,与SKDB和RF相比,HBN的方差结果均明显占优(20优9劣,26优3劣).与其他算法相比,HBN的0-1损失函数和偏差结果分别平均提高了6.06%�Bayesian network classifier(BNC)has been widely used in the data mining,artificial intelligence and other fields due to its excellent classification performance and interpretability.Information theory has established a strong mathematical and theoretical basis for its rapid development.For example,conditional mutual information is widely used to measure the conditional dependence between attributes in the topology structure of BNC.However,Bayesian network is also called causal network,the research on causality in the Bayesian network is a controversial topic in the artificial intelligence and other fields.The definition of causality between attributes is much more complex and subtler than that of correlation.Conditional mutual information may be not suitable for measuring the extent to which the global topology structureof BNC fits data,and the symmetry of its expression determines that it can only describe the undirected correlation between attributes,not the directed causality.An exploratory research is carried out in the causal relationship of Bayesian networks from the perspective of information entropy.This paper firstly defines the mapping relationship between the joint entropy function and the joint probability distribution within the Bayesian networks from the perspective of the log-likelihood function,and then proposes the class conditional entropy function and local conditional entropy function based on the joint entropy function to identify the causal relationships between attributes in the topology structure.Finally,a label-driven heuristic structure learning method is proposed to build a BNC that can balance labeled data fitting and unlabeled data generalization,which is named HBN.Experimental evaluation on 35 datasets from the UCI machine learning repository shows that the proposed algorithm enjoys significant advantages in terms of classification performance over other state-of-the-art algorithms.For example,in terms of 0-1 loss function,HBN beats the algorithm of correlation-based feature weightin

关 键 词:贝叶斯网络分类器 对数似然函数 联合熵 条件熵 交叉熵 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象