检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:石凯[1,2] 聂富强 孙峰[2] SHI Kai;NIE Fuqiang;SUN Feng(School of Statistics,Southwest University of Finance and Economics,Chengdu 611130,China;College of Mathematics and Information Science,Leshan Normal University,Leshan,Sichuan 614000,China)
机构地区:[1]西南财经大学统计学院,成都611130 [2]乐山师范学院数学与信息科学学院,四川乐山614000
出 处:《计算机工程与应用》2019年第6期8-12,30,共6页Computer Engineering and Applications
基 金:国家自然科学基金青年项目(No.11701245);四川省教育厅项目(No.18SB0223)
摘 要:判别分析在数据挖掘、识别中有着广泛的应用,其中充分利用训练集的信息,改进判别规则算法,降低误判率一直是众多研究关注的焦点。传统的一些判别算法中,往往事先假定数据的分布类型来建立判别规则,但多维数据结构往往存在违背假定的情形,从而导致较高的误判率。针对此类问题,提出采用非参核密度算法建立多维数据的判别规则,同时通过Iris数据和Seeds数据进行实证分析。结果表明,与现有的判别分析算法相比较,所提判别算法利用样本资料信息更充分,显著提高了多维数据的判别精度,并且该算法不受分布假定的限制,具有广泛的适用性。Discriminant analysis is widely used in data mining and recognition.How to make full use of the information of training sets,and how to improve the algorithm of discriminant rules and reduce the rate of misjudgement has always been the focus for many researches.In some traditional algorithms,the distribution type of data is often assumed firstly,but the structures of multidimensional data often violate the assumptions and lead to a higher rate of misjudgment.Aiming at such problems,this paper proposes to establish discriminant rules by the algorithm of nonparametric kernel density,and carries out empirical analysis through Iris and Seeds data.The results show that compared with the existing discriminant analysis algorithms,the proposed algorithm uses the information of data more fully,and significantly improves the accuracy of the multidimensional data.At the same time,this algorithm is not restricted by the distribution assumption,so it has wide applicability.
关 键 词:多维数据 判别分析 非参数统计 核函数 概率密度
分 类 号:TP274.2[自动化与计算机技术—检测技术与自动化装置]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222