一种分类挖掘算法及其应用  被引量:1

A Classification Algorithm and Its Application

在线阅读下载全文

作  者:赵志宏[1] 骆斌[1] 林海[1] 

机构地区:[1]南京大学计算机软件新技术国家重点实验室南京大学计算机科学与技术系,南京210093

出  处:《南京大学学报(自然科学版)》2001年第2期142-147,共6页Journal of Nanjing University(Natural Science)

基  金:国家自然科学基金! ( 60 0 0 3 0 1 0 )

摘  要:提出一种增量式混合型分类挖掘算法 ,将基于概率论的符号学习与神经网络学习相结合 ,能够对既包含离散属性又包含连续属性的多个概念进行有效的分类处理 ,且具有较强的增量挖掘能力 .该算法在法院决策支持系统中得到了运用 ,取得了较好的效果 .Classification is a main method of data mining. The purpose of the classification is to find the common specifications from the objects stored in the database, and then, use the schema to classify them. Artificial neutral network, decision tree and legacy algorithm are methods of classification. In the paper, an incremental compound classification algorithm is proposed. Artificial neutral network learning and symbol learning based on the theory of the probability are combined in the algorithm. The main idea of the algorithm is given below. First, the symbol learning algorithm is used to classify a set of training instances by the discrete attributes. When the instances that can't be classified accurately are encounted, FTART network is used to process these instances, and the incessant attributes is made use of in the algorithm to learn new schemas. The incremental learning of the algorithm is based on the component decision tree and the FTART network. When new instances are added, the algorithm only needs to make a single pass of the incremental learning, and the decision tree and the neutral network needn't be regenerated. The algorithm can correctly classify them by easily adjusting the existing structure. Another important advantage is that when a new input schema is added to a trained FTART network, the new network structure can easily be generated by adding some nodes to the second layer of the network, which is different to the traditional BP algorithm. Therefore, the efficiency and the speed of the learning are greatly raised. The main classes of the algorithm are class ROOT and class NODE. ROOT class includes the learning algorithm and the attributes which the decision tree stores in. NODE class is included in the ROOT class, and it is used to define the nodes of the decision tree. The algorithm is based on the two classes. The detail steps of the algorithm are discussed in the paper. The incremental compound classification algorithm can process multi concept collections that contain bot

关 键 词:数据挖掘 符号学习 人工神经网络 分类挖掘算法 增量挖掘能力 数据库 

分 类 号:TP311.131[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象