检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]成都理工大学信息工程学院,四川成都610059
出 处:《计算机仿真》2011年第6期219-222,283,共5页Computer Simulation
摘 要:研究文本分类、提高文本检索效率问题,针对文本特征维数过高导致神经网络收敛速度慢、文本分类精度低的难题,结合粗糙集的属性约简和神经网络的文本分类优点,提出了一种粗糙集(RS)结合BP神经网络的文本自动分类算法(RS-BPNN)。RS-BPNN首先应用粗糙集理论的属性约简对文本特征预处理,降低向量维数,然后把冗余的属性从决策表中删去,最后利用神经网络进行分类。并在MATLAB环境中进行了仿真实验,仿真结果表明,RS-BPNN方法的识别精度比传统的BP神经网络高4%左右,提高了文本分类的精度和检索效率。Although Rough Set can get obviously categorization rules with information reduction under the premise of not influeneing the aceuraey of Text Categorization,it is sensitive to noise data.Neural Network has a strong ability to learn fuzzy data,but it can not remove uncertain and vague information and its performance is weakened because the vectors of text are very huge.A hybrid classifier is presented based on the combination of rough set theory and BP neural network.Firstly,the documents are denoted by vector space model.Secondly,the feature vector were reduced by using rough sets.Finally,the documents were classed by BP neural network.Experimental results show that the algorithm of Rough-ANN is effective for the texts classification,and has the better performance in classification precision,stability and fault-tolerance compared with the traditional BP neural networks.
分 类 号:TP183[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.229