检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:吴震[1] 冉晓燕 苗权[1,2] 刘纯艳 张栋 魏娜 WU Zhen;RAN Xiaoyan;MIAO Quan;LIU Chunyan;ZHANG Dong;WEI Na(National Computer Network Emergency Response Technical Team/Coordination Center of China,Beijing 100029,China;Beijing Branch of National Computer Network Emergency Response Technical Team/Coordination Center of China,Beijing 100055,China;Great Wall Computer Software&System Inc.,Beijing 100190,China)
机构地区:[1]国家计算机网络应急技术处理协调中心,北京100029 [2]国家计算机网络应急技术处理协调中心北京分中心,北京100055 [3]长城计算机软件与系统有限公司,北京100190
出 处:《北京航空航天大学学报》2022年第2期193-198,共6页Journal of Beijing University of Aeronautics and Astronautics
摘 要:随着中国经济的高速发展和技术创新能力的不断提升,高效的组织、分类信息是提供个性化行业管理和跟踪分析的基础。根据行业信息特点和发展规律,提出了一种基于fastText算法的行业分类模型。首先,构建行业分类关键词库,通过特征词库进行分词和权重计算。然后,构建分类器模型,实现中文行业的自动分类。最后,实验选取了80000个包含企业经营范围、企业信息、舆论信息的测试文档,结果表明,所提模型结果高于Bayes、决策树、KNN等分类算法,取得了较好的应用效果。With the rapid development of China's economy and the continuous improvement of technological innovation ability,efficient organization and classification information is the basis of providing personalized industry management and tracking analysis.According to the characteristics of industry information and the law of development,a Chinese industry classification model based on fastText is proposed in this paper.First,the keyword database of industry classification is constructed,then word segmentation and weight calculation are carried out by feature lexicon,and finally the classifier model is constructed to realize the automatic classification of industry.In the experiment,80000 test documents including business scope,enterprise information and public opinion information were selected.The results show that the classification accuracy of the proposed model is higher than that of Bayes,decision tree,KNN and other classification algorithms.Thus,the proposed model works well in the application.
关 键 词:自然语言处理 行业分类 fastText算法 关键词 语法模型
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.3