检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:赵天锐 ZHAO Tian-rui(Luoyang Campus,Information Engineering University of PLA Strategic Support Forces,Luoyang 471003,China)
机构地区:[1]战略支援部队信息工程大学洛阳校区,河南洛阳471000
出 处:《电脑知识与技术》2021年第4期204-206,共3页Computer Knowledge and Technology
摘 要:机器学习在诸多学科领域的定量分析中都已经显现出了巨大价值。本文借助sklearn机器学习库,以韩国国立国语院2015年发布的《新词调查报告书》中收录的新造词为对象,根据报告中出现的分类标准为词汇建立特征矩阵。而后运用多种机器学习算法进行特征选择,最终筛选出对韩国语新造词词义理解影响较强的因素。实验结果表明:如果该词为派生词或外来词,该词呈现低透明度的概率更高。Machine learning has shown great value in quantitative analysis in many disciplines.This article uses the sklearn ma⁃chine learning library provided by Python to build a feature matrix for the vocabulary based on the newly coined words included in the"New Word Survey Report"issued by the National Academy of Korean Language in 2015.Then,a variety of machine learning algorithms are used for feature selection,and finally the factors that have a strong influence on the understanding of the meaning of new Korean words are screened out.The experimental results show that if the word is a derived word or a foreign word,the word has a higher probability of showing low transparency.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.149.4.109