检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:武雅萱 王悦欣[1] 李洋[1] 王博晨 Wu Yaxuan;Wang Yuexin;Li Yang;Wang Bochen
出 处:《科教文汇》2017年第17期50-52,57,共4页Journal of Science and Education
摘 要:在垃圾评论问题日益严重的今天,本文主要对产品的评论识别进行研究。在分词技术上,对逆向最大匹配算法进行改进,将中性高频词及无用词先行在句子中剔除,减少循环次数,提高运算效率。重新设置词语权重,在相似度定义中加入平滑因子,从而可以识别近义词。从实验结果可以看出,这种新的识别技术在很大程度上提高了对于产品评论识别的准确率和召回率。In today's increasingly serious problem of spam product reviews,this paper focuses on the identification of product reviews.In word segmentation technology,the reverse maximum matching algorithm is improved,eliminating neutral high frequency words and useless words first in a sentence and reducing the number of cycles,so as to improve the efficiency of the operation.The word weight is also reset and smoothing factors are added in the definition of similarity,which can identify synonyms.As can be seen from the experimental results,this new identification technology can improve the accuracy and recall rate of product reviews to a large extent.
分 类 号:G642[文化科学—高等教育学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.222.82.248