检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]东北林业大学信息与计算机工程学院,哈尔滨150040
出 处:《科学技术与工程》2012年第21期5181-5186,共6页Science Technology and Engineering
基 金:国家自然科学基金(71001023);中央高校基本科研业务费专项资金(DL11BB25)资助
摘 要:目前互联网已经成为信息和观点的交换主要媒介,因此也成为了手机用户对于产品观点的最佳来源。但是目前为止研究中文文本的评论挖掘问题的研究还比较少。为了进一步发展这一领域的研究,旨在从中文客户评论中得到用户关心的产品特征。方法基于关联规则理论中的Apriori算法。主要通过计算频繁特征项的各分量在文本中出现位置的概率,从而确定挖掘到的候选产品特征中词汇的语序,使挖掘结果满足中文的正规语法要求。采用因特网上的评论数据作为语料,通过实验结果表明所提出的方法使得中文评论中的产品特征挖掘性能有所提高。The Internet become used as a main medium for exchange of information and opinions, so Web has become an excellent source for gathering consumer opinions about products. However, up to now there are very few researches conducted on online reviews mining for Chinese text. In order to remedy this deficiency how to automatically mine product features is studied. The proposed method based on Apriori algorithm in the theory of association rules. The method computed the location probability value of words in frequent itemsets appeared in sentences, and then corrected the words sequence of the candidate product features. This made the mining results meet the requirements of standard syntax in Chinese language. The customer reviews from several popular website as the corpus dataset, and experimental findings indicated that the proposed method improves the performance of the product features extraction from Chinese customer reviews are downloaded.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222