融合异质网络与主题模型的方面分预测  被引量:22

Aspect rating prediction based on heterogeneous network and topic model

在线阅读下载全文

作  者:吉余岗 李依桐[1,2] 石川 

机构地区:[1]北京邮电大学计算机学院,北京100876 [2]智能通信软件与多媒体北京市重点实验室(北京邮电大学),北京100876

出  处:《计算机应用》2017年第11期3201-3206,共6页journal of Computer Applications

基  金:国家自然科学基金资助项目(61375058);国家973计划项目(2013cb329606);北京市教育委员会共建项目~~

摘  要:针对传统方面分预测模型只考虑内容信息而缺乏对评论网络结构的分析,提出了融合异质信息网络和主题模型构建方面分预测算法(HINToAsp)。首先,从意见短语角度构建了评论主题挖掘模型(Phrase-PLSA),有效整合评论信息和评分信息进行方面主题挖掘;进而,考虑用户、评论和商品之间的结构信息,提出了在"用户评论商品"异质信息网络上的主题传播模型模型,用于刻画用户特性、商品属性;最后,基于随机游走框架有效整合内容信息和结构信息,进行精准的方面分预测。通过在大众点评(Dianping)和TripAdvisor数据集上和四元组PLSA(QPLSA)、高斯分布的情绪评估(GRAOS)模型及情绪均衡主题模型(SATM)的准确度对比实验,证明了HINToAsp算法的有效性,可以更好地用于商品的推荐系统。Concerning the problem that traditional aspect rating prediction methods just pay attention to textual information while ignoring the structural information in the review network, a novel Aspect rating prediction method based on Heterogeneous Idormation Network and Topic model (HINToAsp) was proposed for effectively integering textual information and structural information. Firstly, a new review topic model of opinion phrases called Phrase-PLSA (Phrase-based Probabilistic Latent Semantic Analysis) was put forward to integrate textual information of reviews and ratings for mining aspect topics. And then, considering the rich structural information among users, reviews, and items, a topic propagation model was designed by the aid of constructing "User-Review-Item" heterogeneous information network. Finally, a random walk framework was used to combine textual information and structural information effectively, which insured an accurate aspect rating prediction. The experimental results on both Dianping corpora and TripAdvisor corpora demonstrate that HINToAsp is more effective than recent methods like the Quad-tuples PLSA (QPLSA) model, the Gaussian distribution for RAting Over Sentiments (GRAOS) model and the Sentiment-Aligned Topic Model (SATM), and has better performance on recommendation system.

关 键 词:方面分预测 异质信息网络 主题模型 结构信息 推荐系统 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象