检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:贺瑞芳[1,2] 王浩成 刘宏宇 王博 HE Rui-Fang;WANG Hao-Cheng;LIU Hong-Yu;WANG Bo(College of Intelligence and Computing,Tianjin University,Tianjin 300350,China;Tianjin Key Laboratory of Cognitive Computing and Applications,Tianjin 300350,China)
机构地区:[1]天津大学智能与计算学部,天津300350 [2]天津市认知计算与应用重点实验室,天津300350
出 处:《软件学报》2023年第11期5162-5178,共17页Journal of Software
基 金:国家自然科学基金(61976154);国家重点研发计划(2019YFC1521200)。
摘 要:社交媒体主题检测旨在从大规模短帖子中挖掘潜在的主题信息.由于帖子形式简短、表达非正规化,且社交媒体中用户交互复杂多样,使得该任务具有一定的挑战性.前人工作仅考虑了帖子的文本内容,或者同时对同构情境下的社交上下文进行建模,忽略了社交网络的异构性.然而,不同的用户交互方式,如转发,评论等,可能意味着不同的行为模式和兴趣偏好,其反映了对主题的不同的关注与理解;此外,不同用户对同一主题的发展和演化具有不同影响,社区中处于引领地位的权威用户相对于普通用户对主题推断会产生更重要的作用.因此,提出一种新的多视图主题模型(multi-view topic model,MVTM),通过编码微博会话网络中的异构社交上下文来推断更加完整、连贯的主题.首先根据用户之间的交互关系构建一个属性多元异构会话网络,并将其分解为具有不同交互语义的多个视图;接着,考虑不同交互方式与不同用户的重要性,借助邻居级注意力和交互级注意力机制,得到特定视图的嵌入表示;最后,设计一个多视图驱动的神经变分推理方法,以捕捉不同视图之间的深层关联,并自适应地平衡它们的一致性和独立性,从而产生更连贯的主题.在3个月新浪微博数据集上的实验结果证明所提方法的有效性.Social media topic detection aims to mine latent topic information from large-scale short posts.It is a challenging task as posts are short in form and informal in expression and user interactions in social media are complex and diverse.Previous studies only consider the textual content of posts or simultaneously model social contexts in homogeneous situations,ignoring the heterogeneity of social networks.However,different types of user interactions,such as forwarding and commenting,could suggest different behavior patterns and interest preferences and reflect different attention to the topic and understanding of the topic.In addition,different users have different influences on the development and evolution of the same topic.Specifically,compared with ordinary users,the leading authoritative users in a community play a more important role in topic inference.For the above reasons,this study proposes a novel multi-view topic model(MVTM)to infer more complete and coherent topics by encoding heterogeneous social contexts in the microblog conversation network.For this purpose,an attributed multiplex heterogeneous conversation network is built according to the interaction relationships among users and decomposed into multiple views with different interaction semantics.Then,the embedded representation of specific views is obtained by leveraging neighbor-level and interaction-level attention mechanisms,with due consideration given to different types of interactions and the importance of different users.Finally,a multi-view neural variational inference method is designed to capture the deep correlations among different views and adaptively balance their consistency and independence,thereby obtaining more coherent topics.Experiments are conducted on a Sina Weibo dataset covering three months,and the results reveal the effectiveness of the proposed method.
关 键 词:社交媒体主题检测 异构社交上下文 多视图 注意力机制 神经变分推理
分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.36