检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]北京大学计算语言学研究所,北京100871 [2]计算语言学教育部重点实验室(北京大学),北京100871
出 处:《计算机研究与发展》2015年第9期2114-2122,共9页Journal of Computer Research and Development
基 金:国家"八六三"高技术研究发展计划基金项目(2015AA015402);国家自然科学基金项目(61370117;61333018);国家社会科学基金重大项目(12&ZD227)
摘 要:中文零指代消解问题包括零指代项的识别和零指代项的消解2个相互关联的子任务.传统的方法在解决该问题时,往往不考虑2个子任务间的关联关系,比如识别出的零指代项必须被消解以及发生消解的必须是零指代项等约束.基于马尔可夫逻辑网络模型可以将零指代项的识别和零指代项的消解2个子任务融合在统一的机器学习框架下进行联合推断与联合学习,采用局部规则分别针对零指代项的识别和消解进行预测,采用全局规则描述这2个子任务间的关联关系.基于OntoNotes3.0的中文数据集上的实验结果显示,基于马尔可夫逻辑网络的联合学习模型相比于独立学习模型以及多个baseline方法能够获得更好的实验效果.Chinese zero anaphora resolution includes two subtasks:zero pronoun detection and zero anaphora resolution,which are correlated with each other.Zero pronoun detection means to recognize all the zero anaphors in a given text,which mainly include null subject or null object,and exist widely in Chinese,Japanese and Italian.Zero anaphora resolution means to determine the antecedent for each recognized zero anaphor,which has already appeared as a noun,pronoun or common noun phrase before the detected zero anaphora in the previous text.Traditional methods to solve Chinese zero anaphora resolution problem generally employ some common-used learning features to construct independent classifiers for zero pronoun detection and zero anaphora resolution,but it cannot capture association relationship between these two subtasks,e.g.recognized zero anaphora must be resolved or the one to be resolved must be zero anaphora and so on.In our method,these two subtasks are combined into a unified machine learning framework with Markov logic to make joint inference and joint learning.We use local formulas to describe zero pronoun detection and zero anaphora resolution respectively,and use global formulas to represent the association relationship between these two subtasks.We find that joint learning model which makes learning with inference can acquire more effective feature weights than independent learning model which just makes learning without inference.Experimental results on OntoNotes3.0Chinese dataset show that our joint learning model can achieve better results compared with independent learning model and other baseline methods.
关 键 词:马尔可夫逻辑网络 中文零指代消解 零指代项识别 联合学习 全局规则 局部规则
分 类 号:TP301[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:13.58.229.23