检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]武汉职业技术学院电子信息工程学院,湖北武汉430074 [2]华中师范大学计算机科学系,湖北武汉430079
出 处:《微计算机信息》2009年第28期23-25,共3页Control & Automation
摘 要:查询导向式自动文摘是近年来文本挖掘领域的一个热点研究课题,它以自动生成偏向用户查询需求的个性化简洁摘要为目的。本文从优化问题的角度提出一种基于遗传算法的句子抽取型文摘选择策略和方法,可以满足摘要长度限制的不同句子集合构成的随机摘要作为初始种群,将文摘的综合特性评价函数作为适应函数,通过遗传算法的全局寻优能力搜索到整体特性接近最优的句子集合作为摘要。该方法将摘要的查询偏好性与冗余性无缝地集成到遗传算法的适应函数中,因而能使生成的摘要具有更优的综合质量。在新浪网上随机抽取100个不同主题的新闻文本作为摘要测试文本,通过实验,验证了该策略和方法的有效性。Query-oriented summarization is a hot research issue in text mining, which aims to generate a query-biased concise summary in accordance with user needs. This paper proposes a sentence extractive summarization approach based on genetic algorithm from the perspective of optimization problem. In the method, different sentence sets constituting the random summaries and conforming to specific length limit are selected as the initial population and the evaluation function for a summary's comprehensive characteristics is considered as the fitness function. With the global optimization ability of genetic algorithm, the sentence set with the best overall performance is selected to create the summary. This method seamlessly integrates the query preference and redundancy into the fitness function of the genetic algorithm to ensure the created summary a better quality. Experimental results on one hundred of news documents with different topics randomly selected from Sina website have demonstrated the effectiveness of the proposed approach.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.249