中文简历自动解析及推荐算法  被引量:6

Chinese resume information automatic extraction and recommendation algorithm

在线阅读下载全文

作  者:谷楠楠 冯筠[1] 孙霞[1] 赵妍 张蕾[1] GU Nannan;FENG Jun;SUN Xia;ZHAO Yan;ZHANG Lei(School of Information Science and Technology, Northwest University, Xi’an 710127, China)

机构地区:[1]西北大学信息科学与技术学院,西安710127

出  处:《计算机工程与应用》2017年第18期141-148,270,共9页Computer Engineering and Applications

基  金:陕西省教育厅自然科学基金(No.JD11258);陕西省教育厅科学研究计划自然科学专项项目(No.15JK1738);陕西省自然科学基础研究计划项目支撑(No.2015JQ6240);西北大学研究生课程建设项目(No.YJD15003)

摘  要:为解决企业人工筛选电子简历效率低等问题,提出一种简历自动解析及推荐方案。对中文简历中的句子进行分词、词性标注等预处理,表示为特征向量,并利用SVM分类算法将所有句子划分成预定义的六个通用类别,包括个人基本信息、求职意向和工作经历等。利用个人基本信息的词法和语法特征,手工构建规则来实现姓名、性别及联系方式等关键信息抽取;对复杂的工作经历等文本用HMM模型进一步抽取详细信息,从而形成基于规则和统计相结合的简历文本信息抽取方法。考虑企业和求职者双方偏好,提出基于内容的互惠推荐算法(Content-Based Reciprocal Recommender algorithm,CBRR)。实验结果表明,整个方案能有效处理电子简历,提高简历筛选效率,辅助企业进行人才招聘。In order to solve the problem of laborious and time-consuming artificial selection from mass electronic resumes,a solution to resumes automatic extraction and recommendation is proposed.Firstly,the sentences in Chinese resume arerepresented as vectors through word segmentation,part-of-speech tagging and other preprocessing steps,then SVM classificationalgorithm is used to classify the sentences into six predefined general classes,such as personal basic information,job intension,working experience and so on.Secondly,according to the lexical and grammatical features of personalbasic information block,the rules are constructed by hand to extract the key information like Name,Gender,and Contactinformation.While the HMM model is used to extract the detailed information in complex information blocks,and putsforward rules and statistics based resume information extraction method.Finally,a Content-Based Reciprocal Recommenderalgorithm(CBRR)is proposed,which takes into account the preferences of both enterprise and job seekers.Theexperiment results show that the solution proposed in this paper can assist enterprises in recruitment,improve screeningefficiency and save recruitment costs.

关 键 词:信息抽取 推荐 协同过滤 规则 统计 简历 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象