A new similarity computing method based on concept similarity in Chinese text processing  被引量:4

A new similarity computing method based on concept similarity in Chinese text processing

在线阅读下载全文

作  者:PENG Jing YANG DongQing TANG ShiWei WANG TengJiao GAO Jun 

机构地区:[1]School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China [2]Department of Science and Technology, Chengdu Municipal Public Security, Bureau, Chengdu 610017, China

出  处:《Science in China(Series F)》2008年第9期1215-1230,共16页中国科学(F辑英文版)

基  金:Supported by the China Postdoctoral Science Foundation (Grant No. 20060400002);the Sichuan Youth Science and Technology Foundation of China (Grant No. 08JJ0109);the National Natural Science Foundation of China (Grant Nos.60473051, 60503037);the National High-tech Re- search and Development of China (Grant No. 2006AA01Z230);the Natural Science Foundation of Beijing Natural Science Foundation (Grant No. 4062018)

摘  要:The paper proposes a new text similarity computing method based on concept similarity in Chinese text processing. The new method converts text to words vector space model at first, and then splits words into a set of concepts. Through computing the inner products between concepts, it obtains the similarity between words. The new method computes the similarity of text based on the similarity of words at last. The contributions of the paper include: 1) propose a new computing formula between words; 2) propose a new text similarity computing method based on words similarity; 3) successfully use the method in the application of similarity computing of WEB news; and 4) prove the validity of the method through extensive experiments.The paper proposes a new text similarity computing method based on concept similarity in Chinese text processing. The new method converts text to words vector space model at first, and then splits words into a set of concepts. Through computing the inner products between concepts, it obtains the similarity between words. The new method computes the similarity of text based on the similarity of words at last. The contributions of the paper include: 1) propose a new computing formula between words; 2) propose a new text similarity computing method based on words similarity; 3) successfully use the method in the application of similarity computing of WEB news; and 4) prove the validity of the method through extensive experiments.

关 键 词:concept similarity similarity computing vector space inner product space 

分 类 号:TP3[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象