Cluster Analysis Based on Contextual Features Extraction for Conversational Corpus  

Cluster Analysis Based on Contextual Features Extraction for Conversational Corpus

在线阅读下载全文

作  者:Qi Chen Yue Chen Minghu Jiang 

机构地区:[1]College of Computer Science and Technology, Shandong University, Shandong, China [2]Department of Chinese Language and Literature, School of Humanities, Tsinghua University, Beijing, China [3]Lab of Computational Linguistics, School of Humanities, Tsinghua University, Beijing, China

出  处:《Journal of Computer and Communications》2015年第5期33-37,共5页电脑和通信(英文)

摘  要:Cluster analysis related to computational linguistics seldom concerned with Pragmatics level. Features of corpus on Pragmatics level related to specific situations, including backgrounds, titles and habits. To improve the accuracy of clustering for conversations collected from international students in Tsinghua University, it required contextual features. Here, we collected four-hundred conversations as a corpus and built it to Vector Space Model. With the Oxford-Duden Dictionary and other methods we modified the model and concluded into three groups. We testified our hypothesis through self-organizing map neural network. The result suggested that the modified model had a better outcome.Cluster analysis related to computational linguistics seldom concerned with Pragmatics level. Features of corpus on Pragmatics level related to specific situations, including backgrounds, titles and habits. To improve the accuracy of clustering for conversations collected from international students in Tsinghua University, it required contextual features. Here, we collected four-hundred conversations as a corpus and built it to Vector Space Model. With the Oxford-Duden Dictionary and other methods we modified the model and concluded into three groups. We testified our hypothesis through self-organizing map neural network. The result suggested that the modified model had a better outcome.

关 键 词:CONVERSATIONAL CORPUS CONTEXTUAL FEATURES VSM SOM 

分 类 号:R73[医药卫生—肿瘤]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象