汉语共时语料库与追踪语料库:语料库语言学的新方向  被引量:7

Chinese Synchronous Corpus and Monitoring Corpus:A New Direction of Corpus Linguistics

在线阅读下载全文

作  者:邹嘉彦[1,2] 邝蔼儿[2] 路斌[1,2] 蔡永富[1] 

机构地区:[1]香港教育学院语言资讯科学研究中心 [2]香港城市大学中文,翻译及语言学系

出  处:《中文信息学报》2011年第6期38-45,共8页Journal of Chinese Information Processing

基  金:香港研究资助局优配研究基金(9040092;9040386;149607)

摘  要:随着信息技术的不断提升、互联网的普及,汉语自然语言处理的难题不断得到解决,汉语语料库的发展和语料库语言学的应用也面临着新的契机。如何持续充分应用庞大的多种语料库,并协同与配合语言学和人文、社会科学多个领域,来追踪了解各种语言现象及其背后的社会文化深层含义,是语料库语言学可以承担的新任务。LIVAC汉语共时语料库持续处理和分析泛华语七个地区十七年四亿字的语料,可真正起到"时间锦囊"的作用,为紧密追踪、科学观察泛华地区语言现象及有关社会文化演变,提供了坚实的基础和科学依据。该文介绍LIVAC如何由汉语"共时语料库"演变为"追踪语料库"。The advancement of information technology and the Internet has offered important solutions to many classical problems in Chinese natural language processing.It has also opened up new opportunities for corpus linguistics,particularly the cultivation and utilization of large corpora for monitoring and tracking various language phenomena from the linguistic perspective,and investigating such language development in relation to the underlying social and cultural implications traditionally studied by humanities and social sciences.Over the past 17 years,the LIVAC corpus has grown into a very large corpus of its kind,containing results from the analysis of about 400 million Chinese characters drawn from news media from 7 communities of pan-Chinese regions.The long-term effort behind LIVAC has enabled it to function as serial time capsules,which provide a solid foundation for scientifically tracking and monitoring various phenomena of language changes together with the associated social and cultural developments within and across pan-Chinese regions.This paper introduces how the LIVAC synchronous corpus has evolved into a monitoring corpus of Chinese communities.

关 键 词:语料库语言学 LIVAC汉语语料库 共时语料库 追踪语料库 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象