吐鲁番文献与汉语语料库建设的若干思考  被引量:2

Thinking about the Documents of Turfan and the Construction of the Corpus of Ancient Chinese Linguistics

在线阅读下载全文

作  者:赵红[1] 

机构地区:[1]南京师范大学文学院,江苏南京210097

出  处:《南京师范大学文学院学报》2014年第3期155-158,共4页Journal of School of Chinese Language and Culture Nanjing Normal University

基  金:国家社科基金重大招标项目"汉语史语料库建设研究"(编号:10&ZD117)

摘  要:中古汉语熟语料库建设不能仅仅满足于古文献的收录,还应该保留普遍存在于传世文献和出土文献当中的诸多异文,实现异文自动检索、自动发现。国家社科基金重大招标项目"汉语史语料库建设研究"将收录一批中古时期的吐鲁番出土文献。针对吐鲁番出土文献众多的异体字,还应通过链接等技术手段保留原字形,进行考释意见的标注关联及文字属性的标注。通过采用通用置标语言,实现语料共享,避免重复建设而产生资源浪费。The construction of phrase corpus should not be hmlted to the collection of ancient Chinese documents, a variety of texts handed down from ancient times or unearthed recently should also be included, and attain the purpose of automatic searching and indexing. As one of the significant bidding project of National Social Science Foundation, the project of "the research of the construction of the corpus of ancient Chinese language" will include a batch of documents unearthed in Turfan. Aiming at the large number of variant Chinese characters in documents of Turfan , the link technology will be used as a means to reserve the original form of these characters, as well as the opinions of philological studies of this texts and the marked character attributes. By adopting Standard Generalized Markup Language, it will reach the goal of corpus sharing, and avoid repeated construction and the waste of resource.

关 键 词:语料库 吐鲁番文献 异文 

分 类 号:H0-09[语言文字—语言学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象