中文文本的地理空间关系标注  被引量:23

Annotation for Geographical Spatial Relations in Chinese Text

在线阅读下载全文

作  者:张雪英[1] 张春菊[1] 朱少楠[1] 

机构地区:[1]南京师范大学虚拟地理环境教育部重点实验室,江苏南京210046

出  处:《测绘学报》2012年第3期468-474,共7页Acta Geodaetica et Cartographica Sinica

基  金:国家自然科学基金(40971231);江苏省研究生创新项目(CXLX11_0874)

摘  要:为有效地解决当前相关标准和标准数据匮乏的问题,通过分析中文文本中地理空间关系描述的语言特点,提出中文文本的地理空间关系标注体系,并以GATE(General Architecture for Text Engineering)为标注工具,以《中国大百科全书中国地理》为文本数据源,采用交叉校验方式建立了地理空间关系标注语料库。实现了中文文本中地理空间关系描述的结构化表达,提供了地理空间关系信息抽取的标准化测试数据。Corpus annotation is a task to provide both reference and training material for method development and benchmark data sets annotated witha given annotation scheme. After analysis of the linguistic characteristics, an annotation scheme is proposed for markup linguistic expressions for spatial relations in Chinese text. And then a natural language processing software-GATE(General Architecture for Text Engineering) is introduced as the anno- tation tool. Based on the proposed annotation scheme, a corpus with "Encyclopedia of China Geography" as the source data is annotated by means of cross-validation to so^ve the problem of annotation inconsistency, In order to realize the structurized representation of geographical spatial relations described in natural language, and to provide standard training and test data for their extraction.

关 键 词:自然语言 中文文本 地理空间关系 标注体系 标注语料库 

分 类 号:P208[天文地球—地图制图学与地理信息工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象