检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:于亚秀[1] 李欣 YU Yaxiu;LI Xin(East China Normal University Library,Shanghai 200062,China;School of Data Science and Engineering,East China Normal University,Shanghai 200062,China)
机构地区:[1]华东师范大学图书馆,上海200062 [2]华东师范大学数据科学与工程学院,上海200062
出 处:《大数据》2022年第6期15-25,共11页Big Data Research
基 金:中央高校基本科研业务费项目(No.2022ECNU-XWK-ZX05)。
摘 要:文本标注是文本分析挖掘中的重要一步,面对大规模古籍资源,人工标注无法满足人文研究需求,且古籍语法结构和语言特点特殊,现代文本标注技术很难直接用于古籍研究。在分析人文研究者进行古籍文本标注中面临的难点和痛点的基础上,提出普适性的古籍标注标准流程,给出基于MARKUS的文本标注模型,并通过具体实践,探索基于该模型的古籍文本标注方法,旨在助推借助数字人文工具改变古籍人文研究方式,拓宽研究规模的应用深度。Text annotation is an important step in text analysis and mining.Manual labeling can no longer meet the needs of humanistic research faced with large-scale text resources,and due to the special grammatical structure and language characteristics of ancient works,the text annotation technology on modern corpora cannot be directly applied to the ancient works.Based on the analysis of the challenges faced by humanities researchers,a universal standard text annotation process of ancient works was proposed,and a model based on MARKUS was given.And ancient works annotation method based on this model through specific example was explored,to promote using tools to change the research methods in digital humanities and to expand the scale of research.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49