检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:龙从军 LONG Congjun
机构地区:[1]中国社会科学院大学文学院 [2]中国社会科学院民族学与人类学研究所民族语言文化行为实验室
出 处:《暨南学报(哲学社会科学版)》2024年第6期15-30,共16页Jinan Journal(Philosophy and Social Sciences)
基 金:国家社会科学基金重大项目“中国民族语言大规模语法标注文本在线检索系统研制与建设研究”(21&ZD304);中国社会科学院实验室孵化专项资助项目“基于民族语言多模态数据的共性特征计算研究”(2024SYFH008)。
摘 要:民族语言数据和语言知识服务在人文社会研究、民族传统科技、文化保护传承以及中华文化基因探索方面发挥着重要作用。本文以民族语言数据与知识服务为出发点,构建了面向民族语言文化研究的专业数据资源和系列知识库。利用数字人文技术把民族语言学界的重要文献数据进行数字化处理,利用知识图谱技术把各领域知识关联起来,形成文献检索和知识服务平台,按照民族语言词典类、语言简志类、濒危语言类、语法标注类、参考语法类、论文类和其他类收录数据,文献数据库收入文献150多部,关联各民族语言中的语法范畴概念200多个,并对格范畴知识关联结果进行了分析。初步研究发现,民族语言数据的准确性、一致性和规范性值得关注;我国民族语言类型十分丰富,语言的多样性承载了文化的多样性,语言知识的关联性揭示了各民族语言文化之间的共性和差异,启发研究者对民族语言间的亲属关系和文化互鉴进行思考和探索。Before the advent of the era of large language models,ethnolinguists mainly obtained relevant research data by manually searching for various works that recorded ethnolinguistic words,and sentences.In the process from data acquisition to data collection,there are often many practical problems such as high difficulty in data collection,incomplete information acquisition,scattered data distribution,and no system.Nowadays,the processing of language data has entered the era of large models,and the above problems can be effectively solved by collecting,sorting,and saving data and systematizing the data into the database.However,due to the relative shortage of ethnic language resources,the access channels are not smooth,and the interpretation and analysis of relevant ethnic language data require strong language knowledgeability.As a result,the effect of a large model processing ethnic language data is not ideal.Up to now,there is no publicly used large-scale professional database of Chinese ethnic languages in academia.To make effective use of the advantages of large model processing of ethnic languages and solve various problems faced by large model processing of ethnic languages,actively building ethnic language data and language knowledge services with human-computer collaboration as the core should be an effective measure to carry out ethnic language research and inheritance in the Internet era.Ethnic language data and language knowledge services play an important role in humanities and social research,ethnic traditional science and technology,cultural protection and inheritance,and the exploration of Chinese cultural genes.Based on ethnic language data and knowledge service,this paper constructs professional data resources and a series of knowledge bases for ethnic language and culture research.Digital humanities technology is used to digitize important literature data in the field of ethnolinguistics,and knowledge graph technology is used to associate domain knowledge to form a literature retrieval and knowledge
分 类 号:H2[语言文字—少数民族语言]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49