AI+专家驱动的科技文献信息资源消费端数据体系建设研究  

Building Consumption Data Systems Driven by AI Plus Expert for Scientific and Technical Literature Information Resources

在线阅读下载全文

作  者:叶光辉[1] 涂凯 胡丽娜 韩丽 冯智敏 YE Guanghui;TU Kai;HU Lina;HAN Li;FENG Zhiming(School of information management,Central China Normal University,Wuhan 430079)

机构地区:[1]华中师范大学信息管理学院,武汉430079

出  处:《农业图书情报学报》2024年第9期18-31,共14页Journal of Library and Information Science in Agriculture

基  金:教育部人文社会科学项目“面向共景治理的突发事件舆情演化计算与决策耦合模型研究”(23YJC870011)。

摘  要:[目的/意义]受限于传统文献分类体系局限,用户产生的高价值消费端标注数据还不能作为数据要素融入科技文献服务,致使科技文献服务无法顺应开放科学时代背景与满足用户读者各类知识需求。本研究旨在挖掘AI提供技术突破潜力,构建AI+专家驱动的科技文献信息资源消费端数据体系,以期推动科技文献服务优化进程。[方法/过程]首先分析了科技文献信息资源消费端数据体系建设价值表征,然后提出了科技文献信息资源消费端数据体系建设原则,再者解构与剖析了AI介入科技文献信息资源消费端数据体系建设风险。最后,根据AI介入数据标注工作的程度,设计了3种AI+专家协同用户科技文献信息资源数据标注创新模式。[结果/结论]聚焦于引领用户协同完成数据标注工作,AI+专家辅助型数据标注模式下,AI充当工具角色根据专家制定处理规则完成表层信息处理,协助用户完成数据标注;AI+专家合作型数据标注模式下,AI完成科技文献预标注标签审查工作,用户从自生成标签模式转变为评判与挑选AI生成的数据标签模式,专家辅助审核最终数据标签质量;AI+专家主导型数据标注模式下,用户提供数据标注需求,专家进行过程操作指导,数据标注由AI4S平台自动化完成。[Purpose/Significance]Limited by the constraints of traditional literature classification systems,scientific and technical literature information resources face problems such as inadequate disclosure and resource utilization.At the same time,high-quality user-generated data cannot yet be integrated as data elements into services related to scientific and technical literature services,which prevents these services from adapting to the context of the open science and meeting the diverse knowledge needs of readers.This study aims to harness the technological breakthrough potential of AI to build a consumer-end data system for scientific and technical literature information resources driven by AI and experts.This will help to overcome the shortcomings of traditional services,such as the lack of supporting reading information and low interactivity between users,with the hope of promoting the optimization process of scientific and technical literature information resource services.[Method/Process]First,the study analyzes the four-dimensional value representation of the consumer-end data systems for scientific and technical literature information resources,including the intrinsic value,the tool value,the academic value,and the future value of annotation data.Then,following the processing flow of consumer-end data,namely the collection phase,utilization phase,and management phase,the paper proposes principles for the construction of consumer-end data systems.Furthermore,the paper deconstructs and analyzes the risks associated with the involvement of AI in the construction of consumer-end data systems,including four types of risks:machine algorithm risks,annotation content risks,annotation data risks and application risks.Finally,based on the degree of AI involvement in data annotation work,three innovative models of AI plus expert collaborates with user to accomplish data annotation for scientific and technical literature information resources are designed:the AI plus expert-assisted data annotation model,the AI plus exp

关 键 词:科技文献信息资源 AI 体系建设 数据标注 模式设计 

分 类 号:G251[文化科学—图书馆学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象