基于“AI+众包”的文献数字化实践探析——以Transkribus平台为例  

Typical Cases and Enlightenments of Document Digitization Based on“AI+Crowdsourcing”:Taking Transkribus Platform as an Example

在线阅读下载全文

作  者:于英香[1] 程裕 Yu Yingxiang;Cheng Yu(College of Cultural Heritage and Information Management,Shanghai University,Shanghai 200444,China)

机构地区:[1]上海大学文化遗产与信息管理学院,上海200444

出  处:《图书情报研究》2025年第1期12-21,共10页Library and Information Studies

基  金:国家社科基金一般项目“档案服务智慧化转型的实现机制与推进策略研究”(项目编号:24BTQ024)的研究成果之一。

摘  要:[目的/意义]近年来,人工智能的飞速发展使得机器对海量历史文献数字化成为可能,但我国以手写OCR识别为代表的基于AI的数字化实践仍处于发展阶段,且融合众包模式的平台较少。本文以国外文献数字化平台Transkribus为例,探讨其在“AI+众包”文献数字化方面的经验,以期为我国开展以AI为核心的文献资源数字化研究和实践提供借鉴。[方法/过程]结合文献和网络调研,采用案例分析法,从建设动因、基础要素、保障条件几个方面深入剖析目前持续运营且不断更新的“AI+众包”文献数字化平台Transkribus,并探讨对我国文献数字化的启示。[结果/结论]总结Transkribus平台的成功经验,提出我国应强化AI技术在数字化转型中的核心作用,提升数字化工作效率与质量;引入众包模式助力“AI+”文献数字化工作,完善数字化工作模式与流程;加强机构协同与合作,推动文献数字化和文化遗产保护工作。[Purpose/significance]In recent years,the rapid development of artificial intelligence has made it possible for machines to digitize massive historical documents,but the digitization practice based on AI,represented by handwritten OCR recognition,is still in the development stage in China,and there are few platforms based on crowdsourcing.This study takes Transkribus,a foreign document digitization platform,as an e xample to d iscuss i ts d ocument d igitization e xperience i n“AI+c rowdsourcing”,i n order to provide reference for China to carry out AI-centered document digitization research and practice.[Method/process]Combined with literature and network research,this study uses case analysis method to deeply analyze Transkribus,the“AI+crowdsourcing”document digitization platform that is continuously operating and constantly updated,from the aspects of platform construction motivation,basic elements and guarantee conditions.And this study discusses its enlightenment to China’s document digitization.[Result/conclusion]After summarizing the successful experience of Transkribus platform,it is proposed that China should strengthen the core role of AI technology in digital transformation and improve the efficiency and quality of digital work;Introduce crowdsourcing mode to help“AI+”literature digitization work,and improve digital work mode and process;Strengthen institutional coordination and cooperation to promote the digitization of documents and the protection of cultural heritage.

关 键 词:文献数字化 人工智能 众包 Transkribus 

分 类 号:G270.7[文化科学—档案学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象