检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:臧国全[1]
出 处:《图书情报知识》2010年第3期62-67,共6页Documentation,Information & Knowledge
基 金:河南省高校科技创新人才支持计划(2008-551)资助
摘 要:基于英国国家图书馆的Reshelp和Burney两个古旧英文报纸数字化项目,作者对文本型数字图像的OCR识别的准确度进行测试实验,结果显示整体准确度不高,且从高到低依次为字符、单词、重要单词、大写字母开头的重要单词。然后,将OCR识别周期划分为数字扫描对象的获取、数字图像的生产、数字图像的处理和文本识别等四个阶段,分析每个阶段影响准确度的因素,探讨提高准确度的具体措施。The following two aspects are discussed in this paper: ( 1 ) based on Reshelp and Burney historic English newspaper digitization projects in British Library, the author does an experiment on OCR accuracy measurement, and the result shows that the overall accuracies are not very good, and the sequence from high to low is characters, words, significant words and words start with capital letter; (2) based on the four stages of OCR period which are digital scanning object obtainment, digital image production, digital image process and text recognition, the author analyses the accuracy influencing factors and discusses the measures for improving the accuracy.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28