检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:白浩[1] 张习文[2] 付永刚[2] 安维华[2]
机构地区:[1]北京语言大学汉语进修学院,北京100083 [2]北京语言大学信息科学学院数字媒体实验室,北京100083
出 处:《计算机工程与应用》2012年第15期153-158,共6页Computer Engineering and Applications
基 金:国家自然科学基金(No.60970158);北京语言大学青年自主科研支持计划资助项目(No.09JBT014);北京语言大学青年自主科研支持计划资助项目(中央高校基本科研业务费专项资金资助)(No.10JBT02)
摘 要:中文数字墨水文本的分割结果包含单字、文本行和段落三个层次对象,单字在其中占有较大比例,情况复杂。使用自动的分割方法难以提供完全正确的单字提取结果,这时就需要进行人机交互校正单字提取结果。优化的可视化方法可以在人机交互时大大提高校正效率。面向交互校正错误的单字提取结果,针对单字结果间的邻近和重叠等情况,给出了一种自适应的可视化方法。该方法先生成单字的正放最小外接矩形,如果相邻矩形重叠,则改用凸包,仍然重叠,则给单字结果加上颜色。对多种数字墨水文本的单字提取结果进行可视化表示,取得了较好的效果。The result of segmented digital ink text in Chinese includes three levels of objects: characters, lines and paragraphs. Characters form a significant percentage in the result and the situations of them are always complex. Automatic methods hardly provide completely correct result of extracted characters. So the result needs to be modified by human-computer interactive operation. Optimized visualization can improve the efficiency of modification. With the modification of the errors of extracted characters, according to the adjacency and overlapping among characters, this paper proposes a self-adaptive visualization. The approach gets rectangular bounding box of characters; if they are overlapping, the approach changes to convex hull to visualize the segmented characters; if they are still overlapping, the approach uses different colors to perform the segmented characters. Tested on many sorts of extracted characters from digital ink text in Chinese, the approach is effective.
分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222