基于汉字连通分量的印刷图像版面分割方法  被引量:3

Printed image layout segmentation method based on Chinese character connected component

在线阅读下载全文

作  者:付芦静[1] 钱军浩[1] 钟云飞[2] 

机构地区:[1]江南大学物联网工程学院,江苏无锡214122 [2]湖南工业大学包装与材料工程学院,湖南株洲412007

出  处:《计算机工程与应用》2015年第5期178-182,共5页Computer Engineering and Applications

基  金:湖南省自然科学基金重点资助项目(No.10JJ2048);湖南工业大学自然科学研究项目(No.2011HZX03);湖南省自然科学省市联合基金项目(No.12JJ9043)

摘  要:针对彩色印刷图像背景色彩丰富和汉字存在多个连通分量,连通域文字分割算法不能精确提取文字,提出基于汉字连通分量的彩色印刷图像版面分割方法。利用金字塔变换逆半调算法对图像进行预处理,通过颜色采样和均值偏移分割图像颜色,标记文字连通分量,根据汉字结构和连通分量特性重建汉字连通分量,分析文字连通分量连接关系确定文字排列方向实现文字分割。实验结果表明,该方法能够有效地重建汉字连通分量,在彩色印刷图像中实现对不同字体、字号、颜色的文字分割。Contraposing the background color of the color printed image is plentiful and Chinese character has multiple connected components, text segmentation algorithm of connected domain can't accurately extract text. A method of color printed image's layout segmentation based on Chinese character connected component is proposed. Image is preprocessed via inverse halftoning algorithm of pyramid transforming. Then, it segments image color through color sampling and mean shift and marks text connected components. It reconstructs Chinese character connected component according to the structure of characters and connected components feature. Finally, the connection relations of characters connected components are analyzed to determine the orientation of text and realize text segmentation. The experimental results show that the method can effectively reconstruct character connected component and achieve text segmentation on color printed image for different font, font size and color.

关 键 词:文字分割 连通分量重建 逆半调 颜色采样 均值偏移 聚类中心 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象