基于视窗的OCR页面图像倾斜检测方法  被引量:2

Skew Document Image Detection Method Based on Windows Transform

在线阅读下载全文

作  者:靳从[1] 魏之来[1] 杨静宇[1] 

机构地区:[1]南京理工大学计算机系,南京210094

出  处:《中国图象图形学报(A辑)》2004年第11期1290-1293,共4页Journal of Image and Graphics

摘  要:文档在扫描输入过程中 ,所生成的页面图像一般都存在一定的角度倾斜 ,当页面图像倾斜角度过大时 ,将对进一步的版面分析以及字符识别产生不良影响。为了快速准确地检测页面图像倾斜角度和降低计算量 ,提出了一种基于视窗变换的页面图像倾斜检测方法 ,该算法首先对视窗中的文字及图片的细节部分进行模糊 ,然后对其边沿进行直线拟合 ,以便快速检测页面图像倾斜角度。实验结果表明 ,该方法能快速准确地检测出各类页面图像的倾斜角度 。During OCR(optical character recognition) image scanning, the document images, are always placed slantwise to some extent. When the skew degree is big enough, it will influence the effect of document analysis and lower the recognition accuracy as the algorithm for layout analysis and character recognition are very sensitive to page skew. So the skew degree detection is a very important step during the preprocessing of document analysis. In this paper, a skew detection method based on the window analysis is presented. First it chooses the suitable windows which are not in the margin but in the layout of a printed page. Then according to the kind of contents, just like tables, text lines, images and etc., it uses the different methods to pre-processing the windows image. To overcome the large computing, the third step is to blur the text lines and image from the window. The forth step is to detect the edges of the blurring regions .At last it uses a straight line fitting to the edges, and gets the skew angle. By this method, experimental results show that the skew angles of many kinds of document images can be efficiently and accurately detected, and it has sufficient adaptability.

关 键 词:页面 倾斜检测 视窗 图像 OCR 倾斜角度 文档 法能 不良影响 快速检测 

分 类 号:TB66[一般工业技术—制冷工程] TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象