基于笔划合并的手写体信函地址汉字切分识别  被引量:8

Handwritten Chinese address segmentation and recognition based on merging strokes

在线阅读下载全文

作  者:王嵘[1] 丁晓青[1] 刘长松[1] 

机构地区:[1]清华大学电子工程系智能技术与系统国家重点实验室,北京100084

出  处:《清华大学学报(自然科学版)》2004年第4期498-502,共5页Journal of Tsinghua University(Science and Technology)

基  金:国家"八六三"高技术项目(2001AA114081)

摘  要:为了自动地处理存在着大量的笔划交叉与粘连的实际信函地址行,采用了一种基于笔划提取合并的手写体汉字切分识别方法。对于从实际信函中提取出的单行地址文本图像,首先提取出字符的横、竖、撇、捺等笔划,再根据一定的准则将笔划合并成字根,最终应用与地址解释相结合的动态规划算法得到最终的切分结果,获得投递区域。用从邮政分拣机上获得的443个信函地址行二值图像样本进行测试,省市一级和市县一级投递地址的正确识别率已经达到了66%。The recognition accuracy of Chinese characters in the handwritten address line of letters for automatic mail processing, especially for characters with overlapped or crossed strokes, was improved using a segmentation method to extract and merge strokes. The strokes were extracted from the address line image and classified into four direction types, horizontal, vertical, right slanting, and left slanting strokes. Then, the strokes were merged into radicals. After the dynamic interpretation of the address, the final segmentation result and the sorting area were interpreted. An experiment was then performed on 443 unconstrained handwritten address lines, which were extracted from a real postal sorting machine. The algorithm gave correct sorting rates for the province and city names of up to 66%.

关 键 词:笔划合并 文字识别 汉字切分 手写体汉字 信函地址 自动处理方式 信函处理 

分 类 号:TP391.43[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象