结合MSCRs与MSERs的自然场景文本检测  被引量:18

Natural scene text detection method by integrating MSCRs into MSERs

在线阅读下载全文

作  者:易尧华[1] 申春辉[1] 刘菊华[1] 卢利琼[1] 

机构地区:[1]武汉大学印刷与包装系,武汉430072

出  处:《中国图象图形学报》2017年第2期154-160,共7页Journal of Image and Graphics

基  金:国家自然科学基金项目(61601335);国家科技支撑计划资助项目(2013BAH03B01);国家测绘地理信息局卫星测绘技术与应用重点实验室经费资助项目(KLSMTA-2016-04);中国博士后科学基金项目(2015M582277);中央高校基本科研业务费专项基金项目(2042015kf0059)~~

摘  要:目的目前,基于MSERs(maximally stable extremal regions)的文本检测方法是自然场景图像文本检测的主流方法。但是自然场景图像中部分文本的背景复杂多变,MSERs算法无法将其准确提取出来,降低了该类方法的鲁棒性。本文针对自然场景图像文本背景复杂多变的特点,将MSCRs(maximally stable color regions)算法用于自然场景文本检测,提出一种结合MSCRs与MSERs的自然场景文本检测方法。方法首先采用MSCRs算法与MSERs算法提取候选字符区域;然后利用候选字符区域的纹理特征训练随机森林字符分类器,对候选字符区域进行分类,从而得到字符区域;最后,依据字符区域的彩色一致性和几何邻接关系对字符进行合并,得到最终文本检测结果。结果本文方法在ICDAR 2013上的召回率、准确率和F值分别为71.9%、84.1%和77.5%,相对于其他方法的召回率和F值均有所提高。结论本文方法对自然场景图像文本检测具有较强的鲁棒性,实验结果验证了本文方法的有效性。Objective Text detection methods based on the maximally stable extremal regions (MSERs) algorithm are now widely used in natural scene text detection. However, text regions in natural scene images can have complex backgrounds that differ from those in documents and business cards, which cannot be accurately extracted by the MSERs algorithm. A text detection method is proposed for natural scene images by integrating the maximally stable color regions (MSCRs) into MSERs in this study to overcome the said problem. Method The character candidates are first extracted with both the MSCRs and MSERs algorithms. Parts of the non-character candidates are then eliminated according to the geometric information. The texture features are exploited to distinguish the character and non-character candidates, and a random forest character classifier is trained. The non-character candidates are then eliminated according to the classification result of the character classifier. Finally, the single character candidates are grouped into text regions according to the color similarityand geometric adjacency information. Result The proposed natural scene text detection method achieved 71.9% , 84. 1% , and 77.5% in recall rate, precision rate, and f-score on the ICDAR 2013 database, respectively. The recall rate and f-score improved, unlike other state-of-the-art methods. Conclusion The proposed text detection method is robust for natural scene images, and experimental results show the effectiveness of the proposed method.

关 键 词:自然场景 复杂背景 文本检测 MSCRs MSERs 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象