检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:方天红[1,2] 陈庆虎[2] 鄢煜尘[2] 周前进[2]
机构地区:[1]湖北工程学院物理与电子信息工程学院,湖北孝感432000 [2]武汉大学电子信息学院,湖北武汉430072
出 处:《武汉大学学报(工学版)》2016年第1期154-160,共7页Engineering Journal of Wuhan University
基 金:公安部重大项目(编号:2014JSYJA017);湖北省教育厅科学技术研究项目(编号:B2015033);湖北工程学院科学研究项目(编号:201511)
摘 要:为了解决计算机打印文档的自动鉴别,提出了一种基于中文汉字显微放大图像灰度共生矩阵统计纹理特征的打印文档鉴别算法.首先,从理论模型上分析了激光打印机传动系统对打印字符潜影的影响;接着对字符图像的22维灰度共生矩阵统计纹理特征进行计算,并利用ReliefF特征选择算法进行特征选择;最后提取显微字符图像激光扫描方向和纸张行进方向的灰度共生矩阵纹理特征并进行融合,利用最近邻和支持向量机2种分类器进行分类鉴别.在两种样本集上的实验结果表明:特征融合后的鉴别性能有所提高;支持向量机的分类鉴别性能优于最近邻分类器,在相同字无重复样本集上的分类准确率和平均召回率分别为96.5%和96.64%,在相同字有重复样本集上分类准确率和平均召回率分别为98%和98.18%;激光打印机品牌分类准确率为98%.上述的实验结果显示该方法具有良好的打印文档分类鉴别性能.In order to solve the automatic identification of laser print documents,aprint document identification algorithm based on statistical texture features computed from gray-level co-occurrence matrix of Chinese character microscopic images was proposed.First,the laser printer transmission system's influence on the latent images of the printed character was analyzed on the theoretical model.Then twenty-two statistical texture features were calculated;and the ReliefF algorithm was used for feature selection.Finally,the statistical texture features of laser scanning direction and paper moving direction were fused;and the nearest neighbor classifier and support vector machine were used for print document identification.The experimental results on two sample sets reveal that the feature fusion is beneficial to the improvement of the identification performance;the identification performance of support vector machine is better than the nearest neighbor classifier;the average classification identification rate and the average recall rate of the same word sets without duplicate sample are 96.5%and 96.64%respectively;the average classification identification rate and the average recall rate of the same word sets with duplicate sample are 98%and 98.18%respectively;and the average printer brand classification identification rate is 98%.The experimental results show that the method has a good print document identification performance.
关 键 词:打印文档 灰度共生矩阵 特征选择 支持向量机 统计纹理特征
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.145.71.192