电磁泄漏还原图像中的中文文本识别技术研究  被引量:1

Chinese Text Recognition in Electromagnetic Emission Reconstructed Images

在线阅读下载全文

作  者:吕志强[1,2] 张磊 夏宇琦[1,2] 张宁 LV Zhiqiang;ZHANG Lei;XIA Yuqi;ZHANG Ning(The 4th Laboratory,Institute of Information Engineering,Chinese Academy of Sciences,Beijing 100093,China;School of Cyber Security,University of Chinese Academy of Sciences,Beijing 100093,China)

机构地区:[1]中国科学院信息工程研究所第四研究室,北京100093 [2]中国科学院大学网络空间安全学院,北京100093

出  处:《信息安全学报》2021年第3期212-226,共15页Journal of Cyber Security

基  金:国家重点研发计划课题(No.2018YFF01014303)资助。

摘  要:现代计算机的显示信号传输过程存在的电磁泄漏,从电磁泄漏还原得到的图像会受到噪声的严重污染,使得其中的文本内容难以识别。本文提出了一种新的模型,利用基于特征强化的神经网络(Feature Enhancement based Neural Network,FENN)对电磁泄漏还原图像中的中文文本进行识别。模型将去噪自编码器(Denoising Autoencoder,DAE)与卷积神经网络(Convolutional Neural Network,CNN)相结合,对电磁泄漏图像的文本特征进行强化并抑制噪声干扰,在不损失原始图像信息的情况下将鲁棒特征送入后续的循环神经网络(Recurrent Neural Network,RNN),最后将连续时间序列分类(Connectionist Temporal Classification Loss,CTC Loss)损失与均方误差损失(Mean Squared Error Loss)结合形成联合损失对模型进行联合训练,实现无需去噪等常规预处理的中文文本识别。模型在电磁泄漏还原实景数据和公开数据集RCTW17、CASIA-10k上进行了测试,相比于常见的主流识别模型,FENN在电磁泄漏还原图像中的中文识别率最高提升5.4%,体现出明显优势。Electromagnetic emission exists in the process of display signal transmission in modern computers.Therefore,by signal receiving and restoring using eavesdroppers,one can reconstruct the display information emitted from target computer.However,reconstructed images are corrupted by noise,causing difficulty in recognizing its content.In this paper,we propose a new model,using feature-enhancing-based Neural Network(FENN)to recognizes Chinese text lines in reconstructed image.The model combines Convolutional Neural Network(CNN)with denoising autoencoder to achieve enhancement of text features and suppress noise interference.Then robustic features extracted with image information preserved are feed into the following Recurrent Neural Network(RNN).Finally,with Connectionist temporal classification(CTC)Loss and Mean Squared Error(MSE)loss combined,the model can by trained jointly under a joint loss function,by which the model is able to recognize Chinese text lines in reconstructed images without denoising or any other preprocessing.Experiments were performed on dataset consists of reconstructed images and public datasets including RCTW17 and CASIA-10k.Result shows that our method outperforms common recognition methods by 5.4%at most.

关 键 词:电磁泄漏 去噪自编码器 特征强化 中文文本识别 神经网络 

分 类 号:TP183[自动化与计算机技术—控制理论与控制工程] TP309.2[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象