基于自然语言处理技术的网络博客版权保护双水印算法  被引量:2

Network blog copyright protection dual watermarking algorithm based on natural language processing technology

在线阅读下载全文

作  者:朱倩[1] 程显毅[1] 丁镠[1] 高飞[1] 

机构地区:[1]江苏大学计算机科学与通信工程学院

出  处:《南京大学学报(自然科学版)》2010年第2期140-148,共9页Journal of Nanjing University(Natural Science)

基  金:国家自然科学基金(60702056);2009年江苏省研究生创新计划项目(CX09B-204Z);江苏大学校高级人才启动基金(07JDG046)

摘  要:提出了基于汉字知识的文本水印算法,该算法属于自然语言技术文本水印算法,文中进行水印嵌入时,保持句子语义不变.先将一个句子划分成若干个词,再将词划分成若干汉字,最后将汉字细化为偏旁部首.算法根据语义对句子分词,对分词的字数、笔画数等进行运算,最终计算出句子的特征值,进而嵌入水印信息.文本图像水印算法将水印信息嵌入在视觉重要分量上从而获得较好的鲁棒性.针对网络博客中文章或图片被非法复制盗用及传播问题,利用自然语言处理技术,结合电子签名技术,提出了双水印版权保护算法.算法基本思想是将版权认证信息处理后双嵌入,第二次的嵌入以第一次为依托.另外,加密技术使得破解和篡改信息更加地困难.实验表明,该算法具有鲁棒性好,抗检测性强的优点.当文章或图片被非法复制、传播以及在发生侵权行为时能方便快速识别文章或图片的版权归属.This paper proposes a novel text watermarking algorithm based on knowledge of Chinese characters which belongs to lhe class of natural language technology and keeps the invariabilily of sentence's semantics. Chinese character is composed of strokes and characters. A word is made up of one or more characters. And a sentence is constituted of several words. First, based on the semantics, the algorithm carries out word segmentation on sentences, and then calculates the number of words and strokes etc. Finally, the characteristic values of sentences are calculated, thus ~he watermark information is embedded into sentences. The algorithm of watermark of text and image gets better robust result from embedding the information of watermark into important vector of vision. To solve the problem of papers and images in blogs being illegally copied, pirated and propagated, a double watermark copyright protection algorithm using electronic signature technology based on natural language processing is proposed. The basic idea of the algorithm is that the processed copyright protection information is embedded twice, and the second embedding process is on the basic of the first one. Furthermore, it is much harder to decrypt and modify information with this encryption technology. The experiment shows that the proposed algorithm has the advantages of good robustness and strong anti-detection, and when the papers and images are illegally copied, pirated and propagated, it is easy to quickly determine its copyright attribution.

关 键 词:版权保护 博客 双水印 数字水印 自然语言处理 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象