期刊文献+

基于自然语言处理技术的网络博客版权保护双水印算法 被引量:3

Network blog copyright protection dual watermarking algorithm based on natural language processing technology
在线阅读 下载PDF
导出
摘要 提出了基于汉字知识的文本水印算法,该算法属于自然语言技术文本水印算法,文中进行水印嵌入时,保持句子语义不变.先将一个句子划分成若干个词,再将词划分成若干汉字,最后将汉字细化为偏旁部首.算法根据语义对句子分词,对分词的字数、笔画数等进行运算,最终计算出句子的特征值,进而嵌入水印信息.文本图像水印算法将水印信息嵌入在视觉重要分量上从而获得较好的鲁棒性.针对网络博客中文章或图片被非法复制盗用及传播问题,利用自然语言处理技术,结合电子签名技术,提出了双水印版权保护算法.算法基本思想是将版权认证信息处理后双嵌入,第二次的嵌入以第一次为依托.另外,加密技术使得破解和篡改信息更加地困难.实验表明,该算法具有鲁棒性好,抗检测性强的优点.当文章或图片被非法复制、传播以及在发生侵权行为时能方便快速识别文章或图片的版权归属. This paper proposes a novel text watermarking algorithm based on knowledge of Chinese characters which belongs to lhe class of natural language technology and keeps the invariabilily of sentence's semantics. Chinese character is composed of strokes and characters. A word is made up of one or more characters. And a sentence is constituted of several words. First, based on the semantics, the algorithm carries out word segmentation on sentences, and then calculates the number of words and strokes etc. Finally, the characteristic values of sentences are calculated, thus ~he watermark information is embedded into sentences. The algorithm of watermark of text and image gets better robust result from embedding the information of watermark into important vector of vision. To solve the problem of papers and images in blogs being illegally copied, pirated and propagated, a double watermark copyright protection algorithm using electronic signature technology based on natural language processing is proposed. The basic idea of the algorithm is that the processed copyright protection information is embedded twice, and the second embedding process is on the basic of the first one. Furthermore, it is much harder to decrypt and modify information with this encryption technology. The experiment shows that the proposed algorithm has the advantages of good robustness and strong anti-detection, and when the papers and images are illegally copied, pirated and propagated, it is easy to quickly determine its copyright attribution.
出处 《南京大学学报(自然科学版)》 CAS CSCD 北大核心 2010年第2期140-148,共9页 Journal of Nanjing University(Natural Science)
基金 国家自然科学基金(60702056) 2009年江苏省研究生创新计划项目(CX09B-204Z) 江苏大学校高级人才启动基金(07JDG046)
关键词 版权保护 博客 双水印 数字水印 自然语言处理 copyright protecting, blog, double watermark, digital watermark, natural language processing
  • 相关文献

参考文献19

  • 1Liu Y,Sun X,Wu Y.A natural language watermarking based on Chinese syntax.Proceedings of the 2005 International Conference on Natural Computation.Changsha,2005,968-997.
  • 2Liu Y,Sun Y,Sun X,et al.An efficient linguistic steganography for Chinese text.Proceedings of the 2007 IEEE International Conference on Multimedia and Expo.Beijing,2007,2094-2097.
  • 3朱荷香,曲维光,卢俊之,李素建,邵艳秋.面向自动文摘的文本结构划分[J].南京大学学报(自然科学版),2008,44(2):204-211. 被引量:2
  • 4耿焕同,蔡庆生,于琨,赵鹏.一种基于词共现图的文档主题词自动抽取方法[J].南京大学学报(自然科学版),2006,42(2):156-162. 被引量:30
  • 5孙明欣,尹存燕,戴新宇,陈家骏.一种基于元规则的自然语言生成规则解释技术[J].南京大学学报(自然科学版),2006,42(1):69-75. 被引量:1
  • 6Brassil J,Low S,Maxemchuk N.Copyright protection for the electronic distribution of text documents.Proceedings of the 1999 Institute of Electrical and Electronics Engineers IEEE,1999,87(7):1181-1196.
  • 7Amano T,Misaki D.A feature calibration method for watermarking of document image.Proceedings of the 1999 International Conference on Document Analysis and Recognition.Bangalore,1999,91-94.
  • 8张力,袁灯山,尹树田.一种文档加密方法.中国专利,1740943,2006-03-01.
  • 9刘东,孙明,周明天.基于图论的文本数字水印技术[J].计算机研究与发展,2007,44(10):1757-1764. 被引量:6
  • 10Wu M,Liu B.Data hiding in binary image for authentication and annotation.IEEE Transactions on Multimedia,2004,6(4):528-538.

二级参考文献67

共引文献36

同被引文献50

引证文献3

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部