期刊文献+

基于语义的毕业论文题目相似性分析 被引量:1

Similarity analysis of graduation thesis topic based on semantic
在线阅读 下载PDF
导出
摘要 在统计计算机学科专业7年毕业论文题目的基础上,从语义的角度对毕业论文题目的相似性进行了分析,为论文题目的查重及归类提供依据.首先,对毕业论文题目进行分词以得到特征词汇,进而计算特征词的权重构成论文题目的向量表示;其次,使用PLSA方法对得到的题目向量进行语义提取;最后,对语义向量进行相似度比较得出论文题目的相似性.实验结果表明,与传统的VSM方法相比,从语义角度对论文题目进行相似性比较更加合理、有效. Based on seven years of statistics for graduation thesis topic,we analyzed the similarity of graduation thesis from semantic perspective,which could provid a basis for duplicate checking and classification of graduation thesis topic.Firstly,the word segmentation operation was adopted to obtain feature words and then the graduation thesis topic's vector representation was gotten by calculating the weight of feature words.Secondly,the topics' semantic were extracted by the PLSA method.Finally,comparison of the topics' semantic vector was implemented to test the similarity of graduation thesis topic.Experimental results show that semantic similarity was more reasonable and effective compared with the traditional VSM method.
出处 《延边大学学报(自然科学版)》 CAS 2013年第2期129-133,共5页 Journal of Yanbian University(Natural Science Edition)
基金 延边大学科技发展计划项目(延大科合字(2011)第43号)
关键词 语义 毕业论文题目 概率潜在语义分析 相似性分析 semantic graduation thesis topic PLSA similarity analysis
  • 相关文献

参考文献7

  • 1彭泽映,俞晓明,许洪波,刘春阳.大规模短文本的不完全聚类[J].中文信息学报,2011,25(1):54-59. 被引量:35
  • 2路荣,项亮,刘明荣,杨青.基于隐主题分析和文本聚类的微博客中新闻话题的发现[J].模式识别与人工智能,2012,25(3):382-387. 被引量:68
  • 3王细薇,樊兴华,赵军.一种基于特征扩展的中文短文本分类方法[J].计算机应用,2009,29(3):843-845. 被引量:36
  • 4Bollegala D, Matsuo Y, Ishizuka M. Measuring se- mantic similarity between words using web search engines[C// Proe of the 16th International Confer- ence on World Wide Web. Canada: Banff, 2007: 757-766.
  • 5Hofmann T. Probabilistic latent semantic analysis [-C// Proc of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. USA: Berkeley, 1999:289- 296.
  • 6Deerwester S, Dumais S T, Landauer T K. Inde- xing by latent semantic analysis[J. Journal of the American Society for Information Science, 1990, 41 (6) : 391-407.
  • 7Hofmann T. Unsupervised learning by probabilis- tic latent semantic analysisJ-J. Machine Learning, 2001,42(1) :177 196.

二级参考文献27

共引文献132

同被引文献6

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部