期刊文献+

基于语义事务信息聚类的用户概貌构建 被引量:1

Building user profile based on semantic transaction information clustering
在线阅读 下载PDF
导出
摘要 为了揭示用户的访问模式,对传统的基于聚类技术构建用户概貌方法进行了研究,同时引入语义事务分析的观点,提出一种基于潜在语义模型构建用户概貌的方法。通过语义分析中的奇异值分解(SVD)算法,将构建的用户会话-浏览页面矩阵向量空间投影到潜在语义向量空间;利用扩展的K-means聚类算法,对潜在语义向量空间聚类生成用户会话聚类;计算浏览页面均值向量,构建以加权浏览页面集表示的用户概貌;最后采用加权平均访问百分比(WAVP)方法评价构建的用户概貌,表明了该方法的有效性。 To reveal the user’s access pattern,a method of building user profile based on clustering technology is researched,meanwhile,the semantic analysis view and an approach based on latent semantic analysis is introduced to construct user profilesd.Firstly,user session-pageview matrix vector space is constructed and singular value decomposition(SVD) algorithm is applied to project into latent semantic vector space.An extended K-means clustering algorithm is performed on latent semantic vector space to generate user session clusters.Then,the pageview mean vector is computed to construct user profiles expressed in the form of a weighted pageview collection.Finally,the weighted average visit percentage(WAVP) is adopted to evaluate the constructed user profiles,and the validity and efficiency of the presented method is demonstrated.
出处 《计算机工程与设计》 CSCD 北大核心 2010年第20期4497-4499,4523,共4页 Computer Engineering and Design
基金 江苏省高等科学基金项目(09KJB52003)
关键词 潜在语义分析 用户概貌 聚类 用户事务信息 语义向量空间 奇异值分解 latent semantic analysis user profile clustering user transaction information semantic vector space singular value decomposition
  • 相关文献

参考文献8

  • 1Dimitrios Pierrakos,Georgios Paliouras,Christos Papatheodorou,et al.User modeling and user-adapted interaction[C].Netherlands:Khiwer Academic Publishnrs,2003:311-372.
  • 2Shokry RA,Saad AA,El-Makkey NM,et al.Using new soft clustering technique in adaptive web site[C].Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.Washington.DC,USA:IEEE Computer Society,2006:281-286.
  • 3Chad A Williams,Bamshad Mobasher,Robin Burke,et al.Advances in web mining and web usage analysis[M].Network,USA:Springer Berlin/Heidelberg,2007:167-186.
  • 4刘云峰,齐欢,代建民.潜在语义分析在中文信息处理中的应用[J].计算机工程与应用,2005,41(3):91-93. 被引量:18
  • 5Jin Xiaoqing,Wei Yimin.Numerical linear algebra and application[M].Beijing:Science Press,2005:55-60.
  • 6Pawan Lingras,Chad? West.Interval set clustering of web users with rough K-means[J].journal of Intelligent Information Systems,2004,23(1):5-16.
  • 7Ian Davidson,Ravi S S.Using instance-level constraints in agglomerative hierarchical clustering:Theoretical and empirical resuits[J].Data Mining and Knowledge Discovery,2009,18 (2):257-282.
  • 8Ai-BoSong,Mao-XianZhao,Zuo-PengLiang,Yi-ShengDong,Jun-ZhouLuo.Discovering User Profiles for Web Personalized Recommendation[J].Journal of Computer Science & Technology,2004,19(3):320-328. 被引量:2

二级参考文献8

  • 1Deerweter S,Dumais S T,Furnas G W et al. Indexing by Latent Semantic Analysis[J].Journal of the American Society for Information Science, 1990 ;41 (6): 391~407.
  • 2Ding C.A Similarity-Based Probability Model For Latent Semantic Indexing[C].In:Proc of ACM SIGIR Conf, 1999.
  • 3Graesser A C et al.A Simulation of A Human Tutor[J].Journal of Cognitive Systems Research, 1999; 1: 35~51.
  • 4Peter W Foltz ,Darrell Laham,Thomas K Landauer. The Intelligent Essay Assessor:Applications to Educational Technology[J].Interactive Multimedia Electronic Journal of Computer-Enhanced Learning,1999;1(2).
  • 5Papadimitriou C,Raghavan P,Tamaki H et al. Latent Semantic Indexing:A Probabilistic Analysis[J].Journal of Computer and System Sciences, 2000; 61 (2): 217~235.
  • 6Golub G,Loan C V.Matrix Computations(2-nd ed)[M]John-Hopkins,Baltimore, 1986.
  • 7宋擒豹,沈钧毅.Web日志的高效多能挖掘算法[J].计算机研究与发展,2001,38(3):328-333. 被引量:115
  • 8陈宁,陈安,周龙骧,CHEN Ning.大规模交易数据库的一种有效聚类算法(英文)[J].软件学报,2001,12(4):475-484. 被引量:17

共引文献18

同被引文献14

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部