期刊文献+

一种基于会话聚类算法的Web使用挖掘方法 被引量:4

A novel web usage mining method based on web session clustering
在线阅读 下载PDF
导出
摘要 Web使用挖掘作为数据挖掘的一个重要任务,有助于了解用户群体的特征,从而为其提供个性化服务.提出了一种基于用户会话聚类的Web使用挖掘算法.首先,对Web日志预处理采用基于时间窗的用户会话识别方法,提出了一种基于三元组的用户会话表示方法,并在此基础上给出了基于网页语义相似性的会话处理方法,该方法能够在保持用户兴趣不变的情况下有效降低会话维度;其次,提出了一种基于时间及频次的用户会话相似性度量方法;最后,设计了一种两阶段PS-KM会话聚类算法,先用PSO方法进行全局搜索再转入基于K-means方法的局部聚类过程.仿真表明了算法的有效性. Web usage mining has been an important task of data mining. It helps to understand the user group's identity, thus provides personalized service. A novel web usage mining algorithm based on the user sessions clustering is proposed in this paper. Firstly, a time-based user session identification method is used for Web log preprocessing. Furthermore, a 3-tuple data structure is designed to represent web sessions, and a session dimensionality reduction method based on web page semantic similarity is proposed, which could deduce the length of the session effectively with user's interest retaining. Secondly, a new session similarity measure is designed based on both time and frequency. Finally, a two-stage PS-KM session clustering algorithm is proposed. The algorithm first uses PSO method to make a global search, and then uses the local clustering process based on the K-means method. Experimental results show that the algorithm has highly effective.
出处 《系统工程学报》 CSCD 北大核心 2012年第1期129-136,共8页 Journal of Systems Engineering
基金 国家自然科学基金资助项目(71101103 70925005 61074152 70771074) 教育部博士点专项科研基金资助项目(20100032120086 20090032110065 20100032110036)
关键词 WEB使用挖掘 WEB日志 用户会话 聚类 web usage mining web log user session clustering
  • 相关文献

参考文献11

  • 1Shahabi C,Zarkesh A,Adibi J,et al.Knowledge discovery from users Web-page navigation[C]// Proceedings of the 7th International Workshop on Research Issues of Data Mining,Birmingham:IEEE Computer Society,1997:20-29.
  • 2Chen M,Yu P.Data mining for path traversal patterns in a Web environment[C]// Proceedings of the 16th International Conference on Distributed Computing Systems.Washington:IEEE Computer Society,1996:27-30.
  • 3Yan T,Jacobsen M,Garcia-Molina H,et al.From user access patterns to dynamic hypertext linking[J].Computer Networks and ISDN Systems,1996,28(7):1007-1014.
  • 4Smith K,Ng A.Web page clustering using a self-organizing map of user navigation patterns[J].Decision Support Systems.2003, 35(2):245-256.
  • 5Baldi P,Frasconi P,Smyth P.Modeling the Internet and the Web[M].New York:Wiley,2003.
  • 6Alam S,Dobbie G,Riddle P.Particle swarm otimization based clustering of Web usage data[C]// Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.Washington:IEEE Computer Society.2008.3: 451-454.
  • 7吴瑞,宁玉富,郭长友.基于模糊粗糙近似的web浏览模式的聚类[J].系统工程学报,2010,25(1):132-136. 被引量:3
  • 8Niknam T,Amiri B.An effcient hybrid approach based on PSO,ACO and K-means for cluster analysis[J].Applied Soft Computing. 2010,10(1):183-197.
  • 9Poli R,Kennedy J,Blackwell T.Particle swarm optimization:An overview[J].Swarm Intelligence,2007,1(1):33-57.
  • 10Spiliopoulou M,Mobasher B,Berendt B,et al.A framework for the evaluation of session reconstruction heuristics in Web-usage analysis[J].Informs Journal on Computing,2003,15(2):171-190.

二级参考文献8

  • 1Pawan Lingras,Chad West.Interval Set Clustering of Web Users with Rough K-Means[J]. Journal of Intelligent Information Systems . 2004 (1)
  • 2Arotaritei Dragos,Mitra Sushmita.Web mining: a survey in the fuzzy framework. Fuzzy Sets and Systems . 2004
  • 3Supriya K D,Krishna P R.Clustering web transactions using rough approximation. Fuzzy Sets and Systems . 2004
  • 4Liu Baoding,Liu Y K.Expected value of fuzzy variable and fuzzy expected value model. IEEE Transactions on Fuzzy Systems . 2002
  • 5Liu B.Theory and Practice of Uncertain Programming. . 2002
  • 6T.Runkler,J.Beadek.Web mining with relational clustering. International Journal of Approximate Reasoning . 2003
  • 7S. Mitra.An evolutionary rough partitive clustering. Pattern Recognition . 2004
  • 8Wang X,Ha M.Note on maxmin u/E estimation. Fuzzy Sets and Systems . 1998

共引文献2

同被引文献43

引证文献4

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部