期刊文献+
共找到616篇文章
< 1 2 31 >
每页显示 20 50 100
A New Algorithm for Mining Frequent Pattern 被引量:2
1
作者 李力 靳蕃 《Journal of Southwest Jiaotong University(English Edition)》 2002年第1期10-20,共11页
Mining frequent pattern in transaction database, time series databases, and many other kinds of databases have been studied popularly in data mining research. Most of the previous studies adopt Apriori like candidat... Mining frequent pattern in transaction database, time series databases, and many other kinds of databases have been studied popularly in data mining research. Most of the previous studies adopt Apriori like candidate set generation and test approach. However, candidate set generation is very costly. Han J. proposed a novel algorithm FP growth that could generate frequent pattern without candidate set. Based on the analysis of the algorithm FP growth, this paper proposes a concept of equivalent FP tree and proposes an improved algorithm, denoted as FP growth * , which is much faster in speed, and easy to realize. FP growth * adopts a modified structure of FP tree and header table, and only generates a header table in each recursive operation and projects the tree to the original FP tree. The two algorithms get the same frequent pattern set in the same transaction database, but the performance study on computer shows that the speed of the improved algorithm, FP growth * , is at least two times as fast as that of FP growth. 展开更多
关键词 data mining algorithm frequent pattern set FP growth
在线阅读 下载PDF
A Novel Incremental Mining Algorithm of Frequent Patterns for Web Usage Mining 被引量:1
2
作者 DONG Yihong ZHUANG Yueting TAI Xiaoying 《Wuhan University Journal of Natural Sciences》 CAS 2007年第5期777-782,共6页
Because data warehouse is frequently changing, incremental data leads to old knowledge which is mined formerly unavailable. In order to maintain the discovered knowledge and patterns dynamically, this study presents a... Because data warehouse is frequently changing, incremental data leads to old knowledge which is mined formerly unavailable. In order to maintain the discovered knowledge and patterns dynamically, this study presents a novel algorithm updating for global frequent patterns-IPARUC. A rapid clustering method is introduced to divide database into n parts in IPARUC firstly, where the data are similar in the same part. Then, the nodes in the tree are adjusted dynamically in inserting process by "pruning and laying back" to keep the frequency descending order so that they can be shared to approaching optimization. Finally local frequent itemsets mined from each local dataset are merged into global frequent itemsets. The results of experimental study are very encouraging. It is obvious from experiment that IPARUC is more effective and efficient than other two contrastive methods. Furthermore, there is significant application potential to a prototype of Web log Analyzer in web usage mining that can help us to discover useful knowledge effectively, even help managers making decision. 展开更多
关键词 incremental algorithm association rule frequent pattern tree web usage mining
在线阅读 下载PDF
Hybrid Reliability Parameter Selection Method Based on Text Mining, Frequent Pattern Growth Algorithm and Fuzzy Bayesian Network 被引量:1
3
作者 SHUAI Yon SONG Tailian +1 位作者 WANG Jianping ZHAN Wenbin 《Journal of Shanghai Jiaotong university(Science)》 EI 2018年第3期423-428,共6页
Reliability parameter selection is very important in the period of equipment project design and demonstration. In this paper, the problem in selecting the reliability parameters and their number is proposed. In order ... Reliability parameter selection is very important in the period of equipment project design and demonstration. In this paper, the problem in selecting the reliability parameters and their number is proposed. In order to solve this problem, the thought of text mining is used to extract the feature and curtail feature sets from text data firstly, and frequent pattern tree (FPT) of the text data is constructed to reason frequent item-set between the key factors by frequent patter growth (FPC) algorithm. Then on the basis of fuzzy Bayesian network (FBN) and sample distribution, this paper fuzzifies the key attributes, which forms associated relationship in frequent item-sets and their main parameters, eliminates the subjective influence factors and obtains condition mutual information and maximum weight directed tree among all the attribute variables. Furthermore, the hybrid model is established by reason fuzzy prior probability and contingent probability and concluding parameter learning method. Finally, the example indicates the model is believable and effective. 展开更多
关键词 reliability parameter text mining frequent pattern growth(FPG) fuzzy Bayesian network(FBN)
原文传递
Mining Maximal Frequent Patterns in a Unidirectional FP-tree 被引量:1
4
作者 宋晶晶 刘瑞新 +1 位作者 王艳 姜保庆 《Journal of Donghua University(English Edition)》 EI CAS 2006年第6期105-109,共5页
Because mining complete set of frequent patterns from dense database could be impractical, an interesting alternative has been proposed recently. Instead of mining the complete set of frequent patterns, the new model ... Because mining complete set of frequent patterns from dense database could be impractical, an interesting alternative has been proposed recently. Instead of mining the complete set of frequent patterns, the new model only finds out the maximal frequent patterns, which can generate all frequent patterns. FP-growth algorithm is one of the most efficient frequent-pattern mining methods published so far. However, because FP-tree and conditional FP-trees must be two-way traversable, a great deal memory is needed in process of mining. This paper proposes an efficient algorithm Unid_FP-Max for mining maximal frequent patterns based on unidirectional FP-tree. Because of generation method of unidirectional FP-tree and conditional unidirectional FP-trees, the algorithm reduces the space consumption to the fullest extent. With the development of two techniques: single path pruning and header table pruning which can cut down many conditional unidirectional FP-trees generated recursively in mining process, Unid_FP-Max further lowers the expense of time and space. 展开更多
关键词 data mining frequent pattern the maximal frequent pattern Unid _ FP-tree conditional Unid _ FP-tree.
在线阅读 下载PDF
Quantum Algorithm for Mining Frequent Patterns for Association Rule Mining 被引量:1
5
作者 Abdirahman Alasow Marek Perkowski 《Journal of Quantum Information Science》 CAS 2023年第1期1-23,共23页
Maximum frequent pattern generation from a large database of transactions and items for association rule mining is an important research topic in data mining. Association rule mining aims to discover interesting corre... Maximum frequent pattern generation from a large database of transactions and items for association rule mining is an important research topic in data mining. Association rule mining aims to discover interesting correlations, frequent patterns, associations, or causal structures between items hidden in a large database. By exploiting quantum computing, we propose an efficient quantum search algorithm design to discover the maximum frequent patterns. We modified Grover’s search algorithm so that a subspace of arbitrary symmetric states is used instead of the whole search space. We presented a novel quantum oracle design that employs a quantum counter to count the maximum frequent items and a quantum comparator to check with a minimum support threshold. The proposed derived algorithm increases the rate of the correct solutions since the search is only in a subspace. Furthermore, our algorithm significantly scales and optimizes the required number of qubits in design, which directly reflected positively on the performance. Our proposed design can accommodate more transactions and items and still have a good performance with a small number of qubits. 展开更多
关键词 Data mining Association Rule mining frequent pattern Apriori Algorithm Quantum Counter Quantum Comparator Grover’s Search Algorithm
在线阅读 下载PDF
High Utility Periodic Frequent Pattern Mining in Multiple Sequences
6
作者 Chien-Ming Chen Zhenzhou Zhang +1 位作者 Jimmy Ming-Tai Wu Kuruva Lakshmanna 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第10期733-759,共27页
Periodic patternmining has become a popular research subject in recent years;this approach involves the discoveryof frequently recurring patterns in a transaction sequence. However, previous algorithms for periodic pa... Periodic patternmining has become a popular research subject in recent years;this approach involves the discoveryof frequently recurring patterns in a transaction sequence. However, previous algorithms for periodic patternmining have ignored the utility (profit, value) of patterns. Additionally, these algorithms only identify periodicpatterns in a single sequence. However, identifying patterns of high utility that are common to a set of sequencesis more valuable. In several fields, identifying high-utility periodic frequent patterns in multiple sequences isimportant. In this study, an efficient algorithm called MHUPFPS was proposed to identify such patterns. To addressexisting problems, three new measures are defined: the utility, high support, and high-utility period sequenceratios. Further, a new upper bound, upSeqRa, and two new pruning properties were proposed. MHUPFPS usesa newly defined HUPFPS-list structure to significantly accelerate the reduction of the search space and improvethe overall performance of the algorithm. Furthermore, the proposed algorithmis evaluated using several datasets.The experimental results indicate that the algorithm is accurate and effective in filtering several non-high-utilityperiodic frequent patterns. 展开更多
关键词 Decision making frequent periodic pattern multi-sequence database sequential rules utility mining
在线阅读 下载PDF
Association RuleMining Frequent-Pattern-Based Intrusion Detection in Network
7
作者 S.Sivanantham V.Mohanraj +1 位作者 Y.Suresh J.Senthilkumar 《Computer Systems Science & Engineering》 SCIE EI 2023年第2期1617-1631,共15页
In the network security system,intrusion detection plays a significant role.The network security system detects the malicious actions in the network and also conforms the availability,integrity and confidentiality of da... In the network security system,intrusion detection plays a significant role.The network security system detects the malicious actions in the network and also conforms the availability,integrity and confidentiality of data informa-tion resources.Intrusion identification system can easily detect the false positive alerts.If large number of false positive alerts are created then it makes intrusion detection system as difficult to differentiate the false positive alerts from genuine attacks.Many research works have been done.The issues in the existing algo-rithms are more memory space and need more time to execute the transactions of records.This paper proposes a novel framework of network security Intrusion Detection System(IDS)using Modified Frequent Pattern(MFP-Tree)via K-means algorithm.The accuracy rate of Modified Frequent Pattern Tree(MFPT)-K means method infinding the various attacks are Normal 94.89%,for DoS based attack 98.34%,for User to Root(U2R)attacks got 96.73%,Remote to Local(R2L)got 95.89%and Probe attack got 92.67%and is optimal when it is compared with other existing algorithms of K-Means and APRIORI. 展开更多
关键词 IDS K-MEANS frequent pattern tree false alert mining L1-norm
在线阅读 下载PDF
SWFP-Miner: an efficient algorithm for mining weighted frequent pattern over data streams
8
作者 Wang Jie Zeng Yu 《High Technology Letters》 EI CAS 2012年第3期289-294,共6页
Previous weighted frequent pattern (WFP) mining algorithms are not suitable for data streams for they need multiple database scans. In this paper, we present an efficient algorithm SWFP-Miner to mine weighted freque... Previous weighted frequent pattern (WFP) mining algorithms are not suitable for data streams for they need multiple database scans. In this paper, we present an efficient algorithm SWFP-Miner to mine weighted frequent pattern over data streams. SWFP-Miner is based on sliding window and can discover important frequent pattern from the recent data. A new refined weight definition is proposed to keep the downward closure property, and two pruning strategies are presented to prune the weighted infrequent pattern. Experimental studies are performed to evaluate the effectiveness and efficiency of SWFP-Miner. 展开更多
关键词 weighted frequent pattern (WFP) mining data streams data mining slidingwindow SWFP-Miner
在线阅读 下载PDF
A related degree-based frequent pattern mining algorithm for railway fault data
9
作者 Jiaxu Guo Ding Ding +2 位作者 Peihan Yang Qi Zou Yaping Huang 《High-Speed Railway》 2024年第2期101-109,共9页
It is of great significance to improve the efficiency of railway production and operation by realizing the fault knowledge association through the efficient data mining algorithm.However,high utility quantitative freq... It is of great significance to improve the efficiency of railway production and operation by realizing the fault knowledge association through the efficient data mining algorithm.However,high utility quantitative frequent pattern mining algorithms in the field of data mining still suffer from the problems of low time-memory performance and are not easy to scale up.In the context of such needs,we propose a related degree-based frequent pattern mining algorithm,named Related High Utility Quantitative Item set Mining(RHUQI-Miner),to enable the effective mining of railway fault data.The algorithm constructs the item-related degree structure of fault data and gives a pruning optimization strategy to find frequent patterns with higher related degrees,reducing redundancy and invalid frequent patterns.Subsequently,it uses the fixed pattern length strategy to modify the utility information of the item in the mining process so that the algorithm can control the length of the output frequent pattern according to the actual data situation and further improve the performance and practicability of the algorithm.The experimental results on the real fault dataset show that RHUQI-Miner can effectively reduce the time and memory consumption in the mining process,thus providing data support for differentiated and precise maintenance strategies. 展开更多
关键词 High utility QUANTITATIVE frequent pattern mining Related degree pruning Fixed pattern length
在线阅读 下载PDF
Novel Algorithm for Mining Frequent Patterns of Moving Objects Based on Dictionary Tree Improvement
10
作者 Yi Chen Yulan Dong Dechang Pi 《国际计算机前沿大会会议论文集》 2018年第1期20-20,共1页
在线阅读 下载PDF
Fast Discovering Frequent Patterns for Incremental XML Queries
11
作者 PENGDun-lu QIUYang 《Wuhan University Journal of Natural Sciences》 EI CAS 2004年第5期638-646,共9页
It is nontrivial to maintain such discovered frequent query patterns in real XML-DBMS because the transaction database of queries may allow frequent updates and such updates may not only invalidate some existing frequ... It is nontrivial to maintain such discovered frequent query patterns in real XML-DBMS because the transaction database of queries may allow frequent updates and such updates may not only invalidate some existing frequent query patterns but also generate some new frequent query patterns. In this paper, two incremental updating algorithms, FUX-QMiner and FUXQMiner, are proposed for efficient maintenance of discovered frequent query patterns and generation the new frequent query patterns when new XMI, queries are added into the database. Experimental results from our implementation show that the proposed algorithms have good performance. Key words XML - frequent query pattern - incremental algorithm - data mining CLC number TP 311 Foudation item: Supported by the Youthful Foundation for Scientific Research of University of Shanghai for Science and TechnologyBiography: PENG Dun-lu (1974-), male, Associate professor, Ph.D, research direction: data mining, Web service and its application, peerto-peer computing. 展开更多
关键词 XML frequent query pattern incremental algorithm data mining
在线阅读 下载PDF
Adaptive associative classification with emerging frequent patterns
12
作者 Wang Xiaofeng Zhang Dapeng Shi Zhongzhi 《High Technology Letters》 EI CAS 2012年第1期38-44,共7页
In this paper, we propose an enhanced associative classification method by integrating the dynamic property in the process of associative classification. In the proposed method, we employ a support vector machine(SVM... In this paper, we propose an enhanced associative classification method by integrating the dynamic property in the process of associative classification. In the proposed method, we employ a support vector machine(SVM) based method to refine the discovered emerging ~equent patterns for classification rule extension for class label prediction. The empirical study shows that our method can be used to classify increasing resources efficiently and effectively. 展开更多
关键词 associative classification RULE frequent pattern mining emerging frequent pattern supportvector machine (SVM)
在线阅读 下载PDF
Hybrid Recommender System Using Systolic Tree for Pattern Mining
13
作者 S.Rajalakshmi K.R.Santha 《Computer Systems Science & Engineering》 SCIE EI 2023年第2期1251-1262,共12页
A recommender system is an approach performed by e-commerce for increasing smooth users’experience.Sequential pattern mining is a technique of data mining used to identify the co-occurrence relationships by taking in... A recommender system is an approach performed by e-commerce for increasing smooth users’experience.Sequential pattern mining is a technique of data mining used to identify the co-occurrence relationships by taking into account the order of transactions.This work will present the implementation of sequence pattern mining for recommender systems within the domain of e-com-merce.This work will execute the Systolic tree algorithm for mining the frequent patterns to yield feasible rules for the recommender system.The feature selec-tion's objective is to pick a feature subset having the least feature similarity as well as highest relevancy with the target class.This will mitigate the feature vector's dimensionality by eliminating redundant,irrelevant,or noisy data.This work pre-sents a new hybrid recommender system based on optimized feature selection and systolic tree.The features were extracted using Term Frequency-Inverse Docu-ment Frequency(TF-IDF),feature selection with the utilization of River Forma-tion Dynamics(RFD),and the Particle Swarm Optimization(PSO)algorithm.The systolic tree is used for pattern mining,and based on this,the recommendations are given.The proposed methods were evaluated using the MovieLens dataset,and the experimental outcomes confirmed the efficiency of the techniques.It was observed that the RFD feature selection with systolic tree frequent pattern mining with collaborativefiltering,the precision of 0.89 was achieved. 展开更多
关键词 Recommender systems hybrid recommender systems frequent pattern mining collaborativefiltering systolic tree river formation dynamics particle swarm optimization
在线阅读 下载PDF
Mining Cross-Transaction Web Usage Patterns
14
作者 Jian Chen Jian Yin Jin Huang Liangyi Ou 《通讯和计算机(中英文版)》 2005年第5期6-11,81,共7页
关键词 WEB系统 存贮器 交叉处理器 计算机技术
在线阅读 下载PDF
Fast FP-Growth for association rule mining 被引量:1
15
作者 杨明 杨萍 +1 位作者 吉根林 孙志挥 《Journal of Southeast University(English Edition)》 EI CAS 2003年第4期320-323,共4页
In this paper, we propose an efficient algorithm, called FFP-Growth (shortfor fast FP-Growth) , to mine frequent itemsets. Similar to FP-Growth, FFP-Growth searches theFP-tree in the bottom-up order, but need not cons... In this paper, we propose an efficient algorithm, called FFP-Growth (shortfor fast FP-Growth) , to mine frequent itemsets. Similar to FP-Growth, FFP-Growth searches theFP-tree in the bottom-up order, but need not construct conditional pattern bases and sub-FP-trees,thus, saving a substantial amount of time and space, and the FP-tree created by it is much smallerthan that created by TD-FP-Growth, hence improving efficiency. At the same time, FFP-Growth can beeasily extended for reducing the search space as TD-FP-Growth (M) and TD-FP-Growth (C). Experimentalresults show that the algorithm of this paper is effective and efficient. 展开更多
关键词 data mining frequent itemsets association rules frequent pattern tree(FP-tree)
在线阅读 下载PDF
SFPMax——基于排序FP树的最大频繁模式挖掘算法 被引量:26
16
作者 秦亮曦 史忠植 《计算机研究与发展》 EI CSCD 北大核心 2005年第2期217-223,共7页
FP-growth算法是目前较高效的频繁模式挖掘算法之一 ,但将它用于最大频繁模式挖掘时却不能获得较高的效率 深入分析了造成低效的原因 ,提出了利用排序FP 树挖掘最大频繁模式的算法SFP- Max 算法的主要思想如下 :①基于排序FP 树 ;②利... FP-growth算法是目前较高效的频繁模式挖掘算法之一 ,但将它用于最大频繁模式挖掘时却不能获得较高的效率 深入分析了造成低效的原因 ,提出了利用排序FP 树挖掘最大频繁模式的算法SFP- Max 算法的主要思想如下 :①基于排序FP 树 ;②利用最大频繁模式的性质 ,减小产生的候选最大模式的规模 ;③设置中间结果集 ,缩小检验的范围 ,从而减少检验候选最大模式的时间 实验表明 ,SFP -Max是一个高效的最大频繁模式的挖掘算法 ,对于测试的数据集 ,SFP 展开更多
关键词 数据挖掘 关联规则 排序FP-树 最大频繁模式
在线阅读 下载PDF
基于实体感知预训练语言模型的矿物知识图谱补全
17
作者 季晓慧 杨中基 +7 位作者 张占昊 杨眉 许博 吕国诚 刘敏 张招崇 张静 王春宁 《地学前缘》 北大核心 2026年第4期203-210,共8页
矿物知识图谱的完备性对其下游应用效能具有关键影响。为提升矿物知识图谱的完整性,本文提出一种基于实体感知预训练语言模型的知识图谱补全方法。首先,收集并整合最新矿物资料,扩展已有知识库,并将新增矿物实体转化为图谱中的节点。为... 矿物知识图谱的完备性对其下游应用效能具有关键影响。为提升矿物知识图谱的完整性,本文提出一种基于实体感知预训练语言模型的知识图谱补全方法。首先,收集并整合最新矿物资料,扩展已有知识库,并将新增矿物实体转化为图谱中的节点。为补全实体间缺失的关系,本文使用矿物领域数据对语言模型进行微调,以增强其矿物知识理解与推理能力。在此基础上,通过频繁模式挖掘自动生成结构化提示词,并结合检索增强生成技术,在提示中引入相关上下文信息,从而进一步提升模型在知识三元组判定任务上的准确性与可靠性。本文采用Python实现了上述方法,并与现有相关模型进行了对比实验,结果表明本方法在Hits@5、Hits@10指标上显著优于对比模型,补全后矿物知识图谱总关系数提升8.4%,验证了本方法在矿物知识图谱补全任务上的有效性,为完善矿物知识体系提供了可行的技术路径与工具支持。 展开更多
关键词 矿物 知识图谱 知识图谱补全 实体感知预训练语言模型 提示词 频繁模式挖掘
在线阅读 下载PDF
一种面向快速Web漏洞扫描的网页爬取方法
18
作者 王金翔 朱亚运 +3 位作者 刘万大山 姜琳 刘林彬 李俊娥 《计算机应用与软件》 北大核心 2026年第1期370-376,共7页
随着Web应用规模的不断扩大,对网站进行漏洞扫描的时间成本也不断增加。为此,提出一种面向快速Web漏洞扫描的网页爬取方法。该方法在传统的面向Web漏洞扫描的爬虫的基础上,利用增量闭频繁项集挖掘算法对网站页面进行阶段性聚类,并基于... 随着Web应用规模的不断扩大,对网站进行漏洞扫描的时间成本也不断增加。为此,提出一种面向快速Web漏洞扫描的网页爬取方法。该方法在传统的面向Web漏洞扫描的爬虫的基础上,利用增量闭频繁项集挖掘算法对网站页面进行阶段性聚类,并基于页面聚簇和爬虫日志构建页面分类模型,以过滤由同一个服务处理程序生成的冗余页面。实验表明,该方法能有效减少漏洞扫描系统在网站目录遍历和页面聚类上消耗的时间,从而提升Web漏洞扫描的效率。 展开更多
关键词 爬虫 Web漏洞扫描 页面聚类 频繁模式挖掘
在线阅读 下载PDF
频繁模式增量维护算法IM-FPM 被引量:1
19
作者 林晓勇 朱群雄 《仪器仪表学报》 EI CAS CSCD 北大核心 2008年第7期1517-1521,共5页
数据挖掘是当今研究的一个热点,传感器实时收集大量的数据,将数据收集与数据挖掘技术结合起来,是现代数据处理技术发展的重要趋势。频繁模式挖掘是数据挖掘中的核心问题,本文针对数据库发生变化时频繁模式挖掘中普遍存在的重复扫描、遍... 数据挖掘是当今研究的一个热点,传感器实时收集大量的数据,将数据收集与数据挖掘技术结合起来,是现代数据处理技术发展的重要趋势。频繁模式挖掘是数据挖掘中的核心问题,本文针对数据库发生变化时频繁模式挖掘中普遍存在的重复扫描、遍历和计算问题,提出了频繁模式的增量维护算法IM-FPM。该算法充分利用已有挖掘结果来提高效率但又完全独立于上次采用的挖掘方法,并且只需对原始数据库进行一次扫描。实验结果表明,该算法能有效地解决数据库发生变化时的频繁模式增量维护问题。 展开更多
关键词 数据挖掘 关联规则 增量维护 频繁模式 数据处理
在线阅读 下载PDF
面向大规模图数据的参数化模式挖掘技术及应用
20
作者 蒋星 王欣 +1 位作者 潘海洋 胡滨 《南京大学学报(自然科学版)》 北大核心 2026年第1期110-124,共15页
大规模图上的频繁模式挖掘(Frequent Pattern Mining,FPM)因其在社交分析等各类应用中的广泛应用,受到高度关注.受模式语义的约束,传统的FPM技术难以满足多样化的数据分析需求,基于此,提出有参模式(Parameterized Patterns,p⁃模式)及其... 大规模图上的频繁模式挖掘(Frequent Pattern Mining,FPM)因其在社交分析等各类应用中的广泛应用,受到高度关注.受模式语义的约束,传统的FPM技术难以满足多样化的数据分析需求,基于此,提出有参模式(Parameterized Patterns,p⁃模式)及其挖掘技术.通过在模式中引入参数来拓展匹配语义,实现图中复杂关系的有效捕获;设计并实现了高效的挖掘算法(PMiner),用于从大图中发现频繁p⁃模式;提出基于p⁃模式的图关联规则(Graph Association Rule,GAR),并设计GAR挖掘算法GARGen,以发现图中节点间的潜在关联.在真实图数据上的实验不仅验证了上述算法的运行效率,还揭示了p⁃模式与传统模式的差异以及定义在p⁃模式上的GAR在链接预测等任务上的有效性. 展开更多
关键词 有参模式 频繁模式挖掘 关联规则挖掘 图关联规则
在线阅读 下载PDF
上一页 1 2 31 下一页 到第
使用帮助 返回顶部