期刊文献+
共找到38篇文章
< 1 2 >
每页显示 20 50 100
NIA2: A fast indirect association mining algorithm
1
作者 倪旻 徐晓飞 +1 位作者 邓胜春 问晓先 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2005年第5期511-516,共6页
Indirect association is a high level relationship between items and frequent item sets in data. There are many potential applications for indirect associations, such as database marketing, intelligent data analysis, w... Indirect association is a high level relationship between items and frequent item sets in data. There are many potential applications for indirect associations, such as database marketing, intelligent data analysis, web -log analysis, recommended system, etc. Existing indirect association mining algorithms are mostly based on the notion of post - processing of discovery of frequent item sets. In the mining process, all frequent item sets need to be generated first, and then they are fihered and joined to form indirect associations. We have presented an indirect association mining algorithm (NIA) based on anti -monotonicity of indirect associations whereas k candidate indirect associations can be generated directly from k - 1 candidate indirect associations, without all frequent item sets generated. We also use the frequent itempair support matrix to reduce the time and memory space needed by the algorithm. In this paper, a novel algorithm (NIA2) is introduced based on the generation of indirect association patterns between itempairs through one item mediator sets from frequent itempair support matrix. A notion of mediator set support threshold is also presented. NIA2 mines indirect association patterns directly from the dataset, without generating all frequent item sets. The frequent itempair support matrix and the notion of using tm as the support threshold for mediator sets can significantly reduce the cost of joint operations and the search process compared with existing algorithms. Results of experiments on a real - word web log dataset have proved NIA2 one order of magnitude faster than existing algorithms. 展开更多
关键词 data mining association rule mining indirect association frequent itempair support matrix mediator set support threshold
在线阅读 下载PDF
Anti-epileptic medication induced disturbed calcium-vitamin D metabolism:A behavioral analysis using association rule mining technique
2
作者 Pradeep K Dabla Kamal Upreti +5 位作者 Divakar Singh Anju Singh Vinod Puri Adina E Stanciu Nafija Serdarevic Damien Gruson 《World Journal of Experimental Medicine》 2025年第3期145-158,共14页
BACKGROUND There is a lack of study on vitamin D and calcium levels in epileptic patients receiving therapy,despite the growing recognition of the importance of bone health in individuals with epilepsy.Associations on... BACKGROUND There is a lack of study on vitamin D and calcium levels in epileptic patients receiving therapy,despite the growing recognition of the importance of bone health in individuals with epilepsy.Associations one statistical method for finding correlations between variables in big datasets is called association rule mining(ARM).This technique finds patterns of common items or events in the data set,including associations.Through the analysis of patient data,including demographics,genetic information,and reactions with previous treatments,ARM can identify harmful drug reactions,possible novel combinations of medicines,and trends which connect particular individual features to treatment outcomes.AIM To investigate the evidence on the effects of anti-epileptic drugs(AEDs)on calcium metabolism and supplementing with vitamin D to help lower the likelihood of bone-related issues using ARM technique.METHODS ARM technique was used to analyze patients’behavior on calcium metabolism,vitamin D and anti-epileptic medicines.Epileptic sufferers of both sexes who attended neurological outpatient and in patient department clinics were recruited for the study.There were three patient groups:Group 1 received one AED,group 2 received two AEDs,and group 3 received more than two AEDs.The researchers analyzed the alkaline phosphatase,ionized calcium,total calcium,phosphorus,vitamin D levels,or parathyroid hormone values.RESULTS A total of 150 patients,aged 12 years to 60 years,were studied,with 50 in each group(1,2,and 3).60%were men,this gender imbalance may affect the study’s findings,as women have different bone metabolism dynamics influenced by hormonal variations,including menopause.The results may not fully capture the distinct effects of AEDs on female patients.A greater equal distribution of women should be the goal of future studies in order to offer a complete comprehension of the metabolic alterations brought on by AEDs.86 patients had generalized epilepsy,64 partial.42%of patients had AEDs for>5 years.Polytherapy reduced calcium and vitamin D levels compared to mono and dual therapy.Polytherapy elevated alkaline phosphatase and phosphorus levels.CONCLUSION ARM revealed the possible effects of variables like age,gender,and polytherapy on parathyroid hormone levels in individuals taking antiepileptic medication. 展开更多
关键词 Anti-epileptic drugs HOTSPOT EPILEPSY association rule mining Transaction and metabolism
暂未订购
Raw materials consumption reduction for practical electric arc furnace steelmaking: a data association rules mining approach with improved evaluation indicator
3
作者 Yu-chi Zou Ling-zhi Yang +5 位作者 Hang Hu Guan-nan Li Zeng Feng Shuai Wang Feng Chen Yu-feng Guo 《Journal of Iron and Steel Research International》 2025年第10期3308-3327,共20页
Reducing raw materials consumption(RMC)in electric arc furnace(EAF)steelmaking process is beneficial to the reduction in resource and energy consumption.The conventional indicator of evaluating RMC only focuses on EAF... Reducing raw materials consumption(RMC)in electric arc furnace(EAF)steelmaking process is beneficial to the reduction in resource and energy consumption.The conventional indicator of evaluating RMC only focuses on EAF inputs and outputs,neglecting the associations between smelting operations and RMC.Traditional methods of reducing RMC rely on manual experience and lack a standard operation guidance.A method based on association rules mining and metallurgical mechanism(ARM-MM)was proposed.ARM-MM proposed an improved evaluation indicator of RMC and the indicator independently showed the associations between smelting operations and RMC.On the basis,1265 heats of real EAF data were used to obtain the operation guidance for RMC reduction.According to the ratio of hot metal(HM)in charge metals,data were divided into all dataset,low HM ratio dataset,medium HM ratio dataset,and high HM ratio dataset.ARM algorithm was used in each dataset to obtain specific operation guidance.The real average RMC under all dataset,medium HM ratio dataset,and high HM ratio dataset was reduced by 279,486,and 252 kg/heat,respectively,when obtained operation guidance was applied. 展开更多
关键词 Electric arc furnace steelmaking Raw materials consumption Evaluation indicator association rules mining Operation guidance
原文传递
Study on association rules mining based on semantic relativity 被引量:2
4
作者 张磊 夏士雄 +1 位作者 周勇 夏战国 《Journal of Southeast University(English Edition)》 EI CAS 2008年第3期358-360,共3页
An association rules mining method based on semantic relativity is proposed to solve the problem that there are more candidate item sets and higher time complexity in traditional association rules mining.Semantic rela... An association rules mining method based on semantic relativity is proposed to solve the problem that there are more candidate item sets and higher time complexity in traditional association rules mining.Semantic relativity of ontology concepts is used to describe complicated relationships of domains in the method.Candidate item sets with less semantic relativity are filtered to reduce the number of candidate item sets in association rules mining.An ontology hierarchy relationship is regarded as a directed acyclic graph rather than a hierarchy tree in the semantic relativity computation.Not only direct hierarchy relationships,but also non-direct hierarchy relationships and other typical semantic relationships are taken into account.Experimental results show that the proposed method can reduce the number of candidate item sets effectively and improve the efficiency of association rules mining. 展开更多
关键词 ONTOLOGY association rules mining semantic relativity
在线阅读 下载PDF
A New Method Based on Association Rules Mining and Geo-filter for Mining Spatial Association Knowledge 被引量:6
5
作者 LIU Yaolin XIE Peng +3 位作者 HE Qingsong ZHAO Xiang WEI Xiaojian TAN Ronghui 《Chinese Geographical Science》 SCIE CSCD 2017年第3期389-401,共13页
Association rule mining methods, as a set of important data mining tools, could be used for mining spatial association rules of spatial data. However, applications of these methods are limited for mining results conta... Association rule mining methods, as a set of important data mining tools, could be used for mining spatial association rules of spatial data. However, applications of these methods are limited for mining results containing large number of redundant rules. In this paper, a new method named Geo-Filtered Association Rules Mining(GFARM) is proposed to effectively eliminate the redundant rules. An application of GFARM is performed as a case study in which association rules are discovered between building land distribution and potential driving factors in Wuhan, China from 1995 to 2015. Ten sets of regular sampling grids with different sizes are used for detecting the influence of multi-scales on GFARM. Results show that the proposed method can filter 50%–70% of redundant rules. GFARM is also successful in discovering spatial association pattern between building land distribution and driving factors. 展开更多
关键词 data mining association rules rules spatial visualization driving factors analysis land use change
在线阅读 下载PDF
Quantum Algorithm for Mining Frequent Patterns for Association Rule Mining 被引量:1
6
作者 Abdirahman Alasow Marek Perkowski 《Journal of Quantum Information Science》 CAS 2023年第1期1-23,共23页
Maximum frequent pattern generation from a large database of transactions and items for association rule mining is an important research topic in data mining. Association rule mining aims to discover interesting corre... Maximum frequent pattern generation from a large database of transactions and items for association rule mining is an important research topic in data mining. Association rule mining aims to discover interesting correlations, frequent patterns, associations, or causal structures between items hidden in a large database. By exploiting quantum computing, we propose an efficient quantum search algorithm design to discover the maximum frequent patterns. We modified Grover’s search algorithm so that a subspace of arbitrary symmetric states is used instead of the whole search space. We presented a novel quantum oracle design that employs a quantum counter to count the maximum frequent items and a quantum comparator to check with a minimum support threshold. The proposed derived algorithm increases the rate of the correct solutions since the search is only in a subspace. Furthermore, our algorithm significantly scales and optimizes the required number of qubits in design, which directly reflected positively on the performance. Our proposed design can accommodate more transactions and items and still have a good performance with a small number of qubits. 展开更多
关键词 Data mining association Rule mining Frequent Pattern Apriori Algorithm Quantum Counter Quantum Comparator Grover’s Search Algorithm
在线阅读 下载PDF
A Fast Distributed Algorithm for Association Rule Mining Based on Binary Coding Mapping Relation
7
作者 CHEN Geng NI Wei-wei +1 位作者 ZHU Yu-quan SUN Zhi-hui 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期27-30,共4页
Association rule mining is an important issue in data mining. The paper proposed an binary system based method to generate candidate frequent itemsets and corresponding supporting counts efficiently, which needs only ... Association rule mining is an important issue in data mining. The paper proposed an binary system based method to generate candidate frequent itemsets and corresponding supporting counts efficiently, which needs only some operations such as "and", "or" and "xor". Applying this idea in the existed distributed association rule mining al gorithm FDM, the improved algorithm BFDM is proposed. The theoretical analysis and experiment testify that BFDM is effective and efficient. 展开更多
关键词 frequent itemsets distributed association rule mining relation of itemsets-binary data
在线阅读 下载PDF
Comparative Analysis of the Factors Influencing Metro Passenger Arrival Volumes in Wuhan, China, and Lagos, Nigeria: An Application of Association Rule Mining and Neural Network Models
8
作者 Bello Muhammad Lawan Jabir Abubakar Shuyang Zhang 《Journal of Transportation Technologies》 2024年第4期607-653,共47页
This study explores the factors influencing metro passengers’ arrival volume in Wuhan, China, and Lagos, Nigeria, by examining weather, time of day, waiting time, travel behavior, arrival patterns, and metro satisfac... This study explores the factors influencing metro passengers’ arrival volume in Wuhan, China, and Lagos, Nigeria, by examining weather, time of day, waiting time, travel behavior, arrival patterns, and metro satisfaction. It addresses a significant research gap in understanding metro passengers’ dynamics across cultural and geographical contexts. It employs questionnaires, field observations, and advanced data analysis techniques like association rule mining and neural network modeling. Key findings include a correlation between rainy weather, shorter waiting times, and higher arrival volumes. Neural network models showed high predictive accuracy, with waiting time, metro satisfaction, and weather being significant factors in Lagos Light Rail Blue Line Metro. In contrast, arrival patterns, weather, and time of day were more influential in Wuhan Metro Line 5. Results suggest that improving metro satisfaction and reducing waiting times could increase arrival volumes in Lagos Metro while adjusting schedules for weather and peak times could optimize flow in Wuhan Metro. These insights are valuable for transportation planning, passenger arrival volume management, and enhancing user experiences, potentially benefiting urban transportation sustainability and development goals. 展开更多
关键词 Metro Passenger Arrival volume Influencing Factor Analysis Wuhan and Lagos Metro Neural Network Modeling association Rule mining Technique
在线阅读 下载PDF
Automatically Mining Application Signatures for Lightweight Deep Packet Inspection
9
作者 鲁刚 张宏莉 +3 位作者 张宇 Mahmoud T. Qassrawi 余翔湛 彭立志 《China Communications》 SCIE CSCD 2013年第6期86-99,共14页
Automatic signature generation approaches have been widely applied in recent traffic classification.However,they are not suitable for LightWeight Deep Packet Inspection(LW_DPI) since their generated signatures are mat... Automatic signature generation approaches have been widely applied in recent traffic classification.However,they are not suitable for LightWeight Deep Packet Inspection(LW_DPI) since their generated signatures are matched through a search of the entire application data.On the basis of LW_DPI schemes,we present two Hierarchical Clustering(HC) algorithms:HC_TCP and HC_UDP,which can generate byte signatures from TCP and UDP packet payloads respectively.In particular,HC_TCP and HC_ UDP can extract the positions of byte signatures in packet payloads.Further,in order to deal with the case in which byte signatures cannot be derived,we develop an algorithm for generating bit signatures.Compared with the LASER algorithm and Suffix Tree(ST)-based algorithm,the proposed algorithms are better in terms of both classification accuracy and speed.Moreover,the experimental results indicate that,as long as the application-protocol header exists,it is possible to automatically derive reliable and accurate signatures combined with their positions in packet payloads. 展开更多
关键词 traffic classification automatic signature generation association mining hierarchical clustering LW_ DPI
在线阅读 下载PDF
Examining patterns of traditional chinese medicine use in pediatric oncology: A systematic review, meta-analysis and data-mining study 被引量:7
10
作者 Chun Sing Lam Li Wen Peng +5 位作者 Lok Sum Yang Ho Wing Janessa Chou Chi-Kong Li Zhong Zuo Ho-Kee Koon Yin Ting Cheung 《Journal of Integrative Medicine》 SCIE CAS CSCD 2022年第5期402-415,共14页
Background Traditional Chinese medicine(TCM)is becoming a popular complementary approach in pediatric oncology.However,few or no meta-analyses have focused on clinical studies of the use of TCM in pediatric oncology.O... Background Traditional Chinese medicine(TCM)is becoming a popular complementary approach in pediatric oncology.However,few or no meta-analyses have focused on clinical studies of the use of TCM in pediatric oncology.Objective We explored the patterns of TCM use and its efficacy in children with cancer,using a systematic review,meta-analysis and data mining study.Search strategy We conducted a search of five English(Allied and Complementary Medicine Database,Embase,PubMed,Cochrane Central Register of Controlled Trials,and ClinicalTrials.gov)and four Chinese databases(Wanfang Data,China National Knowledge Infrastructure,Chinese Biomedical Literature Database,and VIP Chinese Science and Technology Periodicals Database)for clinical studies published before October 2021,using keywords related to“pediatric,”“cancer,”and“TCM.”Inclusion criteria We included studies which were randomized controlled trials(RCTs)or observational clinical studies,focused on patients aged<19 years old who had been diagnosed with cancer,and included at least one group of subjects receiving TCM treatment.Data extraction and analysis The methodological quality of RCTs and observational studies was assessed using the six-item Jadad scale and the Effective Public Healthcare Panacea Project Quality Assessment Tool,respectively.Meta-analysis was used to evaluate the efficacy of combining TCM with chemotherapy.Study outcomes included the treatment response rate and occurrence of cancer-related symptoms.Association rule mining(ARM)was used to investigate the associations among medicinal herbs and patient symptoms.Results The fifty-four studies included in this analysis were comprised of RCTs(63.0%)and observational studies(37.0%).Most RCTs focused on hematological malignancies(41.2%).The study outcomes included chemotherapy-induced toxicities(76.5%),infection rate(35.3%),and response,survival or relapse rate(23.5%).The methodological quality of most of the RCTs(82.4%)and observational studies(80.0%)was rated as“moderate.”In studies of leukemia patients,adding TCM to conventional treatment significantly improved the clinical response rate(odds ratio[OR]=2.55;95%confidence interval[CI]=1.49-4.36),lowered infection rate(OR=0.23;95%CI=0.13-0.40),and reduced nausea and vomiting(OR=0.13;95%CI=0.08-0.23).ARM showed that Radix Astragali,the most commonly used medicinal herb(58.0%),was associated with treating myelosuppression,gastrointestinal complications,and infection.Conclusion There is growing evidence that TCM is an effective adjuvant therapy for children with cancer.We proposed a checklist to improve the quality of TCM trials in pediatric oncology.Future work will examine the use of ARM techniques on real-world data to evaluate the efficacy of medicinal herbs and drug-herb interactions in children receiving TCM as a part of integrated cancer therapy. 展开更多
关键词 Traditional Chinese Medicine Herbal medicine Pediatric oncology Data mining Associate rule mining CHEMOTHERAPY
原文传递
Hydraulic metal structure health diagnosis based on data mining technology 被引量:3
11
作者 Guang-ming Yang Xiao Feng Kun Yang 《Water Science and Engineering》 EI CAS CSCD 2015年第2期158-163,共6页
In conjunction with association rules for data mining, the connections between testing indices and strong and weak association rules were determined, and new derivative rules were obtained by further reasoning. Associ... In conjunction with association rules for data mining, the connections between testing indices and strong and weak association rules were determined, and new derivative rules were obtained by further reasoning. Association rules were used to analyze correlation and check consistency between indices. This study shows that the judgment obtained by weak association rules or non-association rules is more accurate and more credible than that obtained by strong association rules. When the testing grades of two indices in the weak association rules are inconsistent, the testing grades of indices are more likely to be erroneous, and the mistakes are often caused by human factors. Clustering data mining technology was used to analyze the reliability of a diagnosis, or to perform health diagnosis directly. Analysis showed that the clustering results are related to the indices selected, and that if the indices selected are more significant, the characteristics of clustering results are also more significant, and the analysis or diagnosis is more credible. The indices and diagnosis analysis function produced by this study provide a necessary theoretical foundation and new ideas for the development of hydraulic metal structure health diagnosis technology. 展开更多
关键词 Hydraulic metal structure Health diagnosis Data mining technology Clustering model association rule
在线阅读 下载PDF
Discovering hidden patterns:Association rules for cardiovascular diseases in type 2 diabetes mellitus 被引量:1
12
作者 Pradeep Kumar Dabla Kamal Upreti +2 位作者 Dharmsheel Shrivastav Vimal Mehta Divakar Singh 《World Journal of Methodology》 2024年第2期97-106,共10页
BACKGROUND It is increasingly common to find patients affected by a combination of type 2 diabetes mellitus(T2DM)and coronary artery disease(CAD),and studies are able to correlate their relationships with available bi... BACKGROUND It is increasingly common to find patients affected by a combination of type 2 diabetes mellitus(T2DM)and coronary artery disease(CAD),and studies are able to correlate their relationships with available biological and clinical evidence.The aim of the current study was to apply association rule mining(ARM)to discover whether there are consistent patterns of clinical features relevant to these diseases.ARM leverages clinical and laboratory data to the meaningful patterns for diabetic CAD by harnessing the power help of data-driven algorithms to optimise the decision-making in patient care.AIM To reinforce the evidence of the T2DM-CAD interplay and demonstrate the ability of ARM to provide new insights into multivariate pattern discovery.METHODS This cross-sectional study was conducted at the Department of Biochemistry in a specialized tertiary care centre in Delhi,involving a total of 300 consented subjects categorized into three groups:CAD with diabetes,CAD without diabetes,and healthy controls,with 100 subjects in each group.The participants were enrolled from the Cardiology IPD&OPD for the sample collection.The study employed ARM technique to extract the meaningful patterns and relationships from the clinical data with its original value.RESULTS The clinical dataset comprised 35 attributes from enrolled subjects.The analysis produced rules with a maximum branching factor of 4 and a rule length of 5,necessitating a 1%probability increase for enhancement.Prominent patterns emerged,highlighting strong links between health indicators and diabetes likelihood,particularly elevated HbA1C and random blood sugar levels.The ARM technique identified individuals with a random blood sugar level>175 and HbA1C>6.6 are likely in the“CAD-with-diabetes”group,offering valuable insights into health indicators and influencing factors on disease outcomes.CONCLUSION The application of this method holds promise for healthcare practitioners to offer valuable insights for enhancing patient treatment targeting specific subtypes of CAD with diabetes.Implying artificial intelligence techniques with medical data,we have shown the potential for personalized healthcare and the development of user-friendly applications aimed at improving cardiovascular health outcomes for this high-risk population to optimise the decision-making in patient care. 展开更多
关键词 Coronary artery disease Type 2 diabetes mellitus Coronary angiography association rule mining Artificial intelligence
暂未订购
A data mining approach to characterize road accident locations 被引量:1
13
作者 Sachin Kumar Durga Toshniwal 《Journal of Modern Transportation》 2016年第1期62-72,共11页
Data mining has been proven as a reliable technique to analyze road accidents and provide productive results. Most of the road accident data analysis use data mining techniques, focusing on identifying factors that af... Data mining has been proven as a reliable technique to analyze road accidents and provide productive results. Most of the road accident data analysis use data mining techniques, focusing on identifying factors that affect the severity of an accident. However, any damage resulting from road accidents is always unacceptable in terms of health, property damage and other economic factors. Sometimes, it is found that road accident occurrences are more frequent at certain specific locations. The analysis of these locations can help in identifying certain road accident features that make a road accident to occur frequently in these locations. Association rule mining is one of the popular data mining techniques that identify the correlation in various attributes of road accident. In this paper, we first applied k-means algorithm to group the accident locations into three categories, high-frequency, moderate-frequency and low-frequency accident locations. k-means algorithm takes accident frequency count as a parameter to cluster the locations. Then we used association rule mining to characterize these locations. The rules revealed different factors associated with road accidents at different locations with varying accident frequencies. Theassociation rules for high-frequency accident location disclosed that intersections on highways are more dangerous for every type of accidents. High-frequency accident locations mostly involved two-wheeler accidents at hilly regions. In moderate-frequency accident locations, colonies near local roads and intersection on highway roads are found dangerous for pedestrian hit accidents. Low-frequency accident locations are scattered throughout the district and the most of the accidents at these locations were not critical. Although the data set was limited to some selected attributes, our approach extracted some useful hidden information from the data which can be utilized to take some preventive efforts in these locations. 展开更多
关键词 Road accidents Accident analysis Datamining k-Means association rule mining
在线阅读 下载PDF
Association Rule Analysis-Based Identification of Influential Users in the Social Media
14
作者 Saqib Iqbal Rehan Khan +3 位作者 Hikmat Ullah Khan Fawaz Khaled Alarfaj Abdullah Mohammed Alomair Muzamil Ahmed 《Computers, Materials & Continua》 SCIE EI 2022年第12期6479-6493,共15页
The exchange of information is an innate and natural process that assist in content dispersal.Social networking sites emerge to enrich their users by providing the facility for sharing information and social interacti... The exchange of information is an innate and natural process that assist in content dispersal.Social networking sites emerge to enrich their users by providing the facility for sharing information and social interaction.The extensive adoption of social networking sites also resulted in user content generation.There are diverse research areas explored by the researchers to investigate the influence of social media on users and confirmed that social media sites have a significant impact on markets,politics and social life.Facebook is extensively used platform to share information,thoughts and opinions through posts and comments.The identification of influential users on the social web has grown as hot research field because of vast applications in diverse areas for instance political campaigns marketing,e-commerce,commercial and,etc.Prior research studies either uses linguistic content or graph-based representation of social network for the detection of influential users.In this article,we incorporate association rule mining algorithms to identify the top influential users through frequent patterns.The association rules have been computed using the standard evaluation measures such as support,confidence,lift,and conviction.To verify the results,we also involve conventional metrics for example accuracy,precision,recall and F1-measure according to the association rules perspective.The detailed experiments are carried out using the benchmark College-Msg dataset extracted by Facebook.The obtained results validate the quality and visibility of the proposed approach.The outcome of propose model verify that the association rule mining is able to generate rules to identify the temporal influential users on Facebook who are consistent on regular basis.The preparation of rule set help to create knowledge-based systems which are efficient and widely used in recent era for decision making to solve real-world problems. 展开更多
关键词 association rule mining RANKING social web influential users social media
在线阅读 下载PDF
Examining data visualization pitfalls in scientific publications
15
作者 Vinh T Nguyen Kwanghee Jung Vibhuti Gupta 《Visual Computing for Industry,Biomedicine,and Art》 EI 2021年第1期268-282,共15页
Data visualization blends art and science to convey stories from data via graphical representations.Considering different problems,applications,requirements,and design goals,it is challenging to combine these two comp... Data visualization blends art and science to convey stories from data via graphical representations.Considering different problems,applications,requirements,and design goals,it is challenging to combine these two components at their full force.While the art component involves creating visually appealing and easily interpreted graphics for users,the science component requires accurate representations of a large amount of input data.With a lack of the science component,visualization cannot serve its role of creating correct representations of the actual data,thus leading to wrong perception,interpretation,and decision.It might be even worse if incorrect visual representations were intentionally produced to deceive the viewers.To address common pitfalls in graphical representations,this paper focuses on identifying and understanding the root causes of misinformation in graphical representations.We reviewed the misleading data visualization examples in the scientific publications collected from indexing databases and then projected them onto the fundamental units of visual communication such as color,shape,size,and spatial orientation.Moreover,a text mining technique was applied to extract practical insights from common visualization pitfalls.Cochran’s Q test and McNemar’s test were conducted to examine if there is any difference in the proportions of common errors among color,shape,size,and spatial orientation.The findings showed that the pie chart is the most misused graphical representation,and size is the most critical issue.It was also observed that there were statistically significant differences in the proportion of errors among color,shape,size,and spatial orientation. 展开更多
关键词 Data visualization Graphical representations MISINFORMATION Visual encodings association rule mining Word cloud Cochran’s Q test McNemar’s test
在线阅读 下载PDF
Mining φ-Frequent Itemset Using FP-Tree
16
作者 李天瑞 《Journal of Modern Transportation》 2001年第1期67-74,共8页
The problem of association rule mining has gained considerable prominence in the data mining community for its use as an important tool of knowledge discovery from large scale databases. And there has been a spurt of... The problem of association rule mining has gained considerable prominence in the data mining community for its use as an important tool of knowledge discovery from large scale databases. And there has been a spurt of research activities around this problem. However, traditional association rule mining may often derive many rules in which people are uninterested. This paper reports a generalization of association rule mining called φ association rule mining. It allows people to have different interests on different itemsets that arethe need of real application. Also, it can help to derive interesting rules and substantially reduce the amount of rules. An algorithm based on FP tree for mining φ frequent itemset is presented. It is shown by experiments that the proposed methodis efficient and scalable over large databases. 展开更多
关键词 data processing DATABASES φ association rule mining φ frequent itemset FP tree data mining
在线阅读 下载PDF
Association discovery and outlier detection of air pollution emissions from industrial enterprises driven by big data
17
作者 Zhen Peng Yunxiao Zhang +1 位作者 Yunchong Wang Tianle Tang 《Data Intelligence》 EI 2023年第2期438-456,共19页
Air pollution is a major issue related to national economy and people's livelihood.At present,the researches on air pollution mostly focus on the pollutant emissions in a specific industry or region as a whole,and... Air pollution is a major issue related to national economy and people's livelihood.At present,the researches on air pollution mostly focus on the pollutant emissions in a specific industry or region as a whole,and is a lack of attention to enterprise pollutant emissions from the micro level.Limited by the amount and time granularity of data from enterprises,enterprise pollutant emissions are stll understudied.Driven by big data of air pollution emissions of industrial enterprises monitored in Beijing-Tianjin-Hebei,the data mining of enterprises pollution emissions is carried out in the paper,including the association analysis between different features based on grey association,the association mining between different data based on association rule and the outlier detection based on clustering.The results show that:(1)The industries affecting NOx and SO2 mainly are electric power,heat production and supply industry,metal smelting and processing industries in Beijing-Tianjin-Hebei;(2)These districts nearby Hengshui and Shijiazhuang city in Hebei province form strong association rules;(3)The industrial enterprises in Beijing-Tianjin-Hebei are divided into six clusters,of which three categories belong to outliers with excessive emissions of total vOCs,PM and NH3 respectively. 展开更多
关键词 Air Pollution Emissions of Enterprises Outlier detection based on clustering association Rule mining Grey association Analysis Big data
原文传递
Short Text Mining for Classifying Educational Objectives and Outcomes
18
作者 Yousef Asiri 《Computer Systems Science & Engineering》 SCIE EI 2022年第4期35-50,共16页
Most of the international accreditation bodies in engineering education(e.g.,ABET)and outcome-based educational systems have based their assess-ments on learning outcomes and program educational objectives.However,map... Most of the international accreditation bodies in engineering education(e.g.,ABET)and outcome-based educational systems have based their assess-ments on learning outcomes and program educational objectives.However,map-ping program educational objectives(PEOs)to student outcomes(SOs)is a challenging and time-consuming task,especially for a new program which is applying for ABET-EAC(American Board for Engineering and Technology the American Board for Engineering and Technology—Engineering Accreditation Commission)accreditation.In addition,ABET needs to automatically ensure that the mapping(classification)is reasonable and correct.The classification also plays a vital role in the assessment of students’learning.Since the PEOs are expressed as short text,they do not contain enough semantic meaning and information,and consequently they suffer from high sparseness,multidimensionality and the curse of dimensionality.In this work,a novel associative short text classification tech-nique is proposed to map PEOs to SOs.The datasets are extracted from 152 self-study reports(SSRs)that were produced in operational settings in an engineering program accredited by ABET-EAC.The datasets are processed and transformed into a representational form appropriate for association rule mining.The extracted rules are utilized as delegate classifiers to map PEOs to SOs.The proposed asso-ciative classification of the mapping of PEOs to SOs has shown promising results,which can simplify the classification of short text and avoid many problems caused by enriching short text based on external resources that are not related or relevant to the dataset. 展开更多
关键词 ABET accreditation association rule mining educational data mining engineering education program educational objectives student outcomes associative classification
在线阅读 下载PDF
Effective Diagnosis of Lung Cancer via Various Data-Mining Techniques
19
作者 Subramanian Kanageswari D.Gladis +2 位作者 Irshad Hussain Sultan S.Alshamrani Abdullah Alshehri 《Intelligent Automation & Soft Computing》 SCIE 2023年第4期415-428,共14页
One of the leading cancers for both genders worldwide is lung cancer.The occurrence of lung cancer has fully augmented since the early 19th century.In this manuscript,we have discussed various data mining techniques t... One of the leading cancers for both genders worldwide is lung cancer.The occurrence of lung cancer has fully augmented since the early 19th century.In this manuscript,we have discussed various data mining techniques that have been employed for cancer diagnosis.Exposure to air pollution has been related to various adverse health effects.This work is subject to analysis of various air pollutants and associated health hazards and intends to evaluate the impact of air pollution caused by lung cancer.We have introduced data mining in lung cancer to air pollution,and our approach includes preprocessing,data mining,testing and evaluation,and knowledge discovery.Initially,we will eradicate the noise and irrelevant data,and following that,we will join the multiple informed sources into a common source.From that source,we will designate the information relevant to our investigation to be regained from that assortment.Following that,we will convert the designated data into a suitable mining process.The patterns are abstracted by utilizing a relational suggestion rule mining process.These patterns have revealed information,and this information is categorized with the help of an Auto Associative Neural Network classification method(AANN).The proposed method is compared with the existing method in various factors.In conclusion,the projected Auto associative neural network and relational suggestion rule mining methods accomplish a high accuracy status. 展开更多
关键词 Relational association rule mining auto associative neural network PREPROCESSING data mining biological neural network
在线阅读 下载PDF
Time series data analysis and association rule mining in financial recommendation systems using Hadoop and Spark
20
作者 Yaoyu Chen Yichen Xu 《Advances in Engineering Innovation》 2025年第1期35-39,共5页
Increasing amounts of financial data demand sophisticated analytics to develop sound recommendation models.This article discusses combining time series analysis and association rule mining for big data in Hadoop and S... Increasing amounts of financial data demand sophisticated analytics to develop sound recommendation models.This article discusses combining time series analysis and association rule mining for big data in Hadoop and Spark to enrich financial product recommendation engines.The paper is an integrated analysis of two types of prediction algorithms:AutoRegressive Integrated Moving Average(ARIMA)and Long Short-Term Memory(LSTM)networks to forecast user behavior and demand for financial services in the future from transactional history.The ARIMA model is used as the default while the LSTM model is used to represent non-linear dependencies and give a more dynamic forecast.association rule mining–in particular the Apriori algorithm–is used to find latent patterns and relationships between user transactions and financial products.This article illustrates how time series forecasting and association rule mining can be merged to bring a more useful financial recommendation.The hybrid approach,which combines both approaches,proves to increase user interaction and recommendation accuracy by 20%compared to the previous systems,according to experiments.The paper emphasises the possibilities of using big data in the construction of scalable,individualized financial recommendation systems. 展开更多
关键词 Time Series Analysis Financial Recommendation Systems HADOOP SPARK association Rule mining
在线阅读 下载PDF
上一页 1 2 下一页 到第
使用帮助 返回顶部