Distributed Data Mining is expected to discover preciously unknown, implicit and valuable information from massive data set inherently distributed over a network. In recent years several approaches to distributed data...Distributed Data Mining is expected to discover preciously unknown, implicit and valuable information from massive data set inherently distributed over a network. In recent years several approaches to distributed data mining have been developed, but only a few of them make use of intelligent agents. This paper provides the reason for applying Multi-Agent Technology in Distributed Data Mining and presents a Distributed Data Mining System based on Multi-Agent Technology that deals with heterogeneity in such environment. Based on the advantages of both the CS model and agent-based model, the system is being able to address the specific concern of increasing scalability and enhancing performance.展开更多
The authors designed the spatial data mining system for ore-forming prediction based on the theory and methods of data mining as well as the technique of spatial database,in combination with the characteristics of geo...The authors designed the spatial data mining system for ore-forming prediction based on the theory and methods of data mining as well as the technique of spatial database,in combination with the characteristics of geological information data.The system consists of data management,data mining and knowledge discovery,knowledge representation.It can syncretize multi-source geosciences data effectively,such as geology,geochemistry,geophysics,RS.The system digitized geological information data as data layer files which consist of the two numerical values,to store these files in the system database.According to the combination of the characters of geological information,metallogenic prognosis was realized,as an example from some area in Heilongjiang Province.The prospect area of hydrothermal copper deposit was determined.展开更多
The existing data mining methods are mostly focused on relational databases and structured data, but not on complex structured data (like in extensible markup language(XML)). By converting XML document type descriptio...The existing data mining methods are mostly focused on relational databases and structured data, but not on complex structured data (like in extensible markup language(XML)). By converting XML document type description to the relational semantic recording XML data relations, and using an XML data mining language, the XML data mining system presents a strategy to mine information on XML.展开更多
In order to improve the effect of financial data mining,this paper analyzes the financial data mining system combined with fuzzy clustering multimedia tool algorithm,builds an intelligent model to intelligently proces...In order to improve the effect of financial data mining,this paper analyzes the financial data mining system combined with fuzzy clustering multimedia tool algorithm,builds an intelligent model to intelligently process financial data,and studies a multi-layer financial data protection structure model.According to the propagation characteristics of the financial data signal at the uniform cutoff and the theoretical knowledge of the electron miroscopic motion,this paper analyzes the calculation formulas of the reflected power and the absorbed power of the financial data to the cutoff.Moreover,this paper improves the effect of fuzzy analysis of financialdata and builds a financial data mining system based on fuzzy clustering multimedia tool.The simulation study shows that the financial data mining system based on fuzzy clustering multimedia tool proposed in this paper can effectively improve the effect of financial data mining.展开更多
With the gradual acceleration of information construction in colleges and universities,digital campus and smart campus have gradually become important means for colleges and universities to scientifically manage the c...With the gradual acceleration of information construction in colleges and universities,digital campus and smart campus have gradually become important means for colleges and universities to scientifically manage the campus.They have been applied to teaching,scientific research,student management,and other fields,improving the quality and efficiency of management.This paper mainly studies the intelligent educational administration management system based on data mining technology.Firstly,this paper introduces the application process of data mining technology,and builds an intelligent educational administration management system based on data mining technology.Then,this paper optimizes the application of the Apriori algorithm in educational administration management through transaction compression and frequent sampling.Compared with the traditional Apriori algorithm,the optimized Apriori algorithm in this paper has a shorter execution time under the same minimum support.展开更多
Background Traditional Chinese medicine(TCM)is becoming a popular complementary approach in pediatric oncology.However,few or no meta-analyses have focused on clinical studies of the use of TCM in pediatric oncology.O...Background Traditional Chinese medicine(TCM)is becoming a popular complementary approach in pediatric oncology.However,few or no meta-analyses have focused on clinical studies of the use of TCM in pediatric oncology.Objective We explored the patterns of TCM use and its efficacy in children with cancer,using a systematic review,meta-analysis and data mining study.Search strategy We conducted a search of five English(Allied and Complementary Medicine Database,Embase,PubMed,Cochrane Central Register of Controlled Trials,and ClinicalTrials.gov)and four Chinese databases(Wanfang Data,China National Knowledge Infrastructure,Chinese Biomedical Literature Database,and VIP Chinese Science and Technology Periodicals Database)for clinical studies published before October 2021,using keywords related to“pediatric,”“cancer,”and“TCM.”Inclusion criteria We included studies which were randomized controlled trials(RCTs)or observational clinical studies,focused on patients aged<19 years old who had been diagnosed with cancer,and included at least one group of subjects receiving TCM treatment.Data extraction and analysis The methodological quality of RCTs and observational studies was assessed using the six-item Jadad scale and the Effective Public Healthcare Panacea Project Quality Assessment Tool,respectively.Meta-analysis was used to evaluate the efficacy of combining TCM with chemotherapy.Study outcomes included the treatment response rate and occurrence of cancer-related symptoms.Association rule mining(ARM)was used to investigate the associations among medicinal herbs and patient symptoms.Results The fifty-four studies included in this analysis were comprised of RCTs(63.0%)and observational studies(37.0%).Most RCTs focused on hematological malignancies(41.2%).The study outcomes included chemotherapy-induced toxicities(76.5%),infection rate(35.3%),and response,survival or relapse rate(23.5%).The methodological quality of most of the RCTs(82.4%)and observational studies(80.0%)was rated as“moderate.”In studies of leukemia patients,adding TCM to conventional treatment significantly improved the clinical response rate(odds ratio[OR]=2.55;95%confidence interval[CI]=1.49-4.36),lowered infection rate(OR=0.23;95%CI=0.13-0.40),and reduced nausea and vomiting(OR=0.13;95%CI=0.08-0.23).ARM showed that Radix Astragali,the most commonly used medicinal herb(58.0%),was associated with treating myelosuppression,gastrointestinal complications,and infection.Conclusion There is growing evidence that TCM is an effective adjuvant therapy for children with cancer.We proposed a checklist to improve the quality of TCM trials in pediatric oncology.Future work will examine the use of ARM techniques on real-world data to evaluate the efficacy of medicinal herbs and drug-herb interactions in children receiving TCM as a part of integrated cancer therapy.展开更多
Bioinformatic analysis of large and complex omics datasets has become increasingly useful in modern day biology by providing a great depth of information,with its application to neuroscience termed neuroinformatics.Da...Bioinformatic analysis of large and complex omics datasets has become increasingly useful in modern day biology by providing a great depth of information,with its application to neuroscience termed neuroinformatics.Data mining of omics datasets has enabled the generation of new hypotheses based on differentially regulated biological molecules associated with disease mechanisms,which can be tested experimentally for improved diagnostic and therapeutic targeting of neurodegenerative diseases.Importantly,integrating multi-omics data using a systems bioinformatics approach will advance the understanding of the layered and interactive network of biological regulation that exchanges systemic knowledge to facilitate the development of a comprehensive human brain profile.In this review,we first summarize data mining studies utilizing datasets from the individual type of omics analysis,including epigenetics/epigenomics,transcriptomics,proteomics,metabolomics,lipidomics,and spatial omics,pertaining to Alzheimer's disease,Parkinson's disease,and multiple sclerosis.We then discuss multi-omics integration approaches,including independent biological integration and unsupervised integration methods,for more intuitive and informative interpretation of the biological data obtained across different omics layers.We further assess studies that integrate multi-omics in data mining which provide convoluted biological insights and offer proof-of-concept proposition towards systems bioinformatics in the reconstruction of brain networks.Finally,we recommend a combination of high dimensional bioinformatics analysis with experimental validation to achieve translational neuroscience applications including biomarker discovery,therapeutic development,and elucidation of disease mechanisms.We conclude by providing future perspectives and opportunities in applying integrative multi-omics and systems bioinformatics to achieve precision phenotyping of neurodegenerative diseases and towards personalized medicine.展开更多
Complex repairable system is composed of thousands of components.Some maintenance management and decision problems in maintenance management and decision need to classify a set of components into several classes based...Complex repairable system is composed of thousands of components.Some maintenance management and decision problems in maintenance management and decision need to classify a set of components into several classes based on data mining.Furthermore,with the complexity of industrial equipment increasing,the managers should pay more attention to the key components and carry out the lean management is very important.Therefore,the idea"customer segmentation"of"precise marketing"can be used in the maintenance management of the multi-component system.Following the idea of segmentation,the components of multicomponent systems should be subdivied into groups based on specific attributes relevant to maintenance,such as maintenance cost,mean time between failures,and failure frequency.For the target specific groups of parts,the optimal maintenance policy,health assessment and maintenance scheduling can be determined.The proposed analysis framework will be given out.In order to illustrate the effectiveness of this method,a numerical example is given out.展开更多
This paper first puts forward a case based system framework based on data mining techniques. Then the paper examines the possibility of using neural networks as a method of retrieval in such a case based system. In ...This paper first puts forward a case based system framework based on data mining techniques. Then the paper examines the possibility of using neural networks as a method of retrieval in such a case based system. In this system we propose data mining algorithms to discover case knowledge and other algorithms.展开更多
This paper tries to characterize volcanic rocks through the development and application of an empirical geomechanical system. Geotechnical information was collected from the samples from several Atlantic Ocean islands...This paper tries to characterize volcanic rocks through the development and application of an empirical geomechanical system. Geotechnical information was collected from the samples from several Atlantic Ocean islands including Madeira, Azores and Canarias archipelagos. An empirical rock classification system termed as the volcanic rock system(VRS) is developed and presented in detail. Results using the VRS are compared with those obtained using the traditional rock mass rating(RMR) system. Data mining(DM) techniques are applied to a database of volcanic rock geomechanical information from the islands.Different algorithms were developed and consequently approaches were followed for predicting rock mass classes using the VRS and RMR classification systems. Finally, some conclusions are drawn with emphasis on the fact that a better performance was achieved using attributes from VRS.展开更多
With the economic development and the popularity of application of electronic computer, electronic commerce has rapid development. More and more commerce and key business has been carried on the lnternet because Inter...With the economic development and the popularity of application of electronic computer, electronic commerce has rapid development. More and more commerce and key business has been carried on the lnternet because Internet has the features of interaction, openness, sharing and so on. However, during the daily commerce, people worry about the security of the network system. So a new technology which can detect the unusual behavior in time has been invented in order to protect the security of network system. The system of intrusion detection needs a lot of new technology to protect the data of the network system. The application of data mining technology in the system of intrusion detection can provide a better assistant to the users to analyze the data and improve the accuracy of the checking system.展开更多
In the present study,data mining and network pharmacology were utilized to explore the principles and mechanisms of traditional Chinese medicine(TCM)in treating acute appendicitis.The goal was to provide a scientific ...In the present study,data mining and network pharmacology were utilized to explore the principles and mechanisms of traditional Chinese medicine(TCM)in treating acute appendicitis.The goal was to provide a scientific basis for clinical treatment and further research on this disease.First,we searched the National Patent Database for Chinese herbal compound prescriptions used to treat acute appendicitis.We then applied frequency analysis,character and taste meridian analysis,association rule analysis,and hierarchical cluster analysis to identify the patterns of TCM treatment for acute appendicitis,selecting key combinations of Chinese medicines.Next,we screened the main active components of these key TCM based on quality markers.Using databases such as SwissTargetPrediction,SymMap,ETCM,and STRING,we analyzed the pharmacological mechanisms of these key TCM in treating acute appendicitis.Key active components and targets were further verified through molecular docking.We identified a total of 129 patents involving 316 Chinese medicines,with 24 being frequently used.The results indicated that most Chinese herbs used for acute appendicitis were heat-clearing drugs,blood-activating and stasis-removing drugs,and purging drugs.The primary active ingredients of the Rhubarb-cortex moutan-flos lonicerae combination for treating acute appendicitis included Emodin,Paeonol,Physcion,Chlorogenic acid,Chrysophanol,Rhein acid,and Aloe-emodin.These ingredients targeted key proteins such as ALB,TP53,BCL2,STAT3,IL-6,and TNF,and were involved in cellular responses to lipopolysaccharides,cell composition,and various cytokine-mediated biological processes.They also interacted with signaling pathways like AGE-RAGE,TNF,IL-17,and FoxO.Based on patent data,this study analyzed medication patterns in the treatment of acute appendicitis,discussed the possible mechanisms of key TCM combinations,and provided a scientific basis and new perspectives for the diagnosis and treatment of the disease.展开更多
Objective To identify core acupoint patterns and elucidate the molecular mechanisms of acupuncture for primary depressive disorder(PDD)through data mining and network analysis.Methods A comprehensive literature search...Objective To identify core acupoint patterns and elucidate the molecular mechanisms of acupuncture for primary depressive disorder(PDD)through data mining and network analysis.Methods A comprehensive literature search was conducted across PubMed,Embase,Ovid Technologies(OVID),Web of Science,Cochrane Library,China National Knowledge Infrastructure(CNKI),China National Knowledge Infrastructure Database(VIP),Wanfang Data,and SinoMed Database from database foundation to January 31,2025,for clinical studies on acupuncture treatment of PDD.Descriptive statistics,high-frequency acupoint analysis,degree and betweenness centrality evaluation,and core acupoint prescription mining identified predominant therapeutic combinations for PDD.Network acupuncture was used to predict therapeutic target for the core acupoint prescription.Subsequent protein-protein interaction(PPI)network and molecular complex detection(MCODE)analyses were conducted to identify the key targets and functional modules.Gene Ontology(GO)and Kyoto Encyclopedia of Genes and Genomes(KEGG)analyses explored the underlying biological mechanisms of the core acupoint prescription in treating PDD.Results A total of 57 acupoint prescriptions underwent systematic analysis.The core therapeutic combinations comprised Baihui(GV20),Yintang(GV29),Neiguan(PC6),Hegu(LI4),and Shenmen(HT7).Network acupuncture analysis identified 88 potential therapeutic targets(79 overlapping with PDD),while PPI network analysis revealed central regulatory nodes,including interleukin(IL)-6,IL-1β,tumor necrosis factor(TNF)-α,toll-like receptor 4(TLR4),IL-10,brain-derived neurotrophic factor(BDNF),transforming growth factor(TGF)-β1,C-XC motif chemokine ligand 10(CXCL10),mitogen-activated protein kinase 3(MAPK3),and nitric oxide synthase 1(NOS1).MCODE-based modular analysis further elucidated three functionally coherent clusters:inflammation-homeostasis(score=6.571),plasticity-neurotransmission(score=3.143),and oxidative stress(score=3.000).GO and KEGG analyses demonstrated significant enrichment of the MAPK,phosphoinositide 3-kinase/protein kinase B(PI3K/Akt),and hypoxia-inducible factor(HIF)-1 signaling pathways.These mechanistic insights suggested that the antidepressant effects mediated through mechanisms of neuroinflammatory regulation,neuroplasticity restoration,and immune-oxidative stress homeostasis.Conclusion This study reveals that acupuncture alleviates depression through a multi-level mechanism,primarily involving the neuroinflammation suppression,neuroplasticity enhancement,and oxidative stress regulation.These findings systematically clarify the underlying mechanisms of acupuncture’s antidepressant effects and identify novel therapeutic targets for further mechanistic research.展开更多
Introduction Neurosurgical emergencies such as spontaneous intracerebral hemorrhage(ICH),traumatic brain injury(TBI),and acute brain herniation are among the most time-sensitive and high-stakes conditions in modern me...Introduction Neurosurgical emergencies such as spontaneous intracerebral hemorrhage(ICH),traumatic brain injury(TBI),and acute brain herniation are among the most time-sensitive and high-stakes conditions in modern medicine.Clinical decisions often must be made within minutes,yet these decisions are traditionally guided by limited information,heuristic reasoning,and past experience.In this context,the rise of medical data mining and real-time analytics offers a transformative opportunity:to extract actionable intelligence from the flood of clinical,imaging,and physiological data already being collected,and to use this intelligence to guide care in real time[1–3](Figure 1).展开更多
Objective To explore the optimization and principles of acupoint selection and coordination in the treatment of adult abdominal obesity using acupuncture and moxibustion over the past decade using data mining.Methods ...Objective To explore the optimization and principles of acupoint selection and coordination in the treatment of adult abdominal obesity using acupuncture and moxibustion over the past decade using data mining.Methods Clinical studies of abdominal obesity treated with acupuncture and moxibustion,collected in the past 10 years,were searched from China Biology Medicine disc(CBMdisc),China National knowledge infrastructure(CNKI),Wanfang,China Science and Technology Journal Database(VIP),Pubmed,Embase,Google Scholar,Web of Science,(The Cumulative Index to Nursing and Allied Health Literature)CINAHL,Psyclnfo and Scopus,dated from March 1,2013 to March 31,2023.Using IBM SPSS Modeler 18.0 and other software,the frequency analysis,association-rules analysis and cluster analysis were conducted on interventions,traditional Chinese medicine(TCM)patterns,use frequency of acupoint,meridian attribution of acupoint,acupoint location,etc.Results A total of 55 articles were included,with 102 prescriptions and 71 acupoints involved.The top 3 interventions were acupoint embedding method,simple electroacupuncture and simple filiform needling.Seventeen patterns/syndromes of TCM differentiation were collected,dominated by spleen deficiency and damp blockage,spleen and kidney yang deficiency and heat accumulation in stomach and intestines.The acupoints in clinical practice were mostly at the foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian,and located at the abdominal region.The top 5 acupoints of high frequency were Tianshu(ST25),Zhongwan(CV12),Daheng(SP15),Zusanli(ST36),Huaroumen(ST24)and Daimai(GB26).The specific points of the high frequency were the crossing points and front-mu points,of which,ST25 and CV12 were the most prominent.After association-rules analysis on the high-frequency acupoints,20 groups of associated acupoints were obtained,in which,the core acupoints included ST25,CV12,SP15 and ST36.Conclusion In recent 10 years,abdominal obesity is treated by the acupoints of foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian.Compared with the regimen for simple obesity,the acupoints at the abdominal region are specially selected in treatment of abdominal obesity,such as ST25,CV12,SP15 and ST36.Supplementary acupoints are selected based on syndrome differentiation to simultaneously address both the disease manifestations and root causes.展开更多
Objective:To explore the core acupuncture acupoints and pattern-adapted acupoint combination rules for autism spectrum disorder(ASD)complicated with sleep disorder using clinical data mining technology.Methods:A retro...Objective:To explore the core acupuncture acupoints and pattern-adapted acupoint combination rules for autism spectrum disorder(ASD)complicated with sleep disorder using clinical data mining technology.Methods:A retrospective analysis was conducted on the diagnosis and treatment data of 104 children with ASD complicated with sleep disorder admitted to Xi’an Traditional Chinese Medicine(TCM)Encephalopathy Hospital from January 2022 to December 2024.Cross-pattern main acupoints were screened via frequency statistics,chi-square test,and factor analysis;pattern-specific auxiliary acupoints were extracted by combining multiple correspondence analysis,cluster analysis,and association rule mining.Results:Ten cross-pattern main acupoints(Baihui,Sishenzhen,Language Area 1,Language Area 2,Neiguan,Shenmen,Yongquan,Xuanzhong)were identified,and acupoint combination schemes for four major TCM patterns(Hyperactivity of Liver and Heart Fire,Deficiency of Kidney Essence,Deficiency of Both Heart and Spleen,Hyperactivity of Liver with Spleen Deficiency)were established.Conclusion:Acupuncture treatment should follow the principle of“regulating spirit and calming the brain as the root,and dredging collaterals based on pattern differentiation as the branch”.The synergy between main and auxiliary acupoints can accurately regulate the disease,providing a basis for precise clinical treatment.展开更多
A cluster analyzing algorithm based on grids is introduced in this paper,which is applied to data mining in the city emergency system. In the previous applications, data mining was based on the method of analyzing poi...A cluster analyzing algorithm based on grids is introduced in this paper,which is applied to data mining in the city emergency system. In the previous applications, data mining was based on the method of analyzing points and lines, which was not efficient enough in dealing with the geographic information in units of police areas. The proposed algorithm maps an event set stored as a point set to a grid unit set, utilizes the cluster algorithm based on grids to find out all the clusters, and shows the results in the method of visualization. The algorithm performs well when dealing with high dimensional data sets and immense data. It is suitable for the data mining based on geogra-(phic) information system and is supportive to decision-makings in the city emergency system.展开更多
Objective:Prevention and early detection of colorectal cancer(CRC)can increase the chances of successful treatment and reduce burden.Various data mining technologies have been utilized to strengthen the early detectio...Objective:Prevention and early detection of colorectal cancer(CRC)can increase the chances of successful treatment and reduce burden.Various data mining technologies have been utilized to strengthen the early detection of CRC in primary care.Evidence synthesis on the model’s effectiveness is scant.This systematic review synthesizes studies that examine the effect of data mining on improving risk prediction of CRC.Methods:The PRISMA framework guided the conduct of this study.We obtained papers via Pub Med,Cochrane Library,EMBASE and Google Scholar.Quality appraisal was performed using Downs and Black’s quality checklist.To evaluate the performance of included models,the values of specificity and sensitivity were comparted,the values of area under the curve(AUC)were plotted,and the median of overall AUC of included studies was computed.Results:A total of 316 studies were reviewed for full text.Seven articles were included.Included studies implement techniques including artificial neural networks,Bayesian networks and decision trees.Six articles reported the overall model accuracy.Overall,the median AUC is 0.8243[interquartile range(IQR):0.8050-0.8886].In the two articles that reported comparison results with traditional models,the data mining method performed better than the traditional models,with the best AUC improvement of 10.7%.Conclusions:The adoption of data mining technologies for CRC detection is at an early stage.Limited numbers of included articles and heterogeneity of those studies implied that more rigorous research is expected to further investigate the techniques’effects.展开更多
Aiming at the shortcomings in intrusion detection systems (IDSs) used incommercial and research fields, we propose the MA-IDS system, a distributed intrusion detectionsystem based on data mining. In this model, misuse...Aiming at the shortcomings in intrusion detection systems (IDSs) used incommercial and research fields, we propose the MA-IDS system, a distributed intrusion detectionsystem based on data mining. In this model, misuse intrusion detection system CM1DS) and anomalyintrusion de-lection system (AIDS) are combined. Data mining is applied to raise detectionperformance, and distributed mechanism is employed to increase the scalability and efficiency. Host-and network-based mining algorithms employ an improved. Bayes-ian decision theorem that suits forreal security environment to minimize the risks incurred by false decisions. We describe the overallarchitecture of the MA-IDS system, and discuss specific design and implementation issue.展开更多
Background:Erzhu Erchen decoction(EZECD),which is based on Erchen decoction and enhanced with Atractylodes lancea and Atractylodes macrocephala,is widely used for the treatment of dampness and heat(The clinical manife...Background:Erzhu Erchen decoction(EZECD),which is based on Erchen decoction and enhanced with Atractylodes lancea and Atractylodes macrocephala,is widely used for the treatment of dampness and heat(The clinical manifestations of Western medicine include thirst,inability to drink more,diarrhea,yellow urine,red tongue,et al.)internalized disease.Nevertheless,the mechanism of EZECD on damp-heat internalized Type 2 diabetes(T2D)remains unknown.We employed data mining,pharmacology databases and experimental verification to study how EZECD treats damp-heat internalized T2D.Methods:The main compounds or genes of EZECD and damp-heat internalized T2D were obtained from the pharmacology databases.Succeeding,the overlapped targets of EZECD and damp-heat internalized T2D were performed by the Gene Ontology,kyoto encyclopedia of genes and genomes analysis.And the compound-disease targets-pathway network were constructed to obtain the hub compound.Moreover,the hub genes and core related pathways were mined with weighted gene co-expression network analysis based on Gene Expression Omnibus database,the capability of hub compound and genes was valid in AutoDock 1.5.7.Furthermore,and violin plot and gene set enrichment analysis were performed to explore the role of hub genes in damp-heat internalized T2D.Finally,the interactions of hub compound and genes were explored using Comparative Toxicogenomics Database and quantitative polymerase chain reaction.Results:First,herb-compounds-genes-disease network illustrated that the hub compound of EZECD for damp-heat internalized T2D could be quercetin.Consistently,the hub genes were CASP8,CCL2,and AHR according to weighted gene co-expression network analysis.Molecular docking showed that quercetin could bind with the hub genes.Further,gene set enrichment analysis and Gene Ontology represented that CASP8,or CCL2,is negatively involved in insulin secretion response to the TNF or lipopolysaccharide process,and AHR or CCL2 positively regulated lipid and atherosclerosis,and/or including NOD-like receptor signaling pathway,and TNF signaling pathway.Ultimately,the quantitative polymerase chain reaction and western blotting analysis showed that quercetin could down-regulated the mRNA and protein experssion of CASP8,CCL2,and AHR.It was consistent with the results in Comparative Toxicogenomics Database databases.Conclusion:These results demonstrated quercetin could inhibit the expression of CASP8,CCL2,AHR in damp-heat internalized T2D,which improves insulin secretion and inhibits lipid and atherosclerosis,as well as/or including NOD-like receptor signaling pathway,and TNF signaling pathway,suggesting that EZECD may be more effective to treat damp-heat internalized T2D.展开更多
文摘Distributed Data Mining is expected to discover preciously unknown, implicit and valuable information from massive data set inherently distributed over a network. In recent years several approaches to distributed data mining have been developed, but only a few of them make use of intelligent agents. This paper provides the reason for applying Multi-Agent Technology in Distributed Data Mining and presents a Distributed Data Mining System based on Multi-Agent Technology that deals with heterogeneity in such environment. Based on the advantages of both the CS model and agent-based model, the system is being able to address the specific concern of increasing scalability and enhancing performance.
文摘The authors designed the spatial data mining system for ore-forming prediction based on the theory and methods of data mining as well as the technique of spatial database,in combination with the characteristics of geological information data.The system consists of data management,data mining and knowledge discovery,knowledge representation.It can syncretize multi-source geosciences data effectively,such as geology,geochemistry,geophysics,RS.The system digitized geological information data as data layer files which consist of the two numerical values,to store these files in the system database.According to the combination of the characters of geological information,metallogenic prognosis was realized,as an example from some area in Heilongjiang Province.The prospect area of hydrothermal copper deposit was determined.
文摘The existing data mining methods are mostly focused on relational databases and structured data, but not on complex structured data (like in extensible markup language(XML)). By converting XML document type description to the relational semantic recording XML data relations, and using an XML data mining language, the XML data mining system presents a strategy to mine information on XML.
文摘In order to improve the effect of financial data mining,this paper analyzes the financial data mining system combined with fuzzy clustering multimedia tool algorithm,builds an intelligent model to intelligently process financial data,and studies a multi-layer financial data protection structure model.According to the propagation characteristics of the financial data signal at the uniform cutoff and the theoretical knowledge of the electron miroscopic motion,this paper analyzes the calculation formulas of the reflected power and the absorbed power of the financial data to the cutoff.Moreover,this paper improves the effect of fuzzy analysis of financialdata and builds a financial data mining system based on fuzzy clustering multimedia tool.The simulation study shows that the financial data mining system based on fuzzy clustering multimedia tool proposed in this paper can effectively improve the effect of financial data mining.
文摘With the gradual acceleration of information construction in colleges and universities,digital campus and smart campus have gradually become important means for colleges and universities to scientifically manage the campus.They have been applied to teaching,scientific research,student management,and other fields,improving the quality and efficiency of management.This paper mainly studies the intelligent educational administration management system based on data mining technology.Firstly,this paper introduces the application process of data mining technology,and builds an intelligent educational administration management system based on data mining technology.Then,this paper optimizes the application of the Apriori algorithm in educational administration management through transaction compression and frequent sampling.Compared with the traditional Apriori algorithm,the optimized Apriori algorithm in this paper has a shorter execution time under the same minimum support.
文摘Background Traditional Chinese medicine(TCM)is becoming a popular complementary approach in pediatric oncology.However,few or no meta-analyses have focused on clinical studies of the use of TCM in pediatric oncology.Objective We explored the patterns of TCM use and its efficacy in children with cancer,using a systematic review,meta-analysis and data mining study.Search strategy We conducted a search of five English(Allied and Complementary Medicine Database,Embase,PubMed,Cochrane Central Register of Controlled Trials,and ClinicalTrials.gov)and four Chinese databases(Wanfang Data,China National Knowledge Infrastructure,Chinese Biomedical Literature Database,and VIP Chinese Science and Technology Periodicals Database)for clinical studies published before October 2021,using keywords related to“pediatric,”“cancer,”and“TCM.”Inclusion criteria We included studies which were randomized controlled trials(RCTs)or observational clinical studies,focused on patients aged<19 years old who had been diagnosed with cancer,and included at least one group of subjects receiving TCM treatment.Data extraction and analysis The methodological quality of RCTs and observational studies was assessed using the six-item Jadad scale and the Effective Public Healthcare Panacea Project Quality Assessment Tool,respectively.Meta-analysis was used to evaluate the efficacy of combining TCM with chemotherapy.Study outcomes included the treatment response rate and occurrence of cancer-related symptoms.Association rule mining(ARM)was used to investigate the associations among medicinal herbs and patient symptoms.Results The fifty-four studies included in this analysis were comprised of RCTs(63.0%)and observational studies(37.0%).Most RCTs focused on hematological malignancies(41.2%).The study outcomes included chemotherapy-induced toxicities(76.5%),infection rate(35.3%),and response,survival or relapse rate(23.5%).The methodological quality of most of the RCTs(82.4%)and observational studies(80.0%)was rated as“moderate.”In studies of leukemia patients,adding TCM to conventional treatment significantly improved the clinical response rate(odds ratio[OR]=2.55;95%confidence interval[CI]=1.49-4.36),lowered infection rate(OR=0.23;95%CI=0.13-0.40),and reduced nausea and vomiting(OR=0.13;95%CI=0.08-0.23).ARM showed that Radix Astragali,the most commonly used medicinal herb(58.0%),was associated with treating myelosuppression,gastrointestinal complications,and infection.Conclusion There is growing evidence that TCM is an effective adjuvant therapy for children with cancer.We proposed a checklist to improve the quality of TCM trials in pediatric oncology.Future work will examine the use of ARM techniques on real-world data to evaluate the efficacy of medicinal herbs and drug-herb interactions in children receiving TCM as a part of integrated cancer therapy.
基金supported by a Lee Kong Chian School of Medicine Dean’s Postdoctoral Fellowship(021207-00001)from Nanyang Technological University(NTU)Singapore and a Mistletoe Research Fellowship(022522-00001)from the Momental Foundation USA.Jialiu Zeng is supported by a Presidential Postdoctoral Fellowship(021229-00001)from NTU Singapore and an Open Fund Young Investigator Research Grant(OF-YIRG)(MOH-001147)from the National Medical Research Council(NMRC)SingaporeSu Bin Lim is supported by the National Research Foundation(NRF)of Korea(Grant Nos.:2020R1A6A1A03043539,2020M3A9D8037604,2022R1C1C1004756)a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute(KHIDI),funded by the Ministry of Health&Welfare,Republic of Korea(Grant No.:HR22C1734).
文摘Bioinformatic analysis of large and complex omics datasets has become increasingly useful in modern day biology by providing a great depth of information,with its application to neuroscience termed neuroinformatics.Data mining of omics datasets has enabled the generation of new hypotheses based on differentially regulated biological molecules associated with disease mechanisms,which can be tested experimentally for improved diagnostic and therapeutic targeting of neurodegenerative diseases.Importantly,integrating multi-omics data using a systems bioinformatics approach will advance the understanding of the layered and interactive network of biological regulation that exchanges systemic knowledge to facilitate the development of a comprehensive human brain profile.In this review,we first summarize data mining studies utilizing datasets from the individual type of omics analysis,including epigenetics/epigenomics,transcriptomics,proteomics,metabolomics,lipidomics,and spatial omics,pertaining to Alzheimer's disease,Parkinson's disease,and multiple sclerosis.We then discuss multi-omics integration approaches,including independent biological integration and unsupervised integration methods,for more intuitive and informative interpretation of the biological data obtained across different omics layers.We further assess studies that integrate multi-omics in data mining which provide convoluted biological insights and offer proof-of-concept proposition towards systems bioinformatics in the reconstruction of brain networks.Finally,we recommend a combination of high dimensional bioinformatics analysis with experimental validation to achieve translational neuroscience applications including biomarker discovery,therapeutic development,and elucidation of disease mechanisms.We conclude by providing future perspectives and opportunities in applying integrative multi-omics and systems bioinformatics to achieve precision phenotyping of neurodegenerative diseases and towards personalized medicine.
基金National Natural Science Foundations of China(No.71501103)Natural Science Foundation of Inner Mongolia,China(No.2015BS0705)the Program of Higher-Level Talents of Inner Mongolia University,China(No.20700-5145131)
文摘Complex repairable system is composed of thousands of components.Some maintenance management and decision problems in maintenance management and decision need to classify a set of components into several classes based on data mining.Furthermore,with the complexity of industrial equipment increasing,the managers should pay more attention to the key components and carry out the lean management is very important.Therefore,the idea"customer segmentation"of"precise marketing"can be used in the maintenance management of the multi-component system.Following the idea of segmentation,the components of multicomponent systems should be subdivied into groups based on specific attributes relevant to maintenance,such as maintenance cost,mean time between failures,and failure frequency.For the target specific groups of parts,the optimal maintenance policy,health assessment and maintenance scheduling can be determined.The proposed analysis framework will be given out.In order to illustrate the effectiveness of this method,a numerical example is given out.
基金Supported by the National Science of China(6 0 0 75 0 15 ) and Key Project of Scientific and Technological Departmentin Anhui
文摘This paper first puts forward a case based system framework based on data mining techniques. Then the paper examines the possibility of using neural networks as a method of retrieval in such a case based system. In this system we propose data mining algorithms to discover case knowledge and other algorithms.
文摘This paper tries to characterize volcanic rocks through the development and application of an empirical geomechanical system. Geotechnical information was collected from the samples from several Atlantic Ocean islands including Madeira, Azores and Canarias archipelagos. An empirical rock classification system termed as the volcanic rock system(VRS) is developed and presented in detail. Results using the VRS are compared with those obtained using the traditional rock mass rating(RMR) system. Data mining(DM) techniques are applied to a database of volcanic rock geomechanical information from the islands.Different algorithms were developed and consequently approaches were followed for predicting rock mass classes using the VRS and RMR classification systems. Finally, some conclusions are drawn with emphasis on the fact that a better performance was achieved using attributes from VRS.
文摘With the economic development and the popularity of application of electronic computer, electronic commerce has rapid development. More and more commerce and key business has been carried on the lnternet because Internet has the features of interaction, openness, sharing and so on. However, during the daily commerce, people worry about the security of the network system. So a new technology which can detect the unusual behavior in time has been invented in order to protect the security of network system. The system of intrusion detection needs a lot of new technology to protect the data of the network system. The application of data mining technology in the system of intrusion detection can provide a better assistant to the users to analyze the data and improve the accuracy of the checking system.
基金Henan Province Special Research Project of Tra ditional Chinese Medicine(Grant No.2022ZY1090).
文摘In the present study,data mining and network pharmacology were utilized to explore the principles and mechanisms of traditional Chinese medicine(TCM)in treating acute appendicitis.The goal was to provide a scientific basis for clinical treatment and further research on this disease.First,we searched the National Patent Database for Chinese herbal compound prescriptions used to treat acute appendicitis.We then applied frequency analysis,character and taste meridian analysis,association rule analysis,and hierarchical cluster analysis to identify the patterns of TCM treatment for acute appendicitis,selecting key combinations of Chinese medicines.Next,we screened the main active components of these key TCM based on quality markers.Using databases such as SwissTargetPrediction,SymMap,ETCM,and STRING,we analyzed the pharmacological mechanisms of these key TCM in treating acute appendicitis.Key active components and targets were further verified through molecular docking.We identified a total of 129 patents involving 316 Chinese medicines,with 24 being frequently used.The results indicated that most Chinese herbs used for acute appendicitis were heat-clearing drugs,blood-activating and stasis-removing drugs,and purging drugs.The primary active ingredients of the Rhubarb-cortex moutan-flos lonicerae combination for treating acute appendicitis included Emodin,Paeonol,Physcion,Chlorogenic acid,Chrysophanol,Rhein acid,and Aloe-emodin.These ingredients targeted key proteins such as ALB,TP53,BCL2,STAT3,IL-6,and TNF,and were involved in cellular responses to lipopolysaccharides,cell composition,and various cytokine-mediated biological processes.They also interacted with signaling pathways like AGE-RAGE,TNF,IL-17,and FoxO.Based on patent data,this study analyzed medication patterns in the treatment of acute appendicitis,discussed the possible mechanisms of key TCM combinations,and provided a scientific basis and new perspectives for the diagnosis and treatment of the disease.
文摘Objective To identify core acupoint patterns and elucidate the molecular mechanisms of acupuncture for primary depressive disorder(PDD)through data mining and network analysis.Methods A comprehensive literature search was conducted across PubMed,Embase,Ovid Technologies(OVID),Web of Science,Cochrane Library,China National Knowledge Infrastructure(CNKI),China National Knowledge Infrastructure Database(VIP),Wanfang Data,and SinoMed Database from database foundation to January 31,2025,for clinical studies on acupuncture treatment of PDD.Descriptive statistics,high-frequency acupoint analysis,degree and betweenness centrality evaluation,and core acupoint prescription mining identified predominant therapeutic combinations for PDD.Network acupuncture was used to predict therapeutic target for the core acupoint prescription.Subsequent protein-protein interaction(PPI)network and molecular complex detection(MCODE)analyses were conducted to identify the key targets and functional modules.Gene Ontology(GO)and Kyoto Encyclopedia of Genes and Genomes(KEGG)analyses explored the underlying biological mechanisms of the core acupoint prescription in treating PDD.Results A total of 57 acupoint prescriptions underwent systematic analysis.The core therapeutic combinations comprised Baihui(GV20),Yintang(GV29),Neiguan(PC6),Hegu(LI4),and Shenmen(HT7).Network acupuncture analysis identified 88 potential therapeutic targets(79 overlapping with PDD),while PPI network analysis revealed central regulatory nodes,including interleukin(IL)-6,IL-1β,tumor necrosis factor(TNF)-α,toll-like receptor 4(TLR4),IL-10,brain-derived neurotrophic factor(BDNF),transforming growth factor(TGF)-β1,C-XC motif chemokine ligand 10(CXCL10),mitogen-activated protein kinase 3(MAPK3),and nitric oxide synthase 1(NOS1).MCODE-based modular analysis further elucidated three functionally coherent clusters:inflammation-homeostasis(score=6.571),plasticity-neurotransmission(score=3.143),and oxidative stress(score=3.000).GO and KEGG analyses demonstrated significant enrichment of the MAPK,phosphoinositide 3-kinase/protein kinase B(PI3K/Akt),and hypoxia-inducible factor(HIF)-1 signaling pathways.These mechanistic insights suggested that the antidepressant effects mediated through mechanisms of neuroinflammatory regulation,neuroplasticity restoration,and immune-oxidative stress homeostasis.Conclusion This study reveals that acupuncture alleviates depression through a multi-level mechanism,primarily involving the neuroinflammation suppression,neuroplasticity enhancement,and oxidative stress regulation.These findings systematically clarify the underlying mechanisms of acupuncture’s antidepressant effects and identify novel therapeutic targets for further mechanistic research.
文摘Introduction Neurosurgical emergencies such as spontaneous intracerebral hemorrhage(ICH),traumatic brain injury(TBI),and acute brain herniation are among the most time-sensitive and high-stakes conditions in modern medicine.Clinical decisions often must be made within minutes,yet these decisions are traditionally guided by limited information,heuristic reasoning,and past experience.In this context,the rise of medical data mining and real-time analytics offers a transformative opportunity:to extract actionable intelligence from the flood of clinical,imaging,and physiological data already being collected,and to use this intelligence to guide care in real time[1–3](Figure 1).
基金Supported by Shanghai College Students Innovation and Entrepreneurship Training Program Project:202310268066The 16th Batch of Science And Technology Innovation Projects of Shanghai University of Traditional Chinese Medicine:SHUTCM2023010+1 种基金2024 Shanghai Oriental Talent Program Youth Project2021 High-level Local University Innovation Team Project of Shanghai University of Traditional Chinese Medicine:No.3 Shanghai Education Commission Personnel [2022]。
文摘Objective To explore the optimization and principles of acupoint selection and coordination in the treatment of adult abdominal obesity using acupuncture and moxibustion over the past decade using data mining.Methods Clinical studies of abdominal obesity treated with acupuncture and moxibustion,collected in the past 10 years,were searched from China Biology Medicine disc(CBMdisc),China National knowledge infrastructure(CNKI),Wanfang,China Science and Technology Journal Database(VIP),Pubmed,Embase,Google Scholar,Web of Science,(The Cumulative Index to Nursing and Allied Health Literature)CINAHL,Psyclnfo and Scopus,dated from March 1,2013 to March 31,2023.Using IBM SPSS Modeler 18.0 and other software,the frequency analysis,association-rules analysis and cluster analysis were conducted on interventions,traditional Chinese medicine(TCM)patterns,use frequency of acupoint,meridian attribution of acupoint,acupoint location,etc.Results A total of 55 articles were included,with 102 prescriptions and 71 acupoints involved.The top 3 interventions were acupoint embedding method,simple electroacupuncture and simple filiform needling.Seventeen patterns/syndromes of TCM differentiation were collected,dominated by spleen deficiency and damp blockage,spleen and kidney yang deficiency and heat accumulation in stomach and intestines.The acupoints in clinical practice were mostly at the foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian,and located at the abdominal region.The top 5 acupoints of high frequency were Tianshu(ST25),Zhongwan(CV12),Daheng(SP15),Zusanli(ST36),Huaroumen(ST24)and Daimai(GB26).The specific points of the high frequency were the crossing points and front-mu points,of which,ST25 and CV12 were the most prominent.After association-rules analysis on the high-frequency acupoints,20 groups of associated acupoints were obtained,in which,the core acupoints included ST25,CV12,SP15 and ST36.Conclusion In recent 10 years,abdominal obesity is treated by the acupoints of foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian.Compared with the regimen for simple obesity,the acupoints at the abdominal region are specially selected in treatment of abdominal obesity,such as ST25,CV12,SP15 and ST36.Supplementary acupoints are selected based on syndrome differentiation to simultaneously address both the disease manifestations and root causes.
基金Song Hujie’s Inheritance Studio of National Renowned Traditional Chinese Medicine Experts.
文摘Objective:To explore the core acupuncture acupoints and pattern-adapted acupoint combination rules for autism spectrum disorder(ASD)complicated with sleep disorder using clinical data mining technology.Methods:A retrospective analysis was conducted on the diagnosis and treatment data of 104 children with ASD complicated with sleep disorder admitted to Xi’an Traditional Chinese Medicine(TCM)Encephalopathy Hospital from January 2022 to December 2024.Cross-pattern main acupoints were screened via frequency statistics,chi-square test,and factor analysis;pattern-specific auxiliary acupoints were extracted by combining multiple correspondence analysis,cluster analysis,and association rule mining.Results:Ten cross-pattern main acupoints(Baihui,Sishenzhen,Language Area 1,Language Area 2,Neiguan,Shenmen,Yongquan,Xuanzhong)were identified,and acupoint combination schemes for four major TCM patterns(Hyperactivity of Liver and Heart Fire,Deficiency of Kidney Essence,Deficiency of Both Heart and Spleen,Hyperactivity of Liver with Spleen Deficiency)were established.Conclusion:Acupuncture treatment should follow the principle of“regulating spirit and calming the brain as the root,and dredging collaterals based on pattern differentiation as the branch”.The synergy between main and auxiliary acupoints can accurately regulate the disease,providing a basis for precise clinical treatment.
文摘A cluster analyzing algorithm based on grids is introduced in this paper,which is applied to data mining in the city emergency system. In the previous applications, data mining was based on the method of analyzing points and lines, which was not efficient enough in dealing with the geographic information in units of police areas. The proposed algorithm maps an event set stored as a point set to a grid unit set, utilizes the cluster algorithm based on grids to find out all the clusters, and shows the results in the method of visualization. The algorithm performs well when dealing with high dimensional data sets and immense data. It is suitable for the data mining based on geogra-(phic) information system and is supportive to decision-makings in the city emergency system.
基金supported by the National Natural Science Foundation of China(No.71804183)。
文摘Objective:Prevention and early detection of colorectal cancer(CRC)can increase the chances of successful treatment and reduce burden.Various data mining technologies have been utilized to strengthen the early detection of CRC in primary care.Evidence synthesis on the model’s effectiveness is scant.This systematic review synthesizes studies that examine the effect of data mining on improving risk prediction of CRC.Methods:The PRISMA framework guided the conduct of this study.We obtained papers via Pub Med,Cochrane Library,EMBASE and Google Scholar.Quality appraisal was performed using Downs and Black’s quality checklist.To evaluate the performance of included models,the values of specificity and sensitivity were comparted,the values of area under the curve(AUC)were plotted,and the median of overall AUC of included studies was computed.Results:A total of 316 studies were reviewed for full text.Seven articles were included.Included studies implement techniques including artificial neural networks,Bayesian networks and decision trees.Six articles reported the overall model accuracy.Overall,the median AUC is 0.8243[interquartile range(IQR):0.8050-0.8886].In the two articles that reported comparison results with traditional models,the data mining method performed better than the traditional models,with the best AUC improvement of 10.7%.Conclusions:The adoption of data mining technologies for CRC detection is at an early stage.Limited numbers of included articles and heterogeneity of those studies implied that more rigorous research is expected to further investigate the techniques’effects.
文摘Aiming at the shortcomings in intrusion detection systems (IDSs) used incommercial and research fields, we propose the MA-IDS system, a distributed intrusion detectionsystem based on data mining. In this model, misuse intrusion detection system CM1DS) and anomalyintrusion de-lection system (AIDS) are combined. Data mining is applied to raise detectionperformance, and distributed mechanism is employed to increase the scalability and efficiency. Host-and network-based mining algorithms employ an improved. Bayes-ian decision theorem that suits forreal security environment to minimize the risks incurred by false decisions. We describe the overallarchitecture of the MA-IDS system, and discuss specific design and implementation issue.
基金supported by a grant from Hubei Key Laboratory of Diabetes and Angiopathy Program of Hubei University of Science and Technology(2020XZ10)Project of Education Commission of Hubei Province(B2022192).
文摘Background:Erzhu Erchen decoction(EZECD),which is based on Erchen decoction and enhanced with Atractylodes lancea and Atractylodes macrocephala,is widely used for the treatment of dampness and heat(The clinical manifestations of Western medicine include thirst,inability to drink more,diarrhea,yellow urine,red tongue,et al.)internalized disease.Nevertheless,the mechanism of EZECD on damp-heat internalized Type 2 diabetes(T2D)remains unknown.We employed data mining,pharmacology databases and experimental verification to study how EZECD treats damp-heat internalized T2D.Methods:The main compounds or genes of EZECD and damp-heat internalized T2D were obtained from the pharmacology databases.Succeeding,the overlapped targets of EZECD and damp-heat internalized T2D were performed by the Gene Ontology,kyoto encyclopedia of genes and genomes analysis.And the compound-disease targets-pathway network were constructed to obtain the hub compound.Moreover,the hub genes and core related pathways were mined with weighted gene co-expression network analysis based on Gene Expression Omnibus database,the capability of hub compound and genes was valid in AutoDock 1.5.7.Furthermore,and violin plot and gene set enrichment analysis were performed to explore the role of hub genes in damp-heat internalized T2D.Finally,the interactions of hub compound and genes were explored using Comparative Toxicogenomics Database and quantitative polymerase chain reaction.Results:First,herb-compounds-genes-disease network illustrated that the hub compound of EZECD for damp-heat internalized T2D could be quercetin.Consistently,the hub genes were CASP8,CCL2,and AHR according to weighted gene co-expression network analysis.Molecular docking showed that quercetin could bind with the hub genes.Further,gene set enrichment analysis and Gene Ontology represented that CASP8,or CCL2,is negatively involved in insulin secretion response to the TNF or lipopolysaccharide process,and AHR or CCL2 positively regulated lipid and atherosclerosis,and/or including NOD-like receptor signaling pathway,and TNF signaling pathway.Ultimately,the quantitative polymerase chain reaction and western blotting analysis showed that quercetin could down-regulated the mRNA and protein experssion of CASP8,CCL2,and AHR.It was consistent with the results in Comparative Toxicogenomics Database databases.Conclusion:These results demonstrated quercetin could inhibit the expression of CASP8,CCL2,AHR in damp-heat internalized T2D,which improves insulin secretion and inhibits lipid and atherosclerosis,as well as/or including NOD-like receptor signaling pathway,and TNF signaling pathway,suggesting that EZECD may be more effective to treat damp-heat internalized T2D.