In the present study,data mining and network pharmacology were utilized to explore the principles and mechanisms of traditional Chinese medicine(TCM)in treating acute appendicitis.The goal was to provide a scientific ...In the present study,data mining and network pharmacology were utilized to explore the principles and mechanisms of traditional Chinese medicine(TCM)in treating acute appendicitis.The goal was to provide a scientific basis for clinical treatment and further research on this disease.First,we searched the National Patent Database for Chinese herbal compound prescriptions used to treat acute appendicitis.We then applied frequency analysis,character and taste meridian analysis,association rule analysis,and hierarchical cluster analysis to identify the patterns of TCM treatment for acute appendicitis,selecting key combinations of Chinese medicines.Next,we screened the main active components of these key TCM based on quality markers.Using databases such as SwissTargetPrediction,SymMap,ETCM,and STRING,we analyzed the pharmacological mechanisms of these key TCM in treating acute appendicitis.Key active components and targets were further verified through molecular docking.We identified a total of 129 patents involving 316 Chinese medicines,with 24 being frequently used.The results indicated that most Chinese herbs used for acute appendicitis were heat-clearing drugs,blood-activating and stasis-removing drugs,and purging drugs.The primary active ingredients of the Rhubarb-cortex moutan-flos lonicerae combination for treating acute appendicitis included Emodin,Paeonol,Physcion,Chlorogenic acid,Chrysophanol,Rhein acid,and Aloe-emodin.These ingredients targeted key proteins such as ALB,TP53,BCL2,STAT3,IL-6,and TNF,and were involved in cellular responses to lipopolysaccharides,cell composition,and various cytokine-mediated biological processes.They also interacted with signaling pathways like AGE-RAGE,TNF,IL-17,and FoxO.Based on patent data,this study analyzed medication patterns in the treatment of acute appendicitis,discussed the possible mechanisms of key TCM combinations,and provided a scientific basis and new perspectives for the diagnosis and treatment of the disease.展开更多
Introduction Neurosurgical emergencies such as spontaneous intracerebral hemorrhage(ICH),traumatic brain injury(TBI),and acute brain herniation are among the most time-sensitive and high-stakes conditions in modern me...Introduction Neurosurgical emergencies such as spontaneous intracerebral hemorrhage(ICH),traumatic brain injury(TBI),and acute brain herniation are among the most time-sensitive and high-stakes conditions in modern medicine.Clinical decisions often must be made within minutes,yet these decisions are traditionally guided by limited information,heuristic reasoning,and past experience.In this context,the rise of medical data mining and real-time analytics offers a transformative opportunity:to extract actionable intelligence from the flood of clinical,imaging,and physiological data already being collected,and to use this intelligence to guide care in real time[1–3](Figure 1).展开更多
With the gradual acceleration of information construction in colleges and universities,digital campus and smart campus have gradually become important means for colleges and universities to scientifically manage the c...With the gradual acceleration of information construction in colleges and universities,digital campus and smart campus have gradually become important means for colleges and universities to scientifically manage the campus.They have been applied to teaching,scientific research,student management,and other fields,improving the quality and efficiency of management.This paper mainly studies the intelligent educational administration management system based on data mining technology.Firstly,this paper introduces the application process of data mining technology,and builds an intelligent educational administration management system based on data mining technology.Then,this paper optimizes the application of the Apriori algorithm in educational administration management through transaction compression and frequent sampling.Compared with the traditional Apriori algorithm,the optimized Apriori algorithm in this paper has a shorter execution time under the same minimum support.展开更多
Objective To explore the optimization and principles of acupoint selection and coordination in the treatment of adult abdominal obesity using acupuncture and moxibustion over the past decade using data mining.Methods ...Objective To explore the optimization and principles of acupoint selection and coordination in the treatment of adult abdominal obesity using acupuncture and moxibustion over the past decade using data mining.Methods Clinical studies of abdominal obesity treated with acupuncture and moxibustion,collected in the past 10 years,were searched from China Biology Medicine disc(CBMdisc),China National knowledge infrastructure(CNKI),Wanfang,China Science and Technology Journal Database(VIP),Pubmed,Embase,Google Scholar,Web of Science,(The Cumulative Index to Nursing and Allied Health Literature)CINAHL,Psyclnfo and Scopus,dated from March 1,2013 to March 31,2023.Using IBM SPSS Modeler 18.0 and other software,the frequency analysis,association-rules analysis and cluster analysis were conducted on interventions,traditional Chinese medicine(TCM)patterns,use frequency of acupoint,meridian attribution of acupoint,acupoint location,etc.Results A total of 55 articles were included,with 102 prescriptions and 71 acupoints involved.The top 3 interventions were acupoint embedding method,simple electroacupuncture and simple filiform needling.Seventeen patterns/syndromes of TCM differentiation were collected,dominated by spleen deficiency and damp blockage,spleen and kidney yang deficiency and heat accumulation in stomach and intestines.The acupoints in clinical practice were mostly at the foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian,and located at the abdominal region.The top 5 acupoints of high frequency were Tianshu(ST25),Zhongwan(CV12),Daheng(SP15),Zusanli(ST36),Huaroumen(ST24)and Daimai(GB26).The specific points of the high frequency were the crossing points and front-mu points,of which,ST25 and CV12 were the most prominent.After association-rules analysis on the high-frequency acupoints,20 groups of associated acupoints were obtained,in which,the core acupoints included ST25,CV12,SP15 and ST36.Conclusion In recent 10 years,abdominal obesity is treated by the acupoints of foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian.Compared with the regimen for simple obesity,the acupoints at the abdominal region are specially selected in treatment of abdominal obesity,such as ST25,CV12,SP15 and ST36.Supplementary acupoints are selected based on syndrome differentiation to simultaneously address both the disease manifestations and root causes.展开更多
Objective:To explore the core acupuncture acupoints and pattern-adapted acupoint combination rules for autism spectrum disorder(ASD)complicated with sleep disorder using clinical data mining technology.Methods:A retro...Objective:To explore the core acupuncture acupoints and pattern-adapted acupoint combination rules for autism spectrum disorder(ASD)complicated with sleep disorder using clinical data mining technology.Methods:A retrospective analysis was conducted on the diagnosis and treatment data of 104 children with ASD complicated with sleep disorder admitted to Xi’an Traditional Chinese Medicine(TCM)Encephalopathy Hospital from January 2022 to December 2024.Cross-pattern main acupoints were screened via frequency statistics,chi-square test,and factor analysis;pattern-specific auxiliary acupoints were extracted by combining multiple correspondence analysis,cluster analysis,and association rule mining.Results:Ten cross-pattern main acupoints(Baihui,Sishenzhen,Language Area 1,Language Area 2,Neiguan,Shenmen,Yongquan,Xuanzhong)were identified,and acupoint combination schemes for four major TCM patterns(Hyperactivity of Liver and Heart Fire,Deficiency of Kidney Essence,Deficiency of Both Heart and Spleen,Hyperactivity of Liver with Spleen Deficiency)were established.Conclusion:Acupuncture treatment should follow the principle of“regulating spirit and calming the brain as the root,and dredging collaterals based on pattern differentiation as the branch”.The synergy between main and auxiliary acupoints can accurately regulate the disease,providing a basis for precise clinical treatment.展开更多
The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities...The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders.展开更多
Data mining is the process of extracting implicit but potentially useful information from incomplete, noisy, and fuzzy data. Data mining offers excellent nonlinear modeling and self-organized learning, and it can play...Data mining is the process of extracting implicit but potentially useful information from incomplete, noisy, and fuzzy data. Data mining offers excellent nonlinear modeling and self-organized learning, and it can play a vital role in the interpretation of well logging data of complex reservoirs. We used data mining to identify the lithologies in a complex reservoir. The reservoir lithologies served as the classification task target and were identified using feature extraction, feature selection, and modeling of data streams. We used independent component analysis to extract information from well curves. We then used the branch-and- bound algorithm to look for the optimal feature subsets and eliminate redundant information. Finally, we used the C5.0 decision-tree algorithm to set up disaggregated models of the well logging curves. The modeling and actual logging data were in good agreement, showing the usefulness of data mining methods in complex reservoirs.展开更多
Different acupuncture-moxibustion therapies can produce different clinical effects, that is, the effect has specificity, which is significantly important in obtaining acupuncture-moxibustion efficacy. In this study, t...Different acupuncture-moxibustion therapies can produce different clinical effects, that is, the effect has specificity, which is significantly important in obtaining acupuncture-moxibustion efficacy. In this study, the clinical application laws of fire needle, acupoint injection, catgut embedment, acupoint application, moxibustion therapy and filiform needle acupuncture were summarized in the aspects of category of disease, efficacy and related prescriptions (such as medication and acupoint selection) based on the result of data mining, and the general applicable categories of disease of acupuncture-moxibustion treatment methods were further screened, so as to guide the clinical application and give play to the best efficacy.展开更多
An intrusion detection (ID) model is proposed based on the fuzzy data mining method. A major difficulty of anomaly ID is that patterns of the normal behavior change with time. In addition, an actual intrusion with a...An intrusion detection (ID) model is proposed based on the fuzzy data mining method. A major difficulty of anomaly ID is that patterns of the normal behavior change with time. In addition, an actual intrusion with a small deviation may match normal patterns. So the intrusion behavior cannot be detected by the detection system.To solve the problem, fuzzy data mining technique is utilized to extract patterns representing the normal behavior of a network. A set of fuzzy association rules mined from the network data are shown as a model of “normal behaviors”. To detect anomalous behaviors, fuzzy association rules are generated from new audit data and the similarity with sets mined from “normal” data is computed. If the similarity values are lower than a threshold value,an alarm is given. Furthermore, genetic algorithms are used to adjust the fuzzy membership functions and to select an appropriate set of features.展开更多
In order to improve the accuracy and integrality of mining data records from the web, the concepts of isomorphic page and directory page and three algorithms are proposed. An isomorphic web page is a set of web pages ...In order to improve the accuracy and integrality of mining data records from the web, the concepts of isomorphic page and directory page and three algorithms are proposed. An isomorphic web page is a set of web pages that have uniform structure, only differing in main information. A web page which contains many links that link to isomorphic web pages is called a directory page. Algorithm 1 can find directory web pages in a web using adjacent links similar analysis method. It first sorts the link, and then counts the links in each directory. If the count is greater than a given valve then finds the similar sub-page links in the directory and gives the results. A function for an isomorphic web page judgment is also proposed. Algorithm 2 can mine data records from an isomorphic page using a noise information filter. It is based on the fact that the noise information is the same in two isomorphic pages, only the main information is different. Algorithm 3 can mine data records from an entire website using the technology of spider. The experiment shows that the proposed algorithms can mine data records more intactly than the existing algorithms. Mining data records from isomorphic pages is an efficient method.展开更多
Nowadays, the scale of data normally stored in a database collected by Data Acquisition System (DAS) or Distributed Control System (DCS) in a power plant is becoming larger and larger. However there are abundant valua...Nowadays, the scale of data normally stored in a database collected by Data Acquisition System (DAS) or Distributed Control System (DCS) in a power plant is becoming larger and larger. However there are abundant valuable knowledge hidden behind them. It will be beyond people's capacity to analyze and understand these data stored in such a scale database. Fortunately data mining techniques are arising at the historic moment. In this paper, we explain the basic concept and general knowledge of data mining; analyze the characteristics and research method of data mining; give some typical applications of data mining system based on power plant real time database on intranet.展开更多
Many high quality studies have emerged from public databases,such as Surveillance,Epidemiology,and End Results(SEER),National Health and Nutrition Examination Survey(NHANES),The Cancer Genome Atlas(TCGA),and Medical I...Many high quality studies have emerged from public databases,such as Surveillance,Epidemiology,and End Results(SEER),National Health and Nutrition Examination Survey(NHANES),The Cancer Genome Atlas(TCGA),and Medical Information Mart for Intensive Care(MIMIC);however,these data are often characterized by a high degree of dimensional heterogeneity,timeliness,scarcity,irregularity,and other characteristics,resulting in the value of these data not being fully utilized.Data-mining technology has been a frontier field in medical research,as it demonstrates excellent performance in evaluating patient risks and assisting clinical decision-making in building disease-prediction models.Therefore,data mining has unique advantages in clinical big-data research,especially in large-scale medical public databases.This article introduced the main medical public database and described the steps,tasks,and models of data mining in simple language.Additionally,we described data-mining methods along with their practical applications.The goal of this work was to aid clinical researchers in gaining a clear and intuitive understanding of the application of data-mining technology on clinical big-data in order to promote the production of research results that are beneficial to doctors and patients.展开更多
Knowledge Discovery in Databases is gaining attention and raising new hopes for traditional Chinese medicine (TCM) researchers. It is a useful tool in understanding and deciphering TCM theories. Aiming for a better ...Knowledge Discovery in Databases is gaining attention and raising new hopes for traditional Chinese medicine (TCM) researchers. It is a useful tool in understanding and deciphering TCM theories. Aiming for a better understanding of Chinese herbal property theory (CHPT), this paper performed an improved association rule learning to analyze semistructured text in the book entitled Shennong's Classic of Materia Medica. The text was firstly annotated and transformed to well-structured multidimensional data. Subsequently, an Apriori algorithm was employed for producing association rules after the sensitivity analysis of parameters. From the confirmed 120 resulting rules that described the intrinsic relationships between herbal property (qi, flavor and their combinations) and herbal efficacy, two novel fundamental principles underlying CHPT were acquired and further elucidated: (1) the many-to-one mapping of herbal efficacy to herbal property; (2) the nonrandom overlap between the related efficacy of qi and flavor. This work provided an innovative knowledge about CHPT, which would be helpful for its modern research.展开更多
OBJECTIVE: To help researchers selecting appropriate data mining models to provide better evidence for the clinical practice of Traditional Chinese Medicine(TCM) diagnosis and therapy.METHODS: Clinical issues based on...OBJECTIVE: To help researchers selecting appropriate data mining models to provide better evidence for the clinical practice of Traditional Chinese Medicine(TCM) diagnosis and therapy.METHODS: Clinical issues based on data mining models were comprehensively summarized from four significant elements of the clinical studies:symptoms, symptom patterns, herbs, and efficacy.Existing problems were further generalized to determine the relevant factors of the performance of data mining models, e.g. data type, samples, parameters, variable labels. Combining these relevant factors, the TCM clinical data features were compared with regards to statistical characters and informatics properties. Data models were compared simultaneously from the view of applied conditions and suitable scopes.RESULTS: The main application problems were the inconsistent data type and the small samples for the used data mining models, which caused the inappropriate results, even the mistake results. These features, i.e. advantages, disadvantages, satisfied data types, tasks of data mining, and the TCM issues, were summarized and compared.CONCLUSION: By aiming at the special features of different data mining models, the clinical doctors could select the suitable data mining models to resolve the TCM problem.展开更多
OBJECTIVE:We applied data mining techniques to the study of acupuncture as a treatment for juvenile myopia,with the aim of identifying hidden patterns in the data.METHODS:Fifty patients with juvenile myopia were selec...OBJECTIVE:We applied data mining techniques to the study of acupuncture as a treatment for juvenile myopia,with the aim of identifying hidden patterns in the data.METHODS:Fifty patients with juvenile myopia were selected and treated with acupuncture,and data mining was used to analyze the effects of treatment and the influence of behavioral variables.Clustering analysis was used to divide myopia patients into two classifications before acupuncture treatment.Artificial neural network BP algorithm was adopted to analyze the roles of different factors in changes in diopters.An association algorithm was used to analyze factors associated with the subjective experience of acupuncture and average diopter.RESULTS:The two classification results were fully consistent with the understandings of the ophthalmic circles.The duration of using the Internet and watching TV every day was the main factor that affected vision.Acupuncture feelings and therapeutic effect have a strong correlativity.A good or above experience's score of acupuncture could slow the progression of juvenile myopia.CONCLUSION:Collecting data from patients with juvenile myopia by using data mining can extract hidden potential rules and knowledge from the research evidence.The decision support can be provided to improve the doctor's clinical acupuncture treatment effects.展开更多
Objective:To analyze the component law of Chinese patent medicines for anti-influenza and develop new prescriptions for anti-influenza by unsupervised data mining methods. Methods: Chinese patent medicine recipes for ...Objective:To analyze the component law of Chinese patent medicines for anti-influenza and develop new prescriptions for anti-influenza by unsupervised data mining methods. Methods: Chinese patent medicine recipes for anti-influenza were collected and recorded in the database, and then the correlation coefficient between herbs, core combinations of herbs and new prescriptions were analyzed by using modified mutual information, complex system entropy cluster and unsupervised hierarchical clustering, respectively. Results: Based on analysis of 126 Chinese patent medicine recipes, the frequency of each herb occurrence in these recipes, 54 frequently-used herb pairs, 34 core combinations were determined, and 4 new recipes for influenza were developed. Conclusion: Unsupervised data mining methods are able to mine the component law quickly and develop new prescriptions.展开更多
Unstable angina(UA) is the most dangerous type of Coronary Heart Disease(CHD) to cause more and more mortal and morbid world wide. Identification of biomarkers for UA at the level of proteomics and metabolomics is...Unstable angina(UA) is the most dangerous type of Coronary Heart Disease(CHD) to cause more and more mortal and morbid world wide. Identification of biomarkers for UA at the level of proteomics and metabolomics is a better avenue to understand the inner mechanism of it. Feature selection based data mining method is better suited to identify biomarkers of UA. In this study, we carried out clinical epidemiology to collect plasmas of UA in-patients and controls. Proteomics and metabolomics data were obtained via two-dimensional difference gel electrophoresis and gas chromatography techniques. We presented a novel computational strategy to select biomarkers as few as possible for UA in the two groups of data. Firstly, decision tree was used to select biomarkers for UA and 3-fold cross validation was used to evaluate computational performanees for the three methods. Alternatively, we combined inde- pendent t test and classification based data mining method as well as backward elimination technique to select, as few as possible, protein and metabolite biomarkers with best classification performances. By the method, we selected 6 proteins and 5 metabolites for UA. The novel method presented here provides a better insight into the pathology of a disease.展开更多
For the multi-mode radar working in the modern electronicbattlefield, different working states of one single radar areprone to being classified as multiple emitters when adoptingtraditional classification methods to p...For the multi-mode radar working in the modern electronicbattlefield, different working states of one single radar areprone to being classified as multiple emitters when adoptingtraditional classification methods to process intercepted signals,which has a negative effect on signal classification. A classificationmethod based on spatial data mining is presented to address theabove challenge. Inspired by the idea of spatial data mining, theclassification method applies nuclear field to depicting the distributioninformation of pulse samples in feature space, and digs out thehidden cluster information by analyzing distribution characteristics.In addition, a membership-degree criterion to quantify the correlationamong all classes is established, which ensures classificationaccuracy of signal samples. Numerical experiments show that thepresented method can effectively prevent different working statesof multi-mode emitter from being classified as several emitters,and achieve higher classification accuracy.展开更多
Background Traditional Chinese medicine(TCM)is becoming a popular complementary approach in pediatric oncology.However,few or no meta-analyses have focused on clinical studies of the use of TCM in pediatric oncology.O...Background Traditional Chinese medicine(TCM)is becoming a popular complementary approach in pediatric oncology.However,few or no meta-analyses have focused on clinical studies of the use of TCM in pediatric oncology.Objective We explored the patterns of TCM use and its efficacy in children with cancer,using a systematic review,meta-analysis and data mining study.Search strategy We conducted a search of five English(Allied and Complementary Medicine Database,Embase,PubMed,Cochrane Central Register of Controlled Trials,and ClinicalTrials.gov)and four Chinese databases(Wanfang Data,China National Knowledge Infrastructure,Chinese Biomedical Literature Database,and VIP Chinese Science and Technology Periodicals Database)for clinical studies published before October 2021,using keywords related to“pediatric,”“cancer,”and“TCM.”Inclusion criteria We included studies which were randomized controlled trials(RCTs)or observational clinical studies,focused on patients aged<19 years old who had been diagnosed with cancer,and included at least one group of subjects receiving TCM treatment.Data extraction and analysis The methodological quality of RCTs and observational studies was assessed using the six-item Jadad scale and the Effective Public Healthcare Panacea Project Quality Assessment Tool,respectively.Meta-analysis was used to evaluate the efficacy of combining TCM with chemotherapy.Study outcomes included the treatment response rate and occurrence of cancer-related symptoms.Association rule mining(ARM)was used to investigate the associations among medicinal herbs and patient symptoms.Results The fifty-four studies included in this analysis were comprised of RCTs(63.0%)and observational studies(37.0%).Most RCTs focused on hematological malignancies(41.2%).The study outcomes included chemotherapy-induced toxicities(76.5%),infection rate(35.3%),and response,survival or relapse rate(23.5%).The methodological quality of most of the RCTs(82.4%)and observational studies(80.0%)was rated as“moderate.”In studies of leukemia patients,adding TCM to conventional treatment significantly improved the clinical response rate(odds ratio[OR]=2.55;95%confidence interval[CI]=1.49-4.36),lowered infection rate(OR=0.23;95%CI=0.13-0.40),and reduced nausea and vomiting(OR=0.13;95%CI=0.08-0.23).ARM showed that Radix Astragali,the most commonly used medicinal herb(58.0%),was associated with treating myelosuppression,gastrointestinal complications,and infection.Conclusion There is growing evidence that TCM is an effective adjuvant therapy for children with cancer.We proposed a checklist to improve the quality of TCM trials in pediatric oncology.Future work will examine the use of ARM techniques on real-world data to evaluate the efficacy of medicinal herbs and drug-herb interactions in children receiving TCM as a part of integrated cancer therapy.展开更多
According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are comput...According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are computed to determine the time delay and the embedding dimension.Due to different features of the data,data mining algorithm is conducted to classify the data into different groups.Redundant information is eliminated by the advantage of data mining technology,and the historical loads that have highly similar features with the forecasting day are searched by the system.As a result,the training data can be decreased and the computing speed can also be improved when constructing support vector machine(SVM) model.Then,SVM algorithm is used to predict power load with parameters that get in pretreatment.In order to prove the effectiveness of the new model,the calculation with data mining SVM algorithm is compared with that of single SVM and back propagation network.It can be seen that the new DSVM algorithm effectively improves the forecast accuracy by 0.75%,1.10% and 1.73% compared with SVM for two random dimensions of 11-dimension,14-dimension and BP network,respectively.This indicates that the DSVM gains perfect improvement effect in the short-term power load forecasting.展开更多
基金Henan Province Special Research Project of Tra ditional Chinese Medicine(Grant No.2022ZY1090).
文摘In the present study,data mining and network pharmacology were utilized to explore the principles and mechanisms of traditional Chinese medicine(TCM)in treating acute appendicitis.The goal was to provide a scientific basis for clinical treatment and further research on this disease.First,we searched the National Patent Database for Chinese herbal compound prescriptions used to treat acute appendicitis.We then applied frequency analysis,character and taste meridian analysis,association rule analysis,and hierarchical cluster analysis to identify the patterns of TCM treatment for acute appendicitis,selecting key combinations of Chinese medicines.Next,we screened the main active components of these key TCM based on quality markers.Using databases such as SwissTargetPrediction,SymMap,ETCM,and STRING,we analyzed the pharmacological mechanisms of these key TCM in treating acute appendicitis.Key active components and targets were further verified through molecular docking.We identified a total of 129 patents involving 316 Chinese medicines,with 24 being frequently used.The results indicated that most Chinese herbs used for acute appendicitis were heat-clearing drugs,blood-activating and stasis-removing drugs,and purging drugs.The primary active ingredients of the Rhubarb-cortex moutan-flos lonicerae combination for treating acute appendicitis included Emodin,Paeonol,Physcion,Chlorogenic acid,Chrysophanol,Rhein acid,and Aloe-emodin.These ingredients targeted key proteins such as ALB,TP53,BCL2,STAT3,IL-6,and TNF,and were involved in cellular responses to lipopolysaccharides,cell composition,and various cytokine-mediated biological processes.They also interacted with signaling pathways like AGE-RAGE,TNF,IL-17,and FoxO.Based on patent data,this study analyzed medication patterns in the treatment of acute appendicitis,discussed the possible mechanisms of key TCM combinations,and provided a scientific basis and new perspectives for the diagnosis and treatment of the disease.
文摘Introduction Neurosurgical emergencies such as spontaneous intracerebral hemorrhage(ICH),traumatic brain injury(TBI),and acute brain herniation are among the most time-sensitive and high-stakes conditions in modern medicine.Clinical decisions often must be made within minutes,yet these decisions are traditionally guided by limited information,heuristic reasoning,and past experience.In this context,the rise of medical data mining and real-time analytics offers a transformative opportunity:to extract actionable intelligence from the flood of clinical,imaging,and physiological data already being collected,and to use this intelligence to guide care in real time[1–3](Figure 1).
文摘With the gradual acceleration of information construction in colleges and universities,digital campus and smart campus have gradually become important means for colleges and universities to scientifically manage the campus.They have been applied to teaching,scientific research,student management,and other fields,improving the quality and efficiency of management.This paper mainly studies the intelligent educational administration management system based on data mining technology.Firstly,this paper introduces the application process of data mining technology,and builds an intelligent educational administration management system based on data mining technology.Then,this paper optimizes the application of the Apriori algorithm in educational administration management through transaction compression and frequent sampling.Compared with the traditional Apriori algorithm,the optimized Apriori algorithm in this paper has a shorter execution time under the same minimum support.
基金Supported by Shanghai College Students Innovation and Entrepreneurship Training Program Project:202310268066The 16th Batch of Science And Technology Innovation Projects of Shanghai University of Traditional Chinese Medicine:SHUTCM2023010+1 种基金2024 Shanghai Oriental Talent Program Youth Project2021 High-level Local University Innovation Team Project of Shanghai University of Traditional Chinese Medicine:No.3 Shanghai Education Commission Personnel [2022]。
文摘Objective To explore the optimization and principles of acupoint selection and coordination in the treatment of adult abdominal obesity using acupuncture and moxibustion over the past decade using data mining.Methods Clinical studies of abdominal obesity treated with acupuncture and moxibustion,collected in the past 10 years,were searched from China Biology Medicine disc(CBMdisc),China National knowledge infrastructure(CNKI),Wanfang,China Science and Technology Journal Database(VIP),Pubmed,Embase,Google Scholar,Web of Science,(The Cumulative Index to Nursing and Allied Health Literature)CINAHL,Psyclnfo and Scopus,dated from March 1,2013 to March 31,2023.Using IBM SPSS Modeler 18.0 and other software,the frequency analysis,association-rules analysis and cluster analysis were conducted on interventions,traditional Chinese medicine(TCM)patterns,use frequency of acupoint,meridian attribution of acupoint,acupoint location,etc.Results A total of 55 articles were included,with 102 prescriptions and 71 acupoints involved.The top 3 interventions were acupoint embedding method,simple electroacupuncture and simple filiform needling.Seventeen patterns/syndromes of TCM differentiation were collected,dominated by spleen deficiency and damp blockage,spleen and kidney yang deficiency and heat accumulation in stomach and intestines.The acupoints in clinical practice were mostly at the foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian,and located at the abdominal region.The top 5 acupoints of high frequency were Tianshu(ST25),Zhongwan(CV12),Daheng(SP15),Zusanli(ST36),Huaroumen(ST24)and Daimai(GB26).The specific points of the high frequency were the crossing points and front-mu points,of which,ST25 and CV12 were the most prominent.After association-rules analysis on the high-frequency acupoints,20 groups of associated acupoints were obtained,in which,the core acupoints included ST25,CV12,SP15 and ST36.Conclusion In recent 10 years,abdominal obesity is treated by the acupoints of foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian.Compared with the regimen for simple obesity,the acupoints at the abdominal region are specially selected in treatment of abdominal obesity,such as ST25,CV12,SP15 and ST36.Supplementary acupoints are selected based on syndrome differentiation to simultaneously address both the disease manifestations and root causes.
基金Song Hujie’s Inheritance Studio of National Renowned Traditional Chinese Medicine Experts.
文摘Objective:To explore the core acupuncture acupoints and pattern-adapted acupoint combination rules for autism spectrum disorder(ASD)complicated with sleep disorder using clinical data mining technology.Methods:A retrospective analysis was conducted on the diagnosis and treatment data of 104 children with ASD complicated with sleep disorder admitted to Xi’an Traditional Chinese Medicine(TCM)Encephalopathy Hospital from January 2022 to December 2024.Cross-pattern main acupoints were screened via frequency statistics,chi-square test,and factor analysis;pattern-specific auxiliary acupoints were extracted by combining multiple correspondence analysis,cluster analysis,and association rule mining.Results:Ten cross-pattern main acupoints(Baihui,Sishenzhen,Language Area 1,Language Area 2,Neiguan,Shenmen,Yongquan,Xuanzhong)were identified,and acupoint combination schemes for four major TCM patterns(Hyperactivity of Liver and Heart Fire,Deficiency of Kidney Essence,Deficiency of Both Heart and Spleen,Hyperactivity of Liver with Spleen Deficiency)were established.Conclusion:Acupuncture treatment should follow the principle of“regulating spirit and calming the brain as the root,and dredging collaterals based on pattern differentiation as the branch”.The synergy between main and auxiliary acupoints can accurately regulate the disease,providing a basis for precise clinical treatment.
文摘The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders.
基金sponsored by the National Science and Technology Major Project(No.2011ZX05023-005-006)
文摘Data mining is the process of extracting implicit but potentially useful information from incomplete, noisy, and fuzzy data. Data mining offers excellent nonlinear modeling and self-organized learning, and it can play a vital role in the interpretation of well logging data of complex reservoirs. We used data mining to identify the lithologies in a complex reservoir. The reservoir lithologies served as the classification task target and were identified using feature extraction, feature selection, and modeling of data streams. We used independent component analysis to extract information from well curves. We then used the branch-and- bound algorithm to look for the optimal feature subsets and eliminate redundant information. Finally, we used the C5.0 decision-tree algorithm to set up disaggregated models of the well logging curves. The modeling and actual logging data were in good agreement, showing the usefulness of data mining methods in complex reservoirs.
基金National Natural Science Foundation of China:81072883,81173342,81473773Scientific Research Project of Hebei Education Department:Z 2014145Planned Project of Young Talents in Colleges and Universities in Hebei Province:BJ 2014047
文摘Different acupuncture-moxibustion therapies can produce different clinical effects, that is, the effect has specificity, which is significantly important in obtaining acupuncture-moxibustion efficacy. In this study, the clinical application laws of fire needle, acupoint injection, catgut embedment, acupoint application, moxibustion therapy and filiform needle acupuncture were summarized in the aspects of category of disease, efficacy and related prescriptions (such as medication and acupoint selection) based on the result of data mining, and the general applicable categories of disease of acupuncture-moxibustion treatment methods were further screened, so as to guide the clinical application and give play to the best efficacy.
文摘An intrusion detection (ID) model is proposed based on the fuzzy data mining method. A major difficulty of anomaly ID is that patterns of the normal behavior change with time. In addition, an actual intrusion with a small deviation may match normal patterns. So the intrusion behavior cannot be detected by the detection system.To solve the problem, fuzzy data mining technique is utilized to extract patterns representing the normal behavior of a network. A set of fuzzy association rules mined from the network data are shown as a model of “normal behaviors”. To detect anomalous behaviors, fuzzy association rules are generated from new audit data and the similarity with sets mined from “normal” data is computed. If the similarity values are lower than a threshold value,an alarm is given. Furthermore, genetic algorithms are used to adjust the fuzzy membership functions and to select an appropriate set of features.
文摘In order to improve the accuracy and integrality of mining data records from the web, the concepts of isomorphic page and directory page and three algorithms are proposed. An isomorphic web page is a set of web pages that have uniform structure, only differing in main information. A web page which contains many links that link to isomorphic web pages is called a directory page. Algorithm 1 can find directory web pages in a web using adjacent links similar analysis method. It first sorts the link, and then counts the links in each directory. If the count is greater than a given valve then finds the similar sub-page links in the directory and gives the results. A function for an isomorphic web page judgment is also proposed. Algorithm 2 can mine data records from an isomorphic page using a noise information filter. It is based on the fact that the noise information is the same in two isomorphic pages, only the main information is different. Algorithm 3 can mine data records from an entire website using the technology of spider. The experiment shows that the proposed algorithms can mine data records more intactly than the existing algorithms. Mining data records from isomorphic pages is an efficient method.
文摘Nowadays, the scale of data normally stored in a database collected by Data Acquisition System (DAS) or Distributed Control System (DCS) in a power plant is becoming larger and larger. However there are abundant valuable knowledge hidden behind them. It will be beyond people's capacity to analyze and understand these data stored in such a scale database. Fortunately data mining techniques are arising at the historic moment. In this paper, we explain the basic concept and general knowledge of data mining; analyze the characteristics and research method of data mining; give some typical applications of data mining system based on power plant real time database on intranet.
基金the National Social Science Foundation of China(No.16BGL183).
文摘Many high quality studies have emerged from public databases,such as Surveillance,Epidemiology,and End Results(SEER),National Health and Nutrition Examination Survey(NHANES),The Cancer Genome Atlas(TCGA),and Medical Information Mart for Intensive Care(MIMIC);however,these data are often characterized by a high degree of dimensional heterogeneity,timeliness,scarcity,irregularity,and other characteristics,resulting in the value of these data not being fully utilized.Data-mining technology has been a frontier field in medical research,as it demonstrates excellent performance in evaluating patient risks and assisting clinical decision-making in building disease-prediction models.Therefore,data mining has unique advantages in clinical big-data research,especially in large-scale medical public databases.This article introduced the main medical public database and described the steps,tasks,and models of data mining in simple language.Additionally,we described data-mining methods along with their practical applications.The goal of this work was to aid clinical researchers in gaining a clear and intuitive understanding of the application of data-mining technology on clinical big-data in order to promote the production of research results that are beneficial to doctors and patients.
文摘Knowledge Discovery in Databases is gaining attention and raising new hopes for traditional Chinese medicine (TCM) researchers. It is a useful tool in understanding and deciphering TCM theories. Aiming for a better understanding of Chinese herbal property theory (CHPT), this paper performed an improved association rule learning to analyze semistructured text in the book entitled Shennong's Classic of Materia Medica. The text was firstly annotated and transformed to well-structured multidimensional data. Subsequently, an Apriori algorithm was employed for producing association rules after the sensitivity analysis of parameters. From the confirmed 120 resulting rules that described the intrinsic relationships between herbal property (qi, flavor and their combinations) and herbal efficacy, two novel fundamental principles underlying CHPT were acquired and further elucidated: (1) the many-to-one mapping of herbal efficacy to herbal property; (2) the nonrandom overlap between the related efficacy of qi and flavor. This work provided an innovative knowledge about CHPT, which would be helpful for its modern research.
基金Supported by Research on Pattern differentiation of AIDS based on Graph Theroy of National Natural Science Foundation of China(No.81202858)Research on Intervention Evaluation of TCM Health Differentiation of National Key Technology Support Program(No.2012BAI25B02)+3 种基金Research and Development in Digital Information System of Traditional Chinese Medicine of National 863 Program of China(No.2012AA02A609)Acupuncture Efficacy of Gastrointestinal Dysfunction(No.ZZ05003)Acupuncture-point Specialty Analysis based on Image Processing Technology(No.ZZ03090)of Self-selected subject of China Academy of Chinese Medical SciencesSemantic Recognition of Tongue and Pulse based on Image Content of the Beijing Key Laboratory of Advanced Information Science and Network Technology(No.XDXX1306)
文摘OBJECTIVE: To help researchers selecting appropriate data mining models to provide better evidence for the clinical practice of Traditional Chinese Medicine(TCM) diagnosis and therapy.METHODS: Clinical issues based on data mining models were comprehensively summarized from four significant elements of the clinical studies:symptoms, symptom patterns, herbs, and efficacy.Existing problems were further generalized to determine the relevant factors of the performance of data mining models, e.g. data type, samples, parameters, variable labels. Combining these relevant factors, the TCM clinical data features were compared with regards to statistical characters and informatics properties. Data models were compared simultaneously from the view of applied conditions and suitable scopes.RESULTS: The main application problems were the inconsistent data type and the small samples for the used data mining models, which caused the inappropriate results, even the mistake results. These features, i.e. advantages, disadvantages, satisfied data types, tasks of data mining, and the TCM issues, were summarized and compared.CONCLUSION: By aiming at the special features of different data mining models, the clinical doctors could select the suitable data mining models to resolve the TCM problem.
基金Supported by National Natural Science Foundation grant NO.40976108Public Projects of Science and Technology Ministry grant NO.201105033
文摘OBJECTIVE:We applied data mining techniques to the study of acupuncture as a treatment for juvenile myopia,with the aim of identifying hidden patterns in the data.METHODS:Fifty patients with juvenile myopia were selected and treated with acupuncture,and data mining was used to analyze the effects of treatment and the influence of behavioral variables.Clustering analysis was used to divide myopia patients into two classifications before acupuncture treatment.Artificial neural network BP algorithm was adopted to analyze the roles of different factors in changes in diopters.An association algorithm was used to analyze factors associated with the subjective experience of acupuncture and average diopter.RESULTS:The two classification results were fully consistent with the understandings of the ophthalmic circles.The duration of using the Internet and watching TV every day was the main factor that affected vision.Acupuncture feelings and therapeutic effect have a strong correlativity.A good or above experience's score of acupuncture could slow the progression of juvenile myopia.CONCLUSION:Collecting data from patients with juvenile myopia by using data mining can extract hidden potential rules and knowledge from the research evidence.The decision support can be provided to improve the doctor's clinical acupuncture treatment effects.
基金supported by Scientific Research Special Project of TCM Profession (200907001E)Science and Technology Special Major Project for "Significant New Drugs Formulation" (2009ZX09301-005-02)
文摘Objective:To analyze the component law of Chinese patent medicines for anti-influenza and develop new prescriptions for anti-influenza by unsupervised data mining methods. Methods: Chinese patent medicine recipes for anti-influenza were collected and recorded in the database, and then the correlation coefficient between herbs, core combinations of herbs and new prescriptions were analyzed by using modified mutual information, complex system entropy cluster and unsupervised hierarchical clustering, respectively. Results: Based on analysis of 126 Chinese patent medicine recipes, the frequency of each herb occurrence in these recipes, 54 frequently-used herb pairs, 34 core combinations were determined, and 4 new recipes for influenza were developed. Conclusion: Unsupervised data mining methods are able to mine the component law quickly and develop new prescriptions.
基金Supported by the National Basic Research Program of China(No2011CB505106)the National Natural Science Foundation of China(No30902020)+2 种基金the Foundation of National Department of Public Benefit Research of China(No200807007)the Creation Fund for Significant New Drugs of China(No2009ZX09502-018)the Foundation of International Science and Technology Cooperation of China(No2008DFA30610)
文摘Unstable angina(UA) is the most dangerous type of Coronary Heart Disease(CHD) to cause more and more mortal and morbid world wide. Identification of biomarkers for UA at the level of proteomics and metabolomics is a better avenue to understand the inner mechanism of it. Feature selection based data mining method is better suited to identify biomarkers of UA. In this study, we carried out clinical epidemiology to collect plasmas of UA in-patients and controls. Proteomics and metabolomics data were obtained via two-dimensional difference gel electrophoresis and gas chromatography techniques. We presented a novel computational strategy to select biomarkers as few as possible for UA in the two groups of data. Firstly, decision tree was used to select biomarkers for UA and 3-fold cross validation was used to evaluate computational performanees for the three methods. Alternatively, we combined inde- pendent t test and classification based data mining method as well as backward elimination technique to select, as few as possible, protein and metabolite biomarkers with best classification performances. By the method, we selected 6 proteins and 5 metabolites for UA. The novel method presented here provides a better insight into the pathology of a disease.
基金supported by the National Natural Science Foundation of China(61371172)the International S&T Cooperation Program of China(2015DFR10220)+1 种基金the Ocean Engineering Project of National Key Laboratory Foundation(1213)the Fundamental Research Funds for the Central Universities(HEUCF1608)
文摘For the multi-mode radar working in the modern electronicbattlefield, different working states of one single radar areprone to being classified as multiple emitters when adoptingtraditional classification methods to process intercepted signals,which has a negative effect on signal classification. A classificationmethod based on spatial data mining is presented to address theabove challenge. Inspired by the idea of spatial data mining, theclassification method applies nuclear field to depicting the distributioninformation of pulse samples in feature space, and digs out thehidden cluster information by analyzing distribution characteristics.In addition, a membership-degree criterion to quantify the correlationamong all classes is established, which ensures classificationaccuracy of signal samples. Numerical experiments show that thepresented method can effectively prevent different working statesof multi-mode emitter from being classified as several emitters,and achieve higher classification accuracy.
文摘Background Traditional Chinese medicine(TCM)is becoming a popular complementary approach in pediatric oncology.However,few or no meta-analyses have focused on clinical studies of the use of TCM in pediatric oncology.Objective We explored the patterns of TCM use and its efficacy in children with cancer,using a systematic review,meta-analysis and data mining study.Search strategy We conducted a search of five English(Allied and Complementary Medicine Database,Embase,PubMed,Cochrane Central Register of Controlled Trials,and ClinicalTrials.gov)and four Chinese databases(Wanfang Data,China National Knowledge Infrastructure,Chinese Biomedical Literature Database,and VIP Chinese Science and Technology Periodicals Database)for clinical studies published before October 2021,using keywords related to“pediatric,”“cancer,”and“TCM.”Inclusion criteria We included studies which were randomized controlled trials(RCTs)or observational clinical studies,focused on patients aged<19 years old who had been diagnosed with cancer,and included at least one group of subjects receiving TCM treatment.Data extraction and analysis The methodological quality of RCTs and observational studies was assessed using the six-item Jadad scale and the Effective Public Healthcare Panacea Project Quality Assessment Tool,respectively.Meta-analysis was used to evaluate the efficacy of combining TCM with chemotherapy.Study outcomes included the treatment response rate and occurrence of cancer-related symptoms.Association rule mining(ARM)was used to investigate the associations among medicinal herbs and patient symptoms.Results The fifty-four studies included in this analysis were comprised of RCTs(63.0%)and observational studies(37.0%).Most RCTs focused on hematological malignancies(41.2%).The study outcomes included chemotherapy-induced toxicities(76.5%),infection rate(35.3%),and response,survival or relapse rate(23.5%).The methodological quality of most of the RCTs(82.4%)and observational studies(80.0%)was rated as“moderate.”In studies of leukemia patients,adding TCM to conventional treatment significantly improved the clinical response rate(odds ratio[OR]=2.55;95%confidence interval[CI]=1.49-4.36),lowered infection rate(OR=0.23;95%CI=0.13-0.40),and reduced nausea and vomiting(OR=0.13;95%CI=0.08-0.23).ARM showed that Radix Astragali,the most commonly used medicinal herb(58.0%),was associated with treating myelosuppression,gastrointestinal complications,and infection.Conclusion There is growing evidence that TCM is an effective adjuvant therapy for children with cancer.We proposed a checklist to improve the quality of TCM trials in pediatric oncology.Future work will examine the use of ARM techniques on real-world data to evaluate the efficacy of medicinal herbs and drug-herb interactions in children receiving TCM as a part of integrated cancer therapy.
基金Project(70671039) supported by the National Natural Science Foundation of China
文摘According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are computed to determine the time delay and the embedding dimension.Due to different features of the data,data mining algorithm is conducted to classify the data into different groups.Redundant information is eliminated by the advantage of data mining technology,and the historical loads that have highly similar features with the forecasting day are searched by the system.As a result,the training data can be decreased and the computing speed can also be improved when constructing support vector machine(SVM) model.Then,SVM algorithm is used to predict power load with parameters that get in pretreatment.In order to prove the effectiveness of the new model,the calculation with data mining SVM algorithm is compared with that of single SVM and back propagation network.It can be seen that the new DSVM algorithm effectively improves the forecast accuracy by 0.75%,1.10% and 1.73% compared with SVM for two random dimensions of 11-dimension,14-dimension and BP network,respectively.This indicates that the DSVM gains perfect improvement effect in the short-term power load forecasting.