In the present study,data mining and network pharmacology were utilized to explore the principles and mechanisms of traditional Chinese medicine(TCM)in treating acute appendicitis.The goal was to provide a scientific ...In the present study,data mining and network pharmacology were utilized to explore the principles and mechanisms of traditional Chinese medicine(TCM)in treating acute appendicitis.The goal was to provide a scientific basis for clinical treatment and further research on this disease.First,we searched the National Patent Database for Chinese herbal compound prescriptions used to treat acute appendicitis.We then applied frequency analysis,character and taste meridian analysis,association rule analysis,and hierarchical cluster analysis to identify the patterns of TCM treatment for acute appendicitis,selecting key combinations of Chinese medicines.Next,we screened the main active components of these key TCM based on quality markers.Using databases such as SwissTargetPrediction,SymMap,ETCM,and STRING,we analyzed the pharmacological mechanisms of these key TCM in treating acute appendicitis.Key active components and targets were further verified through molecular docking.We identified a total of 129 patents involving 316 Chinese medicines,with 24 being frequently used.The results indicated that most Chinese herbs used for acute appendicitis were heat-clearing drugs,blood-activating and stasis-removing drugs,and purging drugs.The primary active ingredients of the Rhubarb-cortex moutan-flos lonicerae combination for treating acute appendicitis included Emodin,Paeonol,Physcion,Chlorogenic acid,Chrysophanol,Rhein acid,and Aloe-emodin.These ingredients targeted key proteins such as ALB,TP53,BCL2,STAT3,IL-6,and TNF,and were involved in cellular responses to lipopolysaccharides,cell composition,and various cytokine-mediated biological processes.They also interacted with signaling pathways like AGE-RAGE,TNF,IL-17,and FoxO.Based on patent data,this study analyzed medication patterns in the treatment of acute appendicitis,discussed the possible mechanisms of key TCM combinations,and provided a scientific basis and new perspectives for the diagnosis and treatment of the disease.展开更多
Objective To identify core acupoint patterns and elucidate the molecular mechanisms of acupuncture for primary depressive disorder(PDD)through data mining and network analysis.Methods A comprehensive literature search...Objective To identify core acupoint patterns and elucidate the molecular mechanisms of acupuncture for primary depressive disorder(PDD)through data mining and network analysis.Methods A comprehensive literature search was conducted across PubMed,Embase,Ovid Technologies(OVID),Web of Science,Cochrane Library,China National Knowledge Infrastructure(CNKI),China National Knowledge Infrastructure Database(VIP),Wanfang Data,and SinoMed Database from database foundation to January 31,2025,for clinical studies on acupuncture treatment of PDD.Descriptive statistics,high-frequency acupoint analysis,degree and betweenness centrality evaluation,and core acupoint prescription mining identified predominant therapeutic combinations for PDD.Network acupuncture was used to predict therapeutic target for the core acupoint prescription.Subsequent protein-protein interaction(PPI)network and molecular complex detection(MCODE)analyses were conducted to identify the key targets and functional modules.Gene Ontology(GO)and Kyoto Encyclopedia of Genes and Genomes(KEGG)analyses explored the underlying biological mechanisms of the core acupoint prescription in treating PDD.Results A total of 57 acupoint prescriptions underwent systematic analysis.The core therapeutic combinations comprised Baihui(GV20),Yintang(GV29),Neiguan(PC6),Hegu(LI4),and Shenmen(HT7).Network acupuncture analysis identified 88 potential therapeutic targets(79 overlapping with PDD),while PPI network analysis revealed central regulatory nodes,including interleukin(IL)-6,IL-1β,tumor necrosis factor(TNF)-α,toll-like receptor 4(TLR4),IL-10,brain-derived neurotrophic factor(BDNF),transforming growth factor(TGF)-β1,C-XC motif chemokine ligand 10(CXCL10),mitogen-activated protein kinase 3(MAPK3),and nitric oxide synthase 1(NOS1).MCODE-based modular analysis further elucidated three functionally coherent clusters:inflammation-homeostasis(score=6.571),plasticity-neurotransmission(score=3.143),and oxidative stress(score=3.000).GO and KEGG analyses demonstrated significant enrichment of the MAPK,phosphoinositide 3-kinase/protein kinase B(PI3K/Akt),and hypoxia-inducible factor(HIF)-1 signaling pathways.These mechanistic insights suggested that the antidepressant effects mediated through mechanisms of neuroinflammatory regulation,neuroplasticity restoration,and immune-oxidative stress homeostasis.Conclusion This study reveals that acupuncture alleviates depression through a multi-level mechanism,primarily involving the neuroinflammation suppression,neuroplasticity enhancement,and oxidative stress regulation.These findings systematically clarify the underlying mechanisms of acupuncture’s antidepressant effects and identify novel therapeutic targets for further mechanistic research.展开更多
Introduction Neurosurgical emergencies such as spontaneous intracerebral hemorrhage(ICH),traumatic brain injury(TBI),and acute brain herniation are among the most time-sensitive and high-stakes conditions in modern me...Introduction Neurosurgical emergencies such as spontaneous intracerebral hemorrhage(ICH),traumatic brain injury(TBI),and acute brain herniation are among the most time-sensitive and high-stakes conditions in modern medicine.Clinical decisions often must be made within minutes,yet these decisions are traditionally guided by limited information,heuristic reasoning,and past experience.In this context,the rise of medical data mining and real-time analytics offers a transformative opportunity:to extract actionable intelligence from the flood of clinical,imaging,and physiological data already being collected,and to use this intelligence to guide care in real time[1–3](Figure 1).展开更多
With the gradual acceleration of information construction in colleges and universities,digital campus and smart campus have gradually become important means for colleges and universities to scientifically manage the c...With the gradual acceleration of information construction in colleges and universities,digital campus and smart campus have gradually become important means for colleges and universities to scientifically manage the campus.They have been applied to teaching,scientific research,student management,and other fields,improving the quality and efficiency of management.This paper mainly studies the intelligent educational administration management system based on data mining technology.Firstly,this paper introduces the application process of data mining technology,and builds an intelligent educational administration management system based on data mining technology.Then,this paper optimizes the application of the Apriori algorithm in educational administration management through transaction compression and frequent sampling.Compared with the traditional Apriori algorithm,the optimized Apriori algorithm in this paper has a shorter execution time under the same minimum support.展开更多
Objective To explore the optimization and principles of acupoint selection and coordination in the treatment of adult abdominal obesity using acupuncture and moxibustion over the past decade using data mining.Methods ...Objective To explore the optimization and principles of acupoint selection and coordination in the treatment of adult abdominal obesity using acupuncture and moxibustion over the past decade using data mining.Methods Clinical studies of abdominal obesity treated with acupuncture and moxibustion,collected in the past 10 years,were searched from China Biology Medicine disc(CBMdisc),China National knowledge infrastructure(CNKI),Wanfang,China Science and Technology Journal Database(VIP),Pubmed,Embase,Google Scholar,Web of Science,(The Cumulative Index to Nursing and Allied Health Literature)CINAHL,Psyclnfo and Scopus,dated from March 1,2013 to March 31,2023.Using IBM SPSS Modeler 18.0 and other software,the frequency analysis,association-rules analysis and cluster analysis were conducted on interventions,traditional Chinese medicine(TCM)patterns,use frequency of acupoint,meridian attribution of acupoint,acupoint location,etc.Results A total of 55 articles were included,with 102 prescriptions and 71 acupoints involved.The top 3 interventions were acupoint embedding method,simple electroacupuncture and simple filiform needling.Seventeen patterns/syndromes of TCM differentiation were collected,dominated by spleen deficiency and damp blockage,spleen and kidney yang deficiency and heat accumulation in stomach and intestines.The acupoints in clinical practice were mostly at the foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian,and located at the abdominal region.The top 5 acupoints of high frequency were Tianshu(ST25),Zhongwan(CV12),Daheng(SP15),Zusanli(ST36),Huaroumen(ST24)and Daimai(GB26).The specific points of the high frequency were the crossing points and front-mu points,of which,ST25 and CV12 were the most prominent.After association-rules analysis on the high-frequency acupoints,20 groups of associated acupoints were obtained,in which,the core acupoints included ST25,CV12,SP15 and ST36.Conclusion In recent 10 years,abdominal obesity is treated by the acupoints of foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian.Compared with the regimen for simple obesity,the acupoints at the abdominal region are specially selected in treatment of abdominal obesity,such as ST25,CV12,SP15 and ST36.Supplementary acupoints are selected based on syndrome differentiation to simultaneously address both the disease manifestations and root causes.展开更多
Objective:To explore the core acupuncture acupoints and pattern-adapted acupoint combination rules for autism spectrum disorder(ASD)complicated with sleep disorder using clinical data mining technology.Methods:A retro...Objective:To explore the core acupuncture acupoints and pattern-adapted acupoint combination rules for autism spectrum disorder(ASD)complicated with sleep disorder using clinical data mining technology.Methods:A retrospective analysis was conducted on the diagnosis and treatment data of 104 children with ASD complicated with sleep disorder admitted to Xi’an Traditional Chinese Medicine(TCM)Encephalopathy Hospital from January 2022 to December 2024.Cross-pattern main acupoints were screened via frequency statistics,chi-square test,and factor analysis;pattern-specific auxiliary acupoints were extracted by combining multiple correspondence analysis,cluster analysis,and association rule mining.Results:Ten cross-pattern main acupoints(Baihui,Sishenzhen,Language Area 1,Language Area 2,Neiguan,Shenmen,Yongquan,Xuanzhong)were identified,and acupoint combination schemes for four major TCM patterns(Hyperactivity of Liver and Heart Fire,Deficiency of Kidney Essence,Deficiency of Both Heart and Spleen,Hyperactivity of Liver with Spleen Deficiency)were established.Conclusion:Acupuncture treatment should follow the principle of“regulating spirit and calming the brain as the root,and dredging collaterals based on pattern differentiation as the branch”.The synergy between main and auxiliary acupoints can accurately regulate the disease,providing a basis for precise clinical treatment.展开更多
Population growth leads to increased utilization of water resources.One of these resources is groundwater,which has steadily declined each year.The depletion of these resources brings about various environmental chall...Population growth leads to increased utilization of water resources.One of these resources is groundwater,which has steadily declined each year.The depletion of these resources brings about various environmental challenges.The present study aimed to explore the relationship between groundwater fluctuations and land subsidence in the Malayer Plain,Iran,focusing on quantifying subsidence resulting from groundwater extraction.Using Sentinel-1 satellite data(2014–2019)and monthly piezometric measurements(1996–2018),the analysis revealed an average deformation velocity of–6.3 cm yr–1,with accumulated subsidence of–32 cm over the 2014–2019 period.The maximum subsidence rate reached 10.3 cm yr–1 in areas of intensive agricultural activity.A wavelet-PCA spatiotemporal analysis of groundwater fluctuations identified critical multi-scale patterns strongly correlated with subsidence trends.Regression analysis between subsidence rates and groundwater fluctuations at various wavelet decomposition levels explained 75%of the variance(R2=0.75),indicating that intermediate-scale groundwater declines were the primary drivers of subsidence.Furthermore,land use analysis using Landsat data(1999–2021)revealed a 6230-ha increase in irrigated farmland,contributing to heightened groundwater extraction and subsidence rates.These findings highlight the critical need for sustainable groundwater management to mitigate the risks of continued subsidence in the region.展开更多
A deep-sea riser is a crucial component of the mining system used to lift seafloor mineral resources to the vessel.Even minor damage to the riser can lead to substantial financial losses,environmental impacts,and safe...A deep-sea riser is a crucial component of the mining system used to lift seafloor mineral resources to the vessel.Even minor damage to the riser can lead to substantial financial losses,environmental impacts,and safety hazards.However,identifying modal parameters for structural health monitoring remains a major challenge due to its large deformations and flexibility.Vibration signal-based methods are essential for detecting damage and enabling timely maintenance to minimize losses.However,accurately extracting features from one-dimensional(1D)signals is often hindered by various environmental factors and measurement noises.To address this challenge,a novel approach based on a residual convolutional auto-encoder(RCAE)is proposed for detecting damage in deep-sea mining risers,incorporating a data fusion strategy.First,principal component analysis(PCA)is applied to reduce environmental fluctuations and fuse multisensor strain readings.Subsequently,a 1D-RCAE is used to extract damage-sensitive features(DSFs)from the fused dataset.A Mahalanobis distance indicator is established to compare the DSFs of the testing and healthy risers.The specific threshold for these distances is determined using the 3σcriterion,which is employed to assess whether damage has occurred in the testing riser.The effectiveness and robustness of the proposed approach are verified through numerical simulations of a 500-m riser and experimental tests on a 6-m riser.Moreover,the impact of contaminated noise and environmental fluctuations is examined.Results show that the proposed PCA-1D-RCAE approach can effectively detect damage and is resilient to measurement noise and environmental fluctuations.The accuracy exceeds 98%under noise-free conditions and remains above 90%even with 10 dB noise.This novel approach has the potential to establish a new standard for evaluating the health and integrity of risers during mining operations,thereby reducing the high costs and risks associated with failures.Maintenance activities can be scheduled more efficiently by enabling early and accurate detection of riser damage,minimizing downtime and avoiding catastrophic failures.展开更多
The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities...The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders.展开更多
In today's connected world,the generation of massive streaming data across diverse domains has become commonplace.In the presence of concept drift,class imbalance,label scarcity,and new class emergence,these chall...In today's connected world,the generation of massive streaming data across diverse domains has become commonplace.In the presence of concept drift,class imbalance,label scarcity,and new class emergence,these challenges jointly degrade representation stability,bias learning toward outdated distributions,and reduce the resilience and reliability of detection in dynamic environments.This paper proposes a streaming classincremental learning(SCIL)framework to address these issues.The SCIL framework integrates an autoencoder(AE)with a multi-layer perceptron for multi-class prediction,employs a dual-loss strategy(classification and reconstruction)for prediction and new class detection,uses corrected pseudo-labels for online training,manages classes with queues,and applies oversampling to handle imbalance.The rationale behind the method's structure is elucidated through ablation studies,and a comprehensive experimental evaluation is performed using both real-world and synthetic datasets that feature class imbalance,incremental classes,and concept drifts.Our results demonstrate that SCIL outperforms strong baselines and state-of-the-art methods.In line with our commitment to Open Science,we make our code and datasets available to the community.展开更多
Reducing raw materials consumption(RMC)in electric arc furnace(EAF)steelmaking process is beneficial to the reduction in resource and energy consumption.The conventional indicator of evaluating RMC only focuses on EAF...Reducing raw materials consumption(RMC)in electric arc furnace(EAF)steelmaking process is beneficial to the reduction in resource and energy consumption.The conventional indicator of evaluating RMC only focuses on EAF inputs and outputs,neglecting the associations between smelting operations and RMC.Traditional methods of reducing RMC rely on manual experience and lack a standard operation guidance.A method based on association rules mining and metallurgical mechanism(ARM-MM)was proposed.ARM-MM proposed an improved evaluation indicator of RMC and the indicator independently showed the associations between smelting operations and RMC.On the basis,1265 heats of real EAF data were used to obtain the operation guidance for RMC reduction.According to the ratio of hot metal(HM)in charge metals,data were divided into all dataset,low HM ratio dataset,medium HM ratio dataset,and high HM ratio dataset.ARM algorithm was used in each dataset to obtain specific operation guidance.The real average RMC under all dataset,medium HM ratio dataset,and high HM ratio dataset was reduced by 279,486,and 252 kg/heat,respectively,when obtained operation guidance was applied.展开更多
39 soil samples surrounding a lead-zinc mining area in Guangxi were collected,and the contents of Pb,Hg,Cd,Cr,As,Cu,Zn,and Ni were determined to investigate the pollution characteristics and sources of heavy metals.Ar...39 soil samples surrounding a lead-zinc mining area in Guangxi were collected,and the contents of Pb,Hg,Cd,Cr,As,Cu,Zn,and Ni were determined to investigate the pollution characteristics and sources of heavy metals.ArcGIS inverse distance weight difference method was used to analyze the characteristics of pollution distribution,and single-factor pollution index,Nemerow comprehensive pollution index,ground accumulation index,and potential ecological risk index were selected to evaluate the characteristics of heavy metal pollution.Based on correlation analysis,the absolute principal component-multiple linear regression(APCS-MLR)and positive definite matrix factorization(PMF)models were used to analyze the sources of soil heavy metals.The results showed that the average concentrations of all eight heavy metals exceeded both national and Guangxi soil background values.Hg,Cd,and Zn exhibited high variation(greater than 0.5),indicating significant external disturbances,and their spatial distribution was closely related to mining activity locations.The single-factor pollution index evaluation indicated varying degrees of pollution risk for Cd,Zn,and As,with Cd and Zn being the most severe pollutants,as 69.23%and 30.77%of the samples fell into the moderate pollution or higher category.The geoaccumulation index analysis ranked the mean pollution levels of the eight elements as follows:Zn>Cd>Ni>Pb>Cu>Cr>Hg>As,with Cd and Zn showing the most severe contamination,and 51.28%of the samples exhibiting moderate or higher pollution levels.The Nemerow comprehensive pollution index evaluation showed that 74.35%of soil samples were classified as moderate to heavy pollution.The potential ecological risk index assessment indicated significant ecological risks posed by Cd and Zn,with 82.05%and 5.12%of the samples classified as causing strong to extreme ecological risks,respectively.The source apportionment analysis revealed minor differences between the two models.The APCS-MLR model identified three pollution sources and their contribution rates:anthropogenic mining sources(31.13%),parent material sources(40.38%),and unidentified sources(28.49%).The PMF model identified three pollution sources with contribution rates of anthropogenic mining sources(26.10%),parent material sources(46.96%),and a combined traffic and agricultural source(26.61%).Pb,Hg,Cd,and Zn mainly originated from mining activities;Cr,As,and Ni were primarily derived from the parent material,while Cu was predominantly attributed to traffic and agricultural sources.These findings provide a scientific basis for the prevention and control of heavy metal pollution in mining areas.展开更多
Earthquakes are highly destructive spatio-temporal phenomena whose analysis is essential for disaster preparedness and risk mitigation.Modern seismological research produces vast volumes of heterogeneous data from sei...Earthquakes are highly destructive spatio-temporal phenomena whose analysis is essential for disaster preparedness and risk mitigation.Modern seismological research produces vast volumes of heterogeneous data from seismic networks,satellite observations,and geospatial repositories,creating the need for scalable infrastructures capable of integrating and analyzing such data to support intelligent decision-making.Data warehousing technologies provide a robust foundation for this purpose;however,existing earthquake-oriented data warehouses remain limited,often relying on simplified schemas,domain-specific analytics,or cataloguing efforts.This paper presents the design and implementation of a spatio-temporal data warehouse for seismic activity.The framework integrates spatial and temporal dimensions in a unified schema and introduces a novel array-based approach for managing many-to-many relationships between facts and dimensions without intermediate bridge tables.A comparative evaluation against a conventional bridge-table schema demonstrates that the array-based design improves fact-centric query performance,while the bridge-table schema remains advantageous for dimension-centric queries.To reconcile these trade-offs,a hybrid schema is proposed that retains both representations,ensuring balanced efficiency across heterogeneous workloads.The proposed framework demonstrates how spatio-temporal data warehousing can address schema complexity,improve query performance,and support multidimensional visualization.In doing so,it provides a foundation for integrating seismic analysis into broader big data-driven intelligent decision systems for disaster resilience,risk mitigation,and emergency management.展开更多
This work contributes to the theoretical foundation for pricing in data markets and offers practical insights for managing digital data exchanges in the era of big data.We propose a structured pricing model for data e...This work contributes to the theoretical foundation for pricing in data markets and offers practical insights for managing digital data exchanges in the era of big data.We propose a structured pricing model for data exchanges transitioning from quasi-public to marketoriented operations.To address the complex dynamics among data exchanges,suppliers,and consumers,the authors develop a threestage Stackelberg game framework.In this model,the data exchange acts as a leader setting transaction commission rates,suppliers are intermediate leaders determining unit prices,and consumers are followers making purchasing decisions.Two pricing strategies are examined:the Independent Pricing Approach(IPA)and the novel Perfectly Competitive Pricing Approach(PCPA),which accounts for competition among data providers.Using backward induction,the study derives subgame-perfect equilibria and proves the existence and uniqueness of Stackelberg equilibria under both approaches.Extensive numerical simulations are carried out in the model,demonstrating that PCPA enhances data demander utility,encourages supplier competition,increases transaction volume,and improves the overall profitability and sustainability of data exchanges.Social welfare analysis further confirms PCPA’s superiority in promoting efficient and fair data markets.展开更多
Mine surveying is an indispensable and crucial basic technical work in the process of mineral resource development.It plays an important role throughout the entire life cycle of a mine,from exploration,design,construc...Mine surveying is an indispensable and crucial basic technical work in the process of mineral resource development.It plays an important role throughout the entire life cycle of a mine,from exploration,design,construction,and production to closure,and is known as the“eyes of the mine”.With the rapid development of satellite technology,computer science,artificial intelligence,robotics,and spatiotemporal big data,mine surveying science and technology supported by spatial information technology is increasingly playing the role of the“brain of the mine”.This paper systematically summarizes the characteristics of mining surveying science and technology in contemporary and future mining development.First,based on the requirements of safe,efficient,and green development in modern mining,an analysis is conducted on the innovative practices of intelligent mining methods;secondly,it explains the transformation of regional economic and mining economic integration towards lengthening the industrial chain and scientific and technological innovation.Regarding intelligent mining,this paper discusses three technical dimensions:(1)By establishing a spatiotemporal data model of the mine,real-time perception and remote intelligent control of the production system are realized;(2)Based on the transparent mine three-dimensional geological modelling technology,the accuracy of geological condition prediction and the scientific nature of mining decisions are significantly improved;(3)By integrating multi-source remote sensing data and deep learning algorithms,a high-precision coal and rock identification system is constructed.The study further revealed the innovative application value of mine surveying in the post-mining era,including:diversified utilization of underground space in mining areas(tourism development,geothermal energy storage,pumped storage,etc.),multi-platform remote sensing coordinated ecological restoration monitoring,and optimized land space planning in mining areas.Practice has proved that mine surveying technology is an important technical engine for promoting green transformation and high-quality development in resource-based regions,and has irreplaceable strategic significance for achieving coordinated development of energy,economy,and environment.展开更多
Ovarian cancer(OC)is one of the leading causes of death related to gynecological cancer,with the main difficulty of its early diagnosis and a heterogeneous nature of tumor biomarkers.Machine learning(ML)has the potent...Ovarian cancer(OC)is one of the leading causes of death related to gynecological cancer,with the main difficulty of its early diagnosis and a heterogeneous nature of tumor biomarkers.Machine learning(ML)has the potential to process complex datasets and support decision-making in OC diagnosis.Nevertheless,traditional ML models tend to be biased,overfitting,noisy,and less generalized.Moreover,their black-box nature reduces interpretability and limits their practical clinical applicability.In this study,we introduce an explainable ensemble learning(EL)model,TreeX-Stack,based on a stacking architecture that employs tree-based learners such as Decision Tree(DT),Random Forest(RF),Gradient Boosting(GB),and Extreme Gradient Boosting(XGBoost)as base learners,and Logistic Regression(LR)as the meta-learner to enhance ovarian cancer(OC)diagnosis.Local Interpretable ModelAgnostic Explanations(LIME)are used to explain individual predictions,making the model outputs more clinically interpretable and applicable.The model is trained on the dataset that includes demographic information,blood test,general chemistry,and tumor markers.Extensive preprocessing includes handling missing data using iterative imputation with Bayesian Ridge and addressing multicollinearity by removing features with correlation coefficients above 0.7.Relevant features are then selected using the Boruta feature selection method.To obtain robust and unbiased performance estimates during hyperparameter tuning,nested cross-validation(CV)with grid search is employed,and all experiments are repeated five times to ensure statistical reliability.TreeX-Stack demonstrates excellent diagnostic performance,achieving an accuracy of 0.9027,a precision of 0.8673,a recall of 0.9391,and an F1-score of 0.9012.Feature-importance analyses using LIME and permutation importance highlight Human Epididymis Protein 4(HE4)as the most significant biomarker for OC.The combination of high predictive performance and interpretability makes TreeX-Stack a reliable tool for clinical decision support in OC diagnosis.展开更多
Accurately assessing the relationship between tree growth and climatic factors is of great importance in dendrochronology.This study evaluated the consistency between alternative climate datasets(including station and...Accurately assessing the relationship between tree growth and climatic factors is of great importance in dendrochronology.This study evaluated the consistency between alternative climate datasets(including station and gridded data)and actual climate data(fixed-point observations near the sampling sites),in northeastern China’s warm temperate zone and analyzed differences in their correlations with tree-ring width index.The results were:(1)Gridded temperature data,as well as precipitation and relative humidity data from the Huailai meteorological station,was more consistent with the actual climate data;in contrast,gridded soil moisture content data showed significant discrepancies.(2)Horizontal distance had a greater impact on the representativeness of actual climate conditions than vertical elevation differences.(3)Differences in consistency between alternative and actual climate data also affected their correlations with tree-ring width indices.In some growing season months,correlation coefficients,both in magnitude and sign,differed significantly from those based on actual data.The selection of different alternative climate datasets can lead to biased results in assessing forest responses to climate change,which is detrimental to the management of forest ecosystems in harsh environments.Therefore,the scientific and rational selection of alternative climate data is essential for dendroecological and climatological research.展开更多
To address the severe challenges of PM_(2.5) and ozone co-control during the"14^(th) Five-Year Plan"period and to enhance the precision and intelligence level of air environment governance,it is imperative t...To address the severe challenges of PM_(2.5) and ozone co-control during the"14^(th) Five-Year Plan"period and to enhance the precision and intelligence level of air environment governance,it is imperative to build an efficient comprehensive management platform for regional air quality.In this paper,the specific practice in Zibo City,Shandong Province is as an example to systematically analyze the top-level design,technical implementation,and innovative application of a comprehensive management platform for regional air quality integrating"perception monitoring,data fusion,research judgment of early warnings,analysis of sources,collaborative dispatching,and evaluation assessment".Through the construction of an"sky-air-ground"integrated three-dimensional monitoring network,the platform integrates multi-source heterogeneous environmental data,and employs big data,cloud computing,artificial intelligence,CALPUFF/CMAQ,and other numerical model technologies to achieve comprehensive perception,precise prediction,intelligent source tracing,and closed-loop management of air pollution.The platform innovatively establishes a full-process closed-loop management mechanism of"data-early warning-disposition-evaluation",and achieves a fundamental transformation from passive response to active anticipation and from experience-based judgment to data driving in environmental supervision.The application results show that this platform significantly improves the scientific decision-making ability and collaborative execution efficiency of air pollution governance in Zibo City,providing a replicable and scalable comprehensive solution for similar industrial cities to achieve the continuous improvement of air quality.展开更多
Stability of base-exposed backfill roof in underhand drift-and-fill mining is crucial for the safety of those working beneath.Given the commonly used primary-and-secondary mining sequence,interfaces are formed between...Stability of base-exposed backfill roof in underhand drift-and-fill mining is crucial for the safety of those working beneath.Given the commonly used primary-and-secondary mining sequence,interfaces are formed between adjacent filled drifts,which can weaken the integrity of the backfill roof.These interfaces also lead to two common drift layouts:aligned drifts and staggered drifts.However,less attention has been paid to the interfaces and the two drift layouts were not adequately distinguished in previous studies.In this paper,the interfaces between filled drifts were firstly considered to investigate the stability of backfill roof.Failure modes and strength requirements of backfill roof in aligned and staggered drifts are comprehensively investigated by FLAC3D,with a focus on considerations of varied shear parameters of the interfaces.Results show that failure modes in aligned drifts transition from block sliding to top caving,bottom caving or sloughing as the interface cohesion increases from zero to at least half of the backfill cohesion.Further increases in interface cohesion allow aligned drifts to behave as if there are no interfaces between them.The critical stability conditions of backfill roof in aligned drifts were mostly determined by the interface strength instead of the backfill strength.However,the stability of backfill roof in staggered drifts is barely affected by the interface strength.The outcomes are expected to provide references for mining engineers to optimize drift layouts and perform cost-effective backfill roof strength design at mines using underhand drift-and-fill mining method.展开更多
tRNA-derived small RNAs(tsRNAs),as a class of regulatory small noncoding RNA,have been implicated in a wide variety of human diseases.Large amounts of tsRNA–disease associations have been identified in recent years f...tRNA-derived small RNAs(tsRNAs),as a class of regulatory small noncoding RNA,have been implicated in a wide variety of human diseases.Large amounts of tsRNA–disease associations have been identified in recent years from accumulating studies.However,repositories for cataloging the detailed information on tsRNA–disease associations are scarce.In this study,we provide a tsRNADisease database by integrating experimentally and computationally supported tsRNA–disease associations from manual curation of literatures and other related resources.tsRNADisease contains 5571 manually curated associations between 4759 tsRNAs and 166 diseases with experimental evidence from 346 studies.In addition,it also contains 5013 predicted associations between 1297 tsRNAs and 111 diseases.tsRNADisease provides a user-friendly interface to browse,retrieve,and download data conveniently.This database can improve our understanding of tsRNA deregulation in diseases and serve as a valuable resource for investigating the mechanism of disease-related tsRNAs.tsRNADisease is freely available at http://www.compgenelab.info/tsRNADisease.展开更多
基金Henan Province Special Research Project of Tra ditional Chinese Medicine(Grant No.2022ZY1090).
文摘In the present study,data mining and network pharmacology were utilized to explore the principles and mechanisms of traditional Chinese medicine(TCM)in treating acute appendicitis.The goal was to provide a scientific basis for clinical treatment and further research on this disease.First,we searched the National Patent Database for Chinese herbal compound prescriptions used to treat acute appendicitis.We then applied frequency analysis,character and taste meridian analysis,association rule analysis,and hierarchical cluster analysis to identify the patterns of TCM treatment for acute appendicitis,selecting key combinations of Chinese medicines.Next,we screened the main active components of these key TCM based on quality markers.Using databases such as SwissTargetPrediction,SymMap,ETCM,and STRING,we analyzed the pharmacological mechanisms of these key TCM in treating acute appendicitis.Key active components and targets were further verified through molecular docking.We identified a total of 129 patents involving 316 Chinese medicines,with 24 being frequently used.The results indicated that most Chinese herbs used for acute appendicitis were heat-clearing drugs,blood-activating and stasis-removing drugs,and purging drugs.The primary active ingredients of the Rhubarb-cortex moutan-flos lonicerae combination for treating acute appendicitis included Emodin,Paeonol,Physcion,Chlorogenic acid,Chrysophanol,Rhein acid,and Aloe-emodin.These ingredients targeted key proteins such as ALB,TP53,BCL2,STAT3,IL-6,and TNF,and were involved in cellular responses to lipopolysaccharides,cell composition,and various cytokine-mediated biological processes.They also interacted with signaling pathways like AGE-RAGE,TNF,IL-17,and FoxO.Based on patent data,this study analyzed medication patterns in the treatment of acute appendicitis,discussed the possible mechanisms of key TCM combinations,and provided a scientific basis and new perspectives for the diagnosis and treatment of the disease.
文摘Objective To identify core acupoint patterns and elucidate the molecular mechanisms of acupuncture for primary depressive disorder(PDD)through data mining and network analysis.Methods A comprehensive literature search was conducted across PubMed,Embase,Ovid Technologies(OVID),Web of Science,Cochrane Library,China National Knowledge Infrastructure(CNKI),China National Knowledge Infrastructure Database(VIP),Wanfang Data,and SinoMed Database from database foundation to January 31,2025,for clinical studies on acupuncture treatment of PDD.Descriptive statistics,high-frequency acupoint analysis,degree and betweenness centrality evaluation,and core acupoint prescription mining identified predominant therapeutic combinations for PDD.Network acupuncture was used to predict therapeutic target for the core acupoint prescription.Subsequent protein-protein interaction(PPI)network and molecular complex detection(MCODE)analyses were conducted to identify the key targets and functional modules.Gene Ontology(GO)and Kyoto Encyclopedia of Genes and Genomes(KEGG)analyses explored the underlying biological mechanisms of the core acupoint prescription in treating PDD.Results A total of 57 acupoint prescriptions underwent systematic analysis.The core therapeutic combinations comprised Baihui(GV20),Yintang(GV29),Neiguan(PC6),Hegu(LI4),and Shenmen(HT7).Network acupuncture analysis identified 88 potential therapeutic targets(79 overlapping with PDD),while PPI network analysis revealed central regulatory nodes,including interleukin(IL)-6,IL-1β,tumor necrosis factor(TNF)-α,toll-like receptor 4(TLR4),IL-10,brain-derived neurotrophic factor(BDNF),transforming growth factor(TGF)-β1,C-XC motif chemokine ligand 10(CXCL10),mitogen-activated protein kinase 3(MAPK3),and nitric oxide synthase 1(NOS1).MCODE-based modular analysis further elucidated three functionally coherent clusters:inflammation-homeostasis(score=6.571),plasticity-neurotransmission(score=3.143),and oxidative stress(score=3.000).GO and KEGG analyses demonstrated significant enrichment of the MAPK,phosphoinositide 3-kinase/protein kinase B(PI3K/Akt),and hypoxia-inducible factor(HIF)-1 signaling pathways.These mechanistic insights suggested that the antidepressant effects mediated through mechanisms of neuroinflammatory regulation,neuroplasticity restoration,and immune-oxidative stress homeostasis.Conclusion This study reveals that acupuncture alleviates depression through a multi-level mechanism,primarily involving the neuroinflammation suppression,neuroplasticity enhancement,and oxidative stress regulation.These findings systematically clarify the underlying mechanisms of acupuncture’s antidepressant effects and identify novel therapeutic targets for further mechanistic research.
文摘Introduction Neurosurgical emergencies such as spontaneous intracerebral hemorrhage(ICH),traumatic brain injury(TBI),and acute brain herniation are among the most time-sensitive and high-stakes conditions in modern medicine.Clinical decisions often must be made within minutes,yet these decisions are traditionally guided by limited information,heuristic reasoning,and past experience.In this context,the rise of medical data mining and real-time analytics offers a transformative opportunity:to extract actionable intelligence from the flood of clinical,imaging,and physiological data already being collected,and to use this intelligence to guide care in real time[1–3](Figure 1).
文摘With the gradual acceleration of information construction in colleges and universities,digital campus and smart campus have gradually become important means for colleges and universities to scientifically manage the campus.They have been applied to teaching,scientific research,student management,and other fields,improving the quality and efficiency of management.This paper mainly studies the intelligent educational administration management system based on data mining technology.Firstly,this paper introduces the application process of data mining technology,and builds an intelligent educational administration management system based on data mining technology.Then,this paper optimizes the application of the Apriori algorithm in educational administration management through transaction compression and frequent sampling.Compared with the traditional Apriori algorithm,the optimized Apriori algorithm in this paper has a shorter execution time under the same minimum support.
基金Supported by Shanghai College Students Innovation and Entrepreneurship Training Program Project:202310268066The 16th Batch of Science And Technology Innovation Projects of Shanghai University of Traditional Chinese Medicine:SHUTCM2023010+1 种基金2024 Shanghai Oriental Talent Program Youth Project2021 High-level Local University Innovation Team Project of Shanghai University of Traditional Chinese Medicine:No.3 Shanghai Education Commission Personnel [2022]。
文摘Objective To explore the optimization and principles of acupoint selection and coordination in the treatment of adult abdominal obesity using acupuncture and moxibustion over the past decade using data mining.Methods Clinical studies of abdominal obesity treated with acupuncture and moxibustion,collected in the past 10 years,were searched from China Biology Medicine disc(CBMdisc),China National knowledge infrastructure(CNKI),Wanfang,China Science and Technology Journal Database(VIP),Pubmed,Embase,Google Scholar,Web of Science,(The Cumulative Index to Nursing and Allied Health Literature)CINAHL,Psyclnfo and Scopus,dated from March 1,2013 to March 31,2023.Using IBM SPSS Modeler 18.0 and other software,the frequency analysis,association-rules analysis and cluster analysis were conducted on interventions,traditional Chinese medicine(TCM)patterns,use frequency of acupoint,meridian attribution of acupoint,acupoint location,etc.Results A total of 55 articles were included,with 102 prescriptions and 71 acupoints involved.The top 3 interventions were acupoint embedding method,simple electroacupuncture and simple filiform needling.Seventeen patterns/syndromes of TCM differentiation were collected,dominated by spleen deficiency and damp blockage,spleen and kidney yang deficiency and heat accumulation in stomach and intestines.The acupoints in clinical practice were mostly at the foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian,and located at the abdominal region.The top 5 acupoints of high frequency were Tianshu(ST25),Zhongwan(CV12),Daheng(SP15),Zusanli(ST36),Huaroumen(ST24)and Daimai(GB26).The specific points of the high frequency were the crossing points and front-mu points,of which,ST25 and CV12 were the most prominent.After association-rules analysis on the high-frequency acupoints,20 groups of associated acupoints were obtained,in which,the core acupoints included ST25,CV12,SP15 and ST36.Conclusion In recent 10 years,abdominal obesity is treated by the acupoints of foot-yangming stomach meridian,the conception vessel and the foot-taiyin spleen meridian.Compared with the regimen for simple obesity,the acupoints at the abdominal region are specially selected in treatment of abdominal obesity,such as ST25,CV12,SP15 and ST36.Supplementary acupoints are selected based on syndrome differentiation to simultaneously address both the disease manifestations and root causes.
基金Song Hujie’s Inheritance Studio of National Renowned Traditional Chinese Medicine Experts.
文摘Objective:To explore the core acupuncture acupoints and pattern-adapted acupoint combination rules for autism spectrum disorder(ASD)complicated with sleep disorder using clinical data mining technology.Methods:A retrospective analysis was conducted on the diagnosis and treatment data of 104 children with ASD complicated with sleep disorder admitted to Xi’an Traditional Chinese Medicine(TCM)Encephalopathy Hospital from January 2022 to December 2024.Cross-pattern main acupoints were screened via frequency statistics,chi-square test,and factor analysis;pattern-specific auxiliary acupoints were extracted by combining multiple correspondence analysis,cluster analysis,and association rule mining.Results:Ten cross-pattern main acupoints(Baihui,Sishenzhen,Language Area 1,Language Area 2,Neiguan,Shenmen,Yongquan,Xuanzhong)were identified,and acupoint combination schemes for four major TCM patterns(Hyperactivity of Liver and Heart Fire,Deficiency of Kidney Essence,Deficiency of Both Heart and Spleen,Hyperactivity of Liver with Spleen Deficiency)were established.Conclusion:Acupuncture treatment should follow the principle of“regulating spirit and calming the brain as the root,and dredging collaterals based on pattern differentiation as the branch”.The synergy between main and auxiliary acupoints can accurately regulate the disease,providing a basis for precise clinical treatment.
文摘Population growth leads to increased utilization of water resources.One of these resources is groundwater,which has steadily declined each year.The depletion of these resources brings about various environmental challenges.The present study aimed to explore the relationship between groundwater fluctuations and land subsidence in the Malayer Plain,Iran,focusing on quantifying subsidence resulting from groundwater extraction.Using Sentinel-1 satellite data(2014–2019)and monthly piezometric measurements(1996–2018),the analysis revealed an average deformation velocity of–6.3 cm yr–1,with accumulated subsidence of–32 cm over the 2014–2019 period.The maximum subsidence rate reached 10.3 cm yr–1 in areas of intensive agricultural activity.A wavelet-PCA spatiotemporal analysis of groundwater fluctuations identified critical multi-scale patterns strongly correlated with subsidence trends.Regression analysis between subsidence rates and groundwater fluctuations at various wavelet decomposition levels explained 75%of the variance(R2=0.75),indicating that intermediate-scale groundwater declines were the primary drivers of subsidence.Furthermore,land use analysis using Landsat data(1999–2021)revealed a 6230-ha increase in irrigated farmland,contributing to heightened groundwater extraction and subsidence rates.These findings highlight the critical need for sustainable groundwater management to mitigate the risks of continued subsidence in the region.
基金the National Key Research and Development Program of China(No.2023 YFC2811600)the National Natural Science Foundation of China(Nos.52301349,52088102)+1 种基金the Major Science and Technology Innovation Program of Qingdao(No.223-3-hygg-10-hy)the Qingdao Science Foundation for Post-doctoral Scientists(Nos.QDBSH20220202070,QDBSH20220201015)。
文摘A deep-sea riser is a crucial component of the mining system used to lift seafloor mineral resources to the vessel.Even minor damage to the riser can lead to substantial financial losses,environmental impacts,and safety hazards.However,identifying modal parameters for structural health monitoring remains a major challenge due to its large deformations and flexibility.Vibration signal-based methods are essential for detecting damage and enabling timely maintenance to minimize losses.However,accurately extracting features from one-dimensional(1D)signals is often hindered by various environmental factors and measurement noises.To address this challenge,a novel approach based on a residual convolutional auto-encoder(RCAE)is proposed for detecting damage in deep-sea mining risers,incorporating a data fusion strategy.First,principal component analysis(PCA)is applied to reduce environmental fluctuations and fuse multisensor strain readings.Subsequently,a 1D-RCAE is used to extract damage-sensitive features(DSFs)from the fused dataset.A Mahalanobis distance indicator is established to compare the DSFs of the testing and healthy risers.The specific threshold for these distances is determined using the 3σcriterion,which is employed to assess whether damage has occurred in the testing riser.The effectiveness and robustness of the proposed approach are verified through numerical simulations of a 500-m riser and experimental tests on a 6-m riser.Moreover,the impact of contaminated noise and environmental fluctuations is examined.Results show that the proposed PCA-1D-RCAE approach can effectively detect damage and is resilient to measurement noise and environmental fluctuations.The accuracy exceeds 98%under noise-free conditions and remains above 90%even with 10 dB noise.This novel approach has the potential to establish a new standard for evaluating the health and integrity of risers during mining operations,thereby reducing the high costs and risks associated with failures.Maintenance activities can be scheduled more efficiently by enabling early and accurate detection of riser damage,minimizing downtime and avoiding catastrophic failures.
文摘The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders.
基金supported by the European Research Council(ERC)under Grant Agreement No.951424(Water-Futures)by the Republic of Cyprus through the Deputy Ministry of Research,Innovation and Digital Policy.
文摘In today's connected world,the generation of massive streaming data across diverse domains has become commonplace.In the presence of concept drift,class imbalance,label scarcity,and new class emergence,these challenges jointly degrade representation stability,bias learning toward outdated distributions,and reduce the resilience and reliability of detection in dynamic environments.This paper proposes a streaming classincremental learning(SCIL)framework to address these issues.The SCIL framework integrates an autoencoder(AE)with a multi-layer perceptron for multi-class prediction,employs a dual-loss strategy(classification and reconstruction)for prediction and new class detection,uses corrected pseudo-labels for online training,manages classes with queues,and applies oversampling to handle imbalance.The rationale behind the method's structure is elucidated through ablation studies,and a comprehensive experimental evaluation is performed using both real-world and synthetic datasets that feature class imbalance,incremental classes,and concept drifts.Our results demonstrate that SCIL outperforms strong baselines and state-of-the-art methods.In line with our commitment to Open Science,we make our code and datasets available to the community.
基金supported by National Natural Science Foundation of China(Nos.52174328 and 52474368)Fundamental Research Funds for Central Universities of Central South University(Nos.2022ZZTS0084 and 2024ZZTS0062).
文摘Reducing raw materials consumption(RMC)in electric arc furnace(EAF)steelmaking process is beneficial to the reduction in resource and energy consumption.The conventional indicator of evaluating RMC only focuses on EAF inputs and outputs,neglecting the associations between smelting operations and RMC.Traditional methods of reducing RMC rely on manual experience and lack a standard operation guidance.A method based on association rules mining and metallurgical mechanism(ARM-MM)was proposed.ARM-MM proposed an improved evaluation indicator of RMC and the indicator independently showed the associations between smelting operations and RMC.On the basis,1265 heats of real EAF data were used to obtain the operation guidance for RMC reduction.According to the ratio of hot metal(HM)in charge metals,data were divided into all dataset,low HM ratio dataset,medium HM ratio dataset,and high HM ratio dataset.ARM algorithm was used in each dataset to obtain specific operation guidance.The real average RMC under all dataset,medium HM ratio dataset,and high HM ratio dataset was reduced by 279,486,and 252 kg/heat,respectively,when obtained operation guidance was applied.
文摘39 soil samples surrounding a lead-zinc mining area in Guangxi were collected,and the contents of Pb,Hg,Cd,Cr,As,Cu,Zn,and Ni were determined to investigate the pollution characteristics and sources of heavy metals.ArcGIS inverse distance weight difference method was used to analyze the characteristics of pollution distribution,and single-factor pollution index,Nemerow comprehensive pollution index,ground accumulation index,and potential ecological risk index were selected to evaluate the characteristics of heavy metal pollution.Based on correlation analysis,the absolute principal component-multiple linear regression(APCS-MLR)and positive definite matrix factorization(PMF)models were used to analyze the sources of soil heavy metals.The results showed that the average concentrations of all eight heavy metals exceeded both national and Guangxi soil background values.Hg,Cd,and Zn exhibited high variation(greater than 0.5),indicating significant external disturbances,and their spatial distribution was closely related to mining activity locations.The single-factor pollution index evaluation indicated varying degrees of pollution risk for Cd,Zn,and As,with Cd and Zn being the most severe pollutants,as 69.23%and 30.77%of the samples fell into the moderate pollution or higher category.The geoaccumulation index analysis ranked the mean pollution levels of the eight elements as follows:Zn>Cd>Ni>Pb>Cu>Cr>Hg>As,with Cd and Zn showing the most severe contamination,and 51.28%of the samples exhibiting moderate or higher pollution levels.The Nemerow comprehensive pollution index evaluation showed that 74.35%of soil samples were classified as moderate to heavy pollution.The potential ecological risk index assessment indicated significant ecological risks posed by Cd and Zn,with 82.05%and 5.12%of the samples classified as causing strong to extreme ecological risks,respectively.The source apportionment analysis revealed minor differences between the two models.The APCS-MLR model identified three pollution sources and their contribution rates:anthropogenic mining sources(31.13%),parent material sources(40.38%),and unidentified sources(28.49%).The PMF model identified three pollution sources with contribution rates of anthropogenic mining sources(26.10%),parent material sources(46.96%),and a combined traffic and agricultural source(26.61%).Pb,Hg,Cd,and Zn mainly originated from mining activities;Cr,As,and Ni were primarily derived from the parent material,while Cu was predominantly attributed to traffic and agricultural sources.These findings provide a scientific basis for the prevention and control of heavy metal pollution in mining areas.
文摘Earthquakes are highly destructive spatio-temporal phenomena whose analysis is essential for disaster preparedness and risk mitigation.Modern seismological research produces vast volumes of heterogeneous data from seismic networks,satellite observations,and geospatial repositories,creating the need for scalable infrastructures capable of integrating and analyzing such data to support intelligent decision-making.Data warehousing technologies provide a robust foundation for this purpose;however,existing earthquake-oriented data warehouses remain limited,often relying on simplified schemas,domain-specific analytics,or cataloguing efforts.This paper presents the design and implementation of a spatio-temporal data warehouse for seismic activity.The framework integrates spatial and temporal dimensions in a unified schema and introduces a novel array-based approach for managing many-to-many relationships between facts and dimensions without intermediate bridge tables.A comparative evaluation against a conventional bridge-table schema demonstrates that the array-based design improves fact-centric query performance,while the bridge-table schema remains advantageous for dimension-centric queries.To reconcile these trade-offs,a hybrid schema is proposed that retains both representations,ensuring balanced efficiency across heterogeneous workloads.The proposed framework demonstrates how spatio-temporal data warehousing can address schema complexity,improve query performance,and support multidimensional visualization.In doing so,it provides a foundation for integrating seismic analysis into broader big data-driven intelligent decision systems for disaster resilience,risk mitigation,and emergency management.
基金supported by the National Natural Science Foundation of China[grant numbers 12171158,12371474 and 12571510]Fundamental Research Funds for the Central Universities[grant number 2025ECNU-WLJC006].
文摘This work contributes to the theoretical foundation for pricing in data markets and offers practical insights for managing digital data exchanges in the era of big data.We propose a structured pricing model for data exchanges transitioning from quasi-public to marketoriented operations.To address the complex dynamics among data exchanges,suppliers,and consumers,the authors develop a threestage Stackelberg game framework.In this model,the data exchange acts as a leader setting transaction commission rates,suppliers are intermediate leaders determining unit prices,and consumers are followers making purchasing decisions.Two pricing strategies are examined:the Independent Pricing Approach(IPA)and the novel Perfectly Competitive Pricing Approach(PCPA),which accounts for competition among data providers.Using backward induction,the study derives subgame-perfect equilibria and proves the existence and uniqueness of Stackelberg equilibria under both approaches.Extensive numerical simulations are carried out in the model,demonstrating that PCPA enhances data demander utility,encourages supplier competition,increases transaction volume,and improves the overall profitability and sustainability of data exchanges.Social welfare analysis further confirms PCPA’s superiority in promoting efficient and fair data markets.
基金supported by the National Natural Science Foundation of China(Nos.52394193 and U22A20569)the National Key R&D Program Projects(Nos.2023YFC3804200 and 2023YFC3804205).
文摘Mine surveying is an indispensable and crucial basic technical work in the process of mineral resource development.It plays an important role throughout the entire life cycle of a mine,from exploration,design,construction,and production to closure,and is known as the“eyes of the mine”.With the rapid development of satellite technology,computer science,artificial intelligence,robotics,and spatiotemporal big data,mine surveying science and technology supported by spatial information technology is increasingly playing the role of the“brain of the mine”.This paper systematically summarizes the characteristics of mining surveying science and technology in contemporary and future mining development.First,based on the requirements of safe,efficient,and green development in modern mining,an analysis is conducted on the innovative practices of intelligent mining methods;secondly,it explains the transformation of regional economic and mining economic integration towards lengthening the industrial chain and scientific and technological innovation.Regarding intelligent mining,this paper discusses three technical dimensions:(1)By establishing a spatiotemporal data model of the mine,real-time perception and remote intelligent control of the production system are realized;(2)Based on the transparent mine three-dimensional geological modelling technology,the accuracy of geological condition prediction and the scientific nature of mining decisions are significantly improved;(3)By integrating multi-source remote sensing data and deep learning algorithms,a high-precision coal and rock identification system is constructed.The study further revealed the innovative application value of mine surveying in the post-mining era,including:diversified utilization of underground space in mining areas(tourism development,geothermal energy storage,pumped storage,etc.),multi-platform remote sensing coordinated ecological restoration monitoring,and optimized land space planning in mining areas.Practice has proved that mine surveying technology is an important technical engine for promoting green transformation and high-quality development in resource-based regions,and has irreplaceable strategic significance for achieving coordinated development of energy,economy,and environment.
基金supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University(IMSIU)under the grant number IMSIU-DDRSP2601.
文摘Ovarian cancer(OC)is one of the leading causes of death related to gynecological cancer,with the main difficulty of its early diagnosis and a heterogeneous nature of tumor biomarkers.Machine learning(ML)has the potential to process complex datasets and support decision-making in OC diagnosis.Nevertheless,traditional ML models tend to be biased,overfitting,noisy,and less generalized.Moreover,their black-box nature reduces interpretability and limits their practical clinical applicability.In this study,we introduce an explainable ensemble learning(EL)model,TreeX-Stack,based on a stacking architecture that employs tree-based learners such as Decision Tree(DT),Random Forest(RF),Gradient Boosting(GB),and Extreme Gradient Boosting(XGBoost)as base learners,and Logistic Regression(LR)as the meta-learner to enhance ovarian cancer(OC)diagnosis.Local Interpretable ModelAgnostic Explanations(LIME)are used to explain individual predictions,making the model outputs more clinically interpretable and applicable.The model is trained on the dataset that includes demographic information,blood test,general chemistry,and tumor markers.Extensive preprocessing includes handling missing data using iterative imputation with Bayesian Ridge and addressing multicollinearity by removing features with correlation coefficients above 0.7.Relevant features are then selected using the Boruta feature selection method.To obtain robust and unbiased performance estimates during hyperparameter tuning,nested cross-validation(CV)with grid search is employed,and all experiments are repeated five times to ensure statistical reliability.TreeX-Stack demonstrates excellent diagnostic performance,achieving an accuracy of 0.9027,a precision of 0.8673,a recall of 0.9391,and an F1-score of 0.9012.Feature-importance analyses using LIME and permutation importance highlight Human Epididymis Protein 4(HE4)as the most significant biomarker for OC.The combination of high predictive performance and interpretability makes TreeX-Stack a reliable tool for clinical decision support in OC diagnosis.
基金supported by the International Partnership program of the Chinese Academy of Sciences(170GJHZ2023074GC)National Natural Science Foundation of China(42425706 and 42488201)+1 种基金National Key Research and Development Program of China(2024YFF0807902)Beijing Natural Science Foundation(8242041),and China Postdoctoral Science Foundation(2025M770353).
文摘Accurately assessing the relationship between tree growth and climatic factors is of great importance in dendrochronology.This study evaluated the consistency between alternative climate datasets(including station and gridded data)and actual climate data(fixed-point observations near the sampling sites),in northeastern China’s warm temperate zone and analyzed differences in their correlations with tree-ring width index.The results were:(1)Gridded temperature data,as well as precipitation and relative humidity data from the Huailai meteorological station,was more consistent with the actual climate data;in contrast,gridded soil moisture content data showed significant discrepancies.(2)Horizontal distance had a greater impact on the representativeness of actual climate conditions than vertical elevation differences.(3)Differences in consistency between alternative and actual climate data also affected their correlations with tree-ring width indices.In some growing season months,correlation coefficients,both in magnitude and sign,differed significantly from those based on actual data.The selection of different alternative climate datasets can lead to biased results in assessing forest responses to climate change,which is detrimental to the management of forest ecosystems in harsh environments.Therefore,the scientific and rational selection of alternative climate data is essential for dendroecological and climatological research.
文摘To address the severe challenges of PM_(2.5) and ozone co-control during the"14^(th) Five-Year Plan"period and to enhance the precision and intelligence level of air environment governance,it is imperative to build an efficient comprehensive management platform for regional air quality.In this paper,the specific practice in Zibo City,Shandong Province is as an example to systematically analyze the top-level design,technical implementation,and innovative application of a comprehensive management platform for regional air quality integrating"perception monitoring,data fusion,research judgment of early warnings,analysis of sources,collaborative dispatching,and evaluation assessment".Through the construction of an"sky-air-ground"integrated three-dimensional monitoring network,the platform integrates multi-source heterogeneous environmental data,and employs big data,cloud computing,artificial intelligence,CALPUFF/CMAQ,and other numerical model technologies to achieve comprehensive perception,precise prediction,intelligent source tracing,and closed-loop management of air pollution.The platform innovatively establishes a full-process closed-loop management mechanism of"data-early warning-disposition-evaluation",and achieves a fundamental transformation from passive response to active anticipation and from experience-based judgment to data driving in environmental supervision.The application results show that this platform significantly improves the scientific decision-making ability and collaborative execution efficiency of air pollution governance in Zibo City,providing a replicable and scalable comprehensive solution for similar industrial cities to achieve the continuous improvement of air quality.
基金supported by Deep Earth Probe and Mineral Resources Exploration-National Science and Technology Major Project(Grant No.2024ZD1003705)the Beijing Nova Program(Grant No.20220484057)support from China Scholarship Council under Grant CSC No.202110300001.
文摘Stability of base-exposed backfill roof in underhand drift-and-fill mining is crucial for the safety of those working beneath.Given the commonly used primary-and-secondary mining sequence,interfaces are formed between adjacent filled drifts,which can weaken the integrity of the backfill roof.These interfaces also lead to two common drift layouts:aligned drifts and staggered drifts.However,less attention has been paid to the interfaces and the two drift layouts were not adequately distinguished in previous studies.In this paper,the interfaces between filled drifts were firstly considered to investigate the stability of backfill roof.Failure modes and strength requirements of backfill roof in aligned and staggered drifts are comprehensively investigated by FLAC3D,with a focus on considerations of varied shear parameters of the interfaces.Results show that failure modes in aligned drifts transition from block sliding to top caving,bottom caving or sloughing as the interface cohesion increases from zero to at least half of the backfill cohesion.Further increases in interface cohesion allow aligned drifts to behave as if there are no interfaces between them.The critical stability conditions of backfill roof in aligned drifts were mostly determined by the interface strength instead of the backfill strength.However,the stability of backfill roof in staggered drifts is barely affected by the interface strength.The outcomes are expected to provide references for mining engineers to optimize drift layouts and perform cost-effective backfill roof strength design at mines using underhand drift-and-fill mining method.
基金supported by the National Natural Science Foundation of China(91959106)the Foundation of the Shanghai Municipal Education Commission(24RGZNC02)+4 种基金Shanghai Key Laboratory of Intelligent Information Processing,Fudan University(IIPL-2025-RD3-02)Key University Science Research Project of Anhui Province(2023AH030108)Climbing Peak Training Program for Innovative Technology team of Yijishan Hospital,Wannan Medical College(PF201904)Peak Training Program for Scientific Research of Yijishan Hospital,Wannan Medical College(GF2019G15)the talent project of the First Affiliated Hospital of Wannan Medical College(Yijishan Hospital of Wannan Medical College)(YR202422).
文摘tRNA-derived small RNAs(tsRNAs),as a class of regulatory small noncoding RNA,have been implicated in a wide variety of human diseases.Large amounts of tsRNA–disease associations have been identified in recent years from accumulating studies.However,repositories for cataloging the detailed information on tsRNA–disease associations are scarce.In this study,we provide a tsRNADisease database by integrating experimentally and computationally supported tsRNA–disease associations from manual curation of literatures and other related resources.tsRNADisease contains 5571 manually curated associations between 4759 tsRNAs and 166 diseases with experimental evidence from 346 studies.In addition,it also contains 5013 predicted associations between 1297 tsRNAs and 111 diseases.tsRNADisease provides a user-friendly interface to browse,retrieve,and download data conveniently.This database can improve our understanding of tsRNA deregulation in diseases and serve as a valuable resource for investigating the mechanism of disease-related tsRNAs.tsRNADisease is freely available at http://www.compgenelab.info/tsRNADisease.