A distinctive feature of scholarly communities today is exploring topics and concepts in interdisciplinary and international contexts. This observation is increasingly apparent and visible in advancing our thinking an...A distinctive feature of scholarly communities today is exploring topics and concepts in interdisciplinary and international contexts. This observation is increasingly apparent and visible in advancing our thinking and policies related to human/environmental worlds at local, regional, and global scales. Maps are an important part of these innovative and ongoing research approaches. In this context, we consider urban forests a topic meriting more attention of scholars studying the geographic and environmental intersections of the natural sciences with the social sciences and humanities. We construct two innovative knowledge bases, one a conceptual framework based on major themes and concepts related to mapping urban forests using key words of the first 100 results of a Google Scholar query and a second using the number of Google Scholar hyperlinks about mapping urban forests in 244 capital cities. We discovered that the constructed world maps reveal vast global unevenness in our knowledge about urban forests in hyperlink numbers and ratios, results that merit further attention by disciplinary, international and interdisciplinary scholarly communities.展开更多
Objectives:Electronic health records(EHRs)offer valuable real-world data(RWD)for Chinese medicine research.However,significant methodological challenges remain in developing integrative Chinese-Western medicine(ICWM)d...Objectives:Electronic health records(EHRs)offer valuable real-world data(RWD)for Chinese medicine research.However,significant methodological challenges remain in developing integrative Chinese-Western medicine(ICWM)databases.This study aims to establish a best-practice methodological framework,referred to as BRIDGE,to guide the construction of ICWM databases using EHRs.Methods:We developed the methodological framework through a comprehensive process,including systematic literature review,synthesis of empirical experiences,thematic expert discussions,and consultation with an external panel to reach consensus.Results:The BRIDGE framework outlines 6 core components for ICWM-EHR database development:Overall design,database architecture,data extraction and linkage,data governance,data verification,and data quality evaluation.Key data elements include variables related to study population,treatment or exposure,outcomes,and confounders.These databases support various research applications,particularly in evaluating the effectiveness and safety of integrative therapies.To demonstrate its practical value,we developed an ICWM-EHR database on women’s reproductive lifespan,encompassing 2,064,482 patients.This database captures women’s health conditions across the life course,from reproductive age to older adulthood.Conclusions:The BRIDGE methodological framework provides a standardized approach to building high-quality ICWM-EHR databases.It offers a unique opportunity to strengthen the methodological rigor and real-world relevance of Chinese medicine research in integrated healthcare settings.展开更多
The journal of Meteorological and Environmental Research[ISSN:2152-3940]has been included and stored by the following famous databases:CA,CABI,CSA,EBSCO,UPD,AGRIS,EA,Chinese Science and Technology Periodical Database,...The journal of Meteorological and Environmental Research[ISSN:2152-3940]has been included and stored by the following famous databases:CA,CABI,CSA,EBSCO,UPD,AGRIS,EA,Chinese Science and Technology Periodical Database,and CNKI,as well as Library of Congress,United States.展开更多
The journal of Meteorological and Environmental Research[ISSN:2152-3940]has been included and stored by the following famous databases:CA,CABI,CSA,EBSCO,UPD,AGRIS,EA,Chinese Science and Technology Periodical Database,...The journal of Meteorological and Environmental Research[ISSN:2152-3940]has been included and stored by the following famous databases:CA,CABI,CSA,EBSCO,UPD,AGRIS,EA,Chinese Science and Technology Periodical Database,and CNKI,as well as Library of Congress,United States.展开更多
AI-driven materials databases are transforming research by integrating experimental and computational data to enhance discovery and optimization.Platforms such as Digital Catalysis Platform(DigCat)and Dynamic Database...AI-driven materials databases are transforming research by integrating experimental and computational data to enhance discovery and optimization.Platforms such as Digital Catalysis Platform(DigCat)and Dynamic Database of Solid-State Electrolyte(DDSE)demonstrate how machine learning and predictive modeling can improve catalyst and solid-state electrolyte development.These databases facilitate data standardization,high-throughput screening,and cross-disciplinary collaboration,addressing key challenges in materials informatics.As AI techniques advance,materials databases are expected to play an increasingly vital role in accelerating research and innovation.展开更多
The authors regret that the original publication of this paper did not include Jawad Fayaz as a co-author.After further discussions and a thorough review of the research contributions,it was agreed that his significan...The authors regret that the original publication of this paper did not include Jawad Fayaz as a co-author.After further discussions and a thorough review of the research contributions,it was agreed that his significant contributions to the foundational aspects of the research warranted recognition,and he has now been added as a co-author.展开更多
The future usage of heterogeneous databases will consist of the WWW and CORBA environments. The integration of the WWW databases and CORBA standards are discussed. These two techniques need to merge together to make d...The future usage of heterogeneous databases will consist of the WWW and CORBA environments. The integration of the WWW databases and CORBA standards are discussed. These two techniques need to merge together to make distributed usage of heterogeneous databases user friendly. In an environment integrating WWW databases and CORBA technologies, CORBA can be used to access heterogeneous data sources in the internet. This kind of applications can achieve distributed transactions to assure data consistency and integrity. The application of this technology is with a good prospect.展开更多
The EU’s Artificial Intelligence Act(AI Act)imposes requirements for the privacy compliance of AI systems.AI systems must comply with privacy laws such as the GDPR when providing services.These laws provide users wit...The EU’s Artificial Intelligence Act(AI Act)imposes requirements for the privacy compliance of AI systems.AI systems must comply with privacy laws such as the GDPR when providing services.These laws provide users with the right to issue a Data Subject Access Request(DSAR).Responding to such requests requires database administrators to identify information related to an individual accurately.However,manual compliance poses significant challenges and is error-prone.Database administrators need to write queries through time-consuming labor.The demand for large amounts of data by AI systems has driven the development of NoSQL databases.Due to the flexible schema of NoSQL databases,identifying personal information becomes even more challenging.This paper develops an automated tool to identify personal information that can help organizations respond to DSAR.Our tool employs a combination of various technologies,including schema extraction of NoSQL databases and relationship identification from query logs.We describe the algorithm used by our tool,detailing how it discovers and extracts implicit relationships from NoSQL databases and generates relationship graphs to help developers accurately identify personal data.We evaluate our tool on three datasets,covering different database designs,achieving an F1 score of 0.77 to 1.Experimental results demonstrate that our tool successfully identifies information relevant to the data subject.Our tool reduces manual effort and simplifies GDPR compliance,showing practical application value in enhancing the privacy performance of NOSQL databases and AI systems.展开更多
Traditionally,nonlinear time history analysis(NLTHA)is used to assess the performance of structures under fu-ture hazards which is necessary to develop effective disaster risk management strategies.However,this method...Traditionally,nonlinear time history analysis(NLTHA)is used to assess the performance of structures under fu-ture hazards which is necessary to develop effective disaster risk management strategies.However,this method is computationally intensive and not suitable for analyzing a large number of structures on a city-wide scale.Surrogate models offer an efficient and reliable alternative and facilitate evaluating the performance of multiple structures under different hazard scenarios.However,creating a comprehensive database for surrogate mod-elling at the city level presents challenges.To overcome this,the present study proposes meta databases and a general framework for surrogate modelling of steel structures.The dataset includes 30,000 steel moment-resisting frame buildings,representing low-rise,mid-rise and high-rise buildings,with criteria for connections,beams,and columns.Pushover analysis is performed and structural parameters are extracted,and finally,incorporating two different machine learning algorithms,random forest and Shapley additive explanations,sensitivity and explain-ability analyses of the structural parameters are performed to identify the most significant factors in designing steel moment resisting frames.The framework and databases can be used as a validated source of surrogate modelling of steel frame structures in order for disaster risk management.展开更多
Discovery of materials using“bottom-up”or“top-down”approach is of great interest in materials science.Layered materials consisting of two-dimensional(2D)building blocks provide a good platform to explore new mater...Discovery of materials using“bottom-up”or“top-down”approach is of great interest in materials science.Layered materials consisting of two-dimensional(2D)building blocks provide a good platform to explore new materials in this respect.In van der Waals(vdW)layered materials,these building blocks are charge neutral and can be isolated from their bulk phase(top-down),but usually grow on substrate.In ionic layered materials,they are charged and usually cannot exist independently but can serve as motifs to construct new materials(bottom-up).In this paper,we introduce our recently constructed databases for 2D material-substrate interface(2DMSI),and 2D charged building blocks.For 2DMSI database,we systematically build a workflow to predict appropriate substrates and their geometries at substrates,and construct the 2DMSI database.For the 2D charged building block database,1208 entries from bulk material database are identified.Information of crystal structure,valence state,source,dimension and so on is provided for each entry with a json format.We also show its application in designing and searching for new functional layered materials.The 2DMSI database,building block database,and designed layered materials are available in Science Data Bank at https://doi.org/10.57760/sciencedb.j00113.00188.展开更多
BACKGROUND Ampullary adenocarcinoma is a rare malignant tumor of the gastrointestinal tract.Currently,only a few cases have been reported,resulting in limited information on survival.AIM To develop a dynamic nomogram ...BACKGROUND Ampullary adenocarcinoma is a rare malignant tumor of the gastrointestinal tract.Currently,only a few cases have been reported,resulting in limited information on survival.AIM To develop a dynamic nomogram using internal and external validation to predict survival in patients with ampullary adenocarcinoma.METHODS Data were sourced from the surveillance,epidemiology,and end results stat database.The patients in the database were randomized in a 7:3 ratio into training and validation groups.Using Cox regression univariate and multivariate analyses in the training group,we identified independent risk factors for overall survival and cancer-specific survival to develop the nomogram.The nomogram was validated with a cohort of patients from the First Affiliated Hospital of the Army Medical University.RESULTS For overall and cancer-specific survival,12(sex,age,race,lymph node ratio,tumor size,chemotherapy,surgical modality,T stage,tumor differentiation,brain metastasis,lung metastasis,and extension)and 6(age;surveillance,epidemiology,and end results stage;lymph node ratio;chemotherapy;surgical modality;and tumor differentiation)independent risk factors,respectively,were incorporated into the nomogram.The area under the curve values at 1,3,and 5 years,respectively,were 0.807,0.842,and 0.826 for overall survival and 0.816,0.835,and 0.841 for cancer-specific survival.The internal and external validation cohorts indicated good consistency of the nomogram.CONCLUSION The dynamic nomogram offers robust predictive efficacy for the overall and cancer-specific survival of ampullary adenocarcinoma.展开更多
Natural products(NPs)have long held a significant position in various fields such as medicine,food,agriculture,and materials.The chemical space covered by NPs is extensive but often underexplored.Therefore,high-throug...Natural products(NPs)have long held a significant position in various fields such as medicine,food,agriculture,and materials.The chemical space covered by NPs is extensive but often underexplored.Therefore,high-throughput and efficient methodologies for the annotation and discovery of NPs are desired to address the complexity and diversity of NP-based systems.Mass spectrometry(MS)has emerged as a powerful platform for the annotation and discovery of NPs.MS databases provide vital support for the structural characterization of NPs by integrating extensive mass spectral data and sample information.Additionally,the released annotation methodologies,based on a variety of informatics tools,continuously improve the ability to annotate the structure and properties of compounds.This review examines the current mainstream databases and annotation methodologies,focusing on their advantages and limitations.Prospects for future technological advancements are then discussed in terms of novel applications and research objectives.Through a systematic overview,this review aims to provide valuable insights and a reference for MS-based NPs annotation,thereby promoting the discovery of novel natural entities.展开更多
The unique long-range disordered atomic arrangement inherent in amorphous materials endows them with a range of superior properties,rendering them highly promising for applications in catalysis,medicine,and battery te...The unique long-range disordered atomic arrangement inherent in amorphous materials endows them with a range of superior properties,rendering them highly promising for applications in catalysis,medicine,and battery technology,among other fields.Since not all materials can be synthesized into an amorphous structure,the composition design of amorphous materials holds significant importance.Machine learning offers a valuable alternative to traditional“trial-anderror”methods by predicting properties through experimental data,thus providing efficient guidance in material design.In this study,we develop a machine learning workflow to predict the critical casting diameter,glass transition temperature,and Young's modulus for 45 ternary reported amorphous alloy systems.The predicted results have been organized into a database,enabling direct retrieval of predicted values based on compositional information.Furthermore,the applications of high glass forming ability region screening for specified system,multi-property target system screening and high glass forming ability region search through iteration are also demonstrated.By utilizing machine learning predictions,researchers can effectively narrow the experimental scope and expedite the exploration of compositions.展开更多
Research into metamorphism plays a pivotal role in reconstructing the evolution of continent,particularly through the study of ancient rocks that are highly susceptible to metamorphic alterations due to multiple tecto...Research into metamorphism plays a pivotal role in reconstructing the evolution of continent,particularly through the study of ancient rocks that are highly susceptible to metamorphic alterations due to multiple tectonic activities.In the big data era,the establishment of new data platforms and the application of big data methods have become a focus for metamorphic rocks.Significant progress has been made in creating specialized databases,compiling comprehensive datasets,and utilizing data analytics to address complex scientific questions.However,many existing databases are inadequate in meeting the specific requirements of metamorphic research,resulting from a substantial amount of valuable data remaining uncollected.Therefore,constructing new databases that can cope with the development of the data era is necessary.This article provides an extensive review of existing databases related to metamorphic rocks and discusses data-driven studies in this.Accordingly,several crucial factors that need to be taken into consideration in the establishment of specialized metamorphic databases are identified,aiming to leverage data-driven applications to achieve broader scientific objectives in metamorphic research.展开更多
Objective:This study aimed to investigate the changes in gene expression profiles of multiple myeloma(MM)cells after bortezomib treatment by analyzing the GEO database,thereby providing a theoretical foundation for su...Objective:This study aimed to investigate the changes in gene expression profiles of multiple myeloma(MM)cells after bortezomib treatment by analyzing the GEO database,thereby providing a theoretical foundation for subsequent research on HSP70.Methods:The GSE41929 dataset was selected from the GEO database.Screening and analysis were performed to identify differentially expressed genes between bortezomib-treated and non-treated MM cells.Results:After bortezomib treatment,126 genes in MM cells showed the most significant changes in expression(P<0.05,absolute value of logFC≥1.5).Based on the fold change and the most significant gene module,HSPA1B exhibited the most notable upregulation after HMOX1,followed by HSPA6 and DNAJB1.HSPA1B and HSPA6 are members of the HSP70 protein family,while DNAJB1 primarily interacts with HSP70 to stimulate its ATPase activity and negatively regulates the transcriptional activity of HSF1 induced by heat shock.Conclusion:HSP70 was the most significantly upregulated molecule in MM cells following bortezomib stimulation.展开更多
BACKGROUND For locally advanced gallbladder cancer,previous clinical studies have demon-strated that chemotherapy results in significant survival benefits when compared to surgery alone.However,data demonstrating a si...BACKGROUND For locally advanced gallbladder cancer,previous clinical studies have demon-strated that chemotherapy results in significant survival benefits when compared to surgery alone.However,data demonstrating a similar survival benefit with early-stage gallbladder cancer is limited.This study seeks to evaluate the impact chemotherapy has on survival in patients with early-stage gallbladder cancer using a large,multi-institution database.AIM To investigate the survival benefit of chemotherapy in patients with stage II gallbladder cancer.METHODS We performed a retrospective multivariable analysis of the National Cancer Database from 2010 to 2017 to evaluate the effect that chemotherapy has on the survival of patients with stage II gallbladder cancer.Our objective was to de-termine if there were any statistically significant survival differences between those who received surgery and chemotherapy vs those who only underwent surgery.RESULTS Of the 899 patients with stage II gallbladder cancer,328 patients had undergone chemotherapy and surgery.The average overall survival for those who had surgery and chemotherapy vs only surgery was 52.6 months and 51.1 months,respectively.This difference was not statistically significant(P=0.2).In the secondary analysis,the surgical group who had a liver resection had better overall survival(P<0.0001).CONCLUSION Practitioners should carefully consider chemotherapy for early-stage gallbladder cancer,as risks may outweigh survival benefits,and surgeons should also consider liver resections as part of their surgical management.展开更多
In-depth study of the components of polymyxins is the key to controlling the quality of this class of antibiotics.Similarities and variations of components present significant analytical challenges.A two-dimensional(2...In-depth study of the components of polymyxins is the key to controlling the quality of this class of antibiotics.Similarities and variations of components present significant analytical challenges.A two-dimensional(2D)liquid chromatography-mass spectrometry(LC-MS)method was established for screening and comprehensive profiling of compositions of the antibiotic colistimethate sodium(CMS).A high concentration of phosphate buffer mobile phase was used in the first-dimensional LC system to get the components well separated.For efficient and high-accuracy screening of CMS,a targeted method based on a self-constructed high resolution(HR)mass spectrum database of CMS components was established.The database was built based on the commercial MassHunter Personal Compound Database and Library(PCDL)software and its accuracy of the compound matching result was verified with six known components before being applied to genuine sample screening.On this basis,the unknown peaks in the CMS chromatograms were deduced and assigned.The molecular formula,group composition,and origins of a total of 99 compounds,of which the combined area percentage accounted for more than 95%of CMS components,were deduced by this 2D-LC-MS method combined with the MassHunter PCDL.This profiling method was highly efficient and could distinguish hundreds of components within 3 h,providing reliable results for quality control of this kind of complex drugs.展开更多
Background:Exercise induces molecular changes that involve multiple organs and tissues.Moreover,these changes are modulated by various exercise parameters—such as intensity,frequency,mode,and duration—as well as by ...Background:Exercise induces molecular changes that involve multiple organs and tissues.Moreover,these changes are modulated by various exercise parameters—such as intensity,frequency,mode,and duration—as well as by clinical features like gender,age,and body mass index(BMI),each eliciting distinct biological effects.To assist exercise researchers in understanding these changes from a comprehensive perspective that includes multiple organs,diverse exercise regimens,and a range of clinical features,we developed Exercise Regulated Genes Database(ExerGeneDB),a database of exercise-regulated differential genes.Methods:ExerGeneDB aggregated publicly available exercise-related sequencing datasets and subjected them to uniform quality control and preprocessing.The data,encompassing a variety of types,were organized into a specialized database of exercise-regulated genes.Notably,Exer-GeneDB conducted differential analyses on this collected data,leveraging curated clinical information and accounting for important factors such as gender,age,and BMI.Results:ExerGeneDB has assembled 1692 samples from rats and mice as well as 4492 human samples.It contains data from various tissues and organs,such as skeletal muscle,blood,adipose tissue,intestine,heart,liver,spleen,lungs,kidneys,brain,spinal cord,bone marrow,and bones.ExerGeneDB features bulk ribonucleic acid sequencing(RNA-seq)(including non-coding RNA(ncRNA)and protein-coding RNA),microarray(including ncRNA and protein-coding RNA),and single cell RNA-seq data.Conclusion:ExerGeneDB compiles and re-analyzes exercise-related data with a focus on clinical information.This has culminated in the crea-tion of an interactive database for exercise regulation genes.The website for ExerGeneDB can be found at:https://exergenedb.com.展开更多
This paper analyzes the text of 3261 clauses of 20 RTAs signed by China,classifies them into 52 policy areas according to the international mainstream HMS method,and assigns them through coding.The clause depth of Ch...This paper analyzes the text of 3261 clauses of 20 RTAs signed by China,classifies them into 52 policy areas according to the international mainstream HMS method,and assigns them through coding.The clause depth of China’s RTAs is measured across three-dimensional systems(policy areas,clauses,and core clauses)and two generations of trade policy areas(WTO+,WTO-X,and all policy areas).It is observed that China’s RTAs exhibit greater depth in Industrial Products,Agricultural Products,TBT,Antidumping,Countervailing,and Investment,while showing comparatively less depth in Fiscal Policy,Innovation Policies,and related areas.展开更多
文摘A distinctive feature of scholarly communities today is exploring topics and concepts in interdisciplinary and international contexts. This observation is increasingly apparent and visible in advancing our thinking and policies related to human/environmental worlds at local, regional, and global scales. Maps are an important part of these innovative and ongoing research approaches. In this context, we consider urban forests a topic meriting more attention of scholars studying the geographic and environmental intersections of the natural sciences with the social sciences and humanities. We construct two innovative knowledge bases, one a conceptual framework based on major themes and concepts related to mapping urban forests using key words of the first 100 results of a Google Scholar query and a second using the number of Google Scholar hyperlinks about mapping urban forests in 244 capital cities. We discovered that the constructed world maps reveal vast global unevenness in our knowledge about urban forests in hyperlink numbers and ratios, results that merit further attention by disciplinary, international and interdisciplinary scholarly communities.
基金supported by the National Key Research&Development Program of China(No.2024YFC3505800)the National Natural Science Foundation of China(Nos.82474334,82474335 and 72174132)+3 种基金National Science Fund for Distinguished Young Scholars(No.82225049)the Key Research&Development Projects of Sichuan Provincial Department of Science and Technology(Nos.2024YFFK0174 and 2024YFFK0152)1.3.5 Project for Disciplines of Excellence,West China Hospital,Sichuan University(Nos.ZYYC24010 and ZYGD23004)the Special Fund for Traditional Chinese Medicine of Sichuan Provincial Administration of Traditional Chinese Medicine(No.2024zd023).
文摘Objectives:Electronic health records(EHRs)offer valuable real-world data(RWD)for Chinese medicine research.However,significant methodological challenges remain in developing integrative Chinese-Western medicine(ICWM)databases.This study aims to establish a best-practice methodological framework,referred to as BRIDGE,to guide the construction of ICWM databases using EHRs.Methods:We developed the methodological framework through a comprehensive process,including systematic literature review,synthesis of empirical experiences,thematic expert discussions,and consultation with an external panel to reach consensus.Results:The BRIDGE framework outlines 6 core components for ICWM-EHR database development:Overall design,database architecture,data extraction and linkage,data governance,data verification,and data quality evaluation.Key data elements include variables related to study population,treatment or exposure,outcomes,and confounders.These databases support various research applications,particularly in evaluating the effectiveness and safety of integrative therapies.To demonstrate its practical value,we developed an ICWM-EHR database on women’s reproductive lifespan,encompassing 2,064,482 patients.This database captures women’s health conditions across the life course,from reproductive age to older adulthood.Conclusions:The BRIDGE methodological framework provides a standardized approach to building high-quality ICWM-EHR databases.It offers a unique opportunity to strengthen the methodological rigor and real-world relevance of Chinese medicine research in integrated healthcare settings.
文摘The journal of Meteorological and Environmental Research[ISSN:2152-3940]has been included and stored by the following famous databases:CA,CABI,CSA,EBSCO,UPD,AGRIS,EA,Chinese Science and Technology Periodical Database,and CNKI,as well as Library of Congress,United States.
文摘The journal of Meteorological and Environmental Research[ISSN:2152-3940]has been included and stored by the following famous databases:CA,CABI,CSA,EBSCO,UPD,AGRIS,EA,Chinese Science and Technology Periodical Database,and CNKI,as well as Library of Congress,United States.
文摘AI-driven materials databases are transforming research by integrating experimental and computational data to enhance discovery and optimization.Platforms such as Digital Catalysis Platform(DigCat)and Dynamic Database of Solid-State Electrolyte(DDSE)demonstrate how machine learning and predictive modeling can improve catalyst and solid-state electrolyte development.These databases facilitate data standardization,high-throughput screening,and cross-disciplinary collaboration,addressing key challenges in materials informatics.As AI techniques advance,materials databases are expected to play an increasingly vital role in accelerating research and innovation.
文摘The authors regret that the original publication of this paper did not include Jawad Fayaz as a co-author.After further discussions and a thorough review of the research contributions,it was agreed that his significant contributions to the foundational aspects of the research warranted recognition,and he has now been added as a co-author.
文摘The future usage of heterogeneous databases will consist of the WWW and CORBA environments. The integration of the WWW databases and CORBA standards are discussed. These two techniques need to merge together to make distributed usage of heterogeneous databases user friendly. In an environment integrating WWW databases and CORBA technologies, CORBA can be used to access heterogeneous data sources in the internet. This kind of applications can achieve distributed transactions to assure data consistency and integrity. The application of this technology is with a good prospect.
基金supported by the National Natural Science Foundation of China(No.62302242)the China Postdoctoral Science Foundation(No.2023M731802).
文摘The EU’s Artificial Intelligence Act(AI Act)imposes requirements for the privacy compliance of AI systems.AI systems must comply with privacy laws such as the GDPR when providing services.These laws provide users with the right to issue a Data Subject Access Request(DSAR).Responding to such requests requires database administrators to identify information related to an individual accurately.However,manual compliance poses significant challenges and is error-prone.Database administrators need to write queries through time-consuming labor.The demand for large amounts of data by AI systems has driven the development of NoSQL databases.Due to the flexible schema of NoSQL databases,identifying personal information becomes even more challenging.This paper develops an automated tool to identify personal information that can help organizations respond to DSAR.Our tool employs a combination of various technologies,including schema extraction of NoSQL databases and relationship identification from query logs.We describe the algorithm used by our tool,detailing how it discovers and extracts implicit relationships from NoSQL databases and generates relationship graphs to help developers accurately identify personal data.We evaluate our tool on three datasets,covering different database designs,achieving an F1 score of 0.77 to 1.Experimental results demonstrate that our tool successfully identifies information relevant to the data subject.Our tool reduces manual effort and simplifies GDPR compliance,showing practical application value in enhancing the privacy performance of NOSQL databases and AI systems.
基金financial support from Teesside University to support the Ph.D.programme of the first author.
文摘Traditionally,nonlinear time history analysis(NLTHA)is used to assess the performance of structures under fu-ture hazards which is necessary to develop effective disaster risk management strategies.However,this method is computationally intensive and not suitable for analyzing a large number of structures on a city-wide scale.Surrogate models offer an efficient and reliable alternative and facilitate evaluating the performance of multiple structures under different hazard scenarios.However,creating a comprehensive database for surrogate mod-elling at the city level presents challenges.To overcome this,the present study proposes meta databases and a general framework for surrogate modelling of steel structures.The dataset includes 30,000 steel moment-resisting frame buildings,representing low-rise,mid-rise and high-rise buildings,with criteria for connections,beams,and columns.Pushover analysis is performed and structural parameters are extracted,and finally,incorporating two different machine learning algorithms,random forest and Shapley additive explanations,sensitivity and explain-ability analyses of the structural parameters are performed to identify the most significant factors in designing steel moment resisting frames.The framework and databases can be used as a validated source of surrogate modelling of steel frame structures in order for disaster risk management.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61888102,52272172,and 52102193)the Major Program of the National Natural Science Foundation of China(Grant No.92163206)+2 种基金the National Key Research and Development Program of China(Grant Nos.2021YFA1201501 and 2022YFA1204100)the Strategic Priority Research Program of the Chinese Academy of Sciences(Grant No.XDB30000000)the Fundamental Research Funds for the Central Universities.
文摘Discovery of materials using“bottom-up”or“top-down”approach is of great interest in materials science.Layered materials consisting of two-dimensional(2D)building blocks provide a good platform to explore new materials in this respect.In van der Waals(vdW)layered materials,these building blocks are charge neutral and can be isolated from their bulk phase(top-down),but usually grow on substrate.In ionic layered materials,they are charged and usually cannot exist independently but can serve as motifs to construct new materials(bottom-up).In this paper,we introduce our recently constructed databases for 2D material-substrate interface(2DMSI),and 2D charged building blocks.For 2DMSI database,we systematically build a workflow to predict appropriate substrates and their geometries at substrates,and construct the 2DMSI database.For the 2D charged building block database,1208 entries from bulk material database are identified.Information of crystal structure,valence state,source,dimension and so on is provided for each entry with a json format.We also show its application in designing and searching for new functional layered materials.The 2DMSI database,building block database,and designed layered materials are available in Science Data Bank at https://doi.org/10.57760/sciencedb.j00113.00188.
基金Supported by the Appropriate Technology Promotion Program in Chongqing,No.2023jstg005.
文摘BACKGROUND Ampullary adenocarcinoma is a rare malignant tumor of the gastrointestinal tract.Currently,only a few cases have been reported,resulting in limited information on survival.AIM To develop a dynamic nomogram using internal and external validation to predict survival in patients with ampullary adenocarcinoma.METHODS Data were sourced from the surveillance,epidemiology,and end results stat database.The patients in the database were randomized in a 7:3 ratio into training and validation groups.Using Cox regression univariate and multivariate analyses in the training group,we identified independent risk factors for overall survival and cancer-specific survival to develop the nomogram.The nomogram was validated with a cohort of patients from the First Affiliated Hospital of the Army Medical University.RESULTS For overall and cancer-specific survival,12(sex,age,race,lymph node ratio,tumor size,chemotherapy,surgical modality,T stage,tumor differentiation,brain metastasis,lung metastasis,and extension)and 6(age;surveillance,epidemiology,and end results stage;lymph node ratio;chemotherapy;surgical modality;and tumor differentiation)independent risk factors,respectively,were incorporated into the nomogram.The area under the curve values at 1,3,and 5 years,respectively,were 0.807,0.842,and 0.826 for overall survival and 0.816,0.835,and 0.841 for cancer-specific survival.The internal and external validation cohorts indicated good consistency of the nomogram.CONCLUSION The dynamic nomogram offers robust predictive efficacy for the overall and cancer-specific survival of ampullary adenocarcinoma.
基金supported by the National Natural Science Foundation of China(Nos.82274064,82374026,and 82204591)。
文摘Natural products(NPs)have long held a significant position in various fields such as medicine,food,agriculture,and materials.The chemical space covered by NPs is extensive but often underexplored.Therefore,high-throughput and efficient methodologies for the annotation and discovery of NPs are desired to address the complexity and diversity of NP-based systems.Mass spectrometry(MS)has emerged as a powerful platform for the annotation and discovery of NPs.MS databases provide vital support for the structural characterization of NPs by integrating extensive mass spectral data and sample information.Additionally,the released annotation methodologies,based on a variety of informatics tools,continuously improve the ability to annotate the structure and properties of compounds.This review examines the current mainstream databases and annotation methodologies,focusing on their advantages and limitations.Prospects for future technological advancements are then discussed in terms of novel applications and research objectives.Through a systematic overview,this review aims to provide valuable insights and a reference for MS-based NPs annotation,thereby promoting the discovery of novel natural entities.
基金Project supported by funding from the National Natural Science Foundation of China(Grant Nos.52172258,52473227 and 52171150)the Strategic Priority Research Program of Chinese Academy of Sciences(Grant No.XDB0500200)。
文摘The unique long-range disordered atomic arrangement inherent in amorphous materials endows them with a range of superior properties,rendering them highly promising for applications in catalysis,medicine,and battery technology,among other fields.Since not all materials can be synthesized into an amorphous structure,the composition design of amorphous materials holds significant importance.Machine learning offers a valuable alternative to traditional“trial-anderror”methods by predicting properties through experimental data,thus providing efficient guidance in material design.In this study,we develop a machine learning workflow to predict the critical casting diameter,glass transition temperature,and Young's modulus for 45 ternary reported amorphous alloy systems.The predicted results have been organized into a database,enabling direct retrieval of predicted values based on compositional information.Furthermore,the applications of high glass forming ability region screening for specified system,multi-property target system screening and high glass forming ability region search through iteration are also demonstrated.By utilizing machine learning predictions,researchers can effectively narrow the experimental scope and expedite the exploration of compositions.
基金funded by the National Natural Science Foundation of China(No.42220104008)。
文摘Research into metamorphism plays a pivotal role in reconstructing the evolution of continent,particularly through the study of ancient rocks that are highly susceptible to metamorphic alterations due to multiple tectonic activities.In the big data era,the establishment of new data platforms and the application of big data methods have become a focus for metamorphic rocks.Significant progress has been made in creating specialized databases,compiling comprehensive datasets,and utilizing data analytics to address complex scientific questions.However,many existing databases are inadequate in meeting the specific requirements of metamorphic research,resulting from a substantial amount of valuable data remaining uncollected.Therefore,constructing new databases that can cope with the development of the data era is necessary.This article provides an extensive review of existing databases related to metamorphic rocks and discusses data-driven studies in this.Accordingly,several crucial factors that need to be taken into consideration in the establishment of specialized metamorphic databases are identified,aiming to leverage data-driven applications to achieve broader scientific objectives in metamorphic research.
基金The Innovation Capability Support Program for Medical Research Projects of Xi’an Science and Technology Bureau(23YXYJ0123)The Hospital Level Fund of the First Affiliated Hospital of Xi’an Medical University(XYYFY-2023-08)。
文摘Objective:This study aimed to investigate the changes in gene expression profiles of multiple myeloma(MM)cells after bortezomib treatment by analyzing the GEO database,thereby providing a theoretical foundation for subsequent research on HSP70.Methods:The GSE41929 dataset was selected from the GEO database.Screening and analysis were performed to identify differentially expressed genes between bortezomib-treated and non-treated MM cells.Results:After bortezomib treatment,126 genes in MM cells showed the most significant changes in expression(P<0.05,absolute value of logFC≥1.5).Based on the fold change and the most significant gene module,HSPA1B exhibited the most notable upregulation after HMOX1,followed by HSPA6 and DNAJB1.HSPA1B and HSPA6 are members of the HSP70 protein family,while DNAJB1 primarily interacts with HSP70 to stimulate its ATPase activity and negatively regulates the transcriptional activity of HSF1 induced by heat shock.Conclusion:HSP70 was the most significantly upregulated molecule in MM cells following bortezomib stimulation.
文摘BACKGROUND For locally advanced gallbladder cancer,previous clinical studies have demon-strated that chemotherapy results in significant survival benefits when compared to surgery alone.However,data demonstrating a similar survival benefit with early-stage gallbladder cancer is limited.This study seeks to evaluate the impact chemotherapy has on survival in patients with early-stage gallbladder cancer using a large,multi-institution database.AIM To investigate the survival benefit of chemotherapy in patients with stage II gallbladder cancer.METHODS We performed a retrospective multivariable analysis of the National Cancer Database from 2010 to 2017 to evaluate the effect that chemotherapy has on the survival of patients with stage II gallbladder cancer.Our objective was to de-termine if there were any statistically significant survival differences between those who received surgery and chemotherapy vs those who only underwent surgery.RESULTS Of the 899 patients with stage II gallbladder cancer,328 patients had undergone chemotherapy and surgery.The average overall survival for those who had surgery and chemotherapy vs only surgery was 52.6 months and 51.1 months,respectively.This difference was not statistically significant(P=0.2).In the secondary analysis,the surgical group who had a liver resection had better overall survival(P<0.0001).CONCLUSION Practitioners should carefully consider chemotherapy for early-stage gallbladder cancer,as risks may outweigh survival benefits,and surgeons should also consider liver resections as part of their surgical management.
基金support from the Science Research Program Project for Drug Regulation,Jiangsu Medical Products Administration,China(Grant No.:202207)the National Drug Standards Revision Project,China(Grant No.:2023Y41)+1 种基金the National Natural Science Foundation of China(Grant No.:22276080)the Foreign Expert Project,China(Grant No.:G2022014096L).
文摘In-depth study of the components of polymyxins is the key to controlling the quality of this class of antibiotics.Similarities and variations of components present significant analytical challenges.A two-dimensional(2D)liquid chromatography-mass spectrometry(LC-MS)method was established for screening and comprehensive profiling of compositions of the antibiotic colistimethate sodium(CMS).A high concentration of phosphate buffer mobile phase was used in the first-dimensional LC system to get the components well separated.For efficient and high-accuracy screening of CMS,a targeted method based on a self-constructed high resolution(HR)mass spectrum database of CMS components was established.The database was built based on the commercial MassHunter Personal Compound Database and Library(PCDL)software and its accuracy of the compound matching result was verified with six known components before being applied to genuine sample screening.On this basis,the unknown peaks in the CMS chromatograms were deduced and assigned.The molecular formula,group composition,and origins of a total of 99 compounds,of which the combined area percentage accounted for more than 95%of CMS components,were deduced by this 2D-LC-MS method combined with the MassHunter PCDL.This profiling method was highly efficient and could distinguish hundreds of components within 3 h,providing reliable results for quality control of this kind of complex drugs.
基金supported by grants from the National Natural Science Foundation of China(82225005, 82020108002 to JX,82200321 to QZ)Science and Technology Commission of ShanghaiMunicipality(23410750100,20DZ2255400,, 21XD1421300 to JX)+1 种基金the“Dawn”Program of Shanghai Educa-tion Commission(19SG34 to JX)Shanghai Sailing Program(21YF1413200 to QZ).
文摘Background:Exercise induces molecular changes that involve multiple organs and tissues.Moreover,these changes are modulated by various exercise parameters—such as intensity,frequency,mode,and duration—as well as by clinical features like gender,age,and body mass index(BMI),each eliciting distinct biological effects.To assist exercise researchers in understanding these changes from a comprehensive perspective that includes multiple organs,diverse exercise regimens,and a range of clinical features,we developed Exercise Regulated Genes Database(ExerGeneDB),a database of exercise-regulated differential genes.Methods:ExerGeneDB aggregated publicly available exercise-related sequencing datasets and subjected them to uniform quality control and preprocessing.The data,encompassing a variety of types,were organized into a specialized database of exercise-regulated genes.Notably,Exer-GeneDB conducted differential analyses on this collected data,leveraging curated clinical information and accounting for important factors such as gender,age,and BMI.Results:ExerGeneDB has assembled 1692 samples from rats and mice as well as 4492 human samples.It contains data from various tissues and organs,such as skeletal muscle,blood,adipose tissue,intestine,heart,liver,spleen,lungs,kidneys,brain,spinal cord,bone marrow,and bones.ExerGeneDB features bulk ribonucleic acid sequencing(RNA-seq)(including non-coding RNA(ncRNA)and protein-coding RNA),microarray(including ncRNA and protein-coding RNA),and single cell RNA-seq data.Conclusion:ExerGeneDB compiles and re-analyzes exercise-related data with a focus on clinical information.This has culminated in the crea-tion of an interactive database for exercise regulation genes.The website for ExerGeneDB can be found at:https://exergenedb.com.
基金General Project of Beijing Social Science Foundation,“Research on the Internal and External Strategic Alignment of Regional Trade Agreements and the High-Quality Construction of China(Beijing)Pilot Free Trade Zone”(Project No.:21GLB021)。
文摘This paper analyzes the text of 3261 clauses of 20 RTAs signed by China,classifies them into 52 policy areas according to the international mainstream HMS method,and assigns them through coding.The clause depth of China’s RTAs is measured across three-dimensional systems(policy areas,clauses,and core clauses)and two generations of trade policy areas(WTO+,WTO-X,and all policy areas).It is observed that China’s RTAs exhibit greater depth in Industrial Products,Agricultural Products,TBT,Antidumping,Countervailing,and Investment,while showing comparatively less depth in Fiscal Policy,Innovation Policies,and related areas.