Severe acute respiratory syndrome coronavirus 2(SARS-CoV-2)is the pathogen of the ongoing coronavirus disease 2019(COVID-19)global pandemic.Here,by centralizing published cell-based experiments,clinical trials,and vir...Severe acute respiratory syndrome coronavirus 2(SARS-CoV-2)is the pathogen of the ongoing coronavirus disease 2019(COVID-19)global pandemic.Here,by centralizing published cell-based experiments,clinical trials,and virtual drug screening data from the NCBI PubMed database,we developed a database of SARS-CoV-2 inhibitors for COVID-19,dbSCI,which includes 234 SARS-CoV-2 inhibitors collected from publications based on cell-based experiments,81 drugs of COVID-19 in clinical trials and 1305 potential SARS-CoV-2 inhibitors from bioinformatics analyses.dbSCI provides four major functions:(1)search the drug target or its inhibitor for SARS-CoV-2,(2)browse target/inhibitor information collected from cell experiments,clinical trials,and virtual drug screenings,(3)download,and(4)submit data.Each entry in dbSCI contains 18 types of information,including inhibitor/drug name,targeting protein,mechanism of inhibition,experimental technique,experimental sample type,and reference information.In summary,dbSCI provides a relatively comprehensive,credible repository for inhibitors/drugs against SARS-CoV-2 and their potential targeting mechanisms and it will be valuable for further studies to control COVID-19.展开更多
The differences between the data integration of a dynamic database grid (DBG) and that of a distributed database system are analyzed, and three kinds of data integration strategies are given on the background of DBG...The differences between the data integration of a dynamic database grid (DBG) and that of a distributed database system are analyzed, and three kinds of data integration strategies are given on the background of DBG based on Peer to Peer (P2P) framework, including the centralized data integration (CDI) strategy, the distributed data integration (DDI) strategy and the filter-based data integration (FDDI) strategy. CDI calls all the database grid services (DGSs) at a single node, DDI disperses the DGSs to multiple nodes, while FDDI schedules the data integration nodes based on filtering the keywords returned from DGSs. The performance of these three integration strategies are compared with and analyzed by simulation experiments. FDDI is more evident for filtering the keywords with data redundancy increasing. Through the reduction of large amount of data transportation, it effectively shortens the executing time for the task and improves its efficiency.展开更多
In-depth study of the components of polymyxins is the key to controlling the quality of this class of antibiotics.Similarities and variations of components present significant analytical challenges.A two-dimensional(2...In-depth study of the components of polymyxins is the key to controlling the quality of this class of antibiotics.Similarities and variations of components present significant analytical challenges.A two-dimensional(2D)liquid chromatography-mass spectrometry(LC-MS)method was established for screening and comprehensive profiling of compositions of the antibiotic colistimethate sodium(CMS).A high concentration of phosphate buffer mobile phase was used in the first-dimensional LC system to get the components well separated.For efficient and high-accuracy screening of CMS,a targeted method based on a self-constructed high resolution(HR)mass spectrum database of CMS components was established.The database was built based on the commercial MassHunter Personal Compound Database and Library(PCDL)software and its accuracy of the compound matching result was verified with six known components before being applied to genuine sample screening.On this basis,the unknown peaks in the CMS chromatograms were deduced and assigned.The molecular formula,group composition,and origins of a total of 99 compounds,of which the combined area percentage accounted for more than 95%of CMS components,were deduced by this 2D-LC-MS method combined with the MassHunter PCDL.This profiling method was highly efficient and could distinguish hundreds of components within 3 h,providing reliable results for quality control of this kind of complex drugs.展开更多
This paper brings the comparison of performances of CO_(2)conversion by plasma and plasma-assisted catalysis based on the data collected from literature in this field,organised in an open access online database.This t...This paper brings the comparison of performances of CO_(2)conversion by plasma and plasma-assisted catalysis based on the data collected from literature in this field,organised in an open access online database.This tool is open to all users to carry out their own analyses,but also to contributors who wish to add their data to the database in order to improve the relevance of the comparisons made,and ultimately to improve the efficiency of CO_(2)conversion by plasma-catalysis.The creation of this database and database user interface is motivated by the fact that plasma-catalysis is a fast-growing field for all CO_(2)conversion processes,be it methanation,dry reforming of methane,methanolisation,or others.As a result of this rapid increase,there is a need for a set of standard procedures to rigorously compare performances of different systems.However,this is currently not possible because the fundamental mechanisms of plasma-catalysis are still too poorly understood to define these standard procedures.Fortunately however,the accumulated data within the CO_(2)plasma-catalysis community has become large enough to warrant so-called“big data”studies more familiar in the fields of medicine and the social sciences.To enable comparisons between multiple data sets and make future research more effective,this work proposes the first database on CO_(2)conversion performances by plasma-catalysis open to the whole community.This database has been initiated in the framework of a H_(2)0_(2)0 European project and is called the“PIONEER Data Base”.The database gathers a large amount of CO_(2)conversion performance data such as conversion rate,energy efficiency,and selectivity for numerous plasma sources coupled with or without a catalyst.Each data set is associated with metadata describing the gas mixture,the plasma source,the nature of the catalyst,and the form of coupling with the plasma.Beyond the database itself,a data extraction tool with direct visualisation features or advanced filtering functionalities has been developed and is available online to the public.The simple and fast visualisation of the state of the art puts new results into context,identifies literal gaps in data,and consequently points towards promising research routes.More advanced data extraction illustrates the impact that the database can have in the understanding of plasma-catalyst coupling.Lessons learned from the review of a large amount of literature during the setup of the database lead to best practice advice to increase comparability between future CO_(2)plasma-catalytic studies.Finally,the community is strongly encouraged to contribute to the database not only to increase the visibility of their data but also the relevance of the comparisons allowed by this tool.展开更多
Electronic patient data gives many advantages,but also new difficulties.Deadlocks may delay procedures like acquiring patient information.Distributed deadlock resolution solutions introduce uncertainty due to inaccura...Electronic patient data gives many advantages,but also new difficulties.Deadlocks may delay procedures like acquiring patient information.Distributed deadlock resolution solutions introduce uncertainty due to inaccurate transaction properties.Soft computing-based solutions have been developed to solve this challenge.In a single framework,ambiguous,vague,incomplete,and inconsistent transaction attribute information has received minimal attention.The work presented in this paper employed type-2 neutrosophic logic,an extension of type-1 neutrosophic logic,to handle uncertainty in real-time deadlock-resolving systems.The proposed method is structured to reflect multiple types of knowledge and relations among transactions’features that include validation factor degree,slackness degree,degree of deadline-missed transaction based on the degree of membership of truthiness,degree ofmembership of indeterminacy,and degree ofmembership of falsity.Here,the footprint of uncertainty(FOU)for truth,indeterminacy,and falsity represents the level of uncertainty that exists in the value of a grade of membership.We employed a distributed real-time transaction processing simulator(DRTTPS)to conduct the simulations and conducted experiments using the benchmark Pima Indians diabetes dataset(PIDD).As the results showed,there is an increase in detection rate and a large drop in rollback rate when this new strategy is used.The performance of Type-2 neutrosophicbased resolution is better than the Type-1 neutrosophic-based approach on the execution ratio scale.The improvement rate has reached 10%to 20%,depending on the number of arrived transactions.展开更多
Discovery of materials using“bottom-up”or“top-down”approach is of great interest in materials science.Layered materials consisting of two-dimensional(2D)building blocks provide a good platform to explore new mater...Discovery of materials using“bottom-up”or“top-down”approach is of great interest in materials science.Layered materials consisting of two-dimensional(2D)building blocks provide a good platform to explore new materials in this respect.In van der Waals(vdW)layered materials,these building blocks are charge neutral and can be isolated from their bulk phase(top-down),but usually grow on substrate.In ionic layered materials,they are charged and usually cannot exist independently but can serve as motifs to construct new materials(bottom-up).In this paper,we introduce our recently constructed databases for 2D material-substrate interface(2DMSI),and 2D charged building blocks.For 2DMSI database,we systematically build a workflow to predict appropriate substrates and their geometries at substrates,and construct the 2DMSI database.For the 2D charged building block database,1208 entries from bulk material database are identified.Information of crystal structure,valence state,source,dimension and so on is provided for each entry with a json format.We also show its application in designing and searching for new functional layered materials.The 2DMSI database,building block database,and designed layered materials are available in Science Data Bank at https://doi.org/10.57760/sciencedb.j00113.00188.展开更多
Objective:To investigate the variation,expression and clinical significance of E2F3 gene in melanoma.Methods:Firstly,cBioportal database,Oncomine database and GEO database were used to analyze the variation and expres...Objective:To investigate the variation,expression and clinical significance of E2F3 gene in melanoma.Methods:Firstly,cBioportal database,Oncomine database and GEO database were used to analyze the variation and expression level of E2F3 gene in melanoma.OSskcm database and TISIDB database were used to analyze the relationship between E2F3 and melanoma prognosis and tumor immune infiltrating cells.Then,the LinkedOmics database was used to identify the differential genes related to E2F3 expression in melanoma and analyze their biological functions.Finally,small molecule compounds for the treatment of melanoma were screened through the CMap database.Results:The mutation rate of E2F3 gene in melanoma is about 4%,and there are 21 mutation sites.Compared with normal skin tissues,the expression of E2F3 gene in melanoma was significantly increased(P<0.01).The mutation and increased expression of E2F3 gene were associated with the shortened overall survival(OS)of melanoma patients(P<0.05).The CNA level of E2F3 was negatively correlated with the expression levels of lymphocytes such as pDC,Neutrophil,Act DC and Th17,and negatively correlated with the expression levels of chemokines such as CXCL5,CCL13 and CCR1.The methylation level of E2F3 was positively correlated with the expression levels of Th1,Neutrophil,Act DC and other lymphocytes,and positively correlated with the expression levels of CXCL16,CXCL12,CCR1 and other chemokines.The expression level of E2F3 was negatively correlated with the expression levels of lymphocytes such as Th17,Tcm CD4 and Th1,and negatively correlated with the expression levels of chemokines such as CXCL 16,CCL 22 and CCL 2.The expression of 96 genes such as UHRF1BP1 in melanoma was significantly correlated with the expression of E2F3(|cor|0.5,P<0.05).The above genes were mainly related to RNA transport,eukaryotic ribosome biogenesis,cell cycle and other pathways.Among them,WDR12,WDR43,RBM28,UTP18,DKC1,PAK1IP1,DDX31,TEX10,TRUB1 and TRMT61B were the top 10 hub genes.YC-1,simvastatin,cytochalasin-d,Deforolimus and cytochalasin-b may be five small molecule compounds for the treatment of melanoma.Conclusion:The mutation and increased expression level of E2F3 gene are related to the poor prognosis of melanoma and participate in the occurrence and development of melanoma by affecting the expression of different tumor immune infiltrating cell subtypes,which may be a potential diagnostic marker and therapeutic target for melanoma.展开更多
The CALPHAD thermodynamic databases are very useful to analyze the complex chemical reactions happening in high temperature material process.The FactSage thermodynamic database can be used to calculate complex phase d...The CALPHAD thermodynamic databases are very useful to analyze the complex chemical reactions happening in high temperature material process.The FactSage thermodynamic database can be used to calculate complex phase diagrams and equilibrium phases involving refractories in industrial process.In this study,the FactSage thermodynamic database relevant to ZrO_(2)-based refractories was reviewed and the application of the database to understanding the corrosion of continuous casting nozzle refractories in steelmaking was presented.展开更多
For integrating relational databases (RDBs) into semantic web applications, the W3C RDB2RDF Working Group recommended two approaches, Direct Mapping (DM) and R2RML. The DM provides a set of mapping rules according to ...For integrating relational databases (RDBs) into semantic web applications, the W3C RDB2RDF Working Group recommended two approaches, Direct Mapping (DM) and R2RML. The DM provides a set of mapping rules according to RDB schema, while the R2RML allows users to manually define mappings according to existing target ontology. The major problem to use R2RML is the effort for creating R2RML mapping documents manually. This may lead to appearance of many mistakes in the R2RML documents and requires domain experts. In this paper, we propose and implement an approach to generate an R2RML mapping documents automatically from RDB schema. The R2RML mapping reflects the behavior of the DM specification and allows any R2RML parser to generate a set of RDF triples from relational data.The input of generating approach is DBsInfo class that automatically generated from relational schema. An experimental prototype is developed and shows the effectiveness of our approach algorithms.展开更多
A large number of experimental and theoretical investigations of carbon dioxide (CO 2 ) spectra have been conducted since the most recent update of the High-Resolution Transmission Molecular Absorption (HITRAN) da...A large number of experimental and theoretical investigations of carbon dioxide (CO 2 ) spectra have been conducted since the most recent update of the High-Resolution Transmission Molecular Absorption (HITRAN) database. To maintain optimal parameters, the HITRAN 2004 CO 2 line list has been completely replaced by HITRAN 2008 data in the near-infrared region from 4300 cm-1 to 7000 cm-1 . To examine the effect of this change on the retrieval of CO 2 vertical column data from reflected sunlight spectra in the 1.61-μm spectral window, synthetic measurements for a given atmospheric state and instrument setup were generated and compared using radiative transfer model with the line-transition parameters from the HITRAN 2004 and 2008 databases. Simulated retrievals were then performed based on the optimal estimation retrieval theory. The results show that large systematic errors in atmospheric CO 2 column retrievals were induced by the differences in the HITRAN laboratory line parameters in the 1.61-μm region. The retrieved CO 2 columns were underestimated by 10 ppm using the HITRAN 2004 data, and improvements resulting from the use of the improved HITRAN database were more pronounced at a higher spectral resolution.展开更多
An excellent cardinality estimation can make the query optimiser produce a good execution plan.Although there are some studies on cardinality estimation,the prediction results of existing cardinality estimators are in...An excellent cardinality estimation can make the query optimiser produce a good execution plan.Although there are some studies on cardinality estimation,the prediction results of existing cardinality estimators are inaccurate and the query efficiency cannot be guaranteed as well.In particular,they are difficult to accurately obtain the complex relationships between multiple tables in complex database systems.When dealing with complex queries,the existing cardinality estimators cannot achieve good results.In this study,a novel cardinality estimator is proposed.It uses the core techniques with the BiLSTM network structure and adds the attention mechanism.First,the columns involved in the query statements in the training set are sampled and compressed into bitmaps.Then,the Word2vec model is used to embed the word vectors about the query statements.Finally,the BiLSTM network and attention mechanism are employed to deal with word vectors.The proposed model takes into consideration not only the correlation between tables but also the processing of complex predicates.Extensive experiments and the evaluation of BiLSTM-Attention Cardinality Estimator(BACE)on the IMDB datasets are conducted.The results show that the deep learning model can significantly improve the quality of cardinality estimation,which is a vital role in query optimisation for complex databases.展开更多
Stochastic modeling of ground motion is a simple tool to predict ground shaking level for future earthquake and less time consuming than physics-based deterministic modeling.In this paper,a record-based stochastic met...Stochastic modeling of ground motion is a simple tool to predict ground shaking level for future earthquake and less time consuming than physics-based deterministic modeling.In this paper,a record-based stochastic method that considers the time-and frequency-evolution of ground motion is used to estimate ground motion for scenario earthquakes in tectonic active region.The stochastic method employs a time-domain modulation function to describe the temporal nonstationarity and a filter impulse response function that describe the evolution of frequency content.For characterizing the modulation function and the filter impulse function,six parameters(Ia,D5-95,tmid,ωmid,ω',ξf)are defined,and 2,571 pairs of ground motion recording in the NGA-west2 database are selected to identify the six parameters.Probabilistic density function is assigned to each of the parameter by fitting the frequency distribution histogram.The parameters are then transformed into standard normal space where regression analysis is performed by considering each parameter as function of moment magnitude,rupture distance,vS30(The time-averaged shear wave velocity of the top 30 m of soil).The prediction equations are used to generate ground motions for several scenario earthquakes and compared to NGA-West2 GMPEs.展开更多
基金supported by the Innovation and Entrepreneurship Training Program for College Students in Henan University(20221022009)the Key Scientific Research Project of Henan Province(22A310012)+1 种基金supporting program for Central Plain Young Top Talents(ZYQR201912176)the program from Academy for Advanced Interdisciplinary Studies of Henan University(Y21008L).
文摘Severe acute respiratory syndrome coronavirus 2(SARS-CoV-2)is the pathogen of the ongoing coronavirus disease 2019(COVID-19)global pandemic.Here,by centralizing published cell-based experiments,clinical trials,and virtual drug screening data from the NCBI PubMed database,we developed a database of SARS-CoV-2 inhibitors for COVID-19,dbSCI,which includes 234 SARS-CoV-2 inhibitors collected from publications based on cell-based experiments,81 drugs of COVID-19 in clinical trials and 1305 potential SARS-CoV-2 inhibitors from bioinformatics analyses.dbSCI provides four major functions:(1)search the drug target or its inhibitor for SARS-CoV-2,(2)browse target/inhibitor information collected from cell experiments,clinical trials,and virtual drug screenings,(3)download,and(4)submit data.Each entry in dbSCI contains 18 types of information,including inhibitor/drug name,targeting protein,mechanism of inhibition,experimental technique,experimental sample type,and reference information.In summary,dbSCI provides a relatively comprehensive,credible repository for inhibitors/drugs against SARS-CoV-2 and their potential targeting mechanisms and it will be valuable for further studies to control COVID-19.
基金Supported by the National High-Technology Re-search and Development Program of China(2003AA414210) the National Natural Science Foundation of China (60573090)
文摘The differences between the data integration of a dynamic database grid (DBG) and that of a distributed database system are analyzed, and three kinds of data integration strategies are given on the background of DBG based on Peer to Peer (P2P) framework, including the centralized data integration (CDI) strategy, the distributed data integration (DDI) strategy and the filter-based data integration (FDDI) strategy. CDI calls all the database grid services (DGSs) at a single node, DDI disperses the DGSs to multiple nodes, while FDDI schedules the data integration nodes based on filtering the keywords returned from DGSs. The performance of these three integration strategies are compared with and analyzed by simulation experiments. FDDI is more evident for filtering the keywords with data redundancy increasing. Through the reduction of large amount of data transportation, it effectively shortens the executing time for the task and improves its efficiency.
基金support from the Science Research Program Project for Drug Regulation,Jiangsu Medical Products Administration,China(Grant No.:202207)the National Drug Standards Revision Project,China(Grant No.:2023Y41)+1 种基金the National Natural Science Foundation of China(Grant No.:22276080)the Foreign Expert Project,China(Grant No.:G2022014096L).
文摘In-depth study of the components of polymyxins is the key to controlling the quality of this class of antibiotics.Similarities and variations of components present significant analytical challenges.A two-dimensional(2D)liquid chromatography-mass spectrometry(LC-MS)method was established for screening and comprehensive profiling of compositions of the antibiotic colistimethate sodium(CMS).A high concentration of phosphate buffer mobile phase was used in the first-dimensional LC system to get the components well separated.For efficient and high-accuracy screening of CMS,a targeted method based on a self-constructed high resolution(HR)mass spectrum database of CMS components was established.The database was built based on the commercial MassHunter Personal Compound Database and Library(PCDL)software and its accuracy of the compound matching result was verified with six known components before being applied to genuine sample screening.On this basis,the unknown peaks in the CMS chromatograms were deduced and assigned.The molecular formula,group composition,and origins of a total of 99 compounds,of which the combined area percentage accounted for more than 95%of CMS components,were deduced by this 2D-LC-MS method combined with the MassHunter PCDL.This profiling method was highly efficient and could distinguish hundreds of components within 3 h,providing reliable results for quality control of this kind of complex drugs.
基金funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No.813393partially funded by the Portuguese FCT-Funda??o para a Ciência e a Tecnologia,under projects UIDB/50010/2020,UIDP/50010/2020 and PTDC/FIS-PLA/1616/2021。
文摘This paper brings the comparison of performances of CO_(2)conversion by plasma and plasma-assisted catalysis based on the data collected from literature in this field,organised in an open access online database.This tool is open to all users to carry out their own analyses,but also to contributors who wish to add their data to the database in order to improve the relevance of the comparisons made,and ultimately to improve the efficiency of CO_(2)conversion by plasma-catalysis.The creation of this database and database user interface is motivated by the fact that plasma-catalysis is a fast-growing field for all CO_(2)conversion processes,be it methanation,dry reforming of methane,methanolisation,or others.As a result of this rapid increase,there is a need for a set of standard procedures to rigorously compare performances of different systems.However,this is currently not possible because the fundamental mechanisms of plasma-catalysis are still too poorly understood to define these standard procedures.Fortunately however,the accumulated data within the CO_(2)plasma-catalysis community has become large enough to warrant so-called“big data”studies more familiar in the fields of medicine and the social sciences.To enable comparisons between multiple data sets and make future research more effective,this work proposes the first database on CO_(2)conversion performances by plasma-catalysis open to the whole community.This database has been initiated in the framework of a H_(2)0_(2)0 European project and is called the“PIONEER Data Base”.The database gathers a large amount of CO_(2)conversion performance data such as conversion rate,energy efficiency,and selectivity for numerous plasma sources coupled with or without a catalyst.Each data set is associated with metadata describing the gas mixture,the plasma source,the nature of the catalyst,and the form of coupling with the plasma.Beyond the database itself,a data extraction tool with direct visualisation features or advanced filtering functionalities has been developed and is available online to the public.The simple and fast visualisation of the state of the art puts new results into context,identifies literal gaps in data,and consequently points towards promising research routes.More advanced data extraction illustrates the impact that the database can have in the understanding of plasma-catalyst coupling.Lessons learned from the review of a large amount of literature during the setup of the database lead to best practice advice to increase comparability between future CO_(2)plasma-catalytic studies.Finally,the community is strongly encouraged to contribute to the database not only to increase the visibility of their data but also the relevance of the comparisons allowed by this tool.
文摘Electronic patient data gives many advantages,but also new difficulties.Deadlocks may delay procedures like acquiring patient information.Distributed deadlock resolution solutions introduce uncertainty due to inaccurate transaction properties.Soft computing-based solutions have been developed to solve this challenge.In a single framework,ambiguous,vague,incomplete,and inconsistent transaction attribute information has received minimal attention.The work presented in this paper employed type-2 neutrosophic logic,an extension of type-1 neutrosophic logic,to handle uncertainty in real-time deadlock-resolving systems.The proposed method is structured to reflect multiple types of knowledge and relations among transactions’features that include validation factor degree,slackness degree,degree of deadline-missed transaction based on the degree of membership of truthiness,degree ofmembership of indeterminacy,and degree ofmembership of falsity.Here,the footprint of uncertainty(FOU)for truth,indeterminacy,and falsity represents the level of uncertainty that exists in the value of a grade of membership.We employed a distributed real-time transaction processing simulator(DRTTPS)to conduct the simulations and conducted experiments using the benchmark Pima Indians diabetes dataset(PIDD).As the results showed,there is an increase in detection rate and a large drop in rollback rate when this new strategy is used.The performance of Type-2 neutrosophicbased resolution is better than the Type-1 neutrosophic-based approach on the execution ratio scale.The improvement rate has reached 10%to 20%,depending on the number of arrived transactions.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61888102,52272172,and 52102193)the Major Program of the National Natural Science Foundation of China(Grant No.92163206)+2 种基金the National Key Research and Development Program of China(Grant Nos.2021YFA1201501 and 2022YFA1204100)the Strategic Priority Research Program of the Chinese Academy of Sciences(Grant No.XDB30000000)the Fundamental Research Funds for the Central Universities.
文摘Discovery of materials using“bottom-up”or“top-down”approach is of great interest in materials science.Layered materials consisting of two-dimensional(2D)building blocks provide a good platform to explore new materials in this respect.In van der Waals(vdW)layered materials,these building blocks are charge neutral and can be isolated from their bulk phase(top-down),but usually grow on substrate.In ionic layered materials,they are charged and usually cannot exist independently but can serve as motifs to construct new materials(bottom-up).In this paper,we introduce our recently constructed databases for 2D material-substrate interface(2DMSI),and 2D charged building blocks.For 2DMSI database,we systematically build a workflow to predict appropriate substrates and their geometries at substrates,and construct the 2DMSI database.For the 2D charged building block database,1208 entries from bulk material database are identified.Information of crystal structure,valence state,source,dimension and so on is provided for each entry with a json format.We also show its application in designing and searching for new functional layered materials.The 2DMSI database,building block database,and designed layered materials are available in Science Data Bank at https://doi.org/10.57760/sciencedb.j00113.00188.
基金National Natural Science Foundation of China (No.82060503)。
文摘Objective:To investigate the variation,expression and clinical significance of E2F3 gene in melanoma.Methods:Firstly,cBioportal database,Oncomine database and GEO database were used to analyze the variation and expression level of E2F3 gene in melanoma.OSskcm database and TISIDB database were used to analyze the relationship between E2F3 and melanoma prognosis and tumor immune infiltrating cells.Then,the LinkedOmics database was used to identify the differential genes related to E2F3 expression in melanoma and analyze their biological functions.Finally,small molecule compounds for the treatment of melanoma were screened through the CMap database.Results:The mutation rate of E2F3 gene in melanoma is about 4%,and there are 21 mutation sites.Compared with normal skin tissues,the expression of E2F3 gene in melanoma was significantly increased(P<0.01).The mutation and increased expression of E2F3 gene were associated with the shortened overall survival(OS)of melanoma patients(P<0.05).The CNA level of E2F3 was negatively correlated with the expression levels of lymphocytes such as pDC,Neutrophil,Act DC and Th17,and negatively correlated with the expression levels of chemokines such as CXCL5,CCL13 and CCR1.The methylation level of E2F3 was positively correlated with the expression levels of Th1,Neutrophil,Act DC and other lymphocytes,and positively correlated with the expression levels of CXCL16,CXCL12,CCR1 and other chemokines.The expression level of E2F3 was negatively correlated with the expression levels of lymphocytes such as Th17,Tcm CD4 and Th1,and negatively correlated with the expression levels of chemokines such as CXCL 16,CCL 22 and CCL 2.The expression of 96 genes such as UHRF1BP1 in melanoma was significantly correlated with the expression of E2F3(|cor|0.5,P<0.05).The above genes were mainly related to RNA transport,eukaryotic ribosome biogenesis,cell cycle and other pathways.Among them,WDR12,WDR43,RBM28,UTP18,DKC1,PAK1IP1,DDX31,TEX10,TRUB1 and TRMT61B were the top 10 hub genes.YC-1,simvastatin,cytochalasin-d,Deforolimus and cytochalasin-b may be five small molecule compounds for the treatment of melanoma.Conclusion:The mutation and increased expression level of E2F3 gene are related to the poor prognosis of melanoma and participate in the occurrence and development of melanoma by affecting the expression of different tumor immune infiltrating cell subtypes,which may be a potential diagnostic marker and therapeutic target for melanoma.
基金Tata Steel Netherlands,Posco,Hyundai Steel,Nucor Steel,RioTinto,Nippon Steel Corp.,JFE Steel,Voestalpine,RHi-Magnesita,Doosan Enerbility,Seah Besteel,Umicore,Vesuvius and Schott AG are gratefully acknowledged.
文摘The CALPHAD thermodynamic databases are very useful to analyze the complex chemical reactions happening in high temperature material process.The FactSage thermodynamic database can be used to calculate complex phase diagrams and equilibrium phases involving refractories in industrial process.In this study,the FactSage thermodynamic database relevant to ZrO_(2)-based refractories was reviewed and the application of the database to understanding the corrosion of continuous casting nozzle refractories in steelmaking was presented.
基金This work is supported by National Natural Science Foundation of China under grants 61572221, 61173170, 61300222, 61370230, 61433006 and U1401258, Innovation Fund of Huazhong University of Science and Technology under grants 2015TS069 and 2015TS071, Science and Technology Support Program of Hubei Province under grant 2014BCH270 and 2015AAA013, and Science and Technology Program of Guangdong Province under grant 2014B010111007.
文摘For integrating relational databases (RDBs) into semantic web applications, the W3C RDB2RDF Working Group recommended two approaches, Direct Mapping (DM) and R2RML. The DM provides a set of mapping rules according to RDB schema, while the R2RML allows users to manually define mappings according to existing target ontology. The major problem to use R2RML is the effort for creating R2RML mapping documents manually. This may lead to appearance of many mistakes in the R2RML documents and requires domain experts. In this paper, we propose and implement an approach to generate an R2RML mapping documents automatically from RDB schema. The R2RML mapping reflects the behavior of the DM specification and allows any R2RML parser to generate a set of RDF triples from relational data.The input of generating approach is DBsInfo class that automatically generated from relational schema. An experimental prototype is developed and shows the effectiveness of our approach algorithms.
基金supported by the National Natural Science Foundation of China (Grant No 41130104)Ministry of Science and Technology of China (Grant No 2010DFA22770)+1 种基金the key projects fromthe 11th Five-Year Plan of national scientific and technological (Grant No 2008BAC34B04-2)the National Basic Research Program of China (also called 973 Program,Grant Nos 2005CB422200x and 2006CB403702)
文摘A large number of experimental and theoretical investigations of carbon dioxide (CO 2 ) spectra have been conducted since the most recent update of the High-Resolution Transmission Molecular Absorption (HITRAN) database. To maintain optimal parameters, the HITRAN 2004 CO 2 line list has been completely replaced by HITRAN 2008 data in the near-infrared region from 4300 cm-1 to 7000 cm-1 . To examine the effect of this change on the retrieval of CO 2 vertical column data from reflected sunlight spectra in the 1.61-μm spectral window, synthetic measurements for a given atmospheric state and instrument setup were generated and compared using radiative transfer model with the line-transition parameters from the HITRAN 2004 and 2008 databases. Simulated retrievals were then performed based on the optimal estimation retrieval theory. The results show that large systematic errors in atmospheric CO 2 column retrievals were induced by the differences in the HITRAN laboratory line parameters in the 1.61-μm region. The retrieved CO 2 columns were underestimated by 10 ppm using the HITRAN 2004 data, and improvements resulting from the use of the improved HITRAN database were more pronounced at a higher spectral resolution.
基金supported by the National Natural Science Foundation of China under grant nos.61772091,61802035,61962006,61962038,U1802271,U2001212,and 62072311the Sichuan Science and Technology Program under grant nos.2021JDJQ0021 and 22ZDYF2680+7 种基金the CCF‐Huawei Database System Innovation Research Plan under grant no.CCF‐HuaweiDBIR2020004ADigital Media Art,Key Laboratory of Sichuan Province,Sichuan Conservatory of Music,Chengdu,China under grant no.21DMAKL02the Chengdu Major Science and Technology Innovation Project under grant no.2021‐YF08‐00156‐GXthe Chengdu Technology Innovation and Research and Development Project under grant no.2021‐YF05‐00491‐SNthe Natural Science Foundation of Guangxi under grant no.2018GXNSFDA138005the Guangdong Basic and Applied Basic Research Foundation under grant no.2020B1515120028the Science and Technology Innovation Seedling Project of Sichuan Province under grant no 2021006the College Student Innovation and Entrepreneurship Training Program of Chengdu University of Information Technology under grant nos.202110621179 and 202110621186.
文摘An excellent cardinality estimation can make the query optimiser produce a good execution plan.Although there are some studies on cardinality estimation,the prediction results of existing cardinality estimators are inaccurate and the query efficiency cannot be guaranteed as well.In particular,they are difficult to accurately obtain the complex relationships between multiple tables in complex database systems.When dealing with complex queries,the existing cardinality estimators cannot achieve good results.In this study,a novel cardinality estimator is proposed.It uses the core techniques with the BiLSTM network structure and adds the attention mechanism.First,the columns involved in the query statements in the training set are sampled and compressed into bitmaps.Then,the Word2vec model is used to embed the word vectors about the query statements.Finally,the BiLSTM network and attention mechanism are employed to deal with word vectors.The proposed model takes into consideration not only the correlation between tables but also the processing of complex predicates.Extensive experiments and the evaluation of BiLSTM-Attention Cardinality Estimator(BACE)on the IMDB datasets are conducted.The results show that the deep learning model can significantly improve the quality of cardinality estimation,which is a vital role in query optimisation for complex databases.
基金supported by the National Natural Science Foundation of China(No.51878578).
文摘Stochastic modeling of ground motion is a simple tool to predict ground shaking level for future earthquake and less time consuming than physics-based deterministic modeling.In this paper,a record-based stochastic method that considers the time-and frequency-evolution of ground motion is used to estimate ground motion for scenario earthquakes in tectonic active region.The stochastic method employs a time-domain modulation function to describe the temporal nonstationarity and a filter impulse response function that describe the evolution of frequency content.For characterizing the modulation function and the filter impulse function,six parameters(Ia,D5-95,tmid,ωmid,ω',ξf)are defined,and 2,571 pairs of ground motion recording in the NGA-west2 database are selected to identify the six parameters.Probabilistic density function is assigned to each of the parameter by fitting the frequency distribution histogram.The parameters are then transformed into standard normal space where regression analysis is performed by considering each parameter as function of moment magnitude,rupture distance,vS30(The time-averaged shear wave velocity of the top 30 m of soil).The prediction equations are used to generate ground motions for several scenario earthquakes and compared to NGA-West2 GMPEs.