Data organization requires high efficiency for large amount of data applied in the digital mine system. A new method of storing massive data of block model is proposed to meet the characteristics of the database, incl...Data organization requires high efficiency for large amount of data applied in the digital mine system. A new method of storing massive data of block model is proposed to meet the characteristics of the database, including ACID-compliant, concurrency support, data sharing, and efficient access. Each block model is organized by linear octree, stored in LMDB(lightning memory-mapped database). Geological attribute can be queried at any point of 3D space by comparison algorithm of location code and conversion algorithm from address code of geometry space to location code of storage. The performance and robustness of querying geological attribute at 3D spatial region are enhanced greatly by the transformation from 3D to 2D and the method of 2D grid scanning to screen the inner and outer points. Experimental results showed that this method can access the massive data of block model, meeting the database characteristics. The method with LMDB is at least 3 times faster than that with etree, especially when it is used to read. In addition, the larger the amount of data is processed, the more efficient the method would be.展开更多
Using spatial data integration and database technology,analyzing and integrating the assessment results in all the development zones at different time in Hunan Province,the paper is intended to construct the database ...Using spatial data integration and database technology,analyzing and integrating the assessment results in all the development zones at different time in Hunan Province,the paper is intended to construct the database and managerial system for the assessment results of land use intensity in development zones,thus formulating"one map"of Hunan Development zones and realizing the integrated management and application of the assessment results in all the development zones at any time of Hunan above the provincial level.It has been proved that the system has good application effect and promising development in land management for land management departments and development zones.展开更多
Schema incompatibility is a major challenge to a federated database systemfor data sharing among heterogeneous,multiple and autonomous databases.This paperpresents a mapping approach based on import schema,export sche...Schema incompatibility is a major challenge to a federated database systemfor data sharing among heterogeneous,multiple and autonomous databases.This paperpresents a mapping approach based on import schema,export schema and domain conver-sion function,through which schema incompatibility problems such as naming conflict,domain incompatibility and entity definition incompatibility can be resolved effectively.The implementation techniques are also discussed.展开更多
A product data management system for a manufacturing enterprise is to make sure that the proper product data can be communicated to the right people at the right time.This paper describes a system analysis paradigm fo...A product data management system for a manufacturing enterprise is to make sure that the proper product data can be communicated to the right people at the right time.This paper describes a system analysis paradigm for data analysis in a product data management(PDM)development.Three aspects of the paradigm,i.e.,function,structure and behavior are rep- resented.The use of the paradigm explains why so many kinds of objects are necessary in a commercial database matrix and what models are available for developing a PDM application.As another result,a lot of models are derived from the analysis of product data system paradigm to model product data and PDM database definitions.展开更多
A new web product data management architecture is presented. The three-tier web architecture and Simple Object Access Protocol (SOAP) are combined to build the web-based product data management (PDM) system which incl...A new web product data management architecture is presented. The three-tier web architecture and Simple Object Access Protocol (SOAP) are combined to build the web-based product data management (PDM) system which includes three tiers: the user services tier, the business services tier, and the data services tier. The client service component uses the server-side technology, and Extensible Markup Language (XML) web service which uses SOAP as the communication protocol is chosen as the business service component. To illustrate how to build a web-based PDM system using the proposed architecture, a case PDM system which included three logical tires was built. To use the security and central management features of the database, a stored procedure was recommended in the data services tier. The business object was implemented as an XML web service so that client could use standard internet protocols to communicate with the business object from any platform. In order to satisfy users using all sorts of browser, the server-side technology and Microsoft ASP.NET was used to create the dynamic user interface.展开更多
In this paper, we research on the research on the mass structured data storage and sorting algorithm and methodology for SQL database under the big data environment. With the data storage market development and center...In this paper, we research on the research on the mass structured data storage and sorting algorithm and methodology for SQL database under the big data environment. With the data storage market development and centering on the server, the data will store model to data- centric data storage model. Storage is considered from the start, just keep a series of data, for the management system and storage device rarely consider the intrinsic value of the stored data. The prosperity of the Internet has changed the world data storage, and with the emergence of many new applications. Theoretically, the proposed algorithm has the ability of dealing with massive data and numerically, the algorithm could enhance the processing accuracy and speed which will be meaningful.展开更多
Forensic investigations,especially those related to missing persons and unidentified remains,produce different types of data that must be managed and understood.The data collected and produced are extensive and origin...Forensic investigations,especially those related to missing persons and unidentified remains,produce different types of data that must be managed and understood.The data collected and produced are extensive and originate from various sources:the police,non-governmental organizations(NGOs),medical examiner offices,specialised forensic teams,family members,and others.Some examples of information include,but are not limited to,the investigative background information,excavation data of burial sites,antemortem data on missing persons,and postmortem data on the remains of unidentified individuals.These complex data must be stored in a secured place,analysed,compared,shared,and then reported to the investigative actors and the public,especially the families of missing persons,who should be kept informed of the investigation.Therefore,a data management system with the capability of performing the tasks relevant to the goals of the investigation and the identification of an individual,while respecting the deceased and their families,is critical for standardising investigations.Data management is crucial to assure the quality of investigative processes,and it must be recognised as a holistic integrated system.The aim of this article is to discuss some of the most important components of an effective forensic data management system.The discussion is enriched by examples,challenges,and lessons learned from the erratic development and launching of databases for missing and unidentified persons in Brazil.The main objective of this article is to bring attention to the urgent need for an effective and integrated system in Brazil.展开更多
Product family(PF) is the most important part of product platform. A new method is proposed to mine PF based on multi-space product data in PLM database. Product structure tree(PST) and bill of material(BOM) are used ...Product family(PF) is the most important part of product platform. A new method is proposed to mine PF based on multi-space product data in PLM database. Product structure tree(PST) and bill of material(BOM) are used as the data source. A PF can be obtained by mining physics space, logic space and attribute space of product data. In this work, firstly, a PLM database is described, consisting of data organization form, data structure, and data characteristics. Then the PF mining method introduces the sequence alignment techniques used in bio-informatics, which mainly includes data pre-processing, regularization, mining algorithm and cluster analysis. Finally, the feasibility and effectiveness of the proposed method are verified by a case study of high and middle pressure valve, demonstrating a feasible method to obtain PF from PLM database.展开更多
Since the late of previous decade, hypertext technique has been applied in many areas. A hypertext data model with version control which is applied to a digital delivery for engineering documents named Optical Disk ba...Since the late of previous decade, hypertext technique has been applied in many areas. A hypertext data model with version control which is applied to a digital delivery for engineering documents named Optical Disk based Electronic Archives Management System(ODEAMS) is presented first and it has successfully solved some problems in engineering data management. Then, this paper describes some details to implement the hypertext network in ODEAMS after introducing the requirements and characters of engineering data management.展开更多
In this paper, an open architecture and its implementation for product data management are presented. The system architecture, product data definition model, and a set of components of the architecture are discussed i...In this paper, an open architecture and its implementation for product data management are presented. The system architecture, product data definition model, and a set of components of the architecture are discussed in detail. Especially, the principle and some mechanism of one of the components, an object oriented database management system GH-EODB, are discussed in more detail. The architecture is extensible and adaptable.展开更多
For storing and modeling three-dimensional(3D)topographic objects(e.g.buildings,roads,dykes,and the terrain),tetrahedralizations have been proposed as an alternative to boundary representations.While in theory they ha...For storing and modeling three-dimensional(3D)topographic objects(e.g.buildings,roads,dykes,and the terrain),tetrahedralizations have been proposed as an alternative to boundary representations.While in theory they have several advantages,current implementations are either not space efficient or do not store topological relationships(which makes spatial analysis and updating slow,or require the use of an expensive 3D spatial index).We discuss in this paper an alternative data structure for storing tetrahedralizations in a database management system(DBMS).It is based on the idea of storing only the vertices and stars of edges;triangles and tetrahedra are represented implicitly.It has been used previously in main memory,but not in a DBMS.We describe how to modify it to obtain an efficient implementation in a DBMS,and we describe how it can be used for modeling 3D topography.As we demonstrate with different real-world examples,the structure is compacter than known alternatives,it permits us to store attributes for any primitives,and has the added benefit of being topological,which permits us to query it efficiently.The structure can be easily implemented in most DBMS(we describe our implementation in PostgreSQL),and we present some of the engineering choices we made for the implementation.展开更多
Acomputerized platform for multi-channel physiological signals is developed in our lab to highly improve the recording and review for the output of The Polygraph System. The platform mainly consists of a Pentium III P...Acomputerized platform for multi-channel physiological signals is developed in our lab to highly improve the recording and review for the output of The Polygraph System. The platform mainly consists of a Pentium III PC and a high speed A/D converter and is supported by Visual Basic 6.0 and Microsoft Access 2 000. The platform has powerful functions for data acquisition, real-time waveform display and review. It has proved its reliability and flexibility through practical animal experiments. Besides, its modulized program design provides interfaces for further data processing and analysis.展开更多
Widely used in clinical research, the database is a new type of data management automation technology and the most efficient tool for data management. In this article, we first explain some basic concepts, such as the...Widely used in clinical research, the database is a new type of data management automation technology and the most efficient tool for data management. In this article, we first explain some basic concepts, such as the definition, classification, and establishment of databases. Afterward, the workflow for establishing databases, inputting data, verifying data, and managing databases is presented. Meanwhile, by discussing the application of databases in clinical research, we illuminate the important role of databases in clinical research practice. Lastly, we introduce the reanalysis of randomized controlled trials(RCTs) and cloud computing techniques, showing the most recent advancements of databases in clinical research.展开更多
Database system is the infrastructure of the modern information system. The R&D in the database system and its technologies is one of the important research topics in the field. The database R&D in China took off la...Database system is the infrastructure of the modern information system. The R&D in the database system and its technologies is one of the important research topics in the field. The database R&D in China took off later but it moves along by giant steps. This report presents the achievements Renmin University of China (RUC) has made in the past 25 years and at the same time addresses some of the research projects we, RUC, are currently working on. The National Natural Science Foundation of China supports and initiates most of our research projects and these successfully conducted projects have produced fruitful results.展开更多
Accurate characterization of the chemical composition of complex traditional Chinese medicine(TCM)is an essential foundation for the modern scientific interpretation of TCM principles.Mass spectrometry is the most dom...Accurate characterization of the chemical composition of complex traditional Chinese medicine(TCM)is an essential foundation for the modern scientific interpretation of TCM principles.Mass spectrometry is the most dominant technique in current research on the material basis of TCM,offering the highest sensitivity and the richest information provision.Establishing mass spectrometry databases represents the most effective approach to facilitating the structural analysis of TCM chemical components.This paper systematically searches and reviews literature published from January 2005 to January 2025 through online databases such as China National Knowledge Infrastructure,PubMed,and Web of Science,using“mass spectrometry database”and“traditional Chinese medicine”as keywords.It reviews the current status of seven TCM chemical component mass spectrometry databases and seven natural product mass spectrometry databases.The key advancements of these mass spectrometry databases for natural products are summarized,detailing their characteristics,search methodologies,included information,and data sources.Additionally,challenges related to data quality,standardization,timely updates,database interaction,retrieval functionality,and data sharing and security are discussed in depth.Furthermore,the paper explores prospective development directions for TCM mass spectrometry databases,emphasizing the importance of open data sharing,technological innovation,and data security.Through this analysis,the paper aims to offer theoretical guidance and practical recommendations for the precise identification of TCM components,as well as for the construction and application of these databases.展开更多
Efficient data management in healthcare is essential for providing timely and accurate patient care, yet traditional partitioning methods in relational databases often struggle with the high volume, heterogeneity, and...Efficient data management in healthcare is essential for providing timely and accurate patient care, yet traditional partitioning methods in relational databases often struggle with the high volume, heterogeneity, and regulatory complexity of healthcare data. This research introduces a tailored partitioning strategy leveraging the MD5 hashing algorithm to enhance data insertion, query performance, and load balancing in healthcare systems. By applying a consistent hash function to patient IDs, our approach achieves uniform distribution of records across partitions, optimizing retrieval paths and reducing access latency while ensuring data integrity and compliance. We evaluated the method through experiments focusing on partitioning efficiency, scalability, and fault tolerance. The partitioning efficiency analysis compared our MD5-based approach with standard round-robin methods, measuring insertion times, query latency, and data distribution balance. Scalability tests assessed system performance across increasing dataset sizes and varying partition counts, while fault tolerance experiments examined data integrity and retrieval performance under simulated partition failures. The experimental results demonstrate that the MD5-based partitioning strategy significantly reduces query retrieval times by optimizing data access patterns, achieving up to X% better performance compared to round-robin methods. It also scales effectively with larger datasets, maintaining low latency and ensuring robust resilience under failure scenarios. This novel approach offers a scalable, efficient, and fault-tolerant solution for healthcare systems, facilitating faster clinical decision-making and improved patient care in complex data environments.展开更多
Accurate and reliable nuclear decay databases are essential for fundamental and applied nuclear research studies.However,decay data are not usually as accurate as expected and need improvement.Hence,a new Chinese nucl...Accurate and reliable nuclear decay databases are essential for fundamental and applied nuclear research studies.However,decay data are not usually as accurate as expected and need improvement.Hence,a new Chinese nuclear decay database in the fission product mass region(A=66−172)based on several major national evaluated data libraries has been developed under joint efforts in the CNDC working group.A total of 2358 nuclides have been included in this decay database.Two main data formats,namely ENSDF and ENDF,have been adopted.For the total meanβandγenergies,available data from total absorption gamma ray spectroscopy measurements have been adopted.For some nuclides without experimental measurements,theoretically calculated values have been added.展开更多
Combined with the current status of Antarctic data management and the characteristics of polar science data resulted from Chinese Antarctic and Arctic Research Expeditions, the Chinese Polar Science Database System(CP...Combined with the current status of Antarctic data management and the characteristics of polar science data resulted from Chinese Antarctic and Arctic Research Expeditions, the Chinese Polar Science Database System(CPSDS) has been designed and established in 2002. The infrastructure, technical standard, mechanism of sharing data of this system are reviewed in this article. Meanwhile, the development of Chinese polar data management is summarized. As the metadata is the powerful and useful tool for managing and disseminating scientific data, the metadata is also used as “search engine” of CPSDS. Besides, the trend of data management and sharing is also discussed.展开更多
基金Projects(41572317,51374242)supported by the National Natural Science Foundation of ChinaProject(2015CX005)supported by the Innovation Driven Plan of Central South University,China
文摘Data organization requires high efficiency for large amount of data applied in the digital mine system. A new method of storing massive data of block model is proposed to meet the characteristics of the database, including ACID-compliant, concurrency support, data sharing, and efficient access. Each block model is organized by linear octree, stored in LMDB(lightning memory-mapped database). Geological attribute can be queried at any point of 3D space by comparison algorithm of location code and conversion algorithm from address code of geometry space to location code of storage. The performance and robustness of querying geological attribute at 3D spatial region are enhanced greatly by the transformation from 3D to 2D and the method of 2D grid scanning to screen the inner and outer points. Experimental results showed that this method can access the massive data of block model, meeting the database characteristics. The method with LMDB is at least 3 times faster than that with etree, especially when it is used to read. In addition, the larger the amount of data is processed, the more efficient the method would be.
文摘Using spatial data integration and database technology,analyzing and integrating the assessment results in all the development zones at different time in Hunan Province,the paper is intended to construct the database and managerial system for the assessment results of land use intensity in development zones,thus formulating"one map"of Hunan Development zones and realizing the integrated management and application of the assessment results in all the development zones at any time of Hunan above the provincial level.It has been proved that the system has good application effect and promising development in land management for land management departments and development zones.
文摘Schema incompatibility is a major challenge to a federated database systemfor data sharing among heterogeneous,multiple and autonomous databases.This paperpresents a mapping approach based on import schema,export schema and domain conver-sion function,through which schema incompatibility problems such as naming conflict,domain incompatibility and entity definition incompatibility can be resolved effectively.The implementation techniques are also discussed.
文摘A product data management system for a manufacturing enterprise is to make sure that the proper product data can be communicated to the right people at the right time.This paper describes a system analysis paradigm for data analysis in a product data management(PDM)development.Three aspects of the paradigm,i.e.,function,structure and behavior are rep- resented.The use of the paradigm explains why so many kinds of objects are necessary in a commercial database matrix and what models are available for developing a PDM application.As another result,a lot of models are derived from the analysis of product data system paradigm to model product data and PDM database definitions.
基金the National Key Project Foundation of China (No. 2001BA201A0605) and partially supported by the State Key Lab for Mechanical Transmission..
文摘A new web product data management architecture is presented. The three-tier web architecture and Simple Object Access Protocol (SOAP) are combined to build the web-based product data management (PDM) system which includes three tiers: the user services tier, the business services tier, and the data services tier. The client service component uses the server-side technology, and Extensible Markup Language (XML) web service which uses SOAP as the communication protocol is chosen as the business service component. To illustrate how to build a web-based PDM system using the proposed architecture, a case PDM system which included three logical tires was built. To use the security and central management features of the database, a stored procedure was recommended in the data services tier. The business object was implemented as an XML web service so that client could use standard internet protocols to communicate with the business object from any platform. In order to satisfy users using all sorts of browser, the server-side technology and Microsoft ASP.NET was used to create the dynamic user interface.
文摘In this paper, we research on the research on the mass structured data storage and sorting algorithm and methodology for SQL database under the big data environment. With the data storage market development and centering on the server, the data will store model to data- centric data storage model. Storage is considered from the start, just keep a series of data, for the management system and storage device rarely consider the intrinsic value of the stored data. The prosperity of the Internet has changed the world data storage, and with the emergence of many new applications. Theoretically, the proposed algorithm has the ability of dealing with massive data and numerically, the algorithm could enhance the processing accuracy and speed which will be meaningful.
基金This work was partially supported by the CAPES-Science without Borders Scholarship[grant number 99999.013091/2013-01].
文摘Forensic investigations,especially those related to missing persons and unidentified remains,produce different types of data that must be managed and understood.The data collected and produced are extensive and originate from various sources:the police,non-governmental organizations(NGOs),medical examiner offices,specialised forensic teams,family members,and others.Some examples of information include,but are not limited to,the investigative background information,excavation data of burial sites,antemortem data on missing persons,and postmortem data on the remains of unidentified individuals.These complex data must be stored in a secured place,analysed,compared,shared,and then reported to the investigative actors and the public,especially the families of missing persons,who should be kept informed of the investigation.Therefore,a data management system with the capability of performing the tasks relevant to the goals of the investigation and the identification of an individual,while respecting the deceased and their families,is critical for standardising investigations.Data management is crucial to assure the quality of investigative processes,and it must be recognised as a holistic integrated system.The aim of this article is to discuss some of the most important components of an effective forensic data management system.The discussion is enriched by examples,challenges,and lessons learned from the erratic development and launching of databases for missing and unidentified persons in Brazil.The main objective of this article is to bring attention to the urgent need for an effective and integrated system in Brazil.
基金Project(51275362)supported by the National Natural Science Foundation of ChinaProject(2014ZX04015021)supported by National Science and Technology Major Project,China
文摘Product family(PF) is the most important part of product platform. A new method is proposed to mine PF based on multi-space product data in PLM database. Product structure tree(PST) and bill of material(BOM) are used as the data source. A PF can be obtained by mining physics space, logic space and attribute space of product data. In this work, firstly, a PLM database is described, consisting of data organization form, data structure, and data characteristics. Then the PF mining method introduces the sequence alignment techniques used in bio-informatics, which mainly includes data pre-processing, regularization, mining algorithm and cluster analysis. Finally, the feasibility and effectiveness of the proposed method are verified by a case study of high and middle pressure valve, demonstrating a feasible method to obtain PF from PLM database.
文摘Since the late of previous decade, hypertext technique has been applied in many areas. A hypertext data model with version control which is applied to a digital delivery for engineering documents named Optical Disk based Electronic Archives Management System(ODEAMS) is presented first and it has successfully solved some problems in engineering data management. Then, this paper describes some details to implement the hypertext network in ODEAMS after introducing the requirements and characters of engineering data management.
基金the High Technoloey Research and Development Programme of China
文摘In this paper, an open architecture and its implementation for product data management are presented. The system architecture, product data definition model, and a set of components of the architecture are discussed in detail. Especially, the principle and some mechanism of one of the components, an object oriented database management system GH-EODB, are discussed in more detail. The architecture is extensible and adaptable.
基金This research is supported by the Dutch Technology Foundation STW,which is part of the Netherlands Organization for Scientific Research(NWO),and which is partly funded by the Ministry of Economic Affairs(project codes:11300 and 11185).
文摘For storing and modeling three-dimensional(3D)topographic objects(e.g.buildings,roads,dykes,and the terrain),tetrahedralizations have been proposed as an alternative to boundary representations.While in theory they have several advantages,current implementations are either not space efficient or do not store topological relationships(which makes spatial analysis and updating slow,or require the use of an expensive 3D spatial index).We discuss in this paper an alternative data structure for storing tetrahedralizations in a database management system(DBMS).It is based on the idea of storing only the vertices and stars of edges;triangles and tetrahedra are represented implicitly.It has been used previously in main memory,but not in a DBMS.We describe how to modify it to obtain an efficient implementation in a DBMS,and we describe how it can be used for modeling 3D topography.As we demonstrate with different real-world examples,the structure is compacter than known alternatives,it permits us to store attributes for any primitives,and has the added benefit of being topological,which permits us to query it efficiently.The structure can be easily implemented in most DBMS(we describe our implementation in PostgreSQL),and we present some of the engineering choices we made for the implementation.
基金Sponsored by China Ministry of Science and Technology:Joint Chinese - Israel Research Grant (99M- 0 0 4 14 15 )
文摘Acomputerized platform for multi-channel physiological signals is developed in our lab to highly improve the recording and review for the output of The Polygraph System. The platform mainly consists of a Pentium III PC and a high speed A/D converter and is supported by Visual Basic 6.0 and Microsoft Access 2 000. The platform has powerful functions for data acquisition, real-time waveform display and review. It has proved its reliability and flexibility through practical animal experiments. Besides, its modulized program design provides interfaces for further data processing and analysis.
基金supported by Fundamental Research Funds of State Key Laboratory of Ophthalmology (Grant No.2015QN01)Young Teacher Top-Support project of Sun Yat-sen University(Grant No.2015ykzd11)+4 种基金the Cultivation Projects for Young Teaching Staff of Sun Yat-sen University(Grant No.12ykpy61) from the Fundamental Research Funds for the Central Universitiesthe Pearl River Science and Technology New Star(Grant No.2014J2200060)Project of Guangzhou City,the Guangdong Provincial Natural Science Foundation for Distinguished Young Scholars of China(Grant No. 2014A030306030)Youth Science and Technology Innovation Talents Funds in Special Support Plan for High Level Talents in Guangdong Province(Grant No. 2014TQ01R573)Key Research Plan for National Natural Science Foundation of China in Cultivation Project (No.91546101)
文摘Widely used in clinical research, the database is a new type of data management automation technology and the most efficient tool for data management. In this article, we first explain some basic concepts, such as the definition, classification, and establishment of databases. Afterward, the workflow for establishing databases, inputting data, verifying data, and managing databases is presented. Meanwhile, by discussing the application of databases in clinical research, we illuminate the important role of databases in clinical research practice. Lastly, we introduce the reanalysis of randomized controlled trials(RCTs) and cloud computing techniques, showing the most recent advancements of databases in clinical research.
基金Supported by the National Natural Science Foundation of China. Acknowledgements The National Science Foundation of China supported these works. Thanks to NSFC and all the members of the research groups in Renmin University of China.
文摘Database system is the infrastructure of the modern information system. The R&D in the database system and its technologies is one of the important research topics in the field. The database R&D in China took off later but it moves along by giant steps. This report presents the achievements Renmin University of China (RUC) has made in the past 25 years and at the same time addresses some of the research projects we, RUC, are currently working on. The National Natural Science Foundation of China supports and initiates most of our research projects and these successfully conducted projects have produced fruitful results.
基金the Beijing Natural Science Foundation(No.7252249)the National Natural Science Foundation of China(No.82104380)+1 种基金the Scientific and Technological Innovation Project of the China Academy of Chinese Medical Sciences(Nos.CI2023E002,CI2023C071YLL, CI2023C039YGL)the Fundamental Research Funds for the Central Public Welfare Research Institutes(No.ZZ14-YQ-047, ZZ15-WT-04)。
文摘Accurate characterization of the chemical composition of complex traditional Chinese medicine(TCM)is an essential foundation for the modern scientific interpretation of TCM principles.Mass spectrometry is the most dominant technique in current research on the material basis of TCM,offering the highest sensitivity and the richest information provision.Establishing mass spectrometry databases represents the most effective approach to facilitating the structural analysis of TCM chemical components.This paper systematically searches and reviews literature published from January 2005 to January 2025 through online databases such as China National Knowledge Infrastructure,PubMed,and Web of Science,using“mass spectrometry database”and“traditional Chinese medicine”as keywords.It reviews the current status of seven TCM chemical component mass spectrometry databases and seven natural product mass spectrometry databases.The key advancements of these mass spectrometry databases for natural products are summarized,detailing their characteristics,search methodologies,included information,and data sources.Additionally,challenges related to data quality,standardization,timely updates,database interaction,retrieval functionality,and data sharing and security are discussed in depth.Furthermore,the paper explores prospective development directions for TCM mass spectrometry databases,emphasizing the importance of open data sharing,technological innovation,and data security.Through this analysis,the paper aims to offer theoretical guidance and practical recommendations for the precise identification of TCM components,as well as for the construction and application of these databases.
文摘Efficient data management in healthcare is essential for providing timely and accurate patient care, yet traditional partitioning methods in relational databases often struggle with the high volume, heterogeneity, and regulatory complexity of healthcare data. This research introduces a tailored partitioning strategy leveraging the MD5 hashing algorithm to enhance data insertion, query performance, and load balancing in healthcare systems. By applying a consistent hash function to patient IDs, our approach achieves uniform distribution of records across partitions, optimizing retrieval paths and reducing access latency while ensuring data integrity and compliance. We evaluated the method through experiments focusing on partitioning efficiency, scalability, and fault tolerance. The partitioning efficiency analysis compared our MD5-based approach with standard round-robin methods, measuring insertion times, query latency, and data distribution balance. Scalability tests assessed system performance across increasing dataset sizes and varying partition counts, while fault tolerance experiments examined data integrity and retrieval performance under simulated partition failures. The experimental results demonstrate that the MD5-based partitioning strategy significantly reduces query retrieval times by optimizing data access patterns, achieving up to X% better performance compared to round-robin methods. It also scales effectively with larger datasets, maintaining low latency and ensuring robust resilience under failure scenarios. This novel approach offers a scalable, efficient, and fault-tolerant solution for healthcare systems, facilitating faster clinical decision-making and improved patient care in complex data environments.
基金Supported by the National Key R&D Program of China(2022YFA1602000)。
文摘Accurate and reliable nuclear decay databases are essential for fundamental and applied nuclear research studies.However,decay data are not usually as accurate as expected and need improvement.Hence,a new Chinese nuclear decay database in the fission product mass region(A=66−172)based on several major national evaluated data libraries has been developed under joint efforts in the CNDC working group.A total of 2358 nuclides have been included in this decay database.Two main data formats,namely ENSDF and ENDF,have been adopted.For the total meanβandγenergies,available data from total absorption gamma ray spectroscopy measurements have been adopted.For some nuclides without experimental measurements,theoretically calculated values have been added.
文摘Combined with the current status of Antarctic data management and the characteristics of polar science data resulted from Chinese Antarctic and Arctic Research Expeditions, the Chinese Polar Science Database System(CPSDS) has been designed and established in 2002. The infrastructure, technical standard, mechanism of sharing data of this system are reviewed in this article. Meanwhile, the development of Chinese polar data management is summarized. As the metadata is the powerful and useful tool for managing and disseminating scientific data, the metadata is also used as “search engine” of CPSDS. Besides, the trend of data management and sharing is also discussed.