Deep-time Earth research plays a pivotal role in deciphering the rates,patterns,and mechanisms of Earth's evolutionary processes throughout geological history,providing essential scientific foundations for climate...Deep-time Earth research plays a pivotal role in deciphering the rates,patterns,and mechanisms of Earth's evolutionary processes throughout geological history,providing essential scientific foundations for climate prediction,natural resource exploration,and sustainable planetary stewardship.To advance Deep-time Earth research in the era of big data and artificial intelligence,the International Union of Geological Sciences initiated the“Deeptime Digital Earth International Big Science Program”(DDE)in 2019.At the core of this ambitious program lies the development of geoscience knowledge graphs,serving as a transformative knowledge infrastructure that enables the integration,sharing,mining,and analysis of heterogeneous geoscience big data.The DDE knowledge graph initiative has made significant strides in three critical dimensions:(1)establishing a unified knowledge structure across geoscience disciplines that ensures consistent representation of geological entities and their interrelationships through standardized ontologies and semantic frameworks;(2)developing a robust and scalable software infrastructure capable of supporting both expert-driven and machine-assisted knowledge engineering for large-scale graph construction and management;(3)implementing a comprehensive three-tiered architecture encompassing basic,discipline-specific,and application-oriented knowledge graphs,spanning approximately 20 geoscience disciplines.Through its open knowledge framework and international collaborative network,this initiative has fostered multinational research collaborations,establishing a robust foundation for next-generation geoscience research while propelling the discipline toward FAIR(Findable,Accessible,Interoperable,Reusable)data practices in deep-time Earth systems research.展开更多
Since the beginning of the 21 st century,the geoscience research has been entering a significant transitional period with the establishment of a new knowledge system as the core and with the drive of big data as the m...Since the beginning of the 21 st century,the geoscience research has been entering a significant transitional period with the establishment of a new knowledge system as the core and with the drive of big data as the means.It is a revolutionary leap in the research of geoscience knowledge discovery from the traditional encyclopedic discipline knowledge system to the computer-understandable and operable knowledge graph.Based on adopting the graph pattern of general knowledge representation,the geoscience knowledge graph expands the unique spatiotemporal features to the Geoscience knowledge,and integrates geoscience knowledge elements,such as map,text,and number,to establish an all-domain geoscience knowledge representation model.A federated,crowd intelligence-based collaborative method of constructing the geoscience knowledge graph is developed here,which realizes the construction of high-quality professional knowledge graph in collaboration with global geo-scientists.We also develop a method for constructing a dynamic knowledge graph of multi-modal geoscience data based on in-depth text analysis,which extracts geoscience knowledge from massive geoscience literature to construct the latest and most complete dynamic geoscience knowledge graph.A comprehensive and systematic geoscience knowledge graph can not only deepen the existing geoscience big data analysis,but also advance the construction of the high-precision geological time scale driven by big data,the compilation of intelligent maps driven by rules and data,and the geoscience knowledge evolution and reasoning analysis,among others.It will further expand the new directions of geoscience research driven by both data and knowledge,break new ground where geoscience,information science,and data science converge,realize the original innovation of the geoscience research and achieve major theoretical breakthroughs in the spatiotemporal big data research.展开更多
Time is an essential reference system for recording objects,events,and processes in the field of geosciences.There are currently various time references,such as solar calendar,geological time,and regional calendar,to ...Time is an essential reference system for recording objects,events,and processes in the field of geosciences.There are currently various time references,such as solar calendar,geological time,and regional calendar,to represent the knowledge in different domains and regions,which subsequently entails a time conversion process required to interpret temporal information under different time references.However,the current time conversion method is limited by the application scope of existing time ontologies(e.g.,“Jurassic”is a period in geological ontology,but a point value in calendar ontology)and the reliance on experience in conversion processes.These issues restrict accurate and efficient calculation of temporal information across different time references.To address these issues,this paper proposes a Unified Time Framework(UTF)in the geosciences knowledge system.According to a systematic time element parsing from massive time references,the proposed UTF designs an independent time root node to get rid of irrelevant nodes when accessing different time types and to adapt to the time expression of different geoscience disciplines.Furthermore,this UTF carries out several designs:to ensure the accuracy of time expressions by designing quantitative relationship definitions;to enable time calculations across different time elements by designing unified time nodes and structures,and to link to the required external ontologies by designing adequate interfaces.By comparing the time conversion methods,the experiment proves the UTF greatly supports accurate and efficient calculation of temporal information across different time references in SPARQL queries.Moreover,it shows a higher and more stable performance of temporal information queries than the time conversion method.With the advent of the Big Data era in the geosciences,the UTF can be used more widely to discover new geosciences knowledge across different time references.展开更多
基金Strategic Priority Research Program of the Chinese Academy of Sciences,No.XDB0740000National Key Research and Development Program of China,No.2022YFB3904200,No.2022YFF0711601+1 种基金Key Project of Innovation LREIS,No.PI009National Natural Science Foundation of China,No.42471503。
文摘Deep-time Earth research plays a pivotal role in deciphering the rates,patterns,and mechanisms of Earth's evolutionary processes throughout geological history,providing essential scientific foundations for climate prediction,natural resource exploration,and sustainable planetary stewardship.To advance Deep-time Earth research in the era of big data and artificial intelligence,the International Union of Geological Sciences initiated the“Deeptime Digital Earth International Big Science Program”(DDE)in 2019.At the core of this ambitious program lies the development of geoscience knowledge graphs,serving as a transformative knowledge infrastructure that enables the integration,sharing,mining,and analysis of heterogeneous geoscience big data.The DDE knowledge graph initiative has made significant strides in three critical dimensions:(1)establishing a unified knowledge structure across geoscience disciplines that ensures consistent representation of geological entities and their interrelationships through standardized ontologies and semantic frameworks;(2)developing a robust and scalable software infrastructure capable of supporting both expert-driven and machine-assisted knowledge engineering for large-scale graph construction and management;(3)implementing a comprehensive three-tiered architecture encompassing basic,discipline-specific,and application-oriented knowledge graphs,spanning approximately 20 geoscience disciplines.Through its open knowledge framework and international collaborative network,this initiative has fostered multinational research collaborations,establishing a robust foundation for next-generation geoscience research while propelling the discipline toward FAIR(Findable,Accessible,Interoperable,Reusable)data practices in deep-time Earth systems research.
基金supported by the National Natural Science Foundation of China(Grant Nos.41421001,42050101,and 42050105)。
文摘Since the beginning of the 21 st century,the geoscience research has been entering a significant transitional period with the establishment of a new knowledge system as the core and with the drive of big data as the means.It is a revolutionary leap in the research of geoscience knowledge discovery from the traditional encyclopedic discipline knowledge system to the computer-understandable and operable knowledge graph.Based on adopting the graph pattern of general knowledge representation,the geoscience knowledge graph expands the unique spatiotemporal features to the Geoscience knowledge,and integrates geoscience knowledge elements,such as map,text,and number,to establish an all-domain geoscience knowledge representation model.A federated,crowd intelligence-based collaborative method of constructing the geoscience knowledge graph is developed here,which realizes the construction of high-quality professional knowledge graph in collaboration with global geo-scientists.We also develop a method for constructing a dynamic knowledge graph of multi-modal geoscience data based on in-depth text analysis,which extracts geoscience knowledge from massive geoscience literature to construct the latest and most complete dynamic geoscience knowledge graph.A comprehensive and systematic geoscience knowledge graph can not only deepen the existing geoscience big data analysis,but also advance the construction of the high-precision geological time scale driven by big data,the compilation of intelligent maps driven by rules and data,and the geoscience knowledge evolution and reasoning analysis,among others.It will further expand the new directions of geoscience research driven by both data and knowledge,break new ground where geoscience,information science,and data science converge,realize the original innovation of the geoscience research and achieve major theoretical breakthroughs in the spatiotemporal big data research.
基金funded by the National Natural Science Foundation of China(Grant Nos.42050101 and 42101467)the Strategic Priority Research Program of the Chinese Academy of Sciences(Grant No.XDA23100101).
文摘Time is an essential reference system for recording objects,events,and processes in the field of geosciences.There are currently various time references,such as solar calendar,geological time,and regional calendar,to represent the knowledge in different domains and regions,which subsequently entails a time conversion process required to interpret temporal information under different time references.However,the current time conversion method is limited by the application scope of existing time ontologies(e.g.,“Jurassic”is a period in geological ontology,but a point value in calendar ontology)and the reliance on experience in conversion processes.These issues restrict accurate and efficient calculation of temporal information across different time references.To address these issues,this paper proposes a Unified Time Framework(UTF)in the geosciences knowledge system.According to a systematic time element parsing from massive time references,the proposed UTF designs an independent time root node to get rid of irrelevant nodes when accessing different time types and to adapt to the time expression of different geoscience disciplines.Furthermore,this UTF carries out several designs:to ensure the accuracy of time expressions by designing quantitative relationship definitions;to enable time calculations across different time elements by designing unified time nodes and structures,and to link to the required external ontologies by designing adequate interfaces.By comparing the time conversion methods,the experiment proves the UTF greatly supports accurate and efficient calculation of temporal information across different time references in SPARQL queries.Moreover,it shows a higher and more stable performance of temporal information queries than the time conversion method.With the advent of the Big Data era in the geosciences,the UTF can be used more widely to discover new geosciences knowledge across different time references.