GeoLink has leveraged linked data principles to create a dataset that allows users to seamlessly query and reason over some of the most prominent geoscience metadata repositories in the United States.The GeoLink datas...GeoLink has leveraged linked data principles to create a dataset that allows users to seamlessly query and reason over some of the most prominent geoscience metadata repositories in the United States.The GeoLink dataset includes such diverse information as port calls made by oceanographic cruises,physical sample meta-data,research project funding and staffing,and authorship of technical reports.The data has been published according to best practices for linked data and is publicly available via a SPARQL Protocol and RDF Query Language(SPARQL)end point that at present contains more than 45 million Resource Description Framework(RDF)triples together with a collection of ontologies and geo-visualization tools.This article describes the geoscience datasets,the modeling and publication process,and current uses of the dataset.The focus is on providing enough detail to enable researchers,application developers and others who wish to lever-age the GeoLink data in their own work to do so.展开更多
Semantic technologies have emerged as a prominent research area within Big Earth Data.These technologies have provided significant benefits for data discovery and integration.Yet,the formality of the Semantic Web,in l...Semantic technologies have emerged as a prominent research area within Big Earth Data.These technologies have provided significant benefits for data discovery and integration.Yet,the formality of the Semantic Web,in languages such as the Web Ontology Language(OWL),does not always integrate well with the numerical,statistical,and geometric methods of the geosciences.Two prominent challenges in this area are how to semantically model individual measurements and what to do when geoscience needs are not addressed by languages such as OWL.This has led to a fragmented Big Earth Data community with either no solution or incompatible semantic solutions.We use an oceanographic example to highlight the limitations and challenges surrounding the semantic encoding of observations and the use of semantics during analysis.We then present potential solutions to each challenge showing that a full end-to-end application of semantic technologies is not only feasible,but beneficial to Big Earth Data.展开更多
基金This work was supported by the National Science Foundation[1440202].
文摘GeoLink has leveraged linked data principles to create a dataset that allows users to seamlessly query and reason over some of the most prominent geoscience metadata repositories in the United States.The GeoLink dataset includes such diverse information as port calls made by oceanographic cruises,physical sample meta-data,research project funding and staffing,and authorship of technical reports.The data has been published according to best practices for linked data and is publicly available via a SPARQL Protocol and RDF Query Language(SPARQL)end point that at present contains more than 45 million Resource Description Framework(RDF)triples together with a collection of ontologies and geo-visualization tools.This article describes the geoscience datasets,the modeling and publication process,and current uses of the dataset.The focus is on providing enough detail to enable researchers,application developers and others who wish to lever-age the GeoLink data in their own work to do so.
基金This work was supported from the National Science Foundation grant Earth Cube Building Blocks:Collaborative Proposal:GeoLink-Leveraging Semantics and Linked Data for Data Sharing and Discovery in the Geosciences.The authors would like to acknowledge their respective collaborative grants numbered 1435578 and 1440139.
文摘Semantic technologies have emerged as a prominent research area within Big Earth Data.These technologies have provided significant benefits for data discovery and integration.Yet,the formality of the Semantic Web,in languages such as the Web Ontology Language(OWL),does not always integrate well with the numerical,statistical,and geometric methods of the geosciences.Two prominent challenges in this area are how to semantically model individual measurements and what to do when geoscience needs are not addressed by languages such as OWL.This has led to a fragmented Big Earth Data community with either no solution or incompatible semantic solutions.We use an oceanographic example to highlight the limitations and challenges surrounding the semantic encoding of observations and the use of semantics during analysis.We then present potential solutions to each challenge showing that a full end-to-end application of semantic technologies is not only feasible,but beneficial to Big Earth Data.