Background A task assigned to space exploration satellites involves detecting the physical environment within a certain space.However,space detection data are complex and abstract.These data are not conducive for rese...Background A task assigned to space exploration satellites involves detecting the physical environment within a certain space.However,space detection data are complex and abstract.These data are not conducive for researchers'visual perceptions of the evolution and interaction of events in the space environment.Methods A time-series dynamic data sampling method for large-scale space was proposed for sample detection data in space and time,and the corresponding relationships between data location features and other attribute features were established.A tone-mapping method based on statistical histogram equalization was proposed and applied to the final attribute feature data.The visualization process is optimized for rendering by merging materials,reducing the number of patches,and performing other operations.Results The results of sampling,feature extraction,and uniform visualization of the detection data of complex types,long duration spans,and uneven spatial distributions were obtained.The real-time visualization of large-scale spatial structures using augmented reality devices,particularly low-performance devices,was also investigated.Conclusions The proposed visualization system can reconstruct the three-dimensional structure of a large-scale space,express the structure and changes in the spatial environment using augmented reality,and assist in intuitively discovering spatial environmental events and evolutionary rules.展开更多
To improve the performance of the traditional map matching algorithms in freeway traffic state monitoring systems using the low logging frequency GPS (global positioning system) probe data, a map matching algorithm ...To improve the performance of the traditional map matching algorithms in freeway traffic state monitoring systems using the low logging frequency GPS (global positioning system) probe data, a map matching algorithm based on the Oracle spatial data model is proposed. The algorithm uses the Oracle road network data model to analyze the spatial relationships between massive GPS positioning points and freeway networks, builds an N-shortest path algorithm to find reasonable candidate routes between GPS positioning points efficiently, and uses the fuzzy logic inference system to determine the final matched traveling route. According to the implementation with field data from Los Angeles, the computation speed of the algorithm is about 135 GPS positioning points per second and the accuracy is 98.9%. The results demonstrate the effectiveness and accuracy of the proposed algorithm for mapping massive GPS positioning data onto freeway networks with complex geometric characteristics.展开更多
Spatial seismic vulnerability assessments are primally conducted at the community and grid level,using heuristic and empirical approaches.Building-based spatial statistical vulnerability models are rare because of dat...Spatial seismic vulnerability assessments are primally conducted at the community and grid level,using heuristic and empirical approaches.Building-based spatial statistical vulnerability models are rare because of data limitations.Generating open-access spatial inventories that document seismic damage and building attributes and test their effectiveness in assessing damage would promote the advancement of spatial vulnerability assessment.The 2022 Mw 6.7 Luding earthquake in the western Sichuan Province of China provides an opportunity to validate this approach.The local government urgently dispatched experts to survey building damage,marking all buildings with damage class stickers.In this work,we sampled 2889 buildings as GPS points and documented the damage classes and building attributes,including structure type,number of floors,and age.A polygon-based digital inventory was generated by digitizing the rooftops of the sampled buildings and importing the attributes.Statistical regressions were created by plotting damage against shaking intensity and PGA,and Random Forest modeling was carried out considering not only buildings and seismic parameters but also environmental factors.The result indicates that statistical regressions have notable uncertainties,and the Random Forest model shows a≥79%accuracy.Topographical factors showed notable importance in the Random Forest modeling.This work provides an open-access seismic building damage inventory and demonstrates its potential for damage prediction and vulnerability assessment.展开更多
The current global cybersecurity landscape, characterized by the increasing scale and sophistication of cyberattacks, underscores the importance of integrating Cyber Threat Intelligence (CTI) into Land Administration ...The current global cybersecurity landscape, characterized by the increasing scale and sophistication of cyberattacks, underscores the importance of integrating Cyber Threat Intelligence (CTI) into Land Administration Systems (LAS). LAS services involve requests and responses concerning public and private cadastral data, including credentials of parties, ownership, and spatial parcels. This study explores the integration of CTI in LAS to enhance cyber resilience, focusing on the unique vulnerabilities of LAS, such as sensitive data management and interconnection with other critical systems related to spatial data uses and changes. The approach employs a case study of a typical country-specific LAS to analyse structured vulnerabilities and their attributes to determine the degree of vulnerability of LAS through a quantitative inductive approach. The analysis results indicate significant improvements in identifying and mitigating potential threats through CTI integration, thus enhancing cyber resilience. These findings are crucial for policymakers and practitioners to develop robust cybersecurity strategies for LAS.展开更多
This paper proposes a new method for the compression of vector data map. Three key steps are encompassed in the proposed method, namely, the simplification of vector data map via the elimination of vertices, the compr...This paper proposes a new method for the compression of vector data map. Three key steps are encompassed in the proposed method, namely, the simplification of vector data map via the elimination of vertices, the compression of re- moved vertices based on a clustering model, and the decoding of the compressed vector data map. The proposed compres- sion method was implemented and applied to compress vector data map to investigate its performance in terms of the com- pression ratio and distortions of geometric shapes. The results show that the proposed method provides a feasible and effi- cient solution for the compression of vector data map and is able to achieve a promising ratio of compression and maintain the main shape characteristics of the spatial objects within the compressed vector data map.展开更多
A novel Hilbert-curve is introduced for parallel spatial data partitioning, with consideration of the huge-amount property of spatial information and the variable-length characteristic of vector data items. Based on t...A novel Hilbert-curve is introduced for parallel spatial data partitioning, with consideration of the huge-amount property of spatial information and the variable-length characteristic of vector data items. Based on the improved Hilbert curve, the algorithm can be designed to achieve almost-uniform spatial data partitioning among multiple disks in parallel spatial databases. Thus, the phenomenon of data imbalance can be significantly avoided and search and query efficiency can be enhanced.展开更多
The paper aims to present a concise overview of the current status of the national spatial data infrastructures(SDI)of the European Union(EU)member states combined with specific peculiarities for Bulgaria.Some major c...The paper aims to present a concise overview of the current status of the national spatial data infrastructures(SDI)of the European Union(EU)member states combined with specific peculiarities for Bulgaria.Some major challenges within the progress of the EU SDIs establishing,which is regulated by the European Directive INSPIRE(Infrastructure for spatial information in Europe)toward establishment of a SDI for environmental policies and activities,are marked out.Available comparative analyses of the main indicators for metadata,data-sets,and data services provided by EU member states are briefly discussed as a special attention is given to the Bulgarian progress.Recent achievements on accelerating the process of implementing the recommendations of the INSPIRE Directive in Bulgaria are outlined.展开更多
Since web based GIS processes large size spatial geographic information on internet, we should try to improve the efficiency of spatial data query processing and transmission. This paper presents two efficient metho...Since web based GIS processes large size spatial geographic information on internet, we should try to improve the efficiency of spatial data query processing and transmission. This paper presents two efficient methods for this purpose: division transmission and progressive transmission methods. In division transmission method, a map can be divided into several parts, called “tiles”, and only tiles can be transmitted at the request of a client. In progressive transmission method, a map can be split into several phase views based on the significance of vertices, and a server produces a target object and then transmits it progressively when this spatial object is requested from a client. In order to achieve these methods, the algorithms, “tile division”, “priority order estimation” and the strategies for data transmission are proposed in this paper, respectively. Compared with such traditional methods as “map total transmission” and “layer transmission”, the web based GIS data transmission, proposed in this paper, is advantageous in the increase of the data transmission efficiency by a great margin.展开更多
In order to provide a provincial spatial database, this paper presents a scheme for spatial database construction to meet the needs of China. The objective and overall technical route of spatial database construction ...In order to provide a provincial spatial database, this paper presents a scheme for spatial database construction to meet the needs of China. The objective and overall technical route of spatial database construction are described. The logical and physical database models are designed. Key issues are addressed, such as integration of multi-scale heterogeneous spatial databases, spatial data version management based on metadata and integrative management of map cartography and spatial database.展开更多
The advanced data mining technologies and the large quantities of remotely sensed Imagery provide a data mining opportunity with high potential for useful results. Extracting interesting patterns and rules from data s...The advanced data mining technologies and the large quantities of remotely sensed Imagery provide a data mining opportunity with high potential for useful results. Extracting interesting patterns and rules from data sets composed of images and associated ground data can be of importance in object identification, community planning, resource discovery and other areas. In this paper, a data field is presented to express the observed spatial objects and conduct behavior mining on them. First, most of the important aspects are discussed on behavior mining and its implications for the future of data mining. Furthermore, an ideal framework of the behavior mining system is proposed in the network environment. Second, the model of behavior mining is given on the observed spatial objects, including the objects described by the first feature data field and the main feature data field by means of the potential function. Finally, a case study about object identification in public is given and analyzed. The experimental results show that the new model is feasible in behavior mining.展开更多
OpenStreetMap(OSM)data are widely used but their reliability is still variable.Many contributors to OSM have not been trained in geography or surveying and consequently their contributions,including geometry and attri...OpenStreetMap(OSM)data are widely used but their reliability is still variable.Many contributors to OSM have not been trained in geography or surveying and consequently their contributions,including geometry and attribute data inserts,deletions,and updates,can be inaccurate,incomplete,inconsistent,or vague.There are some mechanisms and applications dedicated to discovering bugs and errors in OSM data.Such systems can remove errors through user-checks and applying predefined rules but they need an extra control process to check the real-world validity of suspected errors and bugs.This paper focuses on finding bugs and errors based on patterns and rules extracted from the tracking data of users.The underlying idea is that certain characteristics of user trajectories are directly linked to the type of feature.Using such rules,some sets of potential bugs and errors can be identified and stored for further investigations.展开更多
The validity measurement of fuzzy clustering is a key problem. If clustering is formed, it needs a kind of machine to verify its validity. To make mining more accountable, comprehensible and with a usable spatial patt...The validity measurement of fuzzy clustering is a key problem. If clustering is formed, it needs a kind of machine to verify its validity. To make mining more accountable, comprehensible and with a usable spatial pattern, it is necessary to first detect whether the data set has a clustered structure or not before clustering. This paper discusses a detection method for clustered patterns and a fuzzy clustering algorithm, and studies the validity function of the result produced by fuzzy clustering based on two aspects, which reflect the un-certainty of classification during fuzzy partition and spatial location features of spatial data, and proposes a new validity function of fuzzy clustering for spatial data. The experimental result indicates that the new validity function can accurately measure the validity of the results of fuzzy clustering. Especially, for the result of fuzzy clustering of spatial data, it is robust and its classification result is better when compared to other indices.展开更多
In this paper we propose a service-oriented architecture for spatial data integration (SOA-SDI) in the context of a large number of available spatial data sources that are physically sitting at different places, and d...In this paper we propose a service-oriented architecture for spatial data integration (SOA-SDI) in the context of a large number of available spatial data sources that are physically sitting at different places, and develop web-based GIS systems based on SOA-SDI, allowing client applications to pull in, analyze and present spatial data from those available spatial data sources. The proposed architecture logically includes 4 layers or components; they are layer of multiple data provider services, layer of data in-tegration, layer of backend services, and front-end graphical user interface (GUI) for spatial data presentation. On the basis of the 4-layered SOA-SDI framework, WebGIS applications can be quickly deployed, which proves that SOA-SDI has the potential to reduce the input of software development and shorten the development period.展开更多
This paper presents a methodology to determine three data quality (DQ) risk characteristics: accuracy, comprehensiveness and nonmembership. The methodology provides a set of quantitative models to confirm the informat...This paper presents a methodology to determine three data quality (DQ) risk characteristics: accuracy, comprehensiveness and nonmembership. The methodology provides a set of quantitative models to confirm the information quality risks for the database of the geographical information system (GIS). Four quantitative measures are introduced to examine how the quality risks of source information affect the quality of information outputs produced using the relational algebra operations Selection, Projection, and Cubic Product. It can be used to determine how quality risks associated with diverse data sources affect the derived data. The GIS is the prime source of information on the location of cables, and detection time strongly depends on whether maps indicate the presence of cables in the construction business. Poor data quality in the GIS can contribute to increased risk or higher risk avoidance costs. A case study provides a numerical example of the calculation of the trade-offs between risk and detection costs and provides an example of the calculation of the costs of data quality. We conclude that the model contributes valuable new insight.展开更多
The mathematic theory for uncertainty model of line segment are summed up to achieve a general conception, and the line error hand model of εσ is a basic uncertainty model that can depict the line accuracy and quali...The mathematic theory for uncertainty model of line segment are summed up to achieve a general conception, and the line error hand model of εσ is a basic uncertainty model that can depict the line accuracy and quality efficiently while the model of εm and error entropy can be regarded as the supplement of it. The error band model will reflect and describe the influence of line uncertainty on polygon uncertainty. Therefore, the statistical characteristic of the line error is studied deeply by analyzing the probability that the line error falls into a certain range. Moreover, the theory accordance is achieved in the selecting the error buffer for line feature and the error indicator. The relationship of the accuracy of area for a polygon with the error loop for a polygon boundary is deduced and computed.展开更多
With a review of the recent development in digitalization and application of seabed data, this paper systematically proposed methods for integrating seabed data by analyzing its feature based on ORACLE database manage...With a review of the recent development in digitalization and application of seabed data, this paper systematically proposed methods for integrating seabed data by analyzing its feature based on ORACLE database management system and advanced techniques of spatial data management. We did research on storage structure of seabed data, distributed-integrated database system, standardized spatial database and seabed metadata management system in order to effectively manage and use these seabed information in practical application. Finally, we applied the methods researched and proposed in this paper to build the Bohai Sea engineering geology database that stores engineering geology data and other seabed information from the Bohai Sea area. As a result, the Bohai Sea engineering geology database can effectively integrate huge amount of distributed and complicated seabed data to meet the practical requisition of Bohai Sea en-gineering geology environment exploration and exploitation.展开更多
Gene selection is an indispensable step for analyzing noisy and high-dimensional single-cell RNA-seq(scRNA-seq)data.Compared with the commonly used variance-based methods,by mimicking the human maker selection in the ...Gene selection is an indispensable step for analyzing noisy and high-dimensional single-cell RNA-seq(scRNA-seq)data.Compared with the commonly used variance-based methods,by mimicking the human maker selection in the 2D visualization of cells,a new feature selection method called HRG(Highly Regional Genes)is proposed to find the informative genes,which show regional expression patterns in the cell-cell similarity network.We mathematically find the optimal expression patterns that can maximize the proposed scoring function.In comparison with several unsupervised methods,HRG shows high accuracy and robustness,and can increase the performance of downstream cell clustering and gene correlation analysis.Also,it is applicable for selecting informative genes of sequencing-based spatial transcriptomic data.展开更多
The authors designed the spatial data mining system for ore-forming prediction based on the theory and methods of data mining as well as the technique of spatial database,in combination with the characteristics of geo...The authors designed the spatial data mining system for ore-forming prediction based on the theory and methods of data mining as well as the technique of spatial database,in combination with the characteristics of geological information data.The system consists of data management,data mining and knowledge discovery,knowledge representation.It can syncretize multi-source geosciences data effectively,such as geology,geochemistry,geophysics,RS.The system digitized geological information data as data layer files which consist of the two numerical values,to store these files in the system database.According to the combination of the characters of geological information,metallogenic prognosis was realized,as an example from some area in Heilongjiang Province.The prospect area of hydrothermal copper deposit was determined.展开更多
It is clearly stated in the 19th people's congress that we should make the environmental protection as our national policy. Therefore, it is of great importance to study this issue. This article is going to consid...It is clearly stated in the 19th people's congress that we should make the environmental protection as our national policy. Therefore, it is of great importance to study this issue. This article is going to consider 30 provinces of China as the cross-section, and utilize the data sample from 2006 to 2015 of these cross-sections to formulate a Spatial Panel Data Durbin Model to analyze the effect of FDI. By using these data, this article creates a comprehensive environmental pollution index with the help of entropy. The result indicates that the effect of FDI on environment has a non-linear and spatial spillover characteristic. Before reaching the critical value, FDI has a negative effect on environment; however, with the accumulation of FDI, it will create a significant positive effect on the environment.展开更多
The gap between SDA(Spatial Data Analysis)and GIS(Geographical Information Systems)existed for a long time.Presently this problem still remains in spite of a lot of theore tical and practical studies which tr y to fin...The gap between SDA(Spatial Data Analysis)and GIS(Geographical Information Systems)existed for a long time.Presently this problem still remains in spite of a lot of theore tical and practical studies which tr y to find the solu-tion for it.The research background and current situation about how to in tegrate SDA and GIS are introduced at first.The main idea of this article is to make su re what is the best scheme to bridge th e gap between SDA and GIS and how to design it.There are a lot of factors to influ ence the standards to assess such a sc heme,for instance,the attitude of users and GIS developers,the framework and related functions of current available GI S software in the market and so on.But the two most important ones of them are effic iency and flexibility of the scheme i tself.Efficiency can be measured by the conve-nient extent and temporal length when it is used for carrying out SDA.Flex ibility means users can define their own SDA methods.The best integration schem e should satisfy the two standards at the same time.A group of functions,which can be combined to implement any SDA meth od,are defined in order to design such an integration scheme.The functio ns are divided into five classes according to their properties.展开更多
文摘Background A task assigned to space exploration satellites involves detecting the physical environment within a certain space.However,space detection data are complex and abstract.These data are not conducive for researchers'visual perceptions of the evolution and interaction of events in the space environment.Methods A time-series dynamic data sampling method for large-scale space was proposed for sample detection data in space and time,and the corresponding relationships between data location features and other attribute features were established.A tone-mapping method based on statistical histogram equalization was proposed and applied to the final attribute feature data.The visualization process is optimized for rendering by merging materials,reducing the number of patches,and performing other operations.Results The results of sampling,feature extraction,and uniform visualization of the detection data of complex types,long duration spans,and uneven spatial distributions were obtained.The real-time visualization of large-scale spatial structures using augmented reality devices,particularly low-performance devices,was also investigated.Conclusions The proposed visualization system can reconstruct the three-dimensional structure of a large-scale space,express the structure and changes in the spatial environment using augmented reality,and assist in intuitively discovering spatial environmental events and evolutionary rules.
文摘To improve the performance of the traditional map matching algorithms in freeway traffic state monitoring systems using the low logging frequency GPS (global positioning system) probe data, a map matching algorithm based on the Oracle spatial data model is proposed. The algorithm uses the Oracle road network data model to analyze the spatial relationships between massive GPS positioning points and freeway networks, builds an N-shortest path algorithm to find reasonable candidate routes between GPS positioning points efficiently, and uses the fuzzy logic inference system to determine the final matched traveling route. According to the implementation with field data from Los Angeles, the computation speed of the algorithm is about 135 GPS positioning points per second and the accuracy is 98.9%. The results demonstrate the effectiveness and accuracy of the proposed algorithm for mapping massive GPS positioning data onto freeway networks with complex geometric characteristics.
基金supported by Mission No. 9 "Geological Environment and Hazards" (2019QZKK0900) of "The Second Tibetan Plateau Scientific Expedition and Research" projectNational Natural Science Foundation of China (No.42101087)
文摘Spatial seismic vulnerability assessments are primally conducted at the community and grid level,using heuristic and empirical approaches.Building-based spatial statistical vulnerability models are rare because of data limitations.Generating open-access spatial inventories that document seismic damage and building attributes and test their effectiveness in assessing damage would promote the advancement of spatial vulnerability assessment.The 2022 Mw 6.7 Luding earthquake in the western Sichuan Province of China provides an opportunity to validate this approach.The local government urgently dispatched experts to survey building damage,marking all buildings with damage class stickers.In this work,we sampled 2889 buildings as GPS points and documented the damage classes and building attributes,including structure type,number of floors,and age.A polygon-based digital inventory was generated by digitizing the rooftops of the sampled buildings and importing the attributes.Statistical regressions were created by plotting damage against shaking intensity and PGA,and Random Forest modeling was carried out considering not only buildings and seismic parameters but also environmental factors.The result indicates that statistical regressions have notable uncertainties,and the Random Forest model shows a≥79%accuracy.Topographical factors showed notable importance in the Random Forest modeling.This work provides an open-access seismic building damage inventory and demonstrates its potential for damage prediction and vulnerability assessment.
文摘The current global cybersecurity landscape, characterized by the increasing scale and sophistication of cyberattacks, underscores the importance of integrating Cyber Threat Intelligence (CTI) into Land Administration Systems (LAS). LAS services involve requests and responses concerning public and private cadastral data, including credentials of parties, ownership, and spatial parcels. This study explores the integration of CTI in LAS to enhance cyber resilience, focusing on the unique vulnerabilities of LAS, such as sensitive data management and interconnection with other critical systems related to spatial data uses and changes. The approach employs a case study of a typical country-specific LAS to analyse structured vulnerabilities and their attributes to determine the degree of vulnerability of LAS through a quantitative inductive approach. The analysis results indicate significant improvements in identifying and mitigating potential threats through CTI integration, thus enhancing cyber resilience. These findings are crucial for policymakers and practitioners to develop robust cybersecurity strategies for LAS.
基金Supported by the National 863 Program of China (No. 2007AAI2Z241), the Program for New Century Excellent Talents in University (No. NCET-07-0643), the National Natural Science Foundation of China (No. 40571134, No. 40871185), the National 973 Program of China (No. 108085).
文摘This paper proposes a new method for the compression of vector data map. Three key steps are encompassed in the proposed method, namely, the simplification of vector data map via the elimination of vertices, the compression of re- moved vertices based on a clustering model, and the decoding of the compressed vector data map. The proposed compres- sion method was implemented and applied to compress vector data map to investigate its performance in terms of the com- pression ratio and distortions of geometric shapes. The results show that the proposed method provides a feasible and effi- cient solution for the compression of vector data map and is able to achieve a promising ratio of compression and maintain the main shape characteristics of the spatial objects within the compressed vector data map.
基金Funded by the National 863 Program of China (No. 2005AA113150), and the National Natural Science Foundation of China (No.40701158).
文摘A novel Hilbert-curve is introduced for parallel spatial data partitioning, with consideration of the huge-amount property of spatial information and the variable-length characteristic of vector data items. Based on the improved Hilbert curve, the algorithm can be designed to achieve almost-uniform spatial data partitioning among multiple disks in parallel spatial databases. Thus, the phenomenon of data imbalance can be significantly avoided and search and query efficiency can be enhanced.
文摘The paper aims to present a concise overview of the current status of the national spatial data infrastructures(SDI)of the European Union(EU)member states combined with specific peculiarities for Bulgaria.Some major challenges within the progress of the EU SDIs establishing,which is regulated by the European Directive INSPIRE(Infrastructure for spatial information in Europe)toward establishment of a SDI for environmental policies and activities,are marked out.Available comparative analyses of the main indicators for metadata,data-sets,and data services provided by EU member states are briefly discussed as a special attention is given to the Bulgarian progress.Recent achievements on accelerating the process of implementing the recommendations of the INSPIRE Directive in Bulgaria are outlined.
文摘Since web based GIS processes large size spatial geographic information on internet, we should try to improve the efficiency of spatial data query processing and transmission. This paper presents two efficient methods for this purpose: division transmission and progressive transmission methods. In division transmission method, a map can be divided into several parts, called “tiles”, and only tiles can be transmitted at the request of a client. In progressive transmission method, a map can be split into several phase views based on the significance of vertices, and a server produces a target object and then transmits it progressively when this spatial object is requested from a client. In order to achieve these methods, the algorithms, “tile division”, “priority order estimation” and the strategies for data transmission are proposed in this paper, respectively. Compared with such traditional methods as “map total transmission” and “layer transmission”, the web based GIS data transmission, proposed in this paper, is advantageous in the increase of the data transmission efficiency by a great margin.
基金Supported by the 863 High Technology Program of China (No. 2007AA12Z214), the National Natural Science Foundation of China (No. 40601083) and the National Key Basic Research and Development Program of China ( No. 2004CB318206).
文摘In order to provide a provincial spatial database, this paper presents a scheme for spatial database construction to meet the needs of China. The objective and overall technical route of spatial database construction are described. The logical and physical database models are designed. Key issues are addressed, such as integration of multi-scale heterogeneous spatial databases, spatial data version management based on metadata and integrative management of map cartography and spatial database.
基金Supported by the National 973 Program of China(No.2006CB701305,No.2007CB310804)the National Natural Science Fundation of China(No.60743001)+1 种基金the Best National Thesis Fundation (No.2005047)the National New Century Excellent Talent Fundation (No.NCET-06-0618)
文摘The advanced data mining technologies and the large quantities of remotely sensed Imagery provide a data mining opportunity with high potential for useful results. Extracting interesting patterns and rules from data sets composed of images and associated ground data can be of importance in object identification, community planning, resource discovery and other areas. In this paper, a data field is presented to express the observed spatial objects and conduct behavior mining on them. First, most of the important aspects are discussed on behavior mining and its implications for the future of data mining. Furthermore, an ideal framework of the behavior mining system is proposed in the network environment. Second, the model of behavior mining is given on the observed spatial objects, including the objects described by the first feature data field and the main feature data field by means of the potential function. Finally, a case study about object identification in public is given and analyzed. The experimental results show that the new model is feasible in behavior mining.
基金This research was supported financially by EU FP7 Marie Curie Initial Training Network MULTI-POS(Multi-technology Positioning Professionals)[grant number 316528].
文摘OpenStreetMap(OSM)data are widely used but their reliability is still variable.Many contributors to OSM have not been trained in geography or surveying and consequently their contributions,including geometry and attribute data inserts,deletions,and updates,can be inaccurate,incomplete,inconsistent,or vague.There are some mechanisms and applications dedicated to discovering bugs and errors in OSM data.Such systems can remove errors through user-checks and applying predefined rules but they need an extra control process to check the real-world validity of suspected errors and bugs.This paper focuses on finding bugs and errors based on patterns and rules extracted from the tracking data of users.The underlying idea is that certain characteristics of user trajectories are directly linked to the type of feature.Using such rules,some sets of potential bugs and errors can be identified and stored for further investigations.
文摘The validity measurement of fuzzy clustering is a key problem. If clustering is formed, it needs a kind of machine to verify its validity. To make mining more accountable, comprehensible and with a usable spatial pattern, it is necessary to first detect whether the data set has a clustered structure or not before clustering. This paper discusses a detection method for clustered patterns and a fuzzy clustering algorithm, and studies the validity function of the result produced by fuzzy clustering based on two aspects, which reflect the un-certainty of classification during fuzzy partition and spatial location features of spatial data, and proposes a new validity function of fuzzy clustering for spatial data. The experimental result indicates that the new validity function can accurately measure the validity of the results of fuzzy clustering. Especially, for the result of fuzzy clustering of spatial data, it is robust and its classification result is better when compared to other indices.
基金Supported by the Research Fund of Key GIS Lab of the Education Ministry (No. 200610)
文摘In this paper we propose a service-oriented architecture for spatial data integration (SOA-SDI) in the context of a large number of available spatial data sources that are physically sitting at different places, and develop web-based GIS systems based on SOA-SDI, allowing client applications to pull in, analyze and present spatial data from those available spatial data sources. The proposed architecture logically includes 4 layers or components; they are layer of multiple data provider services, layer of data in-tegration, layer of backend services, and front-end graphical user interface (GUI) for spatial data presentation. On the basis of the 4-layered SOA-SDI framework, WebGIS applications can be quickly deployed, which proves that SOA-SDI has the potential to reduce the input of software development and shorten the development period.
基金The National Natural Science Foundation of China (No.70772021,70372004)China Postdoctoral Science Foundation (No.20060400077)
文摘This paper presents a methodology to determine three data quality (DQ) risk characteristics: accuracy, comprehensiveness and nonmembership. The methodology provides a set of quantitative models to confirm the information quality risks for the database of the geographical information system (GIS). Four quantitative measures are introduced to examine how the quality risks of source information affect the quality of information outputs produced using the relational algebra operations Selection, Projection, and Cubic Product. It can be used to determine how quality risks associated with diverse data sources affect the derived data. The GIS is the prime source of information on the location of cables, and detection time strongly depends on whether maps indicate the presence of cables in the construction business. Poor data quality in the GIS can contribute to increased risk or higher risk avoidance costs. A case study provides a numerical example of the calculation of the trade-offs between risk and detection costs and provides an example of the calculation of the costs of data quality. We conclude that the model contributes valuable new insight.
基金Project supported by the National Natural Science Foundation of China (No.40301043) .
文摘The mathematic theory for uncertainty model of line segment are summed up to achieve a general conception, and the line error hand model of εσ is a basic uncertainty model that can depict the line accuracy and quality efficiently while the model of εm and error entropy can be regarded as the supplement of it. The error band model will reflect and describe the influence of line uncertainty on polygon uncertainty. Therefore, the statistical characteristic of the line error is studied deeply by analyzing the probability that the line error falls into a certain range. Moreover, the theory accordance is achieved in the selecting the error buffer for line feature and the error indicator. The relationship of the accuracy of area for a polygon with the error loop for a polygon boundary is deduced and computed.
文摘With a review of the recent development in digitalization and application of seabed data, this paper systematically proposed methods for integrating seabed data by analyzing its feature based on ORACLE database management system and advanced techniques of spatial data management. We did research on storage structure of seabed data, distributed-integrated database system, standardized spatial database and seabed metadata management system in order to effectively manage and use these seabed information in practical application. Finally, we applied the methods researched and proposed in this paper to build the Bohai Sea engineering geology database that stores engineering geology data and other seabed information from the Bohai Sea area. As a result, the Bohai Sea engineering geology database can effectively integrate huge amount of distributed and complicated seabed data to meet the practical requisition of Bohai Sea en-gineering geology environment exploration and exploitation.
基金supported by the National Key Research and Development Program(2020YFA0712403,2020YFA0906900)National Natural Science Foundation of China(61922047,81890993,61721003,62133006)BNRIST Young Innovation Fund(BNR2020RC01009)。
文摘Gene selection is an indispensable step for analyzing noisy and high-dimensional single-cell RNA-seq(scRNA-seq)data.Compared with the commonly used variance-based methods,by mimicking the human maker selection in the 2D visualization of cells,a new feature selection method called HRG(Highly Regional Genes)is proposed to find the informative genes,which show regional expression patterns in the cell-cell similarity network.We mathematically find the optimal expression patterns that can maximize the proposed scoring function.In comparison with several unsupervised methods,HRG shows high accuracy and robustness,and can increase the performance of downstream cell clustering and gene correlation analysis.Also,it is applicable for selecting informative genes of sequencing-based spatial transcriptomic data.
文摘The authors designed the spatial data mining system for ore-forming prediction based on the theory and methods of data mining as well as the technique of spatial database,in combination with the characteristics of geological information data.The system consists of data management,data mining and knowledge discovery,knowledge representation.It can syncretize multi-source geosciences data effectively,such as geology,geochemistry,geophysics,RS.The system digitized geological information data as data layer files which consist of the two numerical values,to store these files in the system database.According to the combination of the characters of geological information,metallogenic prognosis was realized,as an example from some area in Heilongjiang Province.The prospect area of hydrothermal copper deposit was determined.
基金supported by the Hubei Province Educational Division Social Science Research Project(Grant No.15G051)
文摘It is clearly stated in the 19th people's congress that we should make the environmental protection as our national policy. Therefore, it is of great importance to study this issue. This article is going to consider 30 provinces of China as the cross-section, and utilize the data sample from 2006 to 2015 of these cross-sections to formulate a Spatial Panel Data Durbin Model to analyze the effect of FDI. By using these data, this article creates a comprehensive environmental pollution index with the help of entropy. The result indicates that the effect of FDI on environment has a non-linear and spatial spillover characteristic. Before reaching the critical value, FDI has a negative effect on environment; however, with the accumulation of FDI, it will create a significant positive effect on the environment.
文摘The gap between SDA(Spatial Data Analysis)and GIS(Geographical Information Systems)existed for a long time.Presently this problem still remains in spite of a lot of theore tical and practical studies which tr y to find the solu-tion for it.The research background and current situation about how to in tegrate SDA and GIS are introduced at first.The main idea of this article is to make su re what is the best scheme to bridge th e gap between SDA and GIS and how to design it.There are a lot of factors to influ ence the standards to assess such a sc heme,for instance,the attitude of users and GIS developers,the framework and related functions of current available GI S software in the market and so on.But the two most important ones of them are effic iency and flexibility of the scheme i tself.Efficiency can be measured by the conve-nient extent and temporal length when it is used for carrying out SDA.Flex ibility means users can define their own SDA methods.The best integration schem e should satisfy the two standards at the same time.A group of functions,which can be combined to implement any SDA meth od,are defined in order to design such an integration scheme.The functio ns are divided into five classes according to their properties.