By using CiteSpace software to create a knowledge map of authors,institutions and keywords,the literature on the spatio-temporal behavior of Chinese residents based on big data in the architectural planning discipline...By using CiteSpace software to create a knowledge map of authors,institutions and keywords,the literature on the spatio-temporal behavior of Chinese residents based on big data in the architectural planning discipline published in the China Academic Network Publishing Database(CNKI)was analyzed and discussed.It is found that there was a lack of communication and cooperation among research institutions and scholars;the research hotspots involved four main areas,including“application in tourism research”,“application in traffic travel research”,“application in work-housing relationship research”,and“application in personal family life research”.展开更多
Urban resilience assesses a city’s ability to withstand unknown risks.Scholars are not comprehensive in assessing urban resilience,and they lack consideration of population resilience.This study investigated 110 pref...Urban resilience assesses a city’s ability to withstand unknown risks.Scholars are not comprehensive in assessing urban resilience,and they lack consideration of population resilience.This study investigated 110 prefecturelevel cities in the Yangtze River Economic Belt(YREB)as study areas.We calculated the YREB’s level of urban resilience based on the aspects of“economy-society-population-ecology-infrastructure”,which ensured that the comprehensive evaluation of urban resilience is complete and sufficient.The spatio-temporal evolution of urban resilience was analyzed using exploratory spatial data.Geodetectors were used to investigate the impact of several indicators,focusing on economic,social,population,ecological,and infrastructure factors,on urban resilience.The results showed that the urban resilience of the YREB has maintained a slow upward trend from 2005 to 2018,and the average urban resilience of the YREB has risen from 0.2442 to 0.2560.The resilience gap between cities in the study region increased initially and then decreased.The dominant factor in the spatial differentiation of urban resilience was the economic factors,followed by the population factors.Urban resilience has been clarified and an evaluation index system is constructed,which can provide an effective reference for the evaluation of urban resilience among countries around the world.Based on this,factors that optimize urban resilience are configured,and the regional and national sustainable development can be promoted.展开更多
A factor analysis was applied to soil geochemical data to define anomalies related to buried Pb-Zn mineralization.A favorable main factor with a strong association of the elements Zn,Cu and Pb,related to mineralizatio...A factor analysis was applied to soil geochemical data to define anomalies related to buried Pb-Zn mineralization.A favorable main factor with a strong association of the elements Zn,Cu and Pb,related to mineralization,was selected for interpretation.The median+2 MAD(median absolute deviation)method of exploratory data analysis(EDA)and C-A(concentration-area)fractal modeling were then applied to the Mahalanobis distance,as defined by Zn,Cu and Pb from the factor analysis to set the thresholds for defining multi-element anomalies.As a result,the median+2 MAD method more successfully identified the Pb-Zn mineralization than the C-A fractal model.The soil anomaly identified by the median+2 MAD method on the Mahalanobis distances defined by three principal elements(Zn,Cu and Pb)rather than thirteen elements(Co,Zn,Cu,V,Mo,Ni,Cr,Mn,Pb,Ba,Sr,Zr and Ti)was the more favorable reflection of the ore body.The identified soil geochemical anomalies were compared with the in situ economic Pb-Zn ore bodies for validation.The results showed that the median+2 MAD approach is capable of mapping both strong and weak geochemical anomalies related to buried Pb-Zn mineralization,which is therefore useful at the reconnaissance drilling stage.展开更多
Clustering is used to gain an intuition of the struc tures in the data.Most of the current clustering algorithms pro duce a clustering structure even on data that do not possess such structure.In these cases,the algor...Clustering is used to gain an intuition of the struc tures in the data.Most of the current clustering algorithms pro duce a clustering structure even on data that do not possess such structure.In these cases,the algorithms force a structure in the data instead of discovering one.To avoid false structures in the relations of data,a novel clusterability assessment method called density-based clusterability measure is proposed in this paper.I measures the prominence of clustering structure in the data to evaluate whether a cluster analysis could produce a meaningfu insight to the relationships in the data.This is especially useful in time-series data since visualizing the structure in time-series data is hard.The performance of the clusterability measure is evalu ated against several synthetic data sets and time-series data sets which illustrate that the density-based clusterability measure can successfully indicate clustering structure of time-series data.展开更多
Precise information on indoor positioning provides a foundation for position-related customer services.Despite the emergence of several indoor positioning technologies such as ultrawideband,infrared,radio frequency id...Precise information on indoor positioning provides a foundation for position-related customer services.Despite the emergence of several indoor positioning technologies such as ultrawideband,infrared,radio frequency identification,Bluetooth beacons,pedestrian dead reckoning,and magnetic field,Wi-Fi is one of the most widely used technologies.Predominantly,Wi-Fi fingerprinting is the most popular method and has been researched over the past two decades.Wi-Fi positioning faces three core problems:device heterogeneity,robustness to signal changes caused by human mobility,and device attitude,i.e.,varying orientations.The existing methods do not cover these aspects owing to the unavailability of publicly available datasets.This study introduces a dataset that includes the Wi-Fi received signal strength(RSS)gathered using four different devices,namely Samsung Galaxy S8,S9,A8,LG G6,and LG G7,operated by three surveyors,including a female and two males.In addition,three orientations of the smartphones are used for the data collection and include multiple buildings with a multifloor environment.Various levels of human mobility have been considered in dynamic environments.To analyze the time-related impact on Wi-Fi RSS,data over 3 years have been considered.展开更多
It is urgent and important to explore the dynamic evolution in comprehensive transportation green efficiency(CTGE)in the context of green development.We constructed a social development index that reflects the social ...It is urgent and important to explore the dynamic evolution in comprehensive transportation green efficiency(CTGE)in the context of green development.We constructed a social development index that reflects the social benefits of transportation services,and incorporated it into the comprehensive transportation efficiency evaluation framework as an expected output.Based on the panel data of 30 regions in China from 2003-2018,the CTGE in China was measured using the slacks-based measure-data envelopment analysis(SBM-DEA)model.Further,the dynamic evolution trends of CTGE were determined using the spatial Markov model and exploratory spatio-temporal data analysis(ESTDA)technique from a spatio-temporal perspective.The results showed that the CTGE shows a U-shaped change trend but with an overall low level and significant regional differences.The state transition of CTGE has a strong spatial dependence,and there exists the phenomenon of“club convergence”.Neighbourhood background has a significant impact on the CTGE transition types,and the spatial spillover effect is pronounced.The CTGE has an obvious positive correlation and spatial agglomeration characteristics.The geometric characteristics of the LISA time path show that the evolution process of local spatial structure and local spatial dependence of China’s CTGE is stable,but the integration of spatial evolution is weak.The spatio-temporal transition results of LISA indicate that the CTGE has obvious transfer inertness and has certain path-dependence and spatial locking characteristics,which will become the major difficulty in improving the CTGE.展开更多
Quality of life(QOL) is a hotspot issue that has attracted increasing attention from the Chinese Government and scholars, it is also a vital issue that should be addressed during the cause of ′establishing overall we...Quality of life(QOL) is a hotspot issue that has attracted increasing attention from the Chinese Government and scholars, it is also a vital issue that should be addressed during the cause of ′establishing overall well-off society′. Northeast China is one of the most import old industrial bases in China, however, the industrial structure of heavy chemical industry and the development mode of ′production first, living last′ have leaded to series of social problems, which have also become a serious bottleneck to social stability and economic sustainable development. Through applying the methods of BP neural network, exploratory spatial data analysis(ESDA) and spatial regression model, this paper examines the space-time dynamics of QOL of the residents in Northeast China. We first investigate the indexes of QOL of the residents and then use ESDA methods to visualize its space-time relationship. We have found a spatial agglomeration of QOL of the residents in middle-southern Liaoning Province, central Jilin Province and Harbin-Qiqihar-Daqing area of Heilongjiang Province. Two third of the counties are low-low spatial correlation, and the correlative type of about 60% of the prefecture level areas keeps stable, indicating QOL of the residents in Northeast China shows a certain character of path dependence or spatial locked. We have also found that economic strength and development levels of service industry have positive and obvious effect on QOL of the residents, while the effect of such indexes as the social service level and the proportion of the tertiary industries are less.展开更多
Detailed information on the spatio-temporal changes of cropland soil organic carbon(SOC) can significantly contribute to the improvement of soil fertility and mitigate climate change. Nonetheless, information and know...Detailed information on the spatio-temporal changes of cropland soil organic carbon(SOC) can significantly contribute to the improvement of soil fertility and mitigate climate change. Nonetheless, information and knowledge on the national scale spatio-temporal changes and the corresponding uncertainties of SOC in Chinese upland soils remain limited. The CENTURY model was used to estimate the SOC storages and their changes in Chinese uplands from 1980 to 2010. With the Monte Carlo method, the uncertainties of CENTURY-modelled SOC dynamics associated with the spatial heterogeneous model inputs were quantified. Results revealed that the SOC storage in Chinese uplands increased from 3.03(1.59 to 4.78) Pg C in 1980 to 3.40(2.39 to 4.62) Pg C in 2010. Increment of SOC storage during this period was 370 Tg C, with an uncertainty interval of –440 to 1110 Tg C. The regional disparities of SOC changes reached a significant level, with considerable SOC accumulation in the Huang-Huai-Hai Plain of China and SOC loss in the northeastern China. The SOC lost from Meadow soils, Black soils and Chernozems was most severe, whilst SOC accumulation in Fluvo-aquic soils, Cinnamon soils and Purplish soils was most significant. In modelling large-scale SOC dynamics, the initial soil properties were major sources of uncertainty. Hence, more detailed information concerning the soil properties must be collected. The SOC stock of Chinese uplands in 2010 was still relatively low, manifesting that recommended agricultural management practices in conjunction with effectively economic and policy incentives to farmers for soil fertility improvement were indispensable for future carbon sequestration in these regions.展开更多
Exposure to particulate matter with an aerodynamic diameter of less than 2.5 μm (PM2.5) may increase risk of lung cancer. The repetitive and broad-area coverage of satellites may allow atmospheric remote sensing to o...Exposure to particulate matter with an aerodynamic diameter of less than 2.5 μm (PM2.5) may increase risk of lung cancer. The repetitive and broad-area coverage of satellites may allow atmospheric remote sensing to offer a unique opportunity to monitor air quality and help fill air pollution data gaps that hinder efforts to study air pollution and protect public health. This geographical study explores if there is an association between PM2.5 and lung cancer mortality rate in the conterminous USA. Lung cancer (ICD-10 codes C34- C34) death count and population at risk by county were extracted for the period from 2001 to 2010 from the U.S. CDC WONDER online database. The 2001-2010 Global Annual Average PM2.5 Grids from MODIS and MISR Aerosol Optical Depth dataset was used to calculate a 10 year average PM2.5 pollution. Exploratory spatial data analyses, spatial regression (a spatial lag and a spatial error model), and spatially extended Bayesian Monte Carlo Markov Chain simulation found that there is a significant positive association between lung cancer mortality rate and PM2.5. The association would justify the need of further toxicological investigation of the biological mechanism of the adverse effect of the PM2.5 pollution on lung cancer. The Global Annual Average PM2.5 Grids from MODIS and MISR Aerosol Optical Depth dataset provides a continuous surface of concentrations of PM2.5 and is a useful data source for environmental health research.展开更多
In 2007,China surpassed the USA to become the largest carbon emitter in the world.China has promised a 60%–65%reduction in carbon emissions per unit GDP by 2030,compared to the baseline of 2005.Therefore,it is import...In 2007,China surpassed the USA to become the largest carbon emitter in the world.China has promised a 60%–65%reduction in carbon emissions per unit GDP by 2030,compared to the baseline of 2005.Therefore,it is important to obtain accurate dynamic information on the spatial and temporal patterns of carbon emissions and carbon footprints to support formulating effective national carbon emission reduction policies.This study attempts to build a carbon emission panel data model that simulates carbon emissions in China from 2000–2013 using nighttime lighting data and carbon emission statistics data.By applying the Exploratory Spatial-Temporal Data Analysis(ESTDA)framework,this study conducted an analysis on the spatial patterns and dynamic spatial-temporal interactions of carbon footprints from 2001–2013.The improved Tapio decoupling model was adopted to investigate the levels of coupling or decoupling between the carbon emission load and economic growth in 336 prefecture-level units.The results show that,firstly,high accuracy was achieved by the model in simulating carbon emissions.Secondly,the total carbon footprints and carbon deficits across China increased with average annual growth rates of 4.82%and 5.72%,respectively.The overall carbon footprints and carbon deficits were larger in the North than that in the South.There were extremely significant spatial autocorrelation features in the carbon footprints of prefecture-level units.Thirdly,the relative lengths of the Local Indicators of Spatial Association(LISA)time paths were longer in the North than that in the South,and they increased from the coastal to the central and western regions.Lastly,the overall decoupling index was mainly a weak decoupling type,but the number of cities with this weak decoupling continued to decrease.The unsustainable development trend of China’s economic growth and carbon emission load will continue for some time.展开更多
Closely related to the economy,the analysis and management of electricity consumption has been widely studied.Conventional approaches mainly focus on the prediction and anomaly detection of electricity consumption,whi...Closely related to the economy,the analysis and management of electricity consumption has been widely studied.Conventional approaches mainly focus on the prediction and anomaly detection of electricity consumption,which fails to reveal the in-depth relationships between electricity consumption and various factors such as industry,weather etc..In the meantime,the lack of analysis tools has increased the difficulty in analytical tasks such as correlation analysis and comparative analysis.In this paper,we introduce EcoVis,a visual analysis system that supports the industrial-level spatio-temporal correlation analysis in the electricity consumption data.We not only propose a novel approach to model spatio-temporal data into a graph structure for easier correlation analysis,but also introduce a novel visual representation to display the distributions of multiple instances in a single map.We implement the system with the cooperation with domain experts.Experiments are conducted to demonstrate the effectiveness of our method.展开更多
A significant Geographic Information Science(GIS)issue is closely related to spatial autocorrelation,a burning question in the phase of information extraction from the statistical analysis of georeferenced data.At pre...A significant Geographic Information Science(GIS)issue is closely related to spatial autocorrelation,a burning question in the phase of information extraction from the statistical analysis of georeferenced data.At present,spatial autocorrelation presents two types of measures:continuous and discrete.Is it possible to use Moran’s I and the Moran scatterplot with continuous data?Is it possible to use the same methodology with discrete data?A particular and cumbersome problem is the choice of the spatial-neighborhood matrix(W)for points data.This paper addresses these issues by introducing the concept of covariogram contiguity,where each weight is based on the variogram model for that particular dataset:(1)the variogram,whose range equals the distance with the highest Moran I value,defines the weights for points separated by less than the estimated range and(2)weights equal zero for points widely separated from the variogram range considered.After the W matrix is computed,the Moran location scatterplot is created in an iterative process.In accordance with various lag distances,Moran’s I is presented as a good search factor for the optimal neighborhood area.Uncertainty/transition regions are also emphasized.At the same time,a new Exploratory Spatial Data Analysis(ESDA)tool is developed,the Moran variance scatterplot,since the conventional Moran scatterplot is not sensitive to neighbor variance.This computer-mapping framework allows the study of spatial patterns,outliers,changeover areas,and trends in an ESDA process.All these tools were implemented in a free web e-Learning program for quantitative geographers called SAKWeb#(or,in the near future,myGeooffice.org).展开更多
Identifying the subcellular localization of proteins is particularly helpful in the functional annotation of gene products. In this study, we use Machine Learning and Exploratory Data Analysis (EDA) techniques to ex...Identifying the subcellular localization of proteins is particularly helpful in the functional annotation of gene products. In this study, we use Machine Learning and Exploratory Data Analysis (EDA) techniques to examine and characterize amino acid sequences of human proteins localized in nine cellular compartments. A dataset of 3,749 protein sequences representing human proteins was extracted from the SWISS-PROT database. Feature vectors were created to capture specific amino acid sequence characteristics. Relative to a Support Vector Machine, a Multi-layer Perceptron, and a Naive Bayes classifier, the C4.5 Decision Tree algorithm was the most consistent performer across all nine compartments in reliably predicting the subcellular localization of proteins based on their amino acid sequences (average Precision=0.88; average Sensitivity=0.86). Furthermore, EDA graphics characterized essential features of proteins in each compartment. As examples, proteins localized to the plasma membrane had higher proportions of hydrophobic amino acids; cytoplasmic proteins had higher proportions of neutral amino acids; and mitochondrial proteins had higher proportions of neutral amino acids and lower proportions of polar amino acids. These data showed that the C4.5 classifier and EDA tools can be effective for characterizing and predicting the subcellular localization of human proteins based on their amino acid sequences.展开更多
1 Key concepts underpinning geo-data science Geoinformatics and Geomathematics Computers have been used for data collection,management,analysis,and transmission in geoscience for about 70 years since the 1950s (Merria...1 Key concepts underpinning geo-data science Geoinformatics and Geomathematics Computers have been used for data collection,management,analysis,and transmission in geoscience for about 70 years since the 1950s (Merriam,2001;2004).The term geoinformatics is widely used to describe such activities.In real-world practices,researchers in both geography and geoscience are using the term geoinformatics.展开更多
Understanding the tectono-magmatic evolution history of the Tengchong block is crucial for elucidating the formation of the Eastern Tethys tectonic domain.However,the correlation and evolution of the Tengchong block w...Understanding the tectono-magmatic evolution history of the Tengchong block is crucial for elucidating the formation of the Eastern Tethys tectonic domain.However,the correlation and evolution of the Tengchong block with the Sibumasu and Lhasa blocks is controversial during the Permian and Cretaceous.This study explores the information contained within magmatic rocks using big data and spatio-temporal analysis,providing quantitative constraints for the discussion of the tectonomagmatic evolution of the Tengchong block.To more accurately assess true magma activities and reduce errors caused by preservation and sampling processes,we utilized local singularity analysis to obtain the singularity index time-series.Correlation analysis of zircon ages and eHf(t)(correlation coefficient0.5)values indicates that the Tengchong block is more similar to the Sibumasu block.Results from timelagged cross-correlation analysis indicate that the Tengchong block and Sibumasu block exhibit a shorter lag in magmatic activities(3 Myr).Wavelet analysis reveals similar periods of collision-related magmatic activities(57 Myr and 43 Myr).Integrating evidence from paleontology and ophiolite belts,we propose that the Tengchong block co-evolved more closely with the Sibumasu block than with the Lhasa block,suggesting similar tectonic processes during the Early Permian to Early Cretaceous.Approximately 250–236 Ma,in the western Tengchong block,partial melting of the lower crust occurs due to crustal thickening.Around 219–213 Ma and 198–180 Ma,after the Tengchong block collided with the Eurasian continent,the subduction of the Meso-Tethys Ocean commenced.Around 130–111 Ma,the overall tectonic feature was a scissor-like closure of the Meso-Tethys Ocean from north to south.展开更多
Exploratory data analysis plays a major role in obtaining insights from data.Over the last two decades,researchers have proposed several visual data exploration tools that can assist with each step of the analysis pro...Exploratory data analysis plays a major role in obtaining insights from data.Over the last two decades,researchers have proposed several visual data exploration tools that can assist with each step of the analysis process.Nevertheless,in recent years,data analysis requirements have changed significantly.With constantly increasing size and types of data to be analyzed,scalability and analysis duration are now among the primary concerns of researchers.Moreover,in order to minimize the analysis cost,businesses are in need of data analysis tools that can be used with limited analytical knowledge.To address these challenges,traditional data exploration tools have evolved within the last few years.In this paper,with an in-depth analysis of an industrial tabular dataset,we identify a set of additional exploratory requirements for large datasets.Later,we present a comprehensive survey of the recent advancements in the emerging field of exploratory data analysis.We investigate 50 academic and non-academic visual data exploration tools with respect to their utility in the six fundamental steps of the exploratory data analysis process.We also examine the extent to which these modern data exploration tools fulfill the additional requirements for analyzing large datasets.Finally,we identify and present a set of research opportunities in the field of visual exploratory data analysis.展开更多
文摘By using CiteSpace software to create a knowledge map of authors,institutions and keywords,the literature on the spatio-temporal behavior of Chinese residents based on big data in the architectural planning discipline published in the China Academic Network Publishing Database(CNKI)was analyzed and discussed.It is found that there was a lack of communication and cooperation among research institutions and scholars;the research hotspots involved four main areas,including“application in tourism research”,“application in traffic travel research”,“application in work-housing relationship research”,and“application in personal family life research”.
基金I would like to thank the National Natural Science Foundation of China(Grant No.42061041)for the funding.
文摘Urban resilience assesses a city’s ability to withstand unknown risks.Scholars are not comprehensive in assessing urban resilience,and they lack consideration of population resilience.This study investigated 110 prefecturelevel cities in the Yangtze River Economic Belt(YREB)as study areas.We calculated the YREB’s level of urban resilience based on the aspects of“economy-society-population-ecology-infrastructure”,which ensured that the comprehensive evaluation of urban resilience is complete and sufficient.The spatio-temporal evolution of urban resilience was analyzed using exploratory spatial data.Geodetectors were used to investigate the impact of several indicators,focusing on economic,social,population,ecological,and infrastructure factors,on urban resilience.The results showed that the urban resilience of the YREB has maintained a slow upward trend from 2005 to 2018,and the average urban resilience of the YREB has risen from 0.2442 to 0.2560.The resilience gap between cities in the study region increased initially and then decreased.The dominant factor in the spatial differentiation of urban resilience was the economic factors,followed by the population factors.Urban resilience has been clarified and an evaluation index system is constructed,which can provide an effective reference for the evaluation of urban resilience among countries around the world.Based on this,factors that optimize urban resilience are configured,and the regional and national sustainable development can be promoted.
文摘A factor analysis was applied to soil geochemical data to define anomalies related to buried Pb-Zn mineralization.A favorable main factor with a strong association of the elements Zn,Cu and Pb,related to mineralization,was selected for interpretation.The median+2 MAD(median absolute deviation)method of exploratory data analysis(EDA)and C-A(concentration-area)fractal modeling were then applied to the Mahalanobis distance,as defined by Zn,Cu and Pb from the factor analysis to set the thresholds for defining multi-element anomalies.As a result,the median+2 MAD method more successfully identified the Pb-Zn mineralization than the C-A fractal model.The soil anomaly identified by the median+2 MAD method on the Mahalanobis distances defined by three principal elements(Zn,Cu and Pb)rather than thirteen elements(Co,Zn,Cu,V,Mo,Ni,Cr,Mn,Pb,Ba,Sr,Zr and Ti)was the more favorable reflection of the ore body.The identified soil geochemical anomalies were compared with the in situ economic Pb-Zn ore bodies for validation.The results showed that the median+2 MAD approach is capable of mapping both strong and weak geochemical anomalies related to buried Pb-Zn mineralization,which is therefore useful at the reconnaissance drilling stage.
文摘Clustering is used to gain an intuition of the struc tures in the data.Most of the current clustering algorithms pro duce a clustering structure even on data that do not possess such structure.In these cases,the algorithms force a structure in the data instead of discovering one.To avoid false structures in the relations of data,a novel clusterability assessment method called density-based clusterability measure is proposed in this paper.I measures the prominence of clustering structure in the data to evaluate whether a cluster analysis could produce a meaningfu insight to the relationships in the data.This is especially useful in time-series data since visualizing the structure in time-series data is hard.The performance of the clusterability measure is evalu ated against several synthetic data sets and time-series data sets which illustrate that the density-based clusterability measure can successfully indicate clustering structure of time-series data.
基金This research was supported by the Ministry of Science and ICT(MSIT),Korea,under the Information Technology Research Center(ITRC)support program(IITP-2020-2016-0-00313)supervised by the Institute for Information&communications Technology Planning&Evaluation(IITP)This research was supported by Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Science,ICT and Future Planning(2017R1E1A1A01074345).
文摘Precise information on indoor positioning provides a foundation for position-related customer services.Despite the emergence of several indoor positioning technologies such as ultrawideband,infrared,radio frequency identification,Bluetooth beacons,pedestrian dead reckoning,and magnetic field,Wi-Fi is one of the most widely used technologies.Predominantly,Wi-Fi fingerprinting is the most popular method and has been researched over the past two decades.Wi-Fi positioning faces three core problems:device heterogeneity,robustness to signal changes caused by human mobility,and device attitude,i.e.,varying orientations.The existing methods do not cover these aspects owing to the unavailability of publicly available datasets.This study introduces a dataset that includes the Wi-Fi received signal strength(RSS)gathered using four different devices,namely Samsung Galaxy S8,S9,A8,LG G6,and LG G7,operated by three surveyors,including a female and two males.In addition,three orientations of the smartphones are used for the data collection and include multiple buildings with a multifloor environment.Various levels of human mobility have been considered in dynamic environments.To analyze the time-related impact on Wi-Fi RSS,data over 3 years have been considered.
基金National Key Research and Development Program of China(2019YFB1600400)National Natural Science Foundation of China(72174035)+2 种基金National Natural Science Foundation of China(71774018)Liaoning Revitalization Talents Program(XLYC2008030)Liaoning Provincial Natural Science Foundation Shipping Joint Foundation Program(2020-HYLH-20)。
文摘It is urgent and important to explore the dynamic evolution in comprehensive transportation green efficiency(CTGE)in the context of green development.We constructed a social development index that reflects the social benefits of transportation services,and incorporated it into the comprehensive transportation efficiency evaluation framework as an expected output.Based on the panel data of 30 regions in China from 2003-2018,the CTGE in China was measured using the slacks-based measure-data envelopment analysis(SBM-DEA)model.Further,the dynamic evolution trends of CTGE were determined using the spatial Markov model and exploratory spatio-temporal data analysis(ESTDA)technique from a spatio-temporal perspective.The results showed that the CTGE shows a U-shaped change trend but with an overall low level and significant regional differences.The state transition of CTGE has a strong spatial dependence,and there exists the phenomenon of“club convergence”.Neighbourhood background has a significant impact on the CTGE transition types,and the spatial spillover effect is pronounced.The CTGE has an obvious positive correlation and spatial agglomeration characteristics.The geometric characteristics of the LISA time path show that the evolution process of local spatial structure and local spatial dependence of China’s CTGE is stable,but the integration of spatial evolution is weak.The spatio-temporal transition results of LISA indicate that the CTGE has obvious transfer inertness and has certain path-dependence and spatial locking characteristics,which will become the major difficulty in improving the CTGE.
基金Under the auspices of Key Research Program of Chinese Academic of Science(No.KZZD-EW-06-03,KSZD-EW-Z-021-03)Advantage Discipline Project of Hainan Normal University(No.305010048)+2 种基金Key Discipline Project of Hainan(No.3050107048)National Natural Science Foundation of China(No.41201160,41329001)Natural Science Foundation of Hainan Province(No.414189)
文摘Quality of life(QOL) is a hotspot issue that has attracted increasing attention from the Chinese Government and scholars, it is also a vital issue that should be addressed during the cause of ′establishing overall well-off society′. Northeast China is one of the most import old industrial bases in China, however, the industrial structure of heavy chemical industry and the development mode of ′production first, living last′ have leaded to series of social problems, which have also become a serious bottleneck to social stability and economic sustainable development. Through applying the methods of BP neural network, exploratory spatial data analysis(ESDA) and spatial regression model, this paper examines the space-time dynamics of QOL of the residents in Northeast China. We first investigate the indexes of QOL of the residents and then use ESDA methods to visualize its space-time relationship. We have found a spatial agglomeration of QOL of the residents in middle-southern Liaoning Province, central Jilin Province and Harbin-Qiqihar-Daqing area of Heilongjiang Province. Two third of the counties are low-low spatial correlation, and the correlative type of about 60% of the prefecture level areas keeps stable, indicating QOL of the residents in Northeast China shows a certain character of path dependence or spatial locked. We have also found that economic strength and development levels of service industry have positive and obvious effect on QOL of the residents, while the effect of such indexes as the social service level and the proportion of the tertiary industries are less.
基金Under the auspices of National Key Research and Development Program of China(No.2017YFA0603002)National Natural Science Foundation of China(No.31800358,31700369)+1 种基金Jiangsu Agricultural Science and Technology Innovation Fund(No.CX(19)3099)the Foundation of Jiangsu Vocational College of Agriculture and Forestry(No.2019kj014)。
文摘Detailed information on the spatio-temporal changes of cropland soil organic carbon(SOC) can significantly contribute to the improvement of soil fertility and mitigate climate change. Nonetheless, information and knowledge on the national scale spatio-temporal changes and the corresponding uncertainties of SOC in Chinese upland soils remain limited. The CENTURY model was used to estimate the SOC storages and their changes in Chinese uplands from 1980 to 2010. With the Monte Carlo method, the uncertainties of CENTURY-modelled SOC dynamics associated with the spatial heterogeneous model inputs were quantified. Results revealed that the SOC storage in Chinese uplands increased from 3.03(1.59 to 4.78) Pg C in 1980 to 3.40(2.39 to 4.62) Pg C in 2010. Increment of SOC storage during this period was 370 Tg C, with an uncertainty interval of –440 to 1110 Tg C. The regional disparities of SOC changes reached a significant level, with considerable SOC accumulation in the Huang-Huai-Hai Plain of China and SOC loss in the northeastern China. The SOC lost from Meadow soils, Black soils and Chernozems was most severe, whilst SOC accumulation in Fluvo-aquic soils, Cinnamon soils and Purplish soils was most significant. In modelling large-scale SOC dynamics, the initial soil properties were major sources of uncertainty. Hence, more detailed information concerning the soil properties must be collected. The SOC stock of Chinese uplands in 2010 was still relatively low, manifesting that recommended agricultural management practices in conjunction with effectively economic and policy incentives to farmers for soil fertility improvement were indispensable for future carbon sequestration in these regions.
文摘Exposure to particulate matter with an aerodynamic diameter of less than 2.5 μm (PM2.5) may increase risk of lung cancer. The repetitive and broad-area coverage of satellites may allow atmospheric remote sensing to offer a unique opportunity to monitor air quality and help fill air pollution data gaps that hinder efforts to study air pollution and protect public health. This geographical study explores if there is an association between PM2.5 and lung cancer mortality rate in the conterminous USA. Lung cancer (ICD-10 codes C34- C34) death count and population at risk by county were extracted for the period from 2001 to 2010 from the U.S. CDC WONDER online database. The 2001-2010 Global Annual Average PM2.5 Grids from MODIS and MISR Aerosol Optical Depth dataset was used to calculate a 10 year average PM2.5 pollution. Exploratory spatial data analyses, spatial regression (a spatial lag and a spatial error model), and spatially extended Bayesian Monte Carlo Markov Chain simulation found that there is a significant positive association between lung cancer mortality rate and PM2.5. The association would justify the need of further toxicological investigation of the biological mechanism of the adverse effect of the PM2.5 pollution on lung cancer. The Global Annual Average PM2.5 Grids from MODIS and MISR Aerosol Optical Depth dataset provides a continuous surface of concentrations of PM2.5 and is a useful data source for environmental health research.
基金National Natural Science Foundation of China Youth Science Foundation ProjectNo.41701170+1 种基金National Natural Science Foundation of China,No.41661025,No.42071216Fundamental Research Funds for the Central Universities,No.18LZUJBWZY068。
文摘In 2007,China surpassed the USA to become the largest carbon emitter in the world.China has promised a 60%–65%reduction in carbon emissions per unit GDP by 2030,compared to the baseline of 2005.Therefore,it is important to obtain accurate dynamic information on the spatial and temporal patterns of carbon emissions and carbon footprints to support formulating effective national carbon emission reduction policies.This study attempts to build a carbon emission panel data model that simulates carbon emissions in China from 2000–2013 using nighttime lighting data and carbon emission statistics data.By applying the Exploratory Spatial-Temporal Data Analysis(ESTDA)framework,this study conducted an analysis on the spatial patterns and dynamic spatial-temporal interactions of carbon footprints from 2001–2013.The improved Tapio decoupling model was adopted to investigate the levels of coupling or decoupling between the carbon emission load and economic growth in 336 prefecture-level units.The results show that,firstly,high accuracy was achieved by the model in simulating carbon emissions.Secondly,the total carbon footprints and carbon deficits across China increased with average annual growth rates of 4.82%and 5.72%,respectively.The overall carbon footprints and carbon deficits were larger in the North than that in the South.There were extremely significant spatial autocorrelation features in the carbon footprints of prefecture-level units.Thirdly,the relative lengths of the Local Indicators of Spatial Association(LISA)time paths were longer in the North than that in the South,and they increased from the coastal to the central and western regions.Lastly,the overall decoupling index was mainly a weak decoupling type,but the number of cities with this weak decoupling continued to decrease.The unsustainable development trend of China’s economic growth and carbon emission load will continue for some time.
基金This work was supported by the Science and Technology Project of China Southern Power Grid Corporation(ZBKJXM20180157)the National Natural Science Foundation of China(Grant Nos.61772456,61761136020).
文摘Closely related to the economy,the analysis and management of electricity consumption has been widely studied.Conventional approaches mainly focus on the prediction and anomaly detection of electricity consumption,which fails to reveal the in-depth relationships between electricity consumption and various factors such as industry,weather etc..In the meantime,the lack of analysis tools has increased the difficulty in analytical tasks such as correlation analysis and comparative analysis.In this paper,we introduce EcoVis,a visual analysis system that supports the industrial-level spatio-temporal correlation analysis in the electricity consumption data.We not only propose a novel approach to model spatio-temporal data into a graph structure for easier correlation analysis,but also introduce a novel visual representation to display the distributions of multiple instances in a single map.We implement the system with the cooperation with domain experts.Experiments are conducted to demonstrate the effectiveness of our method.
文摘A significant Geographic Information Science(GIS)issue is closely related to spatial autocorrelation,a burning question in the phase of information extraction from the statistical analysis of georeferenced data.At present,spatial autocorrelation presents two types of measures:continuous and discrete.Is it possible to use Moran’s I and the Moran scatterplot with continuous data?Is it possible to use the same methodology with discrete data?A particular and cumbersome problem is the choice of the spatial-neighborhood matrix(W)for points data.This paper addresses these issues by introducing the concept of covariogram contiguity,where each weight is based on the variogram model for that particular dataset:(1)the variogram,whose range equals the distance with the highest Moran I value,defines the weights for points separated by less than the estimated range and(2)weights equal zero for points widely separated from the variogram range considered.After the W matrix is computed,the Moran location scatterplot is created in an iterative process.In accordance with various lag distances,Moran’s I is presented as a good search factor for the optimal neighborhood area.Uncertainty/transition regions are also emphasized.At the same time,a new Exploratory Spatial Data Analysis(ESDA)tool is developed,the Moran variance scatterplot,since the conventional Moran scatterplot is not sensitive to neighbor variance.This computer-mapping framework allows the study of spatial patterns,outliers,changeover areas,and trends in an ESDA process.All these tools were implemented in a free web e-Learning program for quantitative geographers called SAKWeb#(or,in the near future,myGeooffice.org).
文摘Identifying the subcellular localization of proteins is particularly helpful in the functional annotation of gene products. In this study, we use Machine Learning and Exploratory Data Analysis (EDA) techniques to examine and characterize amino acid sequences of human proteins localized in nine cellular compartments. A dataset of 3,749 protein sequences representing human proteins was extracted from the SWISS-PROT database. Feature vectors were created to capture specific amino acid sequence characteristics. Relative to a Support Vector Machine, a Multi-layer Perceptron, and a Naive Bayes classifier, the C4.5 Decision Tree algorithm was the most consistent performer across all nine compartments in reliably predicting the subcellular localization of proteins based on their amino acid sequences (average Precision=0.88; average Sensitivity=0.86). Furthermore, EDA graphics characterized essential features of proteins in each compartment. As examples, proteins localized to the plasma membrane had higher proportions of hydrophobic amino acids; cytoplasmic proteins had higher proportions of neutral amino acids; and mitochondrial proteins had higher proportions of neutral amino acids and lower proportions of polar amino acids. These data showed that the C4.5 classifier and EDA tools can be effective for characterizing and predicting the subcellular localization of human proteins based on their amino acid sequences.
基金supported by the National Science Foundation (Grant No.1815526).
文摘1 Key concepts underpinning geo-data science Geoinformatics and Geomathematics Computers have been used for data collection,management,analysis,and transmission in geoscience for about 70 years since the 1950s (Merriam,2001;2004).The term geoinformatics is widely used to describe such activities.In real-world practices,researchers in both geography and geoscience are using the term geoinformatics.
文摘Understanding the tectono-magmatic evolution history of the Tengchong block is crucial for elucidating the formation of the Eastern Tethys tectonic domain.However,the correlation and evolution of the Tengchong block with the Sibumasu and Lhasa blocks is controversial during the Permian and Cretaceous.This study explores the information contained within magmatic rocks using big data and spatio-temporal analysis,providing quantitative constraints for the discussion of the tectonomagmatic evolution of the Tengchong block.To more accurately assess true magma activities and reduce errors caused by preservation and sampling processes,we utilized local singularity analysis to obtain the singularity index time-series.Correlation analysis of zircon ages and eHf(t)(correlation coefficient0.5)values indicates that the Tengchong block is more similar to the Sibumasu block.Results from timelagged cross-correlation analysis indicate that the Tengchong block and Sibumasu block exhibit a shorter lag in magmatic activities(3 Myr).Wavelet analysis reveals similar periods of collision-related magmatic activities(57 Myr and 43 Myr).Integrating evidence from paleontology and ophiolite belts,we propose that the Tengchong block co-evolved more closely with the Sibumasu block than with the Lhasa block,suggesting similar tectonic processes during the Early Permian to Early Cretaceous.Approximately 250–236 Ma,in the western Tengchong block,partial melting of the lower crust occurs due to crustal thickening.Around 219–213 Ma and 198–180 Ma,after the Tengchong block collided with the Eurasian continent,the subduction of the Meso-Tethys Ocean commenced.Around 130–111 Ma,the overall tectonic feature was a scissor-like closure of the Meso-Tethys Ocean from north to south.
文摘Exploratory data analysis plays a major role in obtaining insights from data.Over the last two decades,researchers have proposed several visual data exploration tools that can assist with each step of the analysis process.Nevertheless,in recent years,data analysis requirements have changed significantly.With constantly increasing size and types of data to be analyzed,scalability and analysis duration are now among the primary concerns of researchers.Moreover,in order to minimize the analysis cost,businesses are in need of data analysis tools that can be used with limited analytical knowledge.To address these challenges,traditional data exploration tools have evolved within the last few years.In this paper,with an in-depth analysis of an industrial tabular dataset,we identify a set of additional exploratory requirements for large datasets.Later,we present a comprehensive survey of the recent advancements in the emerging field of exploratory data analysis.We investigate 50 academic and non-academic visual data exploration tools with respect to their utility in the six fundamental steps of the exploratory data analysis process.We also examine the extent to which these modern data exploration tools fulfill the additional requirements for analyzing large datasets.Finally,we identify and present a set of research opportunities in the field of visual exploratory data analysis.