Morphological(e.g.shape,size,and height)and function(e.g.working,living,and shopping)information of buildings is highly needed for urban planning and management as well as other applications such as city-scale buildin...Morphological(e.g.shape,size,and height)and function(e.g.working,living,and shopping)information of buildings is highly needed for urban planning and management as well as other applications such as city-scale building energy use modeling.Due to the limited availability of socio-economic geospatial data,it is more challenging to map building functions than building morphological information,especially over large areas.In this study,we proposed an integrated framework to map building functions in 50 U.S.cities by integrating multi-source web-based geospatial data.First,a web crawler was developed to extract Points of Interest(POIs)from Tripadvisor.com,and a map crawler was developed to extract POIs and land use parcels from Google Maps.Second,an unsupervised machine learning algorithm named OneClassSVM was used to identify residential buildings based on landscape features derived from Microsoft building footprints.Third,the type ratio of POIs and the area ratio of land use parcels were used to identify six non-residential functions(i.e.hospital,hotel,school,shop,restaurant,and office).The accuracy assessment indicates that the proposed framework performed well,with an average overall accuracy of 94%and a kappa coefficient of 0.63.With the worldwide coverage of Google Maps and Tripadvisor.com,the proposed framework is transferable to other cities over the world.The data products generated from this study are of great use for quantitative city-scale urban studies,such as building energy use modeling at the single building level over large areas.展开更多
With the acceleration of intelligent transformation of energy system,the monitoring of equipment operation status and optimization of production process in thermal power plants face the challenge of multi-source heter...With the acceleration of intelligent transformation of energy system,the monitoring of equipment operation status and optimization of production process in thermal power plants face the challenge of multi-source heterogeneous data integration.In view of the heterogeneous characteristics of physical sensor data,including temperature,vibration and pressure that generated by boilers,steam turbines and other key equipment and real-time working condition data of SCADA system,this paper proposes a multi-source heterogeneous data fusion and analysis platform for thermal power plants based on edge computing and deep learning.By constructing a multi-level fusion architecture,the platform adopts dynamic weight allocation strategy and 5D digital twin model to realize the collaborative analysis of physical sensor data,simulation calculation results and expert knowledge.The data fusion module combines Kalman filter,wavelet transform and Bayesian estimation method to solve the problem of data time series alignment and dimension difference.Simulation results show that the data fusion accuracy can be improved to more than 98%,and the calculation delay can be controlled within 500 ms.The data analysis module integrates Dymola simulation model and AERMOD pollutant diffusion model,supports the cascade analysis of boiler combustion efficiency prediction and flue gas emission monitoring,system response time is less than 2 seconds,and data consistency verification accuracy reaches 99.5%.展开更多
Accurate monitoring of track irregularities is very helpful to improving the vehicle operation quality and to formulating appropriate track maintenance strategies.Existing methods have the problem that they rely on co...Accurate monitoring of track irregularities is very helpful to improving the vehicle operation quality and to formulating appropriate track maintenance strategies.Existing methods have the problem that they rely on complex signal processing algorithms and lack multi-source data analysis.Driven by multi-source measurement data,including the axle box,the bogie frame and the carbody accelerations,this paper proposes a track irregularities monitoring network(TIMNet)based on deep learning methods.TIMNet uses the feature extraction capability of convolutional neural networks and the sequence map-ping capability of the long short-term memory model to explore the mapping relationship between vehicle accelerations and track irregularities.The particle swarm optimization algorithm is used to optimize the network parameters,so that both the vertical and lateral track irregularities can be accurately identified in the time and spatial domains.The effectiveness and superiority of the proposed TIMNet is analyzed under different simulation conditions using a vehicle dynamics model.Field tests are conducted to prove the availability of the proposed TIMNet in quantitatively monitoring vertical and lateral track irregularities.Furthermore,comparative tests show that the TIMNet has a better fitting degree and timeliness in monitoring track irregularities(vertical R2 of 0.91,lateral R2 of 0.84 and time cost of 10 ms),compared to other classical regression.The test also proves that the TIMNet has a better anti-interference ability than other regression models.展开更多
Multi-source data fusion provides high-precision spatial situational awareness essential for analyzing granular urban social activities.This study used Shanghai’s catering industry as a case study,leveraging electron...Multi-source data fusion provides high-precision spatial situational awareness essential for analyzing granular urban social activities.This study used Shanghai’s catering industry as a case study,leveraging electronic reviews and consumer data sourced from third-party restaurant platforms collected in 2021.By performing weighted processing on two-dimensional point-of-interest(POI)data,clustering hotspots of high-dimensional restaurant data were identified.A hierarchical network of restaurant hotspots was constructed following the Central Place Theory(CPT)framework,while the Geo-Informatic Tupu method was employed to resolve the challenges posed by network deformation in multi-scale processes.These findings suggest the necessity of enhancing the spatial balance of Shanghai’s urban centers by moderately increasing the number and service capacity of suburban centers at the urban periphery.Such measures would contribute to a more optimized urban structure and facilitate the outward dispersion of comfort-oriented facilities such as the restaurant industry.At a finer spatial scale,the distribution of restaurant hotspots demonstrates a polycentric and symmetric spatial pattern,with a developmental trend radiating outward along the city’s ring roads.This trend can be attributed to the efforts of restaurants to establish connections with other urban functional spaces,leading to the reconfiguration of urban spaces,expansion of restaurant-dedicated land use,and the reorganization of associated commercial activities.The results validate the existence of a polycentric urban structure in Shanghai but also highlight the instability of the restaurant hotspot network during cross-scale transitions.展开更多
Taking the Ming Tombs Forest Farm in Beijing as the research object,this research applied multi-source data fusion and GIS heat-map overlay analysis techniques,systematically collected bird observation point data from...Taking the Ming Tombs Forest Farm in Beijing as the research object,this research applied multi-source data fusion and GIS heat-map overlay analysis techniques,systematically collected bird observation point data from the Global Biodiversity Information Facility(GBIF),population distribution data from the Oak Ridge National Laboratory(ORNL)in the United States,as well as information on the composition of tree species in suitable forest areas for birds and the forest geographical information of the Ming Tombs Forest Farm,which is based on literature research and field investigations.By using GIS technology,spatial processing was carried out on bird observation points and population distribution data to identify suitable bird-watching areas in different seasons.Then,according to the suitability value range,these areas were classified into different grades(from unsuitable to highly suitable).The research findings indicated that there was significant spatial heterogeneity in the bird-watching suitability of the Ming Tombs Forest Farm.The north side of the reservoir was generally a core area with high suitability in all seasons.The deep-aged broad-leaved mixed forests supported the overlapping co-existence of the ecological niches of various bird species,such as the Zosterops simplex and Urocissa erythrorhyncha.In contrast,the shallow forest-edge coniferous pure forests and mixed forests were more suitable for specialized species like Carduelis sinica.The southern urban area and the core area of the mausoleums had relatively low suitability due to ecological fragmentation or human interference.Based on these results,this paper proposed a three-level protection framework of“core area conservation—buffer zone management—isolation zone construction”and a spatio-temporal coordinated human-bird co-existence strategy.It was also suggested that the human-bird co-existence space could be optimized through measures such as constructing sound and light buffer interfaces,restoring ecological corridors,and integrating cultural heritage elements.This research provided an operational technical approach and decision-making support for the scientific planning of bird-watching sites and the coordination of ecological protection and tourism development.展开更多
Multi-source seismic technology is an efficient seismic acquisition method that requires a group of blended seismic data to be separated into single-source seismic data for subsequent processing. The separation of ble...Multi-source seismic technology is an efficient seismic acquisition method that requires a group of blended seismic data to be separated into single-source seismic data for subsequent processing. The separation of blended seismic data is a linear inverse problem. According to the relationship between the shooting number and the simultaneous source number of the acquisition system, this separation of blended seismic data is divided into an easily determined or overdetermined linear inverse problem and an underdetermined linear inverse problem that is difficult to solve. For the latter, this paper presents an optimization method that imposes the sparsity constraint on wavefields to construct the object function of inversion, and the problem is solved by using the iterative thresholding method. For the most extremely underdetermined separation problem with single-shooting and multiple sources, this paper presents a method of pseudo-deblending with random noise filtering. In this method, approximate common shot gathers are received through the pseudo-deblending process, and the random noises that appear when the approximate common shot gathers are sorted into common receiver gathers are eliminated through filtering methods. The separation methods proposed in this paper are applied to three types of numerical simulation data, including pure data without noise, data with random noise, and data with linear regular noise to obtain satisfactory results. The noise suppression effects of these methods are sufficient, particularly with single-shooting blended seismic data, which verifies the effectiveness of the proposed methods.展开更多
For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for...For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for the ensemble-based data assimilation methods.In this paper,we propose a multi-source information fused generative adversarial network(MSIGAN)model,which is used for parameterization of the complex geologies.In MSIGAN,various information such as facies distribution,microseismic,and inter-well connectivity,can be integrated to learn the geological features.And two major generative models in deep learning,variational autoencoder(VAE)and generative adversarial network(GAN)are combined in our model.Then the proposed MSIGAN model is integrated into the ensemble smoother with multiple data assimilation(ESMDA)method to conduct history matching.We tested the proposed method on two reservoir models with fluvial facies.The experimental results show that the proposed MSIGAN model can effectively learn the complex geological features,which can promote the accuracy of history matching.展开更多
In traditional medicine and ethnomedicine,medicinal plants have long been recognized as the basis for materials in therapeutic applications worldwide.In particular,the remarkable curative effect of traditional Chinese...In traditional medicine and ethnomedicine,medicinal plants have long been recognized as the basis for materials in therapeutic applications worldwide.In particular,the remarkable curative effect of traditional Chinese medicine during corona virus disease 2019(COVID-19)pandemic has attracted extensive attention globally.Medicinal plants have,therefore,become increasingly popular among the public.However,with increasing demand for and profit with medicinal plants,commercial fraudulent events such as adulteration or counterfeits sometimes occur,which poses a serious threat to the clinical outcomes and interests of consumers.With rapid advances in artificial intelligence,machine learning can be used to mine information on various medicinal plants to establish an ideal resource database.We herein present a review that mainly introduces common machine learning algorithms and discusses their application in multi-source data analysis of medicinal plants.The combination of machine learning algorithms and multi-source data analysis facilitates a comprehensive analysis and aids in the effective evaluation of the quality of medicinal plants.The findings of this review provide new possibilities for promoting the development and utilization of medicinal plants.展开更多
Multi-Source data plays an important role in the evolution of media convergence.Its fusion processing enables the further mining of data and utilization of data value and broadens the path for the sharing and dissemin...Multi-Source data plays an important role in the evolution of media convergence.Its fusion processing enables the further mining of data and utilization of data value and broadens the path for the sharing and dissemination of media data.However,it also faces serious problems in terms of protecting user and data privacy.Many privacy protectionmethods have been proposed to solve the problemof privacy leakage during the process of data sharing,but they suffer fromtwo flaws:1)the lack of algorithmic frameworks for specific scenarios such as dynamic datasets in the media domain;2)the inability to solve the problem of the high computational complexity of ciphertext in multi-source data privacy protection,resulting in long encryption and decryption times.In this paper,we propose a multi-source data privacy protection method based on homomorphic encryption and blockchain technology,which solves the privacy protection problem ofmulti-source heterogeneous data in the dissemination ofmedia and reduces ciphertext processing time.We deployed the proposedmethod on theHyperledger platformfor testing and compared it with the privacy protection schemes based on k-anonymity and differential privacy.The experimental results showthat the key generation,encryption,and decryption times of the proposedmethod are lower than those in data privacy protection methods based on k-anonymity technology and differential privacy technology.This significantly reduces the processing time ofmulti-source data,which gives it potential for use in many applications.展开更多
In view of the lack of comprehensive evaluation and analysis from the combination of natural and human multi-dimensional factors,the urban surface temperature patterns of Changsha in 2000,2009 and 2016 are retrieved b...In view of the lack of comprehensive evaluation and analysis from the combination of natural and human multi-dimensional factors,the urban surface temperature patterns of Changsha in 2000,2009 and 2016 are retrieved based on multi-source spatial data(Landsat 5 and Landsat 8 satellite image data,POI spatial big data,digital elevation model,etc.),and 12 natural and human factors closely related to urban thermal environment are quickly obtained.The standard deviation ellipse and spatial principal component analysis(PCA)methods are used to analyze the effect of urban human residential thermal environment and its influencing factors.The results showed that the heat island area increased by 547 km~2 and the maximum surface temperature difference reached 10.1℃during the period 2000–2016.The spatial distribution of urban heat island was mainly concentrated in urban built-up areas,such as industrial and commercial agglomerations and densely populated urban centers.The spatial distribution pattern of heat island is gradually decreasing from the urban center to the suburbs.There were multiple high-temperature centers,such as Wuyi square business circle,Xingsha economic and technological development zone in Changsha County,Wangcheng industrial zone,Yuelu industrial agglomeration,and Tianxin industrial zone.From 2000 to 2016,the main axis of spatial development of heat island remained in the northeast-southwest direction.The center of gravity of heat island shifted 2.7 km to the southwest with the deflection angle of 54.9°in 2000–2009.The center of gravity of heat island shifted to the northeast by 4.8 km with the deflection angle of 60.9°in 2009–2016.On the whole,the change of spatial pattern of thermal environment in Changsha was related to the change of urban construction intensity.Through the PCA method,it was concluded that landscape pattern,urban construction intensity and topographic landforms were the main factors affecting the spatial pattern of urban thermal environment of Changsha.The promotion effect of human factors on the formation of heat island effect was obviously greater than that of natural factors.The temperature would rise by 0.293℃under the synthetic effect of human and natural factors.Due to the complexity of factors influencing the urban thermal environment of human settlements,the utilization of multi-source data could help to reveal the spatial pattern and evolution law of urban thermal environment,deepen the understanding of the causes of urban heat island effect,and clarify the correlation between human and natural factors,so as to provide scientific supports for the improvement of the quality of urban human settlements.展开更多
Urban functional area(UFA)is a core scientific issue affecting urban sustainability.The current knowledge gap is mainly reflected in the lack of multi-scale quantitative interpretation methods from the perspective of ...Urban functional area(UFA)is a core scientific issue affecting urban sustainability.The current knowledge gap is mainly reflected in the lack of multi-scale quantitative interpretation methods from the perspective of human-land interaction.In this paper,based on multi-source big data include 250 m×250 m resolution cell phone data,1.81×105 Points of Interest(POI)data and administrative boundary data,we built a UFA identification method and demonstrated empirically in Shenyang City,China.We argue that the method we built can effectively identify multi-scale multi-type UFAs based on human activity and further reveal the spatial correlation between urban facilities and human activity.The empirical study suggests that the employment functional zones in Shenyang City are more concentrated in central cities than other single functional zones.There are more mix functional areas in the central city areas,while the planned industrial new cities need to develop comprehensive functions in Shenyang.UFAs have scale effects and human-land interaction patterns.We suggest that city decision makers should apply multi-sources big data to measure urban functional service in a more refined manner from a supply-demand perspective.展开更多
Due to the complex nature of multi-source geological data, it is difficult to rebuild every geological structure through a single 3D modeling method. The multi-source data interpretation method put forward in this ana...Due to the complex nature of multi-source geological data, it is difficult to rebuild every geological structure through a single 3D modeling method. The multi-source data interpretation method put forward in this analysis is based on a database-driven pattern and focuses on the discrete and irregular features of geological data. The geological data from a variety of sources covering a range of accuracy, resolution, quantity and quality are classified and integrated according to their reliability and consistency for 3D modeling. The new interpolation-approximation fitting construction algorithm of geological surfaces with the non-uniform rational B-spline(NURBS) technique is then presented. The NURBS technique can retain the balance among the requirements for accuracy, surface continuity and data storage of geological structures. Finally, four alternative 3D modeling approaches are demonstrated with reference to some examples, which are selected according to the data quantity and accuracy specification. The proposed approaches offer flexible modeling patterns for different practical engineering demands.展开更多
Data fusion can effectively process multi-sensor information to obtain more accurate and reliable results than a single sensor.The data of water quality in the environment comes from different sensors,thus the data mu...Data fusion can effectively process multi-sensor information to obtain more accurate and reliable results than a single sensor.The data of water quality in the environment comes from different sensors,thus the data must be fused.In our research,self-adaptive weighted data fusion method is used to respectively integrate the data from the PH value,temperature,oxygen dissolved and NH3 concentration of water quality environment.Based on the fusion,the Grubbs method is used to detect the abnormal data so as to provide data support for estimation,prediction and early warning of the water quality.展开更多
Although big data are widely used in various fields,its application is still rare in the study of mining subsidence prediction(MSP)caused by underground mining.Traditional research in MSP has the problem of oversimpli...Although big data are widely used in various fields,its application is still rare in the study of mining subsidence prediction(MSP)caused by underground mining.Traditional research in MSP has the problem of oversimplifying geological mining conditions,ignoring the fluctuation of rock layers with space.In the context of geospatial big data,a data-intensive FLAC3D(Fast Lagrangian Analysis of a Continua in 3 Dimensions)model is proposed in this paper based on borehole logs.In the modeling process,we developed a method to handle geospatial big data and were able to make full use of borehole logs.The effectiveness of the proposed method was verified by comparing the results of the traditional method,proposed method,and field observation.The findings show that the proposed method has obvious advantages over the traditional prediction results.The relative error of the maximum surface subsidence predicted by the proposed method decreased by 93.7%and the standard deviation of the prediction results(which was 70 points)decreased by 39.4%,on average.The data-intensive modeling method is of great significance for improving the accuracy of mining subsidence predictions.展开更多
In order to estimate vehicular queue length at signalized intersections accurately and overcome the shortcomings and restrictions of existing studies especially those based on shockwave theory,a new methodology is pre...In order to estimate vehicular queue length at signalized intersections accurately and overcome the shortcomings and restrictions of existing studies especially those based on shockwave theory,a new methodology is presented for estimating vehicular queue length using data from both point detectors and probe vehicles. The methodology applies the shockwave theory to model queue evolution over time and space. Using probe vehicle locations and times as well as point detector measured traffic states,analytical formulations for calculating the maximum and minimum( residual) queue length are developed. The proposed methodology is verified using ground truth data collected from numerical experiments conducted in Shanghai,China. It is found that the methodology has a mean absolute percentage error of 17. 09%,which is reasonably effective in estimating the queue length at traffic signalized intersections. Limitations of the proposed models and algorithms are also discussed in the paper.展开更多
Land cover classification is the core of converting satellite imagery to available geographic data.However,spectral signatures do not always provide enough information in classification decisions.Thus,the application ...Land cover classification is the core of converting satellite imagery to available geographic data.However,spectral signatures do not always provide enough information in classification decisions.Thus,the application of multi-source data becomes necessary.This paper presents an evidential reasoning (ER) approach to incorporate Landsat TM imagery,altitude and slope data.Results show that multi-source data contribute to the classification accuracy achieved by the ER method,whereas play a negative role to that derived by maximum likelihood classifier (MLC).In comparison to the results derived based on TM imagery alone,the overall accuracy rate of the ER method increases by 7.66% and that of the MLC method decreases by 8.35% when all data sources (TM plus altitude and slope) are accessible.The ER method is regarded as a better approach for multi-source image classification.In addition,the method produces not only an accurate classification result,but also the uncertainty which presents the inherent difficulty in classification decisions.The uncertainty associated to the ER classification image is evaluated and proved to be useful for improved classification accuracy.展开更多
Accessibility is a representative indicator for evaluating the supply of bus system.Traditional studies have evaluated the accessibility from different aspects.Considering the interaction among land use,bus timetable ...Accessibility is a representative indicator for evaluating the supply of bus system.Traditional studies have evaluated the accessibility from different aspects.Considering the interaction among land use,bus timetable arrangement and individual factors,a more holistic accessibility measurement is proposed to combine static and dynamic characteristics from multisource traffic data.The rationale of the proposed model is verified by a case study of bus system in Shenzhen,China,which is carried out to find the spatial and temporal discrepancy of service of bus system.It is found that the adjustment of bus schedule to time-varying travel demand can affect accessibility of bus system and that Land-use development,average bus speed and bus facilities all have positive effects on accessibility of bus system.These findings provide sig-nificant reference for transport planning and policy-making.The proposed model is not limited to accessibility measuring of bus system,but also applicable to other travel modes.展开更多
Growing attention has been directed to the use of satellite imagery and open geospatial data to understand large-scale sustainable development outcomes.Health and education are critical domains of the Unites Nations’...Growing attention has been directed to the use of satellite imagery and open geospatial data to understand large-scale sustainable development outcomes.Health and education are critical domains of the Unites Nations’Sus-tainable Development Goals(SDGs),yet existing research on the accessibility of corresponding services focused mainly on detailed but small-scale studies.This means that such studies lack accessibility metrics for large-scale quantitative evaluations.To address this deficiency,we evaluated the accessibility of health and education ser-vices in China's Mainland in 2021 using point-of-interest data,OpenStreetMap road data,land cover data,and WorldPop spatial demographic data.The accessibility metrics used were the least time costs of reaching hospital and school services and population coverage with a time cost of less than 1 h.On the basis of the road network and land cover information,the overall average time costs of reaching hospital and school were 20 and 22 min,respectively.In terms of population coverage,94.7%and 92.5%of the population in China has a time cost of less than 1 h in obtaining hospital and school services,respectively.Counties with low accessibility to hospitals and schools were highly coupled with poor areas and ecological function regions,with the time cost incurred in these areas being more than twice that experienced in non-poor and non-ecological areas.Furthermore,the cumulative time cost incurred by the bottom 20%of counties(by GDP)from access to hospital and school services reached approximately 80%of the national total.Low-GDP counties were compelled to suffer disproportionately increased time costs to acquire health and education services compared with high-GDP counties.The accessibil-ity metrics proposed in this study are highly related to SDGs 3 and 4,and they can serve as auxiliary data that can be used to enhance the evaluation of SDG outcomes.The analysis of the uneven distribution of health and education services in China can help identify areas with backward public services and may contribute to targeted and efficient policy interventions.展开更多
With the increase of different sensors,applications and customers,the demand from data providers and users is for a new geospatial data service model,which supports low cost,high dexterity,and which would provide a co...With the increase of different sensors,applications and customers,the demand from data providers and users is for a new geospatial data service model,which supports low cost,high dexterity,and which would provide a comprehensive service.Based on such requirements and demands,the 21AT TripleSat constellation terminal and data delivery and management system has been developed by a Beijing based high-tech enterprise,Twenty First Century Aerospace Technology Co.,Ltd.(21AT).The company is the first commercial Earth observation satellite operator and service provider in China.This new geospatial data service model allows the user to directly access multi-source satellite data,manage the data order,and carry out automatic massive data production and delivery.The solution also implements safe and hierarchical user management,statistical data analysis,and automatic information reports.In addition,a mobile application is also available for users to easily access system functions.This new geospatial solution has already been successfully applied and installed in many customer sites in China,and is now available globally for international clients interested in fast geospatial solutions.It enables the success of customers’operational services.Besides providing TripleSat Constellation images,the multi-source data access system also allows the users to access other satellite data sources,based on customized agreement.This paper describes and discusses this new geospatial data service model.展开更多
MORPAS is a special GIS (geographic information system) software system, based on the MAPGIS platform whose aim is to prospect and evaluate mineral resources quantificationally by synthesizing geological, geophysical,...MORPAS is a special GIS (geographic information system) software system, based on the MAPGIS platform whose aim is to prospect and evaluate mineral resources quantificationally by synthesizing geological, geophysical, geochemical and remote sensing data. It overlays geological database management, geological background and geological abnormality analysis, image processing of remote sensing and comprehensive abnormality analysis, etc.. It puts forward an integrative solution for the application of GIS in basic-level units and the construction of information engineering in the geological field. As the popularization of computer networks and the request of data sharing, it is necessary to extend its functions in data management so that all its data files can be accessed in the network server. This paper utilizes some MAPGIS functions for the second development and ADO (access data object) technique to access multi-source geological data in SQL Server databases. Then remote visiting and congruous management will be realized in the MORPAS system.展开更多
基金supported by the National Science Foundation[grant numbers 1854502 and 1855902]Publication was made possible in part by support from the HKU Libraries Open Access Author Fund sponsored by the HKU Libraries.USDA is an equal opportunity provider and employer.Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S.Department of Agriculture.
文摘Morphological(e.g.shape,size,and height)and function(e.g.working,living,and shopping)information of buildings is highly needed for urban planning and management as well as other applications such as city-scale building energy use modeling.Due to the limited availability of socio-economic geospatial data,it is more challenging to map building functions than building morphological information,especially over large areas.In this study,we proposed an integrated framework to map building functions in 50 U.S.cities by integrating multi-source web-based geospatial data.First,a web crawler was developed to extract Points of Interest(POIs)from Tripadvisor.com,and a map crawler was developed to extract POIs and land use parcels from Google Maps.Second,an unsupervised machine learning algorithm named OneClassSVM was used to identify residential buildings based on landscape features derived from Microsoft building footprints.Third,the type ratio of POIs and the area ratio of land use parcels were used to identify six non-residential functions(i.e.hospital,hotel,school,shop,restaurant,and office).The accuracy assessment indicates that the proposed framework performed well,with an average overall accuracy of 94%and a kappa coefficient of 0.63.With the worldwide coverage of Google Maps and Tripadvisor.com,the proposed framework is transferable to other cities over the world.The data products generated from this study are of great use for quantitative city-scale urban studies,such as building energy use modeling at the single building level over large areas.
文摘With the acceleration of intelligent transformation of energy system,the monitoring of equipment operation status and optimization of production process in thermal power plants face the challenge of multi-source heterogeneous data integration.In view of the heterogeneous characteristics of physical sensor data,including temperature,vibration and pressure that generated by boilers,steam turbines and other key equipment and real-time working condition data of SCADA system,this paper proposes a multi-source heterogeneous data fusion and analysis platform for thermal power plants based on edge computing and deep learning.By constructing a multi-level fusion architecture,the platform adopts dynamic weight allocation strategy and 5D digital twin model to realize the collaborative analysis of physical sensor data,simulation calculation results and expert knowledge.The data fusion module combines Kalman filter,wavelet transform and Bayesian estimation method to solve the problem of data time series alignment and dimension difference.Simulation results show that the data fusion accuracy can be improved to more than 98%,and the calculation delay can be controlled within 500 ms.The data analysis module integrates Dymola simulation model and AERMOD pollutant diffusion model,supports the cascade analysis of boiler combustion efficiency prediction and flue gas emission monitoring,system response time is less than 2 seconds,and data consistency verification accuracy reaches 99.5%.
基金supported by the Sichuan Science and Technology Program(Nos.2024JDRC0100 and 2023YFQ0091)the National Natural Science Foundation of China(Nos.U21A20167 and 52475138)the Scientific Research Foundation of the State Key Laboratory of Rail Transit Vehicle System(No.2024RVL-T08).
文摘Accurate monitoring of track irregularities is very helpful to improving the vehicle operation quality and to formulating appropriate track maintenance strategies.Existing methods have the problem that they rely on complex signal processing algorithms and lack multi-source data analysis.Driven by multi-source measurement data,including the axle box,the bogie frame and the carbody accelerations,this paper proposes a track irregularities monitoring network(TIMNet)based on deep learning methods.TIMNet uses the feature extraction capability of convolutional neural networks and the sequence map-ping capability of the long short-term memory model to explore the mapping relationship between vehicle accelerations and track irregularities.The particle swarm optimization algorithm is used to optimize the network parameters,so that both the vertical and lateral track irregularities can be accurately identified in the time and spatial domains.The effectiveness and superiority of the proposed TIMNet is analyzed under different simulation conditions using a vehicle dynamics model.Field tests are conducted to prove the availability of the proposed TIMNet in quantitatively monitoring vertical and lateral track irregularities.Furthermore,comparative tests show that the TIMNet has a better fitting degree and timeliness in monitoring track irregularities(vertical R2 of 0.91,lateral R2 of 0.84 and time cost of 10 ms),compared to other classical regression.The test also proves that the TIMNet has a better anti-interference ability than other regression models.
基金Under the auspices of the Key Program of National Natural Science Foundation of China(No.42030409)。
文摘Multi-source data fusion provides high-precision spatial situational awareness essential for analyzing granular urban social activities.This study used Shanghai’s catering industry as a case study,leveraging electronic reviews and consumer data sourced from third-party restaurant platforms collected in 2021.By performing weighted processing on two-dimensional point-of-interest(POI)data,clustering hotspots of high-dimensional restaurant data were identified.A hierarchical network of restaurant hotspots was constructed following the Central Place Theory(CPT)framework,while the Geo-Informatic Tupu method was employed to resolve the challenges posed by network deformation in multi-scale processes.These findings suggest the necessity of enhancing the spatial balance of Shanghai’s urban centers by moderately increasing the number and service capacity of suburban centers at the urban periphery.Such measures would contribute to a more optimized urban structure and facilitate the outward dispersion of comfort-oriented facilities such as the restaurant industry.At a finer spatial scale,the distribution of restaurant hotspots demonstrates a polycentric and symmetric spatial pattern,with a developmental trend radiating outward along the city’s ring roads.This trend can be attributed to the efforts of restaurants to establish connections with other urban functional spaces,leading to the reconfiguration of urban spaces,expansion of restaurant-dedicated land use,and the reorganization of associated commercial activities.The results validate the existence of a polycentric urban structure in Shanghai but also highlight the instability of the restaurant hotspot network during cross-scale transitions.
基金Sponsored by Beijing Youth Innovation Talent Support Program for Urban Greening and Landscaping——The 2024 Special Project for Promoting High-Quality Development of Beijing’s Landscaping through Scientific and Technological Innovation(KJCXQT202410).
文摘Taking the Ming Tombs Forest Farm in Beijing as the research object,this research applied multi-source data fusion and GIS heat-map overlay analysis techniques,systematically collected bird observation point data from the Global Biodiversity Information Facility(GBIF),population distribution data from the Oak Ridge National Laboratory(ORNL)in the United States,as well as information on the composition of tree species in suitable forest areas for birds and the forest geographical information of the Ming Tombs Forest Farm,which is based on literature research and field investigations.By using GIS technology,spatial processing was carried out on bird observation points and population distribution data to identify suitable bird-watching areas in different seasons.Then,according to the suitability value range,these areas were classified into different grades(from unsuitable to highly suitable).The research findings indicated that there was significant spatial heterogeneity in the bird-watching suitability of the Ming Tombs Forest Farm.The north side of the reservoir was generally a core area with high suitability in all seasons.The deep-aged broad-leaved mixed forests supported the overlapping co-existence of the ecological niches of various bird species,such as the Zosterops simplex and Urocissa erythrorhyncha.In contrast,the shallow forest-edge coniferous pure forests and mixed forests were more suitable for specialized species like Carduelis sinica.The southern urban area and the core area of the mausoleums had relatively low suitability due to ecological fragmentation or human interference.Based on these results,this paper proposed a three-level protection framework of“core area conservation—buffer zone management—isolation zone construction”and a spatio-temporal coordinated human-bird co-existence strategy.It was also suggested that the human-bird co-existence space could be optimized through measures such as constructing sound and light buffer interfaces,restoring ecological corridors,and integrating cultural heritage elements.This research provided an operational technical approach and decision-making support for the scientific planning of bird-watching sites and the coordination of ecological protection and tourism development.
文摘Multi-source seismic technology is an efficient seismic acquisition method that requires a group of blended seismic data to be separated into single-source seismic data for subsequent processing. The separation of blended seismic data is a linear inverse problem. According to the relationship between the shooting number and the simultaneous source number of the acquisition system, this separation of blended seismic data is divided into an easily determined or overdetermined linear inverse problem and an underdetermined linear inverse problem that is difficult to solve. For the latter, this paper presents an optimization method that imposes the sparsity constraint on wavefields to construct the object function of inversion, and the problem is solved by using the iterative thresholding method. For the most extremely underdetermined separation problem with single-shooting and multiple sources, this paper presents a method of pseudo-deblending with random noise filtering. In this method, approximate common shot gathers are received through the pseudo-deblending process, and the random noises that appear when the approximate common shot gathers are sorted into common receiver gathers are eliminated through filtering methods. The separation methods proposed in this paper are applied to three types of numerical simulation data, including pure data without noise, data with random noise, and data with linear regular noise to obtain satisfactory results. The noise suppression effects of these methods are sufficient, particularly with single-shooting blended seismic data, which verifies the effectiveness of the proposed methods.
基金supported by the National Natural Science Foundation of China under Grant 51722406,52074340,and 51874335the Shandong Provincial Natural Science Foundation under Grant JQ201808+5 种基金The Fundamental Research Funds for the Central Universities under Grant 18CX02097Athe Major Scientific and Technological Projects of CNPC under Grant ZD2019-183-008the Science and Technology Support Plan for Youth Innovation of University in Shandong Province under Grant 2019KJH002the National Research Council of Science and Technology Major Project of China under Grant 2016ZX05025001-006111 Project under Grant B08028Sinopec Science and Technology Project under Grant P20050-1
文摘For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for the ensemble-based data assimilation methods.In this paper,we propose a multi-source information fused generative adversarial network(MSIGAN)model,which is used for parameterization of the complex geologies.In MSIGAN,various information such as facies distribution,microseismic,and inter-well connectivity,can be integrated to learn the geological features.And two major generative models in deep learning,variational autoencoder(VAE)and generative adversarial network(GAN)are combined in our model.Then the proposed MSIGAN model is integrated into the ensemble smoother with multiple data assimilation(ESMDA)method to conduct history matching.We tested the proposed method on two reservoir models with fluvial facies.The experimental results show that the proposed MSIGAN model can effectively learn the complex geological features,which can promote the accuracy of history matching.
基金supported by the National Natural Science Foundation of China(Grant No.:U2202213)the Special Program for the Major Science and Technology Projects of Yunnan Province,China(Grant Nos.:202102AE090051-1-01,and 202202AE090001).
文摘In traditional medicine and ethnomedicine,medicinal plants have long been recognized as the basis for materials in therapeutic applications worldwide.In particular,the remarkable curative effect of traditional Chinese medicine during corona virus disease 2019(COVID-19)pandemic has attracted extensive attention globally.Medicinal plants have,therefore,become increasingly popular among the public.However,with increasing demand for and profit with medicinal plants,commercial fraudulent events such as adulteration or counterfeits sometimes occur,which poses a serious threat to the clinical outcomes and interests of consumers.With rapid advances in artificial intelligence,machine learning can be used to mine information on various medicinal plants to establish an ideal resource database.We herein present a review that mainly introduces common machine learning algorithms and discusses their application in multi-source data analysis of medicinal plants.The combination of machine learning algorithms and multi-source data analysis facilitates a comprehensive analysis and aids in the effective evaluation of the quality of medicinal plants.The findings of this review provide new possibilities for promoting the development and utilization of medicinal plants.
基金funded by the High-Quality and Cutting-Edge Discipline Construction Project for Universities in Beijing (Internet Information,Communication University of China).
文摘Multi-Source data plays an important role in the evolution of media convergence.Its fusion processing enables the further mining of data and utilization of data value and broadens the path for the sharing and dissemination of media data.However,it also faces serious problems in terms of protecting user and data privacy.Many privacy protectionmethods have been proposed to solve the problemof privacy leakage during the process of data sharing,but they suffer fromtwo flaws:1)the lack of algorithmic frameworks for specific scenarios such as dynamic datasets in the media domain;2)the inability to solve the problem of the high computational complexity of ciphertext in multi-source data privacy protection,resulting in long encryption and decryption times.In this paper,we propose a multi-source data privacy protection method based on homomorphic encryption and blockchain technology,which solves the privacy protection problem ofmulti-source heterogeneous data in the dissemination ofmedia and reduces ciphertext processing time.We deployed the proposedmethod on theHyperledger platformfor testing and compared it with the privacy protection schemes based on k-anonymity and differential privacy.The experimental results showthat the key generation,encryption,and decryption times of the proposedmethod are lower than those in data privacy protection methods based on k-anonymity technology and differential privacy technology.This significantly reduces the processing time ofmulti-source data,which gives it potential for use in many applications.
基金National Social Science Foundation of China,No.15BJY051Open Topic of Hunan Key Laboratory of Land Resources Evaluation and Utilization,No.SYS-ZX-202002Research Project of Appraisement Committee of Social Sciences Research Achievements of Hunan Province,No.XSP18ZDI031。
文摘In view of the lack of comprehensive evaluation and analysis from the combination of natural and human multi-dimensional factors,the urban surface temperature patterns of Changsha in 2000,2009 and 2016 are retrieved based on multi-source spatial data(Landsat 5 and Landsat 8 satellite image data,POI spatial big data,digital elevation model,etc.),and 12 natural and human factors closely related to urban thermal environment are quickly obtained.The standard deviation ellipse and spatial principal component analysis(PCA)methods are used to analyze the effect of urban human residential thermal environment and its influencing factors.The results showed that the heat island area increased by 547 km~2 and the maximum surface temperature difference reached 10.1℃during the period 2000–2016.The spatial distribution of urban heat island was mainly concentrated in urban built-up areas,such as industrial and commercial agglomerations and densely populated urban centers.The spatial distribution pattern of heat island is gradually decreasing from the urban center to the suburbs.There were multiple high-temperature centers,such as Wuyi square business circle,Xingsha economic and technological development zone in Changsha County,Wangcheng industrial zone,Yuelu industrial agglomeration,and Tianxin industrial zone.From 2000 to 2016,the main axis of spatial development of heat island remained in the northeast-southwest direction.The center of gravity of heat island shifted 2.7 km to the southwest with the deflection angle of 54.9°in 2000–2009.The center of gravity of heat island shifted to the northeast by 4.8 km with the deflection angle of 60.9°in 2009–2016.On the whole,the change of spatial pattern of thermal environment in Changsha was related to the change of urban construction intensity.Through the PCA method,it was concluded that landscape pattern,urban construction intensity and topographic landforms were the main factors affecting the spatial pattern of urban thermal environment of Changsha.The promotion effect of human factors on the formation of heat island effect was obviously greater than that of natural factors.The temperature would rise by 0.293℃under the synthetic effect of human and natural factors.Due to the complexity of factors influencing the urban thermal environment of human settlements,the utilization of multi-source data could help to reveal the spatial pattern and evolution law of urban thermal environment,deepen the understanding of the causes of urban heat island effect,and clarify the correlation between human and natural factors,so as to provide scientific supports for the improvement of the quality of urban human settlements.
基金Under the auspices of Natural Science Foundation of China(No.41971166)。
文摘Urban functional area(UFA)is a core scientific issue affecting urban sustainability.The current knowledge gap is mainly reflected in the lack of multi-scale quantitative interpretation methods from the perspective of human-land interaction.In this paper,based on multi-source big data include 250 m×250 m resolution cell phone data,1.81×105 Points of Interest(POI)data and administrative boundary data,we built a UFA identification method and demonstrated empirically in Shenyang City,China.We argue that the method we built can effectively identify multi-scale multi-type UFAs based on human activity and further reveal the spatial correlation between urban facilities and human activity.The empirical study suggests that the employment functional zones in Shenyang City are more concentrated in central cities than other single functional zones.There are more mix functional areas in the central city areas,while the planned industrial new cities need to develop comprehensive functions in Shenyang.UFAs have scale effects and human-land interaction patterns.We suggest that city decision makers should apply multi-sources big data to measure urban functional service in a more refined manner from a supply-demand perspective.
基金Supported by the National Natural Science Foundation of China(No.51379006 and No.51009106)the Program for New Century Excellent Talents in University of Ministry of Education of China(No.NCET-12-0404)the National Basic Research Program of China("973"Program,No.2013CB035903)
文摘Due to the complex nature of multi-source geological data, it is difficult to rebuild every geological structure through a single 3D modeling method. The multi-source data interpretation method put forward in this analysis is based on a database-driven pattern and focuses on the discrete and irregular features of geological data. The geological data from a variety of sources covering a range of accuracy, resolution, quantity and quality are classified and integrated according to their reliability and consistency for 3D modeling. The new interpolation-approximation fitting construction algorithm of geological surfaces with the non-uniform rational B-spline(NURBS) technique is then presented. The NURBS technique can retain the balance among the requirements for accuracy, surface continuity and data storage of geological structures. Finally, four alternative 3D modeling approaches are demonstrated with reference to some examples, which are selected according to the data quantity and accuracy specification. The proposed approaches offer flexible modeling patterns for different practical engineering demands.
基金This study was supported by National Key Research and Development Project(Project No.2017YFD0301506)National Social Science Foundation(Project No.71774052)+1 种基金Hunan Education Department Scientific Research Project(Project No.17K04417A092).
文摘Data fusion can effectively process multi-sensor information to obtain more accurate and reliable results than a single sensor.The data of water quality in the environment comes from different sensors,thus the data must be fused.In our research,self-adaptive weighted data fusion method is used to respectively integrate the data from the PH value,temperature,oxygen dissolved and NH3 concentration of water quality environment.Based on the fusion,the Grubbs method is used to detect the abnormal data so as to provide data support for estimation,prediction and early warning of the water quality.
文摘Although big data are widely used in various fields,its application is still rare in the study of mining subsidence prediction(MSP)caused by underground mining.Traditional research in MSP has the problem of oversimplifying geological mining conditions,ignoring the fluctuation of rock layers with space.In the context of geospatial big data,a data-intensive FLAC3D(Fast Lagrangian Analysis of a Continua in 3 Dimensions)model is proposed in this paper based on borehole logs.In the modeling process,we developed a method to handle geospatial big data and were able to make full use of borehole logs.The effectiveness of the proposed method was verified by comparing the results of the traditional method,proposed method,and field observation.The findings show that the proposed method has obvious advantages over the traditional prediction results.The relative error of the maximum surface subsidence predicted by the proposed method decreased by 93.7%and the standard deviation of the prediction results(which was 70 points)decreased by 39.4%,on average.The data-intensive modeling method is of great significance for improving the accuracy of mining subsidence predictions.
基金Sponsored by the National Natural Science Foundation of China(Grant No.51138003)
文摘In order to estimate vehicular queue length at signalized intersections accurately and overcome the shortcomings and restrictions of existing studies especially those based on shockwave theory,a new methodology is presented for estimating vehicular queue length using data from both point detectors and probe vehicles. The methodology applies the shockwave theory to model queue evolution over time and space. Using probe vehicle locations and times as well as point detector measured traffic states,analytical formulations for calculating the maximum and minimum( residual) queue length are developed. The proposed methodology is verified using ground truth data collected from numerical experiments conducted in Shanghai,China. It is found that the methodology has a mean absolute percentage error of 17. 09%,which is reasonably effective in estimating the queue length at traffic signalized intersections. Limitations of the proposed models and algorithms are also discussed in the paper.
基金Under the auspices of National Natural Science Foundation of China (No.40871188)Knowledge Innovation Programs of Chinese Academy of Sciences (No.INFO-115-C01-SDB4-05)
文摘Land cover classification is the core of converting satellite imagery to available geographic data.However,spectral signatures do not always provide enough information in classification decisions.Thus,the application of multi-source data becomes necessary.This paper presents an evidential reasoning (ER) approach to incorporate Landsat TM imagery,altitude and slope data.Results show that multi-source data contribute to the classification accuracy achieved by the ER method,whereas play a negative role to that derived by maximum likelihood classifier (MLC).In comparison to the results derived based on TM imagery alone,the overall accuracy rate of the ER method increases by 7.66% and that of the MLC method decreases by 8.35% when all data sources (TM plus altitude and slope) are accessible.The ER method is regarded as a better approach for multi-source image classification.In addition,the method produces not only an accurate classification result,but also the uncertainty which presents the inherent difficulty in classification decisions.The uncertainty associated to the ER classification image is evaluated and proved to be useful for improved classification accuracy.
基金This work was jointly supported by the National Key Research and Development Program of China[grant num-ber 2018YFB1600900]the National Natural Science Foundation of China[grant number 71601045].
文摘Accessibility is a representative indicator for evaluating the supply of bus system.Traditional studies have evaluated the accessibility from different aspects.Considering the interaction among land use,bus timetable arrangement and individual factors,a more holistic accessibility measurement is proposed to combine static and dynamic characteristics from multisource traffic data.The rationale of the proposed model is verified by a case study of bus system in Shenzhen,China,which is carried out to find the spatial and temporal discrepancy of service of bus system.It is found that the adjustment of bus schedule to time-varying travel demand can affect accessibility of bus system and that Land-use development,average bus speed and bus facilities all have positive effects on accessibility of bus system.These findings provide sig-nificant reference for transport planning and policy-making.The proposed model is not limited to accessibility measuring of bus system,but also applicable to other travel modes.
基金This work was supported by the National Natural Science Foundation for Distinguished Young Scholars of China(Grant No.41725006).
文摘Growing attention has been directed to the use of satellite imagery and open geospatial data to understand large-scale sustainable development outcomes.Health and education are critical domains of the Unites Nations’Sus-tainable Development Goals(SDGs),yet existing research on the accessibility of corresponding services focused mainly on detailed but small-scale studies.This means that such studies lack accessibility metrics for large-scale quantitative evaluations.To address this deficiency,we evaluated the accessibility of health and education ser-vices in China's Mainland in 2021 using point-of-interest data,OpenStreetMap road data,land cover data,and WorldPop spatial demographic data.The accessibility metrics used were the least time costs of reaching hospital and school services and population coverage with a time cost of less than 1 h.On the basis of the road network and land cover information,the overall average time costs of reaching hospital and school were 20 and 22 min,respectively.In terms of population coverage,94.7%and 92.5%of the population in China has a time cost of less than 1 h in obtaining hospital and school services,respectively.Counties with low accessibility to hospitals and schools were highly coupled with poor areas and ecological function regions,with the time cost incurred in these areas being more than twice that experienced in non-poor and non-ecological areas.Furthermore,the cumulative time cost incurred by the bottom 20%of counties(by GDP)from access to hospital and school services reached approximately 80%of the national total.Low-GDP counties were compelled to suffer disproportionately increased time costs to acquire health and education services compared with high-GDP counties.The accessibil-ity metrics proposed in this study are highly related to SDGs 3 and 4,and they can serve as auxiliary data that can be used to enhance the evaluation of SDG outcomes.The analysis of the uneven distribution of health and education services in China can help identify areas with backward public services and may contribute to targeted and efficient policy interventions.
基金supported by the project of Beijing Municipal Science and Technology Commission and Science and Technology Innovation Base of Cultivating and Developing Engineering[grant number Z161100005016069]the National High Technology Research and Development Program[grant number 2013AA12A303].
文摘With the increase of different sensors,applications and customers,the demand from data providers and users is for a new geospatial data service model,which supports low cost,high dexterity,and which would provide a comprehensive service.Based on such requirements and demands,the 21AT TripleSat constellation terminal and data delivery and management system has been developed by a Beijing based high-tech enterprise,Twenty First Century Aerospace Technology Co.,Ltd.(21AT).The company is the first commercial Earth observation satellite operator and service provider in China.This new geospatial data service model allows the user to directly access multi-source satellite data,manage the data order,and carry out automatic massive data production and delivery.The solution also implements safe and hierarchical user management,statistical data analysis,and automatic information reports.In addition,a mobile application is also available for users to easily access system functions.This new geospatial solution has already been successfully applied and installed in many customer sites in China,and is now available globally for international clients interested in fast geospatial solutions.It enables the success of customers’operational services.Besides providing TripleSat Constellation images,the multi-source data access system also allows the users to access other satellite data sources,based on customized agreement.This paper describes and discusses this new geospatial data service model.
文摘MORPAS is a special GIS (geographic information system) software system, based on the MAPGIS platform whose aim is to prospect and evaluate mineral resources quantificationally by synthesizing geological, geophysical, geochemical and remote sensing data. It overlays geological database management, geological background and geological abnormality analysis, image processing of remote sensing and comprehensive abnormality analysis, etc.. It puts forward an integrative solution for the application of GIS in basic-level units and the construction of information engineering in the geological field. As the popularization of computer networks and the request of data sharing, it is necessary to extend its functions in data management so that all its data files can be accessed in the network server. This paper utilizes some MAPGIS functions for the second development and ADO (access data object) technique to access multi-source geological data in SQL Server databases. Then remote visiting and congruous management will be realized in the MORPAS system.