Based on the 16d-composite MODIS (moderate resolution imaging spectroradiometer)-NDVI(normalized difference vegetation index) time-series data in 2004, vegetation in North Tibet Plateau was classified and seasonal...Based on the 16d-composite MODIS (moderate resolution imaging spectroradiometer)-NDVI(normalized difference vegetation index) time-series data in 2004, vegetation in North Tibet Plateau was classified and seasonal variations on the pixels selected from different vegetation type were analyzed. The Savitzky-Golay filtering algorithm was applied to perform a filtration processing for MODIS-NDVI time-series data. The processed time-series curves can reflect a real variation trend of vegetation growth. The NDVI time-series curves of coniferous forest, high-cold meadow, high-cold meadow steppe and high-cold steppe all appear a mono-peak model during vegetation growth with the maximum peak occurring in August. A decision-tree classification model was established according to either NDVI time-series data or land surface temperature data. And then, both classifying and processing for vegetations were carried out through the model based on NDVI time-series curves. An accuracy test illustrates that classification results are of high accuracy and credibility and the model is conducive for studying a climate variation and estimating a vegetation production at regional even global scale.展开更多
Clustering is used to gain an intuition of the struc tures in the data.Most of the current clustering algorithms pro duce a clustering structure even on data that do not possess such structure.In these cases,the algor...Clustering is used to gain an intuition of the struc tures in the data.Most of the current clustering algorithms pro duce a clustering structure even on data that do not possess such structure.In these cases,the algorithms force a structure in the data instead of discovering one.To avoid false structures in the relations of data,a novel clusterability assessment method called density-based clusterability measure is proposed in this paper.I measures the prominence of clustering structure in the data to evaluate whether a cluster analysis could produce a meaningfu insight to the relationships in the data.This is especially useful in time-series data since visualizing the structure in time-series data is hard.The performance of the clusterability measure is evalu ated against several synthetic data sets and time-series data sets which illustrate that the density-based clusterability measure can successfully indicate clustering structure of time-series data.展开更多
Underground coal fires are one of the most common and serious geohazards in most coal producing countries in the world. Monitoring their spatio-temporal changes plays an important role in controlling and preventing th...Underground coal fires are one of the most common and serious geohazards in most coal producing countries in the world. Monitoring their spatio-temporal changes plays an important role in controlling and preventing the effects of coal fires, and their environmental impact. In this study, the spatio-temporal changes of underground coal fires in Khanh Hoa coal field(North-East of Viet Nam) were analyzed using Landsat time-series data during the 2008-2016 period. Based on land surface temperatures retrieved from Landsat thermal data, underground coal fires related to thermal anomalies were identified using the MEDIAN+1.5×IQR(IQR: Interquartile range) threshold technique. The locations of underground coal fires were validated using a coal fire map produced by the field survey data and cross-validated using the daytime ASTER thermal infrared imagery. Based on the fires extracted from seven Landsat thermal imageries, the spatiotemporal changes of underground coal fire areas were analyzed. The results showed that the thermalanomalous zones have been correlated with known coal fires. Cross-validation of coal fires using ASTER TIR data showed a high consistency of 79.3%. The largest coal fire area of 184.6 hectares was detected in 2010, followed by 2014(181.1 hectares) and 2016(178.5 hectares). The smaller coal fire areas were extracted with areas of 133.6 and 152.5 hectares in 2011 and 2009 respectively. Underground coal fires were mainly detected in the northern and southern part, and tend to spread to north-west of the coal field.展开更多
The increasing penetration rate of electric kickboard vehicles has been popularized and promoted primarily because of its clean and efficient features.Electric kickboards are gradually growing in popularity in tourist...The increasing penetration rate of electric kickboard vehicles has been popularized and promoted primarily because of its clean and efficient features.Electric kickboards are gradually growing in popularity in tourist and education-centric localities.In the upcoming arrival of electric kickboard vehicles,deploying a customer rental service is essential.Due to its freefloating nature,the shared electric kickboard is a common and practical means of transportation.Relocation plans for shared electric kickboards are required to increase the quality of service,and forecasting demand for their use in a specific region is crucial.Predicting demand accurately with small data is troublesome.Extensive data is necessary for training machine learning algorithms for effective prediction.Data generation is a method for expanding the amount of data that will be further accessible for training.In this work,we proposed a model that takes time-series customers’electric kickboard demand data as input,pre-processes it,and generates synthetic data according to the original data distribution using generative adversarial networks(GAN).The electric kickboard mobility demand prediction error was reduced when we combined synthetic data with the original data.We proposed Tabular-GAN-Modified-WGAN-GP for generating synthetic data for better prediction results.We modified The Wasserstein GAN-gradient penalty(GP)with the RMSprop optimizer and then employed Spectral Normalization(SN)to improve training stability and faster convergence.Finally,we applied a regression-based blending ensemble technique that can help us to improve performance of demand prediction.We used various evaluation criteria and visual representations to compare our proposed model’s performance.Synthetic data generated by our suggested GAN model is also evaluated.The TGAN-Modified-WGAN-GP model mitigates the overfitting and mode collapse problem,and it also converges faster than previous GAN models for synthetic data creation.The presented model’s performance is compared to existing ensemble and baseline models.The experimental findings imply that combining synthetic and actual data can significantly reduce prediction error rates in the mean absolute percentage error(MAPE)of 4.476 and increase prediction accuracy.展开更多
In this study,we developed software for vehicle big data analysis to analyze the time-series data of connected vehicles.We designed two software modules:The rst to derive the Pearson correlation coefcients to analyze ...In this study,we developed software for vehicle big data analysis to analyze the time-series data of connected vehicles.We designed two software modules:The rst to derive the Pearson correlation coefcients to analyze the collected data and the second to conduct exploratory data analysis of the collected vehicle data.In particular,we analyzed the dangerous driving patterns of motorists based on the safety standards of the Korea Transportation Safety Authority.We also analyzed seasonal fuel efciency(four seasons)and mileage of vehicles,and identied rapid acceleration,rapid deceleration,sudden stopping(harsh braking),quick starting,sudden left turn,sudden right turn and sudden U-turn driving patterns of vehicles.We implemented the density-based spatial clustering of applications with a noise algorithm for trajectory analysis based on GPS(Global Positioning System)data and designed a long shortterm memory algorithm and an auto-regressive integrated moving average model for time-series data analysis.In this paper,we mainly describe the development environment of the analysis software,the structure and data ow of the overall analysis platform,the conguration of the collected vehicle data,and the various algorithms used in the analysis.Finally,we present illustrative results of our analysis,such as dangerous driving patterns that were detected.展开更多
With the development of the Industrial Internet of Things(IIoT)and cloud computing technologies,intelligent sensors in the field that can generate large volumes of time-series data continuously have emerged.Due to the...With the development of the Industrial Internet of Things(IIoT)and cloud computing technologies,intelligent sensors in the field that can generate large volumes of time-series data continuously have emerged.Due to the lack of equipment and network impacts,highly distributed industrial applications cannot capture and transfer all production data to a distant cloud server in real time.Consequently,a portion of critical production data is lost,which poses the significant challenge of the timely replenishment of missing data.Employing deep learning in the cloud center for data trend prediction based on relevant data can solve this problem.The objective of this study was to develop a time-series prediction model that combines a Transformer model with a sparse Mixture of Experts(MoE).The model is designed specifically for an IIoT system that is used in oil-well operations.The proposed TransMoE prediction model combines the advantages of the MoE and the Transformer model.The MoE can effectively handle multiple subtasks while the Transformer algorithm can reflect the long-range dependency of the input data series.The proposed model was used to predict oil-well yields,and the predicted outcomes were compared with those obtained using a CNN-GRU and CNNLSTM models,as well as the actual recorded data.The experimental results indicated that the proposed TransMoE model can significantly increase the efficiency and accuracy of oil well production sequence data prediction,with an average relative error of 6.26%,which can satisfy the requirements of enterprise data usage.展开更多
A simple data assimilation method for improving estimation of moderate resolution imaging spectroradiometer (MODIS) leaf area index (LAI) time-series data products based on the gradient inverse weighted filter and...A simple data assimilation method for improving estimation of moderate resolution imaging spectroradiometer (MODIS) leaf area index (LAI) time-series data products based on the gradient inverse weighted filter and object analysis is proposed. The properties and quality control (QC) of MODIS LAI data products are introduced. Also, the gradient inverse weighted filter and object analysis are analyzed. An experiment based on the simple data assimilation method is performed using MODIS LAI data sets from 2000 to 2005 of Guizhou Province in China.展开更多
A tremendous amount of data has been generated by global financial markets everyday,and such time-series data needs to be analyzed in real time to explore its potential value.In recent years,we have witnessed the succ...A tremendous amount of data has been generated by global financial markets everyday,and such time-series data needs to be analyzed in real time to explore its potential value.In recent years,we have witnessed the successful adoption of machine learning models on financial data,where the importance of accuracy and timeliness demands highly effective computing frameworks.However,traditional financial time-series data processing frameworks have shown performance degradation and adaptation issues,such as the outlier handling with stock suspension in Pandas and TA-Lib.In this paper,we propose HXPY,a high-performance data processing package with a C++/Python interface for financial time-series data.HXPY supports miscellaneous acceleration techniques such as the streaming algorithm,the vectorization instruction set,and memory optimization,together with various functions such as time window functions,group operations,down-sampling operations,cross-section operations,row-wise or column-wise operations,shape transformations,and alignment functions.The results of benchmark and incremental analysis demonstrate the superior performance of HXPY compared with its counterparts.From MiBs to GiBs data,HXPY significantly outperforms other in-memory dataframe computing rivals even up to hundreds of times.展开更多
By employing the unique phenological feature of winter wheat extracted from peak before winter (PBW) and the advantages of moderate resolution imaging spectroradiometer (MODIS) data with high temporal resolution a...By employing the unique phenological feature of winter wheat extracted from peak before winter (PBW) and the advantages of moderate resolution imaging spectroradiometer (MODIS) data with high temporal resolution and intermediate spatial resolution, a remote sensing-based model for mapping winter wheat on the North China Plain was built through integration with Landsat images and land-use data. First, a phenological window, PBW was drawn from time-series MODIS data. Next, feature extraction was performed for the PBW to reduce feature dimension and enhance its information. Finally, a regression model was built to model the relationship of the phenological feature and the sample data. The amount of information of the PBW was evaluated and compared with that of the main peak (MP). The relative precision of the mapping reached up to 92% in comparison to the Landsat sample data, and ranged between 87 and 96% in comparison to the statistical data. These results were sufficient to satisfy the accuracy requirements for winter wheat mapping at a large scale. Moreover, the proposed method has the ability to obtain the distribution information for winter wheat in an earlier period than previous studies. This study could throw light on the monitoring of winter wheat in China by using unique phenological feature of winter wheat.展开更多
In the age of information sharing, logistics information sharing also faces the risk of privacy leakage. In regard to the privacy leakage of time-series location information in the field of logistics, this paper propo...In the age of information sharing, logistics information sharing also faces the risk of privacy leakage. In regard to the privacy leakage of time-series location information in the field of logistics, this paper proposes a method based on differential privacy for time-series location data publication. Firstly, it constructs public region of interest(PROI) related to time by using clustering optimal algorithm. And it adopts the method of the centroid point to ensure the public interest point(PIP) representing the location of the public interest zone. Secondly, according to the PIP, we can construct location search tree(LST) that is a commonly used index structure of spatial data, in order to ensure the inherent relation among location data. Thirdly, we add Laplace noise to the node of LST, which means fewer times to add Laplace noise on the original data set and ensures the data availability. Finally, experiments show that this method not only ensures the security of sequential location data publishing, but also has better data availability than the general differential privacy method, which achieves a good balance between the security and availability of data.展开更多
Accurate information about phenological stages is essential for canola field management practices such as irrigation, fertilization, and harvesting. Previous studies in canola phenology monitoring focused mainly on th...Accurate information about phenological stages is essential for canola field management practices such as irrigation, fertilization, and harvesting. Previous studies in canola phenology monitoring focused mainly on the flowering stage, using its apparent structure features and colors. Additional phenological stages have been largely overlooked. The objective of this study was to improve a shape-model method(SMM) for extracting winter canola phenological stages from time-series top-of-canopy reflectance images collected by an unmanned aerial vehicle(UAV). The transformation equation of the SMM was refined to account for the multi-peak features of the temporal dynamics of three vegetation indices(VIs)(NDVI, EVI, and CI). An experiment with various seeding scenarios was conducted, including four different seeding dates and three seeding densities. Three mathematical functions: asymmetric Gaussian function(AGF), Fourier function, and double logistic function, were employed to fit timeseries vegetation indices to extract information about phenological stages. The refined SMM effectively estimated the phenological stages of canola, with a minimum root mean square error(RMSE) of 3.7 days for all phenological stages. The AGF function provided the best fitting performance, as it captured multiple peaks in the growth dynamics characteristics for all seeding date scenarios using four scaling parameters. For the three selected VIs, CIred-edgeachieved the greatest accuracy in estimating the phenological stage dates. This study demonstrates the high potential of the refined SMM for estimating winter canola phenology.展开更多
This essay combines the Defense Meteorological Satellite Program Operational Linescan System(DMSP-OLS)nighttime light data and the Visible Infrared Imaging Radiometer Suite(VIIRS)nighttime light data into a“synthetic...This essay combines the Defense Meteorological Satellite Program Operational Linescan System(DMSP-OLS)nighttime light data and the Visible Infrared Imaging Radiometer Suite(VIIRS)nighttime light data into a“synthetic DMSP”dataset,from 1992 to 2020,to retrieve the spatio-temporal variations in energy-related carbon emissions in Xinjiang,China.Then,this paper analyzes several influencing factors for spatial differentiation of carbon emissions in Xinjiang with the application of geographical detector technique.Results reveal that(1)total carbon emissions continued to grow,while the growth rate slowed down in the past five years.(2)Large regional differences exist in total carbon emissions across various regions.Total carbon emissions of these regions in descending order are the northern slope of the Tianshan(Mountains)>the southern slope of the Tianshan>the three prefectures in southern Xinjiang>the northern part of Xinjiang.(3)Economic growth,population size,and energy consumption intensity are the most important factors of spatial differentiation of carbon emissions.The interaction between economic growth and population size as well as between economic growth and energy consumption intensity also enhances the explanatory power of carbon emissions’spatial differentiation.This paper aims to help formulate differentiated carbon reduction targets and strategies for cities in different economic development stages and those with different carbon intensities so as to achieve the carbon peak goals in different steps.展开更多
The rapid integration of Internet of Things(IoT)technologies is reshaping the global energy landscape by deploying smart meters that enable high-resolution consumption monitoring,two-way communication,and advanced met...The rapid integration of Internet of Things(IoT)technologies is reshaping the global energy landscape by deploying smart meters that enable high-resolution consumption monitoring,two-way communication,and advanced metering infrastructure services.However,this digital transformation also exposes power system to evolving threats,ranging from cyber intrusions and electricity theft to device malfunctions,and the unpredictable nature of these anomalies,coupled with the scarcity of labeled fault data,makes realtime detection exceptionally challenging.To address these difficulties,a real-time decision support framework is presented for smart meter anomality detection that leverages rolling time windows and two self-supervised contrastive learning modules.The first module synthesizes diverse negative samples to overcome the lack of labeled anomalies,while the second captures intrinsic temporal patterns for enhanced contextual discrimination.The end-to-end framework continuously updates its model with rolling updated meter data to deliver timely identification of emerging abnormal behaviors in evolving grids.Extensive evaluations on eight publicly available smart meter datasets over seven diverse abnormal patterns testing demonstrate the effectiveness of the proposed full framework,achieving average recall and F1 score of more than 0.85.展开更多
Red tide is an ecological disaster caused by the excessive proliferation of photosynthetic algae in the ocean.The frequent occurrences of red tide have brought serious harms to the marine aquaculture and caused signif...Red tide is an ecological disaster caused by the excessive proliferation of photosynthetic algae in the ocean.The frequent occurrences of red tide have brought serious harms to the marine aquaculture and caused significant economic losses to the marine industry.Red tide prediction can alleviate and even stop the long-term damages to marine ecosystems,which helps maintain the ecological balance of the ocean environment and contributes to the Sustainable Development Goal of“life below water”formulated by the United Nations.Aiming at red tide prediction using remote sensing technology,this study proposed a novel approach of red tide prediction using time-series hyperspectral observations,and examined the proposed method in the Xinghai Bay,China.Three spectral indices,namely the twoband ratio(TBR),the three-band spectral index(TBSI),and the fluorescence baseline height(FLH),were used to reduce the dimensionality of hyperspectral data and extract spectral features.Two machine learning models including the random forest(RF)and the support vector machine(SVM)were employed to predict whether red tide would occur on a target day based on the time-series spectral indices obtained in the previous days.By comparing and analyzing the prediction results of multiple machine learning models trained with different spectral indices and temporal lengths,it is found that both the RF and the SVM models can predict the red tide outbreaks at the accuracies over 0.9 using adequate temporal lengths of input data.When the temporal length of input data is limited,however,it is suggested to use the RF model,which accurately predicts red tide outbreaks using the temporal input of the 2-d TBSI.The proposed method is expected to provide oceanic and maritime agencies with early warnings on red tide outbreaks and ensure the safety of the coastal environment in large spatial scales using optical remote sensing technology.展开更多
Avoiding,reducing,and reversing land degradation and restoring degraded land is an urgent priority to protect the biodiversity and ecosystem services that are vital to life on Earth.To halt and reverse the current tre...Avoiding,reducing,and reversing land degradation and restoring degraded land is an urgent priority to protect the biodiversity and ecosystem services that are vital to life on Earth.To halt and reverse the current trends in land degradation,there is an immediate need to enhance national capacities to undertake quantitative assessments and mapping of their degraded lands,as required by the Sustainable Development Goals(SDGs),in particular,the SDG indicator 15.3.1(“proportion of land that is degraded over total land area”).Earth Observations(EO)can play an important role both for generating this indicator as well as complementing or enhancing national official data sources.Implementations like Trends.Earth to monitor land degradation in accordance with the SDG15.3.1 rely on default datasets of coarse spatial resolution provided by MODIS or AVHRR.Consequently,there is a need to develop methodologies to benefit from medium to high-resolution satellite EO data(e.g.Landsat or Sentinels).In response to this issue,this paper presents an initial overview of an innovative approach to monitor land degradation at the national scale in compliance with the SDG15.3.1 indicator using Landsat observations using a data cube but further work is required to improve the calculation of the three sub-indicators.展开更多
It is crucial to predict future mechanical behaviors for the prevention of structural disasters.Especially for underground construction,the structural mechanical behaviors are affected by multiple internal and externa...It is crucial to predict future mechanical behaviors for the prevention of structural disasters.Especially for underground construction,the structural mechanical behaviors are affected by multiple internal and external factors due to the complex conditions.Given that the existing models fail to take into account all the factors and accurate prediction of the multiple time series simultaneously is difficult using these models,this study proposed an improved prediction model through the autoencoder fused long-and short-term time-series network driven by the mass number of monitoring data.Then,the proposed model was formalized on multiple time series of strain monitoring data.Also,the discussion analysis with a classical baseline and an ablation experiment was conducted to verify the effectiveness of the prediction model.As the results indicate,the proposed model shows obvious superiority in predicting the future mechanical behaviors of structures.As a case study,the presented model was applied to the Nanjing Dinghuaimen tunnel to predict the stain variation on a different time scale in the future.展开更多
Earth observation data are typically compressed using general-purpose single-threaded compression algorithms that operate at a fraction of the bandwidth of modern storage and processing systems.We present evidence tha...Earth observation data are typically compressed using general-purpose single-threaded compression algorithms that operate at a fraction of the bandwidth of modern storage and processing systems.We present evidence that recently developed multi-threaded compression codecs offer substantial benefits over widely used single-threaded codecs in terms of compression efficiency when applied to a selection of moderate resolution imaging spectroradiometer(MODIS)datasets stored in the HDF5 format.Compression codecs from the LZ77 and Rice families are shown to vary in efficacy when applied to different MODIS data products,highlighting the need for compression strategies to be tailored to different classes of data.We also introduce LPC-Rice,a new multi-threaded codec,that performs particularly well when applied to time-series data.展开更多
As financial markets grow increasingly complex and volatile,timeseriesbased stock price forecasting has become a critical research focus in the field of finance.Traditional forecasting methods face significant limitat...As financial markets grow increasingly complex and volatile,timeseriesbased stock price forecasting has become a critical research focus in the field of finance.Traditional forecasting methods face significant limitations in handling nonlinear and high-dimensional data,while neural networks(NNs)have demonstrated great potential due to their powerful feature extraction and pattern recognition capabilities.Although several existing surveys discuss the applications of NNs in stock forecasting,they often lack a detailed examination of models that use time-series data as input and fail to cover the latest research developments.In response,this paper reviews relevant literature from 2015 to 2025 and classifies timeseriesbased stock forecasting methods into four categories:NNs,recurrent NNs(RNNs),convolutional NNs(CNNs),Transformers and other models.We analyze their performance under different market conditions,highlight strengths and limitations,and identify recent trends in model design.Our findings show that hybrid architectures and attention-based models consistently achieve superior forecasting stability and adaptability across volatile market scenarios.This survey offers a systematic reference for researchers and practitioners and outlines promising future research directions.展开更多
Accurate mapping and timely monitoring of urban redevelopment are pivotal for urban studies and decisionmakers to foster sustainable urban development.Traditional mapping methods heavily depend on field surveys and su...Accurate mapping and timely monitoring of urban redevelopment are pivotal for urban studies and decisionmakers to foster sustainable urban development.Traditional mapping methods heavily depend on field surveys and subjective questionnaires,yielding less objective,reliable,and timely data.Recent advancements in Geographic Information Systems(GIS)and remote-sensing technologies have improved the identification and mapping of urban redevelopment through quantitative analysis using satellite-based observations.Nonetheless,challenges persist,particularly concerning accuracy and significant temporal delays.This study introduces a novel approach to modeling urban redevelopment,leveraging machine learning algorithms and remote-sensing data.This methodology can facilitate the accurate and timely identification of urban redevelopment activities.The study’s machine learning model can analyze time-series remote-sensing data to identify spatio-temporal and spectral patterns related to urban redevelopment.The model is thoroughly evaluated,and the results indicate that it can accurately capture the time-series patterns of urban redevelopment.This research’s findings are useful for evaluating urban demographic and economic changes,informing policymaking and urban planning,and contributing to sustainable urban development.The model can also serve as a foundation for future research on early-stage urban redevelopment detection and evaluation of the causes and impacts of urban redevelopment.展开更多
基金the Frontier Program of the Knowledge Innovation Program of Chinese Academy of Sciences
文摘Based on the 16d-composite MODIS (moderate resolution imaging spectroradiometer)-NDVI(normalized difference vegetation index) time-series data in 2004, vegetation in North Tibet Plateau was classified and seasonal variations on the pixels selected from different vegetation type were analyzed. The Savitzky-Golay filtering algorithm was applied to perform a filtration processing for MODIS-NDVI time-series data. The processed time-series curves can reflect a real variation trend of vegetation growth. The NDVI time-series curves of coniferous forest, high-cold meadow, high-cold meadow steppe and high-cold steppe all appear a mono-peak model during vegetation growth with the maximum peak occurring in August. A decision-tree classification model was established according to either NDVI time-series data or land surface temperature data. And then, both classifying and processing for vegetations were carried out through the model based on NDVI time-series curves. An accuracy test illustrates that classification results are of high accuracy and credibility and the model is conducive for studying a climate variation and estimating a vegetation production at regional even global scale.
文摘Clustering is used to gain an intuition of the struc tures in the data.Most of the current clustering algorithms pro duce a clustering structure even on data that do not possess such structure.In these cases,the algorithms force a structure in the data instead of discovering one.To avoid false structures in the relations of data,a novel clusterability assessment method called density-based clusterability measure is proposed in this paper.I measures the prominence of clustering structure in the data to evaluate whether a cluster analysis could produce a meaningfu insight to the relationships in the data.This is especially useful in time-series data since visualizing the structure in time-series data is hard.The performance of the clusterability measure is evalu ated against several synthetic data sets and time-series data sets which illustrate that the density-based clusterability measure can successfully indicate clustering structure of time-series data.
基金funded by the Ministry-level Scientific and Technological Key Programs of Ministry of Natural Resources and Environment of Viet Nam "Application of thermal infrared remote sensing and GIS for mapping underground coal fires in Quang Ninh coal basin" (Grant No. TNMT.2017.08.06)
文摘Underground coal fires are one of the most common and serious geohazards in most coal producing countries in the world. Monitoring their spatio-temporal changes plays an important role in controlling and preventing the effects of coal fires, and their environmental impact. In this study, the spatio-temporal changes of underground coal fires in Khanh Hoa coal field(North-East of Viet Nam) were analyzed using Landsat time-series data during the 2008-2016 period. Based on land surface temperatures retrieved from Landsat thermal data, underground coal fires related to thermal anomalies were identified using the MEDIAN+1.5×IQR(IQR: Interquartile range) threshold technique. The locations of underground coal fires were validated using a coal fire map produced by the field survey data and cross-validated using the daytime ASTER thermal infrared imagery. Based on the fires extracted from seven Landsat thermal imageries, the spatiotemporal changes of underground coal fire areas were analyzed. The results showed that the thermalanomalous zones have been correlated with known coal fires. Cross-validation of coal fires using ASTER TIR data showed a high consistency of 79.3%. The largest coal fire area of 184.6 hectares was detected in 2010, followed by 2014(181.1 hectares) and 2016(178.5 hectares). The smaller coal fire areas were extracted with areas of 133.6 and 152.5 hectares in 2011 and 2009 respectively. Underground coal fires were mainly detected in the northern and southern part, and tend to spread to north-west of the coal field.
基金This work was supported by Korea Institute for Advancement of Technology(KIAT)grant funded by the Korea Government(MOTIE)(P0016977,The Establishment Project of Industry-University Fusion District).
文摘The increasing penetration rate of electric kickboard vehicles has been popularized and promoted primarily because of its clean and efficient features.Electric kickboards are gradually growing in popularity in tourist and education-centric localities.In the upcoming arrival of electric kickboard vehicles,deploying a customer rental service is essential.Due to its freefloating nature,the shared electric kickboard is a common and practical means of transportation.Relocation plans for shared electric kickboards are required to increase the quality of service,and forecasting demand for their use in a specific region is crucial.Predicting demand accurately with small data is troublesome.Extensive data is necessary for training machine learning algorithms for effective prediction.Data generation is a method for expanding the amount of data that will be further accessible for training.In this work,we proposed a model that takes time-series customers’electric kickboard demand data as input,pre-processes it,and generates synthetic data according to the original data distribution using generative adversarial networks(GAN).The electric kickboard mobility demand prediction error was reduced when we combined synthetic data with the original data.We proposed Tabular-GAN-Modified-WGAN-GP for generating synthetic data for better prediction results.We modified The Wasserstein GAN-gradient penalty(GP)with the RMSprop optimizer and then employed Spectral Normalization(SN)to improve training stability and faster convergence.Finally,we applied a regression-based blending ensemble technique that can help us to improve performance of demand prediction.We used various evaluation criteria and visual representations to compare our proposed model’s performance.Synthetic data generated by our suggested GAN model is also evaluated.The TGAN-Modified-WGAN-GP model mitigates the overfitting and mode collapse problem,and it also converges faster than previous GAN models for synthetic data creation.The presented model’s performance is compared to existing ensemble and baseline models.The experimental findings imply that combining synthetic and actual data can significantly reduce prediction error rates in the mean absolute percentage error(MAPE)of 4.476 and increase prediction accuracy.
基金supported by the Technology Innovation Program(10083633,Development on Big Data Analysis Technology and Business Service for Connected Vehicles)funded by the Ministry of Trade,Industry&Energy(MOTIE,Korea)。
文摘In this study,we developed software for vehicle big data analysis to analyze the time-series data of connected vehicles.We designed two software modules:The rst to derive the Pearson correlation coefcients to analyze the collected data and the second to conduct exploratory data analysis of the collected vehicle data.In particular,we analyzed the dangerous driving patterns of motorists based on the safety standards of the Korea Transportation Safety Authority.We also analyzed seasonal fuel efciency(four seasons)and mileage of vehicles,and identied rapid acceleration,rapid deceleration,sudden stopping(harsh braking),quick starting,sudden left turn,sudden right turn and sudden U-turn driving patterns of vehicles.We implemented the density-based spatial clustering of applications with a noise algorithm for trajectory analysis based on GPS(Global Positioning System)data and designed a long shortterm memory algorithm and an auto-regressive integrated moving average model for time-series data analysis.In this paper,we mainly describe the development environment of the analysis software,the structure and data ow of the overall analysis platform,the conguration of the collected vehicle data,and the various algorithms used in the analysis.Finally,we present illustrative results of our analysis,such as dangerous driving patterns that were detected.
文摘With the development of the Industrial Internet of Things(IIoT)and cloud computing technologies,intelligent sensors in the field that can generate large volumes of time-series data continuously have emerged.Due to the lack of equipment and network impacts,highly distributed industrial applications cannot capture and transfer all production data to a distant cloud server in real time.Consequently,a portion of critical production data is lost,which poses the significant challenge of the timely replenishment of missing data.Employing deep learning in the cloud center for data trend prediction based on relevant data can solve this problem.The objective of this study was to develop a time-series prediction model that combines a Transformer model with a sparse Mixture of Experts(MoE).The model is designed specifically for an IIoT system that is used in oil-well operations.The proposed TransMoE prediction model combines the advantages of the MoE and the Transformer model.The MoE can effectively handle multiple subtasks while the Transformer algorithm can reflect the long-range dependency of the input data series.The proposed model was used to predict oil-well yields,and the predicted outcomes were compared with those obtained using a CNN-GRU and CNNLSTM models,as well as the actual recorded data.The experimental results indicated that the proposed TransMoE model can significantly increase the efficiency and accuracy of oil well production sequence data prediction,with an average relative error of 6.26%,which can satisfy the requirements of enterprise data usage.
基金This work was supported by the China Postdoctoral Science Foundation(No.20060390326)the key international S&T cooperation project of China(No.2004DFA06300).
文摘A simple data assimilation method for improving estimation of moderate resolution imaging spectroradiometer (MODIS) leaf area index (LAI) time-series data products based on the gradient inverse weighted filter and object analysis is proposed. The properties and quality control (QC) of MODIS LAI data products are introduced. Also, the gradient inverse weighted filter and object analysis are analyzed. An experiment based on the simple data assimilation method is performed using MODIS LAI data sets from 2000 to 2005 of Guizhou Province in China.
文摘A tremendous amount of data has been generated by global financial markets everyday,and such time-series data needs to be analyzed in real time to explore its potential value.In recent years,we have witnessed the successful adoption of machine learning models on financial data,where the importance of accuracy and timeliness demands highly effective computing frameworks.However,traditional financial time-series data processing frameworks have shown performance degradation and adaptation issues,such as the outlier handling with stock suspension in Pandas and TA-Lib.In this paper,we propose HXPY,a high-performance data processing package with a C++/Python interface for financial time-series data.HXPY supports miscellaneous acceleration techniques such as the streaming algorithm,the vectorization instruction set,and memory optimization,together with various functions such as time window functions,group operations,down-sampling operations,cross-section operations,row-wise or column-wise operations,shape transformations,and alignment functions.The results of benchmark and incremental analysis demonstrate the superior performance of HXPY compared with its counterparts.From MiBs to GiBs data,HXPY significantly outperforms other in-memory dataframe computing rivals even up to hundreds of times.
基金supported by the open research fund of the Key Laboratory of Agri-informatics,Ministry of Agriculture and the fund of Outstanding Agricultural Researcher,Ministry of Agriculture,China
文摘By employing the unique phenological feature of winter wheat extracted from peak before winter (PBW) and the advantages of moderate resolution imaging spectroradiometer (MODIS) data with high temporal resolution and intermediate spatial resolution, a remote sensing-based model for mapping winter wheat on the North China Plain was built through integration with Landsat images and land-use data. First, a phenological window, PBW was drawn from time-series MODIS data. Next, feature extraction was performed for the PBW to reduce feature dimension and enhance its information. Finally, a regression model was built to model the relationship of the phenological feature and the sample data. The amount of information of the PBW was evaluated and compared with that of the main peak (MP). The relative precision of the mapping reached up to 92% in comparison to the Landsat sample data, and ranged between 87 and 96% in comparison to the statistical data. These results were sufficient to satisfy the accuracy requirements for winter wheat mapping at a large scale. Moreover, the proposed method has the ability to obtain the distribution information for winter wheat in an earlier period than previous studies. This study could throw light on the monitoring of winter wheat in China by using unique phenological feature of winter wheat.
基金Supported by the Social Science Foundation of Beijing(15JGB099,15ZHA004)the National Natural Science Foundation of China(61370139)"Information+" Special Fund(5111823610)
文摘In the age of information sharing, logistics information sharing also faces the risk of privacy leakage. In regard to the privacy leakage of time-series location information in the field of logistics, this paper proposes a method based on differential privacy for time-series location data publication. Firstly, it constructs public region of interest(PROI) related to time by using clustering optimal algorithm. And it adopts the method of the centroid point to ensure the public interest point(PIP) representing the location of the public interest zone. Secondly, according to the PIP, we can construct location search tree(LST) that is a commonly used index structure of spatial data, in order to ensure the inherent relation among location data. Thirdly, we add Laplace noise to the node of LST, which means fewer times to add Laplace noise on the original data set and ensures the data availability. Finally, experiments show that this method not only ensures the security of sequential location data publishing, but also has better data availability than the general differential privacy method, which achieves a good balance between the security and availability of data.
基金supported by the National Natural Science Foundation of China (51909228)the Postdoctoral Science Foundation of China (2020M671623)the ‘‘Blue Project” of Yangzhou University。
文摘Accurate information about phenological stages is essential for canola field management practices such as irrigation, fertilization, and harvesting. Previous studies in canola phenology monitoring focused mainly on the flowering stage, using its apparent structure features and colors. Additional phenological stages have been largely overlooked. The objective of this study was to improve a shape-model method(SMM) for extracting winter canola phenological stages from time-series top-of-canopy reflectance images collected by an unmanned aerial vehicle(UAV). The transformation equation of the SMM was refined to account for the multi-peak features of the temporal dynamics of three vegetation indices(VIs)(NDVI, EVI, and CI). An experiment with various seeding scenarios was conducted, including four different seeding dates and three seeding densities. Three mathematical functions: asymmetric Gaussian function(AGF), Fourier function, and double logistic function, were employed to fit timeseries vegetation indices to extract information about phenological stages. The refined SMM effectively estimated the phenological stages of canola, with a minimum root mean square error(RMSE) of 3.7 days for all phenological stages. The AGF function provided the best fitting performance, as it captured multiple peaks in the growth dynamics characteristics for all seeding date scenarios using four scaling parameters. For the three selected VIs, CIred-edgeachieved the greatest accuracy in estimating the phenological stage dates. This study demonstrates the high potential of the refined SMM for estimating winter canola phenology.
基金The Third Xinjiang Scientific Expedition Program(2021xjkk0905)GDAS Special Project of Science and Technology Development(2020GDASYL-20200301003)+2 种基金GDAS Special Project of Science and Technology Development(2020GDASYL-20200102002)National Natural Science Foundation of China(41501144)Project of Department of Natural Resources of Guangdong Province(GDZRZYKJ2022005)。
文摘This essay combines the Defense Meteorological Satellite Program Operational Linescan System(DMSP-OLS)nighttime light data and the Visible Infrared Imaging Radiometer Suite(VIIRS)nighttime light data into a“synthetic DMSP”dataset,from 1992 to 2020,to retrieve the spatio-temporal variations in energy-related carbon emissions in Xinjiang,China.Then,this paper analyzes several influencing factors for spatial differentiation of carbon emissions in Xinjiang with the application of geographical detector technique.Results reveal that(1)total carbon emissions continued to grow,while the growth rate slowed down in the past five years.(2)Large regional differences exist in total carbon emissions across various regions.Total carbon emissions of these regions in descending order are the northern slope of the Tianshan(Mountains)>the southern slope of the Tianshan>the three prefectures in southern Xinjiang>the northern part of Xinjiang.(3)Economic growth,population size,and energy consumption intensity are the most important factors of spatial differentiation of carbon emissions.The interaction between economic growth and population size as well as between economic growth and energy consumption intensity also enhances the explanatory power of carbon emissions’spatial differentiation.This paper aims to help formulate differentiated carbon reduction targets and strategies for cities in different economic development stages and those with different carbon intensities so as to achieve the carbon peak goals in different steps.
文摘The rapid integration of Internet of Things(IoT)technologies is reshaping the global energy landscape by deploying smart meters that enable high-resolution consumption monitoring,two-way communication,and advanced metering infrastructure services.However,this digital transformation also exposes power system to evolving threats,ranging from cyber intrusions and electricity theft to device malfunctions,and the unpredictable nature of these anomalies,coupled with the scarcity of labeled fault data,makes realtime detection exceptionally challenging.To address these difficulties,a real-time decision support framework is presented for smart meter anomality detection that leverages rolling time windows and two self-supervised contrastive learning modules.The first module synthesizes diverse negative samples to overcome the lack of labeled anomalies,while the second captures intrinsic temporal patterns for enhanced contextual discrimination.The end-to-end framework continuously updates its model with rolling updated meter data to deliver timely identification of emerging abnormal behaviors in evolving grids.Extensive evaluations on eight publicly available smart meter datasets over seven diverse abnormal patterns testing demonstrate the effectiveness of the proposed full framework,achieving average recall and F1 score of more than 0.85.
基金The National Natural Science Foundation of China under contract No.42406188the Natural Science Foundation of Liaoning Province under contract No.2024-BS-022+1 种基金the Dalian High-Level Talent Innovation Program under contract No.2022RG02the Fundamental Research Funds for the Central Universities under contract No.3132025107.
文摘Red tide is an ecological disaster caused by the excessive proliferation of photosynthetic algae in the ocean.The frequent occurrences of red tide have brought serious harms to the marine aquaculture and caused significant economic losses to the marine industry.Red tide prediction can alleviate and even stop the long-term damages to marine ecosystems,which helps maintain the ecological balance of the ocean environment and contributes to the Sustainable Development Goal of“life below water”formulated by the United Nations.Aiming at red tide prediction using remote sensing technology,this study proposed a novel approach of red tide prediction using time-series hyperspectral observations,and examined the proposed method in the Xinghai Bay,China.Three spectral indices,namely the twoband ratio(TBR),the three-band spectral index(TBSI),and the fluorescence baseline height(FLH),were used to reduce the dimensionality of hyperspectral data and extract spectral features.Two machine learning models including the random forest(RF)and the support vector machine(SVM)were employed to predict whether red tide would occur on a target day based on the time-series spectral indices obtained in the previous days.By comparing and analyzing the prediction results of multiple machine learning models trained with different spectral indices and temporal lengths,it is found that both the RF and the SVM models can predict the red tide outbreaks at the accuracies over 0.9 using adequate temporal lengths of input data.When the temporal length of input data is limited,however,it is suggested to use the RF model,which accurately predicts red tide outbreaks using the temporal input of the 2-d TBSI.The proposed method is expected to provide oceanic and maritime agencies with early warnings on red tide outbreaks and ensure the safety of the coastal environment in large spatial scales using optical remote sensing technology.
基金This research was funded by the European Commission“Horizon 2020 Program”ERA-PLANET/GEOEssential project,grant number 689443.
文摘Avoiding,reducing,and reversing land degradation and restoring degraded land is an urgent priority to protect the biodiversity and ecosystem services that are vital to life on Earth.To halt and reverse the current trends in land degradation,there is an immediate need to enhance national capacities to undertake quantitative assessments and mapping of their degraded lands,as required by the Sustainable Development Goals(SDGs),in particular,the SDG indicator 15.3.1(“proportion of land that is degraded over total land area”).Earth Observations(EO)can play an important role both for generating this indicator as well as complementing or enhancing national official data sources.Implementations like Trends.Earth to monitor land degradation in accordance with the SDG15.3.1 rely on default datasets of coarse spatial resolution provided by MODIS or AVHRR.Consequently,there is a need to develop methodologies to benefit from medium to high-resolution satellite EO data(e.g.Landsat or Sentinels).In response to this issue,this paper presents an initial overview of an innovative approach to monitor land degradation at the national scale in compliance with the SDG15.3.1 indicator using Landsat observations using a data cube but further work is required to improve the calculation of the three sub-indicators.
基金National Key Research and Development Program of China,Grant/Award Number:2018YFB2101003National Natural Science Foundation of China,Grant/Award Numbers:51991395,U1806226,51778033,51822802,71901011,U1811463,51991391Science and Technology Major Project of Beijing,Grant/Award Number:Z191100002519012。
文摘It is crucial to predict future mechanical behaviors for the prevention of structural disasters.Especially for underground construction,the structural mechanical behaviors are affected by multiple internal and external factors due to the complex conditions.Given that the existing models fail to take into account all the factors and accurate prediction of the multiple time series simultaneously is difficult using these models,this study proposed an improved prediction model through the autoencoder fused long-and short-term time-series network driven by the mass number of monitoring data.Then,the proposed model was formalized on multiple time series of strain monitoring data.Also,the discussion analysis with a classical baseline and an ablation experiment was conducted to verify the effectiveness of the prediction model.As the results indicate,the proposed model shows obvious superiority in predicting the future mechanical behaviors of structures.As a case study,the presented model was applied to the Nanjing Dinghuaimen tunnel to predict the stain variation on a different time scale in the future.
文摘Earth observation data are typically compressed using general-purpose single-threaded compression algorithms that operate at a fraction of the bandwidth of modern storage and processing systems.We present evidence that recently developed multi-threaded compression codecs offer substantial benefits over widely used single-threaded codecs in terms of compression efficiency when applied to a selection of moderate resolution imaging spectroradiometer(MODIS)datasets stored in the HDF5 format.Compression codecs from the LZ77 and Rice families are shown to vary in efficacy when applied to different MODIS data products,highlighting the need for compression strategies to be tailored to different classes of data.We also introduce LPC-Rice,a new multi-threaded codec,that performs particularly well when applied to time-series data.
文摘As financial markets grow increasingly complex and volatile,timeseriesbased stock price forecasting has become a critical research focus in the field of finance.Traditional forecasting methods face significant limitations in handling nonlinear and high-dimensional data,while neural networks(NNs)have demonstrated great potential due to their powerful feature extraction and pattern recognition capabilities.Although several existing surveys discuss the applications of NNs in stock forecasting,they often lack a detailed examination of models that use time-series data as input and fail to cover the latest research developments.In response,this paper reviews relevant literature from 2015 to 2025 and classifies timeseriesbased stock forecasting methods into four categories:NNs,recurrent NNs(RNNs),convolutional NNs(CNNs),Transformers and other models.We analyze their performance under different market conditions,highlight strengths and limitations,and identify recent trends in model design.Our findings show that hybrid architectures and attention-based models consistently achieve superior forecasting stability and adaptability across volatile market scenarios.This survey offers a systematic reference for researchers and practitioners and outlines promising future research directions.
文摘Accurate mapping and timely monitoring of urban redevelopment are pivotal for urban studies and decisionmakers to foster sustainable urban development.Traditional mapping methods heavily depend on field surveys and subjective questionnaires,yielding less objective,reliable,and timely data.Recent advancements in Geographic Information Systems(GIS)and remote-sensing technologies have improved the identification and mapping of urban redevelopment through quantitative analysis using satellite-based observations.Nonetheless,challenges persist,particularly concerning accuracy and significant temporal delays.This study introduces a novel approach to modeling urban redevelopment,leveraging machine learning algorithms and remote-sensing data.This methodology can facilitate the accurate and timely identification of urban redevelopment activities.The study’s machine learning model can analyze time-series remote-sensing data to identify spatio-temporal and spectral patterns related to urban redevelopment.The model is thoroughly evaluated,and the results indicate that it can accurately capture the time-series patterns of urban redevelopment.This research’s findings are useful for evaluating urban demographic and economic changes,informing policymaking and urban planning,and contributing to sustainable urban development.The model can also serve as a foundation for future research on early-stage urban redevelopment detection and evaluation of the causes and impacts of urban redevelopment.