Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decode...Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decoder (ACSF-ED) network to predict the action and locate the object efficiently. In the Adaptive Cross-Scale Fusion Spatio-Temporal Encoder (ACSF ST-Encoder), the Asymptotic Cross-scale Feature-fusion Module (ACCFM) is designed to address the issue of information degradation caused by the propagation of high-level semantic information, thereby extracting high-quality multi-scale features to provide superior features for subsequent spatio-temporal information modeling. Within the Shared-Head Decoder structure, a shared classification and regression detection head is constructed. A multi-constraint loss function composed of one-to-one, one-to-many, and contrastive denoising losses is designed to address the problem of insufficient constraint force in predicting results with traditional methods. This loss function enhances the accuracy of model classification predictions and improves the proximity of regression position predictions to ground truth objects. The proposed method model is evaluated on the popular dataset UCF101-24 and JHMDB-21. Experimental results demonstrate that the proposed method achieves an accuracy of 81.52% on the Frame-mAP metric, surpassing current existing methods.展开更多
Considering the difficulty of integrating the depth points of nautical charts of the East China Sea into a global high-precision Grid Digital Elevation Model(Grid-DEM),we proposed a“Fusion based on Image Recognition(...Considering the difficulty of integrating the depth points of nautical charts of the East China Sea into a global high-precision Grid Digital Elevation Model(Grid-DEM),we proposed a“Fusion based on Image Recognition(FIR)”method for multi-sourced depth data fusion,and used it to merge the electronic nautical chart dataset(referred to as Chart2014 in this paper)with the global digital elevation dataset(referred to as Globalbath2002 in this paper).Compared to the traditional fusion of two datasets by direct combination and interpolation,the new Grid-DEM formed by FIR can better represent the data characteristics of Chart2014,reduce the calculation difficulty,and be more intuitive,and,the choice of different interpolation methods in FIR and the influence of the“exclusion radius R”parameter were discussed.FIR avoids complex calculations of spatial distances among points from different sources,and instead uses spatial exclusion map to perform one-step screening based on the exclusion radius R,which greatly improved the fusion status of a reliable dataset.The fusion results of different experiments were analyzed statistically with root mean square error and mean relative error,showing that the interpolation methods based on Delaunay triangulation are more suitable for the fusion of nautical chart depth of China,and factors such as the point density distribution of multiple source data,accuracy,interpolation method,and various terrain conditions should be fully considered when selecting the exclusion radius R.展开更多
With the advancement of human-computer interaction,surface electromyography(sEMG)-based gesture recognition has garnered increasing attention.However,effectively utilizing the spatio-temporal dependencies in sEMG sign...With the advancement of human-computer interaction,surface electromyography(sEMG)-based gesture recognition has garnered increasing attention.However,effectively utilizing the spatio-temporal dependencies in sEMG signals and integrating multiple key features remain significant challenges for existing techniques.To address this issue,we propose a model named the Two-Stream Hybrid Spatio-Temporal Fusion Network(TS-HSTFNet).Specifically,we design a dynamic spatio-temporal graph convolution module that employs an adaptive dynamic adjacency matrix to explore the spatial dynamic patterns in the sEMG signals fully.Additionally,a spatio-temporal attention fusion module is designed to fully utilize the potential correlations among multiple features for the final fusion.The results indicate that the proposed TS-HSTFNet model achieves 84.96%and 88.08%accuracy on the Ninapro DB2 and Ninapro DB5 datasets,respectively,demonstrating high precision in gesture recognition.Our work emphasizes the importance of extracting spatio-temporal features in gesture recognition and provides a novel approach for multi-source information fusion.展开更多
False data injection attack(FDIA)can affect the state estimation of the power grid by tampering with the measured value of the power grid data,and then destroying the stable operation of the smart grid.Existing work u...False data injection attack(FDIA)can affect the state estimation of the power grid by tampering with the measured value of the power grid data,and then destroying the stable operation of the smart grid.Existing work usually trains a detection model by fusing the data-driven features from diverse power data streams.Data-driven features,however,cannot effectively capture the differences between noisy data and attack samples.As a result,slight noise disturbances in the power grid may cause a large number of false detections for FDIA attacks.To address this problem,this paper designs a deep collaborative self-attention network to achieve robust FDIA detection,in which the spatio-temporal features of cascaded FDIA attacks are fully integrated.Firstly,a high-order Chebyshev polynomials-based graph convolution module is designed to effectively aggregate the spatio information between grid nodes,and the spatial self-attention mechanism is involved to dynamically assign attention weights to each node,which guides the network to pay more attention to the node information that is conducive to FDIA detection.Furthermore,the bi-directional Long Short-Term Memory(LSTM)network is introduced to conduct time series modeling and long-term dependence analysis for power grid data and utilizes the temporal self-attention mechanism to describe the time correlation of data and assign different weights to different time steps.Our designed deep collaborative network can effectively mine subtle perturbations from spatiotemporal feature information,efficiently distinguish power grid noise from FDIA attacks,and adapt to diverse attack intensities.Extensive experiments demonstrate that our method can obtain an efficient detection performance over actual load data from New York Independent System Operator(NYISO)in IEEE 14,IEEE 39,and IEEE 118 bus systems,and outperforms state-of-the-art FDIA detection schemes in terms of detection accuracy and robustness.展开更多
In order to obtain more accurate precipitation data and better simulate the precipitation on the Tibetan Plateau,the simulation capability of 14 Coupled Model Intercomparison Project Phase 6(CMIP6)models of historical...In order to obtain more accurate precipitation data and better simulate the precipitation on the Tibetan Plateau,the simulation capability of 14 Coupled Model Intercomparison Project Phase 6(CMIP6)models of historical precipitation(1982-2014)on the Qinghai-Tibetan Plateau was evaluated in this study.Results indicate that all models exhibit an overestimation of precipitation through the analysis of the Taylor index,temporal and spatial statistical parameters.To correct the overestimation,a fusion correction method combining the Backpropagation Neural Network Correction(BP)and Quantum Mapping(QM)correction,named BQ method,was proposed.With this method,the historical precipitation of each model was corrected in space and time,respectively.The correction results were then analyzed in time,space,and analysis of variance(ANOVA)with those corrected by the BP and QM methods,respectively.Finally,the fusion correction method results for each model were compared with the Climatic Research Unit(CRU)data for significance analysis to obtain the trends of precipitation increase and decrease for each model.The results show that the IPSL-CM6A-LR model is relatively good in simulating historical precipitation on the Qinghai-Tibetan Plateau(R=0.7,RSME=0.15)among the uncorrected data.In terms of time,the total precipitation corrected by the fusion method has the same interannual trend and the closest precipitation values to the CRU data;In terms of space,the annual average precipitation corrected by the fusion method has the smallest difference with the CRU data,and the total historical annual average precipitation is not significantly different from the CRU data,which is better than BP and QM.Therefore,the correction effect of the fusion method on the historical precipitation of each model is better than that of the QM and BP methods.The precipitation in the central and northeastern parts of the plateau shows a significant increasing trend.The correlation coefficients between monthly precipitation and site-detected precipitation for all models after BQ correction exceed 0.8.展开更多
Refined 3D modeling of mine slopes is pivotal for precise prediction of geological hazards.Aiming at the inadequacy of existing single modeling methods in comprehensively representing the overall and localized charact...Refined 3D modeling of mine slopes is pivotal for precise prediction of geological hazards.Aiming at the inadequacy of existing single modeling methods in comprehensively representing the overall and localized characteristics of mining slopes,this study introduces a new method that fuses model data from Unmanned aerial vehicles(UAV)tilt photogrammetry and 3D laser scanning through a data alignment algorithm based on control points.First,the mini batch K-Medoids algorithm is utilized to cluster the point cloud data from ground 3D laser scanning.Then,the elbow rule is applied to determine the optimal cluster number(K0),and the feature points are extracted.Next,the nearest neighbor point algorithm is employed to match the feature points obtained from UAV tilt photogrammetry,and the internal point coordinates are adjusted through the distanceweighted average to construct a 3D model.Finally,by integrating an engineering case study,the K0 value is determined to be 8,with a matching accuracy between the two model datasets ranging from 0.0669 to 1.0373 mm.Therefore,compared with the modeling method utilizing K-medoids clustering algorithm,the new modeling method significantly enhances the computational efficiency,the accuracy of selecting the optimal number of feature points in 3D laser scanning,and the precision of the 3D model derived from UAV tilt photogrammetry.This method provides a research foundation for constructing mine slope model.展开更多
While automatic image captioning systems have made notable progress in the past few years,generating captions that fully convey sentiment remains a considerable challenge.Although existing models achieve strong perfor...While automatic image captioning systems have made notable progress in the past few years,generating captions that fully convey sentiment remains a considerable challenge.Although existing models achieve strong performance in visual recognition and factual description,they often fail to account for the emotional context that is naturally present in human-generated captions.To address this gap,we propose the Sentiment-Driven Caption Generator(SDCG),which combines transformer-based visual and textual processing withmulti-level fusion.RoBERTa is used for extracting sentiment from textual input,while visual features are handled by the Vision Transformer(ViT).These features are fused using several fusion approaches,including Concatenation,Attention,Visual-Sentiment Co-Attention(VSCA),and Cross-Attention.Our experiments demonstrate that SDCG significantly outperforms baseline models such as the Generalized Image Transformer(GIT),which achieves 82.01%,and Bootstrapping Language-Image Pre-training(BLIP),which achieves 83.07%,in sentiment accuracy.While SDCG achieves 94.52%sentiment accuracy and improves scores in BLEU and ROUGE-L,the model demonstrates clear advantages.More importantly,the captions aremore natural,as they incorporate emotional cues and contextual awareness,making them resemble those written by a human.展开更多
Semi-crystalline polymer laser powder bed fusion(L-PBF)has recently attracted increasing interest due to its potential for fabricating complex geometry.However,a more comprehensive understanding of the underlying phys...Semi-crystalline polymer laser powder bed fusion(L-PBF)has recently attracted increasing interest due to its potential for fabricating complex geometry.However,a more comprehensive understanding of the underlying physics during L-PBF is required to better control the properties of the final part.This work proposed a multi-layer numerical model to study the temperature and phase evolution during the polyamide-12(PA12)L-PBF process.The Descend and Parallel Chord methods were introduced to improve the convergence of the non-linear thermal solver.The level-set-based mesh adaptation strategy,governed by multi-physical fields,was applied to alleviate the calculation and accurately track the phase evolution.The processing simulation on the dog-bone model revealed that preheating temperature significantly influences the crystallization behavior.Finally,the multi-layer simulation demonstrated that such a developed numerical model can be used to study the phase transformation during powder layer updating and the cyclic laser sintering phenomena.Moreover,the numerical study suggested that crystallization occurs slowly during the L-PBF process.展开更多
This paper addresses the accuracy and timeliness limitations of traditional comprehensive prediction methods by proposing an approach of decision-level fusion of multisource data.A risk prediction indicator system was...This paper addresses the accuracy and timeliness limitations of traditional comprehensive prediction methods by proposing an approach of decision-level fusion of multisource data.A risk prediction indicator system was established for water and mud inrush in tunnels by analyzing advanced prediction data for specifi c tunnel segments.Additionally,the indicator weights were determined using the analytic hierarchy process combined with the Huber weighting method.Subsequently,a multisource data decision-layer fusion algorithm was utilized to generate fused imaging results for tunnel water and mud inrush risk predictions.Meanwhile,risk analysis was performed for different tunnel sections to achieve spatial and temporal complementarity within the indicator system and optimize redundant information.Finally,model feasibility was validated using the CZ Project Sejila Mountain Tunnel segment as a case study,yielding favorable risk prediction results and enabling effi cient information fusion and support for construction decision-making.展开更多
Tungsten is considered the most promising plasma-facing material for fusion reactors with exceptional performance.Under certain conditions,activated tungsten dust can be generated through plasma–wall interactions and...Tungsten is considered the most promising plasma-facing material for fusion reactors with exceptional performance.Under certain conditions,activated tungsten dust can be generated through plasma–wall interactions and released into the atmosphere.Activated tungsten migrates downward in the soil after atmospheric deposition.However,effective methods for evaluating the environmental dose of gamma rays emitted by activated tungsten are still lacking.Consequently,a method for evaluating the air-absorbed dose rate of activated tungsten dust was proposed considering soil attenuation.Key parameters including the mass attenuation coefficient and energy absorption build-up factor were determined for the main gamma ray energies of radionuclides within the activated tungsten dust.Additionally,air-absorbed dose rates were calculated by assuming that radioactive sources were located at different soil depths and radii.It was found that a soil depth of 50 cm significantly attenuated the environmental dose by 99.9%,whereas the air-absorbed dose rates within the horizontal distance of 500 cm accounted for 91%of the total dose rate.Therefore,this study underscored the importance of soil attenuation in environmental dose assessments,which must be carefully re-examined for the safety analysis of fusion reactors.展开更多
This study investigates a consistent fusion algorithm for distributed multi-rate multi-sensor systems operating in feedback-memory configurations, where each sensor's sampling period is uniform and an integer mult...This study investigates a consistent fusion algorithm for distributed multi-rate multi-sensor systems operating in feedback-memory configurations, where each sensor's sampling period is uniform and an integer multiple of the state update period. The focus is on scenarios where the correlations among Measurement Noises(MNs) from different sensors are unknown. Firstly, a non-augmented local estimator that applies to sampling cases is designed to provide unbiased Local Estimates(LEs) at the fusion points. Subsequently, a measurement-equivalent approach is then developed to parameterize the correlation structure between LEs and reformulate LEs into a unified form, thereby constraining the correlations arising from MNs to an admissible range. Simultaneously, a family of upper bounds on the joint error covariance matrix of LEs is derived based on the constrained correlations, avoiding the need to calculate the exact error cross-covariance matrix of LEs. Finally, a sequential fusion estimator is proposed in the sense of Weighted Minimum Mean Square Error(WMMSE), and it is proven to be unbiased, consistent, and more accurate than the well-known covariance intersection method. Simulation results illustrate the effectiveness of the proposed algorithm by highlighting improvements in consistency and accuracy.展开更多
High-resolution sub-meter satellite data play an increasingly crucial role in the 3D real-scene China construction initiative.Current research on 3D reconstruction using high-resolution satellite data primarily focuse...High-resolution sub-meter satellite data play an increasingly crucial role in the 3D real-scene China construction initiative.Current research on 3D reconstruction using high-resolution satellite data primarily focuses on two approaches:Multi-stereo fusion and multi-view matching.While algorithms based on these two methodologies for multi-view image 3D reconstruction have reached relative maturity,no systematic comparison has been conducted specifically on satellite data to evaluate the relative merits of multi-stereo fusion versus multi-view matching methods.This paper conducts a comparative analysis of the practical accuracy of both approaches using high-resolution satellite datasets from diverse geographical regions.To ensure fairness in accuracy comparison,both methodologies employ non-local dense matching for cost optimization.Results demonstrate that the multi-stereo fusion method outperforms multi-view matching in all evaluation metrics,exhibiting approximately 1.2%higher average matching accuracy and 10.7%superior elevation precision in the experimental datasets.Therefore,for 3D modeling applications using satellite data,we recommend adopting the multi-stereo fusion approach for digital surface model(DSM)product generation.展开更多
Multi-source information fusion (MSIF) is imported into structural damage diagnosis methods to improve the validity of damage detection. After the introduction of the basic theory, the function model, classification...Multi-source information fusion (MSIF) is imported into structural damage diagnosis methods to improve the validity of damage detection. After the introduction of the basic theory, the function model, classifications and mathematical methods of MSIF, a structural damage detection method based on MSIF is presented, which is to fuse two or more damage character vectors from different structural damage diagnosis methods on the character-level. In an experiment of concrete plates, modal information is measured and analyzed. The structural damage detection method based on MSIF is taken to localize cracks of concrete plates and it is proved to be effective. Results of damage detection by the method based on MSIF are compared with those from the modal strain energy method and the flexibility method. Damage, which can hardly be detected by using the single damage identification method, can be diagnosed by the damage detection method based on the character-level MSIF technique. Meanwhile multi-location damage can be identified by the method based on MSIF. This method is sensitive to structural damage and different mathematical methods for MSIF have different preconditions and applicabilities for diversified structures. How to choose mathematical methods for MSIF should be discussed in detail in health monitoring systems of actual structures.展开更多
The problem of the unmanned surface vessel (USV) path planning in static and dynamic obstacle environments is addressed in this paper. Multi-behavior fusion based potential field method is proposed, which contains thr...The problem of the unmanned surface vessel (USV) path planning in static and dynamic obstacle environments is addressed in this paper. Multi-behavior fusion based potential field method is proposed, which contains three behaviors: goal-seeking, boundary-memory following and dynamic-obstacle avoidance. Then, different activation conditions are designed to determine the current behavior. Meanwhile, information on the positions, velocities and the equation of motion for obstacles are detected and calculated by sensor data. Besides, memory information is introduced into the boundary following behavior to enhance cognition capability for the obstacles, and avoid local minima problem caused by the potential field method. Finally, the results of theoretical analysis and simulation show that the collision-free path can be generated for USV within different obstacle environments, and further validated the performance and effectiveness of the presented strategy.展开更多
Satellite remote sensing of inland water body requires a high spatial resolution and a multiband narrow spectral resolution, which makes the fusion between panchromatic(PAN) and multi-spectral(MS) images particularly ...Satellite remote sensing of inland water body requires a high spatial resolution and a multiband narrow spectral resolution, which makes the fusion between panchromatic(PAN) and multi-spectral(MS) images particularly important. Taking the Daquekou section of the Qiantang River as an observation target, four conventional fusion methods widely accepted in satellite image processing, including pan sharpening(PS), principal component analysis(PCA), Gram-Schmidt(GS), and wavelet fusion(WF), are utilized to fuse MS and PAN images of GF-1.The results of subjective and objective evaluation methods application indicate that GS performs the best,followed by the PCA, the WF and the PS in the order of descending. The existence of a large area of the water body is a dominant factor impacting the fusion performance. Meanwhile, the ability of retaining spatial and spectral informations is an important factor affecting the fusion performance of different fusion methods. The fundamental difference of reflectivity information acquisition between water and land is the reason for the failure of conventional fusion methods for land observation such as the PS to be used in the presence of the large water body. It is suggested that the adoption of the conventional fusion methods in the observing water body as the main target should be taken with caution. The performances of the fusion methods need re-assessment when the large-scale water body is present in the remote sensing image or when the research aims for the water body observation.展开更多
For quantitatively explaining the correlations between the vascular plant species abundance (VPSA) and habitat factors, a spatial simulation method has been developed to simulate the distribution of VPSA on the Qingha...For quantitatively explaining the correlations between the vascular plant species abundance (VPSA) and habitat factors, a spatial simulation method has been developed to simulate the distribution of VPSA on the Qinghai-Tibet Plateau. In this paper, the vascular plant type, land cover, mean annual biotemperature, average total annual precipitation, topographic relief, patch connectivity and ecological diversity index were selected to screen the best correlation equation between the VPSA and habitat factors on the basis of 37 national nature reserves on the Qinghai-Tibet Plateau. The research results show that the coefficient of determination between VPSA and habitat factors is 0.94, and the mean error is 2.21 types per km<sup>2</sup>. The distribution of VPSA gradually decreases from southeast to northwest, and reduces with increasing altitude except the desert area of Qaidam Basin. Furthermore, the scenarios of VPSA on the Qinghai-Tibet Plateau during the periods from 1981 to 2010 (T0), from 2011 to 2040 (T2), from 2041 to 2070 (T3) and from 2071 to 2100 (T4) were simulated by combining the land cover change and the climatic scenarios of CMIP5 RCP2.6, RCP4.5 and RCP8.5. The simulated results show that the VPSA would generally decrease on the Qinghai-Tibet Plateau from T0 to T4. The VPSA has the largest change ratio under RCP8.5 scenario, and the smallest change ratio under RCP2.6 scenario. In general, the dynamic change of habitat factors would directly affect the spatial distribution of VPSA on the Qinghai- Tibet Plateau in the future.展开更多
It is known that the exploitation of opencast coal mines has seriously damaged the environments in the semi-arid areas.Vegetation status can reliably reflect the ecological degeneration and restoration in the opencast...It is known that the exploitation of opencast coal mines has seriously damaged the environments in the semi-arid areas.Vegetation status can reliably reflect the ecological degeneration and restoration in the opencast mining areas in the semi-arid areas.Long-time series MODIS NDVI data are widely used to simulate the vegetation cover to reflect the disturbance and restoration of local ecosystems.In this study, both qualitative(linear regression method and coefficient of variation(CoV)) and quantitative(spatial buffer analysis, and change amplitude and the rate of change in the average NDVI) analyses were conducted to analyze the spatio-temporal dynamics of vegetation during 2000–2017 in Jungar Banner of Inner Mongolia Autonomous Region, China, at the large(Jungar Banner and three mine groups) and small(three types of functional areas: opencast coal mining excavation areas, reclamation areas and natural areas) scales.The results show that the rates of change in the average NDVI in the reclamation areas(20%–60%) and opencast coal mining excavation areas(10%–20%) were considerably higher than that in the natural areas(<7%).The vegetation in the reclamation areas experienced a trend of increase(3–5 a after reclamation)-decrease(the sixth year of reclamation)-stability.The vegetation in Jungar Banner has a spatial heterogeneity under the influences of mining and reclamation activities.The ratio of vegetation improvement area to vegetation degradation area in the west, southwest and east mine groups during 2000–2017 was 8:1, 20:1 and 33:1, respectively.The regions with the high CoV of NDVI above 0.45 were mainly distributed around the opencast coal mining excavation areas, and the regions with the CoV of NDVI above 0.25 were mostly located in areas with low(28.8%) and medium-low(10.2%) vegetation cover.The average disturbance distances of mining activities on vegetation in the three mine groups(west, southwest and east) were 800, 800 and 1000 m, respectively.The greater the scale of mining, the farther the disturbance distances of mining activities on vegetation.We conclude that vegetation reclamation will certainly compensate for the negative impacts of opencast coal mining activities on vegetation.Sufficient attention should be paid to the proportional allocation of plant species(herbs and shrubs) in the reclamation areas, and the restored vegetation in these areas needs to be protected for more than 6 a.Then, as the repair time increased, the vegetation condition of the reclamation areas would exceed that of the natural areas.展开更多
Weighted fusion algorithms, which can be applied in the area of multi-sensor data fusion, are advanced based on weighted least square method. A weighted fusion algorithm, in which the relationship between weight coeff...Weighted fusion algorithms, which can be applied in the area of multi-sensor data fusion, are advanced based on weighted least square method. A weighted fusion algorithm, in which the relationship between weight coefficients and measurement noise is established, is proposed by giving attention to the correlation of measurement noise. Then a simplified weighted fusion algorithm is deduced on the assumption that measurement noise is uncorrelated. In addition, an algorithm, which can adjust the weight coefficients in the simplified algorithm by making estimations of measurement noise from measurements, is presented. It is proved by emulation and experiment that the precision performance of the multi-sensor system based on these algorithms is better than that of the multi-sensor system based on other algorithms.展开更多
In the paper, the rational breather soliton and kink solitary wave solution of the (2+1)-dimensional PBLMP equation are obtained by adopting Hirota bilinear method and selecting different test functions. Furthermore, ...In the paper, the rational breather soliton and kink solitary wave solution of the (2+1)-dimensional PBLMP equation are obtained by adopting Hirota bilinear method and selecting different test functions. Furthermore, it has been found that the fusion and degeneration of the kink solitary wave occur when interaction between the rational breather soliton and the kink solitary wave happens. These phenomena are very helpful in researching soliton dynamical complexity in the higher dimensional systems.展开更多
The control rod drive mechanism(CRDM)is an essential part of the control and safety protection system of pressurized water reactors.Current CRDM simulations are mostly performed collectively using a single method,igno...The control rod drive mechanism(CRDM)is an essential part of the control and safety protection system of pressurized water reactors.Current CRDM simulations are mostly performed collectively using a single method,ignoring the influence of multiple motion units and the differences in various features among them,which strongly affect the efficiency and accuracy of the simulations.In this study,we constructed a flow field fusion simulation method based on model features by combining key motion unit analysis and various simulation methods and then applied the method to the CRDM simulation process.CRDM performs motion unit decomposition through the structural hierarchy of function-movement-action method,and the key meta-actions are identified as the nodes in the flow field simulation.We established a fused feature-based multimethod simulation process and processed the simulation methods and data according to the features of the fluid domain space and the structural complexity to obtain the fusion simulation results.Compared to traditional simulation methods and real measurements,the simulation method provides advantages in terms of simulation efficiency and accuracy.展开更多
基金support for this work was supported by Key Lab of Intelligent and Green Flexographic Printing under Grant ZBKT202301.
文摘Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decoder (ACSF-ED) network to predict the action and locate the object efficiently. In the Adaptive Cross-Scale Fusion Spatio-Temporal Encoder (ACSF ST-Encoder), the Asymptotic Cross-scale Feature-fusion Module (ACCFM) is designed to address the issue of information degradation caused by the propagation of high-level semantic information, thereby extracting high-quality multi-scale features to provide superior features for subsequent spatio-temporal information modeling. Within the Shared-Head Decoder structure, a shared classification and regression detection head is constructed. A multi-constraint loss function composed of one-to-one, one-to-many, and contrastive denoising losses is designed to address the problem of insufficient constraint force in predicting results with traditional methods. This loss function enhances the accuracy of model classification predictions and improves the proximity of regression position predictions to ground truth objects. The proposed method model is evaluated on the popular dataset UCF101-24 and JHMDB-21. Experimental results demonstrate that the proposed method achieves an accuracy of 81.52% on the Frame-mAP metric, surpassing current existing methods.
基金Supported by the National Key R&D Program of China (No.2023YFC3008100)the National Natural Science Foundation of China (No.U23A2033)
文摘Considering the difficulty of integrating the depth points of nautical charts of the East China Sea into a global high-precision Grid Digital Elevation Model(Grid-DEM),we proposed a“Fusion based on Image Recognition(FIR)”method for multi-sourced depth data fusion,and used it to merge the electronic nautical chart dataset(referred to as Chart2014 in this paper)with the global digital elevation dataset(referred to as Globalbath2002 in this paper).Compared to the traditional fusion of two datasets by direct combination and interpolation,the new Grid-DEM formed by FIR can better represent the data characteristics of Chart2014,reduce the calculation difficulty,and be more intuitive,and,the choice of different interpolation methods in FIR and the influence of the“exclusion radius R”parameter were discussed.FIR avoids complex calculations of spatial distances among points from different sources,and instead uses spatial exclusion map to perform one-step screening based on the exclusion radius R,which greatly improved the fusion status of a reliable dataset.The fusion results of different experiments were analyzed statistically with root mean square error and mean relative error,showing that the interpolation methods based on Delaunay triangulation are more suitable for the fusion of nautical chart depth of China,and factors such as the point density distribution of multiple source data,accuracy,interpolation method,and various terrain conditions should be fully considered when selecting the exclusion radius R.
基金Funding from the Key Research and development plan of Shaanxi Province"Human robot interaction technology and implementation of bionic robotic arm based on remote operation"(2023-ZDLGY-24).
文摘With the advancement of human-computer interaction,surface electromyography(sEMG)-based gesture recognition has garnered increasing attention.However,effectively utilizing the spatio-temporal dependencies in sEMG signals and integrating multiple key features remain significant challenges for existing techniques.To address this issue,we propose a model named the Two-Stream Hybrid Spatio-Temporal Fusion Network(TS-HSTFNet).Specifically,we design a dynamic spatio-temporal graph convolution module that employs an adaptive dynamic adjacency matrix to explore the spatial dynamic patterns in the sEMG signals fully.Additionally,a spatio-temporal attention fusion module is designed to fully utilize the potential correlations among multiple features for the final fusion.The results indicate that the proposed TS-HSTFNet model achieves 84.96%and 88.08%accuracy on the Ninapro DB2 and Ninapro DB5 datasets,respectively,demonstrating high precision in gesture recognition.Our work emphasizes the importance of extracting spatio-temporal features in gesture recognition and provides a novel approach for multi-source information fusion.
基金supported in part by the Research Fund of Guangxi Key Lab of Multi-Source Information Mining&Security(MIMS21-M-02).
文摘False data injection attack(FDIA)can affect the state estimation of the power grid by tampering with the measured value of the power grid data,and then destroying the stable operation of the smart grid.Existing work usually trains a detection model by fusing the data-driven features from diverse power data streams.Data-driven features,however,cannot effectively capture the differences between noisy data and attack samples.As a result,slight noise disturbances in the power grid may cause a large number of false detections for FDIA attacks.To address this problem,this paper designs a deep collaborative self-attention network to achieve robust FDIA detection,in which the spatio-temporal features of cascaded FDIA attacks are fully integrated.Firstly,a high-order Chebyshev polynomials-based graph convolution module is designed to effectively aggregate the spatio information between grid nodes,and the spatial self-attention mechanism is involved to dynamically assign attention weights to each node,which guides the network to pay more attention to the node information that is conducive to FDIA detection.Furthermore,the bi-directional Long Short-Term Memory(LSTM)network is introduced to conduct time series modeling and long-term dependence analysis for power grid data and utilizes the temporal self-attention mechanism to describe the time correlation of data and assign different weights to different time steps.Our designed deep collaborative network can effectively mine subtle perturbations from spatiotemporal feature information,efficiently distinguish power grid noise from FDIA attacks,and adapt to diverse attack intensities.Extensive experiments demonstrate that our method can obtain an efficient detection performance over actual load data from New York Independent System Operator(NYISO)in IEEE 14,IEEE 39,and IEEE 118 bus systems,and outperforms state-of-the-art FDIA detection schemes in terms of detection accuracy and robustness.
文摘In order to obtain more accurate precipitation data and better simulate the precipitation on the Tibetan Plateau,the simulation capability of 14 Coupled Model Intercomparison Project Phase 6(CMIP6)models of historical precipitation(1982-2014)on the Qinghai-Tibetan Plateau was evaluated in this study.Results indicate that all models exhibit an overestimation of precipitation through the analysis of the Taylor index,temporal and spatial statistical parameters.To correct the overestimation,a fusion correction method combining the Backpropagation Neural Network Correction(BP)and Quantum Mapping(QM)correction,named BQ method,was proposed.With this method,the historical precipitation of each model was corrected in space and time,respectively.The correction results were then analyzed in time,space,and analysis of variance(ANOVA)with those corrected by the BP and QM methods,respectively.Finally,the fusion correction method results for each model were compared with the Climatic Research Unit(CRU)data for significance analysis to obtain the trends of precipitation increase and decrease for each model.The results show that the IPSL-CM6A-LR model is relatively good in simulating historical precipitation on the Qinghai-Tibetan Plateau(R=0.7,RSME=0.15)among the uncorrected data.In terms of time,the total precipitation corrected by the fusion method has the same interannual trend and the closest precipitation values to the CRU data;In terms of space,the annual average precipitation corrected by the fusion method has the smallest difference with the CRU data,and the total historical annual average precipitation is not significantly different from the CRU data,which is better than BP and QM.Therefore,the correction effect of the fusion method on the historical precipitation of each model is better than that of the QM and BP methods.The precipitation in the central and northeastern parts of the plateau shows a significant increasing trend.The correlation coefficients between monthly precipitation and site-detected precipitation for all models after BQ correction exceed 0.8.
基金funded by National Natural Science Foundation of China(Grant Nos.42272333,42277147).
文摘Refined 3D modeling of mine slopes is pivotal for precise prediction of geological hazards.Aiming at the inadequacy of existing single modeling methods in comprehensively representing the overall and localized characteristics of mining slopes,this study introduces a new method that fuses model data from Unmanned aerial vehicles(UAV)tilt photogrammetry and 3D laser scanning through a data alignment algorithm based on control points.First,the mini batch K-Medoids algorithm is utilized to cluster the point cloud data from ground 3D laser scanning.Then,the elbow rule is applied to determine the optimal cluster number(K0),and the feature points are extracted.Next,the nearest neighbor point algorithm is employed to match the feature points obtained from UAV tilt photogrammetry,and the internal point coordinates are adjusted through the distanceweighted average to construct a 3D model.Finally,by integrating an engineering case study,the K0 value is determined to be 8,with a matching accuracy between the two model datasets ranging from 0.0669 to 1.0373 mm.Therefore,compared with the modeling method utilizing K-medoids clustering algorithm,the new modeling method significantly enhances the computational efficiency,the accuracy of selecting the optimal number of feature points in 3D laser scanning,and the precision of the 3D model derived from UAV tilt photogrammetry.This method provides a research foundation for constructing mine slope model.
基金funded by the Committee of Science of the Ministry of Science andHigher Education of the Republic of Kazakhstan(Grant No.BR24993166).
文摘While automatic image captioning systems have made notable progress in the past few years,generating captions that fully convey sentiment remains a considerable challenge.Although existing models achieve strong performance in visual recognition and factual description,they often fail to account for the emotional context that is naturally present in human-generated captions.To address this gap,we propose the Sentiment-Driven Caption Generator(SDCG),which combines transformer-based visual and textual processing withmulti-level fusion.RoBERTa is used for extracting sentiment from textual input,while visual features are handled by the Vision Transformer(ViT).These features are fused using several fusion approaches,including Concatenation,Attention,Visual-Sentiment Co-Attention(VSCA),and Cross-Attention.Our experiments demonstrate that SDCG significantly outperforms baseline models such as the Generalized Image Transformer(GIT),which achieves 82.01%,and Bootstrapping Language-Image Pre-training(BLIP),which achieves 83.07%,in sentiment accuracy.While SDCG achieves 94.52%sentiment accuracy and improves scores in BLEU and ROUGE-L,the model demonstrates clear advantages.More importantly,the captions aremore natural,as they incorporate emotional cues and contextual awareness,making them resemble those written by a human.
文摘Semi-crystalline polymer laser powder bed fusion(L-PBF)has recently attracted increasing interest due to its potential for fabricating complex geometry.However,a more comprehensive understanding of the underlying physics during L-PBF is required to better control the properties of the final part.This work proposed a multi-layer numerical model to study the temperature and phase evolution during the polyamide-12(PA12)L-PBF process.The Descend and Parallel Chord methods were introduced to improve the convergence of the non-linear thermal solver.The level-set-based mesh adaptation strategy,governed by multi-physical fields,was applied to alleviate the calculation and accurately track the phase evolution.The processing simulation on the dog-bone model revealed that preheating temperature significantly influences the crystallization behavior.Finally,the multi-layer simulation demonstrated that such a developed numerical model can be used to study the phase transformation during powder layer updating and the cyclic laser sintering phenomena.Moreover,the numerical study suggested that crystallization occurs slowly during the L-PBF process.
基金supported by the National Natural Science Foundation of China (grant numbers 42293351, and U2468221)。
文摘This paper addresses the accuracy and timeliness limitations of traditional comprehensive prediction methods by proposing an approach of decision-level fusion of multisource data.A risk prediction indicator system was established for water and mud inrush in tunnels by analyzing advanced prediction data for specifi c tunnel segments.Additionally,the indicator weights were determined using the analytic hierarchy process combined with the Huber weighting method.Subsequently,a multisource data decision-layer fusion algorithm was utilized to generate fused imaging results for tunnel water and mud inrush risk predictions.Meanwhile,risk analysis was performed for different tunnel sections to achieve spatial and temporal complementarity within the indicator system and optimize redundant information.Finally,model feasibility was validated using the CZ Project Sejila Mountain Tunnel segment as a case study,yielding favorable risk prediction results and enabling effi cient information fusion and support for construction decision-making.
基金supported by the National Natural Science Foundation of China(No.12375314)。
文摘Tungsten is considered the most promising plasma-facing material for fusion reactors with exceptional performance.Under certain conditions,activated tungsten dust can be generated through plasma–wall interactions and released into the atmosphere.Activated tungsten migrates downward in the soil after atmospheric deposition.However,effective methods for evaluating the environmental dose of gamma rays emitted by activated tungsten are still lacking.Consequently,a method for evaluating the air-absorbed dose rate of activated tungsten dust was proposed considering soil attenuation.Key parameters including the mass attenuation coefficient and energy absorption build-up factor were determined for the main gamma ray energies of radionuclides within the activated tungsten dust.Additionally,air-absorbed dose rates were calculated by assuming that radioactive sources were located at different soil depths and radii.It was found that a soil depth of 50 cm significantly attenuated the environmental dose by 99.9%,whereas the air-absorbed dose rates within the horizontal distance of 500 cm accounted for 91%of the total dose rate.Therefore,this study underscored the importance of soil attenuation in environmental dose assessments,which must be carefully re-examined for the safety analysis of fusion reactors.
基金supported by the National Natural Science Foundation of China (Nos. 62276204, 62203343)。
文摘This study investigates a consistent fusion algorithm for distributed multi-rate multi-sensor systems operating in feedback-memory configurations, where each sensor's sampling period is uniform and an integer multiple of the state update period. The focus is on scenarios where the correlations among Measurement Noises(MNs) from different sensors are unknown. Firstly, a non-augmented local estimator that applies to sampling cases is designed to provide unbiased Local Estimates(LEs) at the fusion points. Subsequently, a measurement-equivalent approach is then developed to parameterize the correlation structure between LEs and reformulate LEs into a unified form, thereby constraining the correlations arising from MNs to an admissible range. Simultaneously, a family of upper bounds on the joint error covariance matrix of LEs is derived based on the constrained correlations, avoiding the need to calculate the exact error cross-covariance matrix of LEs. Finally, a sequential fusion estimator is proposed in the sense of Weighted Minimum Mean Square Error(WMMSE), and it is proven to be unbiased, consistent, and more accurate than the well-known covariance intersection method. Simulation results illustrate the effectiveness of the proposed algorithm by highlighting improvements in consistency and accuracy.
文摘High-resolution sub-meter satellite data play an increasingly crucial role in the 3D real-scene China construction initiative.Current research on 3D reconstruction using high-resolution satellite data primarily focuses on two approaches:Multi-stereo fusion and multi-view matching.While algorithms based on these two methodologies for multi-view image 3D reconstruction have reached relative maturity,no systematic comparison has been conducted specifically on satellite data to evaluate the relative merits of multi-stereo fusion versus multi-view matching methods.This paper conducts a comparative analysis of the practical accuracy of both approaches using high-resolution satellite datasets from diverse geographical regions.To ensure fairness in accuracy comparison,both methodologies employ non-local dense matching for cost optimization.Results demonstrate that the multi-stereo fusion method outperforms multi-view matching in all evaluation metrics,exhibiting approximately 1.2%higher average matching accuracy and 10.7%superior elevation precision in the experimental datasets.Therefore,for 3D modeling applications using satellite data,we recommend adopting the multi-stereo fusion approach for digital surface model(DSM)product generation.
基金The National High Technology Research and Develop-ment Program of China(863Program)(No.2006AA04Z416)the Na-tional Science Fund for Distinguished Young Scholars(No.50725828)the Excellent Dissertation Program for Doctoral Degree of Southeast University(No.0705)
文摘Multi-source information fusion (MSIF) is imported into structural damage diagnosis methods to improve the validity of damage detection. After the introduction of the basic theory, the function model, classifications and mathematical methods of MSIF, a structural damage detection method based on MSIF is presented, which is to fuse two or more damage character vectors from different structural damage diagnosis methods on the character-level. In an experiment of concrete plates, modal information is measured and analyzed. The structural damage detection method based on MSIF is taken to localize cracks of concrete plates and it is proved to be effective. Results of damage detection by the method based on MSIF are compared with those from the modal strain energy method and the flexibility method. Damage, which can hardly be detected by using the single damage identification method, can be diagnosed by the damage detection method based on the character-level MSIF technique. Meanwhile multi-location damage can be identified by the method based on MSIF. This method is sensitive to structural damage and different mathematical methods for MSIF have different preconditions and applicabilities for diversified structures. How to choose mathematical methods for MSIF should be discussed in detail in health monitoring systems of actual structures.
基金financially supported by the National Natural Science Foundation of China(Grant No.51879049)DK-I Dynamic Positioning System Console Project
文摘The problem of the unmanned surface vessel (USV) path planning in static and dynamic obstacle environments is addressed in this paper. Multi-behavior fusion based potential field method is proposed, which contains three behaviors: goal-seeking, boundary-memory following and dynamic-obstacle avoidance. Then, different activation conditions are designed to determine the current behavior. Meanwhile, information on the positions, velocities and the equation of motion for obstacles are detected and calculated by sensor data. Besides, memory information is introduced into the boundary following behavior to enhance cognition capability for the obstacles, and avoid local minima problem caused by the potential field method. Finally, the results of theoretical analysis and simulation show that the collision-free path can be generated for USV within different obstacle environments, and further validated the performance and effectiveness of the presented strategy.
基金The National Key Research and Development Program of China under contract Nos 2016YFC1400901 and 2018YFC1406600the National Natural Science Foundation of China under contract No.40706057+1 种基金the Environmental Protection and Science and Technology Plan Project of Zhejiang Province of China under contract No.2013A021the Research Center for Air Pollution and Health of Zhejiang University
文摘Satellite remote sensing of inland water body requires a high spatial resolution and a multiband narrow spectral resolution, which makes the fusion between panchromatic(PAN) and multi-spectral(MS) images particularly important. Taking the Daquekou section of the Qiantang River as an observation target, four conventional fusion methods widely accepted in satellite image processing, including pan sharpening(PS), principal component analysis(PCA), Gram-Schmidt(GS), and wavelet fusion(WF), are utilized to fuse MS and PAN images of GF-1.The results of subjective and objective evaluation methods application indicate that GS performs the best,followed by the PCA, the WF and the PS in the order of descending. The existence of a large area of the water body is a dominant factor impacting the fusion performance. Meanwhile, the ability of retaining spatial and spectral informations is an important factor affecting the fusion performance of different fusion methods. The fundamental difference of reflectivity information acquisition between water and land is the reason for the failure of conventional fusion methods for land observation such as the PS to be used in the presence of the large water body. It is suggested that the adoption of the conventional fusion methods in the observing water body as the main target should be taken with caution. The performances of the fusion methods need re-assessment when the large-scale water body is present in the remote sensing image or when the research aims for the water body observation.
基金National Key R&D Program of China,No.2017YFA0603702,No.2018YFC0507200National Natural Science Foundation of China,No.41271406,No.91325204Innovation Project of LREIS(O88RA600YA)
文摘For quantitatively explaining the correlations between the vascular plant species abundance (VPSA) and habitat factors, a spatial simulation method has been developed to simulate the distribution of VPSA on the Qinghai-Tibet Plateau. In this paper, the vascular plant type, land cover, mean annual biotemperature, average total annual precipitation, topographic relief, patch connectivity and ecological diversity index were selected to screen the best correlation equation between the VPSA and habitat factors on the basis of 37 national nature reserves on the Qinghai-Tibet Plateau. The research results show that the coefficient of determination between VPSA and habitat factors is 0.94, and the mean error is 2.21 types per km<sup>2</sup>. The distribution of VPSA gradually decreases from southeast to northwest, and reduces with increasing altitude except the desert area of Qaidam Basin. Furthermore, the scenarios of VPSA on the Qinghai-Tibet Plateau during the periods from 1981 to 2010 (T0), from 2011 to 2040 (T2), from 2041 to 2070 (T3) and from 2071 to 2100 (T4) were simulated by combining the land cover change and the climatic scenarios of CMIP5 RCP2.6, RCP4.5 and RCP8.5. The simulated results show that the VPSA would generally decrease on the Qinghai-Tibet Plateau from T0 to T4. The VPSA has the largest change ratio under RCP8.5 scenario, and the smallest change ratio under RCP2.6 scenario. In general, the dynamic change of habitat factors would directly affect the spatial distribution of VPSA on the Qinghai- Tibet Plateau in the future.
基金supported by the National Key Research and Development Program of China (2016YFC0501107)the Project of Ordos Science and Technology Program (2017006)the Special Project of Science and Technology Basic Work of Ministry of Science and Technology of China (2014FY110800)
文摘It is known that the exploitation of opencast coal mines has seriously damaged the environments in the semi-arid areas.Vegetation status can reliably reflect the ecological degeneration and restoration in the opencast mining areas in the semi-arid areas.Long-time series MODIS NDVI data are widely used to simulate the vegetation cover to reflect the disturbance and restoration of local ecosystems.In this study, both qualitative(linear regression method and coefficient of variation(CoV)) and quantitative(spatial buffer analysis, and change amplitude and the rate of change in the average NDVI) analyses were conducted to analyze the spatio-temporal dynamics of vegetation during 2000–2017 in Jungar Banner of Inner Mongolia Autonomous Region, China, at the large(Jungar Banner and three mine groups) and small(three types of functional areas: opencast coal mining excavation areas, reclamation areas and natural areas) scales.The results show that the rates of change in the average NDVI in the reclamation areas(20%–60%) and opencast coal mining excavation areas(10%–20%) were considerably higher than that in the natural areas(<7%).The vegetation in the reclamation areas experienced a trend of increase(3–5 a after reclamation)-decrease(the sixth year of reclamation)-stability.The vegetation in Jungar Banner has a spatial heterogeneity under the influences of mining and reclamation activities.The ratio of vegetation improvement area to vegetation degradation area in the west, southwest and east mine groups during 2000–2017 was 8:1, 20:1 and 33:1, respectively.The regions with the high CoV of NDVI above 0.45 were mainly distributed around the opencast coal mining excavation areas, and the regions with the CoV of NDVI above 0.25 were mostly located in areas with low(28.8%) and medium-low(10.2%) vegetation cover.The average disturbance distances of mining activities on vegetation in the three mine groups(west, southwest and east) were 800, 800 and 1000 m, respectively.The greater the scale of mining, the farther the disturbance distances of mining activities on vegetation.We conclude that vegetation reclamation will certainly compensate for the negative impacts of opencast coal mining activities on vegetation.Sufficient attention should be paid to the proportional allocation of plant species(herbs and shrubs) in the reclamation areas, and the restored vegetation in these areas needs to be protected for more than 6 a.Then, as the repair time increased, the vegetation condition of the reclamation areas would exceed that of the natural areas.
文摘Weighted fusion algorithms, which can be applied in the area of multi-sensor data fusion, are advanced based on weighted least square method. A weighted fusion algorithm, in which the relationship between weight coefficients and measurement noise is established, is proposed by giving attention to the correlation of measurement noise. Then a simplified weighted fusion algorithm is deduced on the assumption that measurement noise is uncorrelated. In addition, an algorithm, which can adjust the weight coefficients in the simplified algorithm by making estimations of measurement noise from measurements, is presented. It is proved by emulation and experiment that the precision performance of the multi-sensor system based on these algorithms is better than that of the multi-sensor system based on other algorithms.
基金Supported by National Natural Science Foundation of China under Grant No.11361048
文摘In the paper, the rational breather soliton and kink solitary wave solution of the (2+1)-dimensional PBLMP equation are obtained by adopting Hirota bilinear method and selecting different test functions. Furthermore, it has been found that the fusion and degeneration of the kink solitary wave occur when interaction between the rational breather soliton and the kink solitary wave happens. These phenomena are very helpful in researching soliton dynamical complexity in the higher dimensional systems.
基金supported by the National Natural Science Foundation of China (No. 52075350)the Special City School Strategic Cooperation Project of Sichuan University and Zigong (No.2021CDZG-3)
文摘The control rod drive mechanism(CRDM)is an essential part of the control and safety protection system of pressurized water reactors.Current CRDM simulations are mostly performed collectively using a single method,ignoring the influence of multiple motion units and the differences in various features among them,which strongly affect the efficiency and accuracy of the simulations.In this study,we constructed a flow field fusion simulation method based on model features by combining key motion unit analysis and various simulation methods and then applied the method to the CRDM simulation process.CRDM performs motion unit decomposition through the structural hierarchy of function-movement-action method,and the key meta-actions are identified as the nodes in the flow field simulation.We established a fused feature-based multimethod simulation process and processed the simulation methods and data according to the features of the fluid domain space and the structural complexity to obtain the fusion simulation results.Compared to traditional simulation methods and real measurements,the simulation method provides advantages in terms of simulation efficiency and accuracy.