False Data Injection Attacks(FDIAs)pose a critical security threat to modern power grids,corrupting state estimation and enabling malicious control actions that can lead to severe consequences,including cascading fail...False Data Injection Attacks(FDIAs)pose a critical security threat to modern power grids,corrupting state estimation and enabling malicious control actions that can lead to severe consequences,including cascading failures,large-scale blackouts,and significant economic losses.While detecting attacks is important,accurately localizing compromised nodes or measurements is even more critical,as it enables timely mitigation,targeted response,and enhanced system resilience beyond what detection alone can offer.Existing research typically models topological features using fixed structures,which can introduce irrelevant information and affect the effectiveness of feature extraction.To address this limitation,this paper proposes an FDIA localization model with adaptive neighborhood selection,which dynamically captures spatial dependencies of the power grid by adjusting node relationships based on data-driven similarities.The improved Transformer is employed to pre-fuse global spatial features of the graph,enriching the feature representation.To improve spatio-temporal correlation extraction for FDIA localization,the proposed model employs dilated causal convolution with a gating mechanism combined with graph convolution to capture and fuse long-range temporal features and adaptive topological features.This fully exploits the temporal dynamics and spatial dependencies inherent in the power grid.Finally,multi-source information is integrated to generate highly robust node embeddings,enhancing FDIA detection and localization.Experiments are conducted on IEEE 14,57,and 118-bus systems,and the results demonstrate that the proposed model substantially improves the accuracy of FDIA localization.Additional experiments are conducted to verify the effectiveness and robustness of the proposed model.展开更多
Source localization of focal electrical activity from scalp electroencephalogram (sEEG) signal is generally modeled as an inverse problem that is highly ill-posed. In this paper, a novel source localization method is ...Source localization of focal electrical activity from scalp electroencephalogram (sEEG) signal is generally modeled as an inverse problem that is highly ill-posed. In this paper, a novel source localization method is proposed to model the EEG inverse problem using spatio-temporal long-short term memory recurrent neural networks (LSTM). The network model consists of two parts, sEEG encoding and source decoding, to model the sEEG signal and receive the regression of source location. As there does not exist enough annotated sEEG signals correspond to specific source locations, simulated data is generated with forward model using finite element method (FEM) to act as a part of training signals. A framework for source localization is proposed to estimate the source position based on simulated training data. Experiments are done on simulated testing data. The results on simulated data exhibit good robustness on noise signal, and the proposed network solves the EEG inverse problem with spatio-temporal deep network. The result show that the proposed method overcomes the highly ill-posed linear inverse problem with data driven learning.展开更多
Due to water conflicts and allocation in the Lancang-Mekong River Basin(LMRB),the spatio-temporal differentiation of total water resources and the natural-human influence need to be clarified.This work investigated LM...Due to water conflicts and allocation in the Lancang-Mekong River Basin(LMRB),the spatio-temporal differentiation of total water resources and the natural-human influence need to be clarified.This work investigated LMRB's terrestrial water storage anomaly(TWSA)and its spatio-temporal dynamics during 2002–2020.Considering the effects of natural factors and human activities,the respective contributions of climate variability and human activities to terrestrial water storage change(TWSC)were separated.Results showed that:(1)LMRB's TWSA decreased by 0.3158 cm/a.(2)TWSA showed a gradual increase in distribution from southwest of MRB to middle LMRB and from northeast of LRB to middle LMRB.TWSA positively changed in Myanmar while slightly changed in Laos and China.It negatively changed in Vietnam,Thailand and Cambodia.(3)TWSA components decreased in a descending order of soil moisture,groundwater and precipitation.(4)Natural factors had a substantial and spatial differentiated influence on TWSA over the LMRB.(5)Climate variability contributed 79%of TWSC in the LMRB while human activities contributed 21%with an increasing impact after 2008.The TWSC of upstream basin countries was found to be controlled by climate variability while Vietnam and Cambodia's TWSC has been controlled by human activities since 2012.展开更多
This paper introduces techniques in Gaussian process regression model for spatiotemporal data collected from complex systems.This study focuses on extracting local structures and then constructing surrogate models bas...This paper introduces techniques in Gaussian process regression model for spatiotemporal data collected from complex systems.This study focuses on extracting local structures and then constructing surrogate models based on Gaussian process assumptions.The proposed Dynamic Gaussian Process Regression(DGPR)consists of a sequence of local surrogate models related to each other.In DGPR,the time-based spatial clustering is carried out to divide the systems into sub-spatio-temporal parts whose interior has similar variation patterns,where the temporal information is used as the prior information for training the spatial-surrogate model.The DGPR is robust and especially suitable for the loosely coupled model structure,also allowing for parallel computation.The numerical results of the test function show the effectiveness of DGPR.Furthermore,the shock tube problem is successfully approximated under different phenomenon complexity.展开更多
This paper deals mainly with the existence and asymptotic behavior of traveling waves in a SIRH model with spatio-temporal delay and nonlocal dispersal based on Schauder’s fixed-point theorem and analysis techniques,...This paper deals mainly with the existence and asymptotic behavior of traveling waves in a SIRH model with spatio-temporal delay and nonlocal dispersal based on Schauder’s fixed-point theorem and analysis techniques,which generalize the results of nonlocal SIRH models without relapse and delay.In particular,the difficulty of obtaining the asymptotic behavior of traveling waves for the appearance of spatio-temporal delay is overcome by the use of integral techniques and analysis techniques.Finally,the more general nonexistence result of traveling waves is also included.展开更多
The ability to accurately predict urban traffic flows is crucial for optimising city operations.Consequently,various methods for forecasting urban traffic have been developed,focusing on analysing historical data to u...The ability to accurately predict urban traffic flows is crucial for optimising city operations.Consequently,various methods for forecasting urban traffic have been developed,focusing on analysing historical data to understand complex mobility patterns.Deep learning techniques,such as graph neural networks(GNNs),are popular for their ability to capture spatio-temporal dependencies.However,these models often become overly complex due to the large number of hyper-parameters involved.In this study,we introduce Dynamic Multi-Graph Spatial-Temporal Graph Neural Ordinary Differential Equation Networks(DMST-GNODE),a framework based on ordinary differential equations(ODEs)that autonomously discovers effective spatial-temporal graph neural network(STGNN)architectures for traffic prediction tasks.The comparative analysis of DMST-GNODE and baseline models indicates that DMST-GNODE model demonstrates superior performance across multiple datasets,consistently achieving the lowest Root Mean Square Error(RMSE)and Mean Absolute Error(MAE)values,alongside the highest accuracy.On the BKK(Bangkok)dataset,it outperformed other models with an RMSE of 3.3165 and an accuracy of 0.9367 for a 20-min interval,maintaining this trend across 40 and 60 min.Similarly,on the PeMS08 dataset,DMST-GNODE achieved the best performance with an RMSE of 19.4863 and an accuracy of 0.9377 at 20 min,demonstrating its effectiveness over longer periods.The Los_Loop dataset results further emphasise this model’s advantage,with an RMSE of 3.3422 and an accuracy of 0.7643 at 20 min,consistently maintaining superiority across all time intervals.These numerical highlights indicate that DMST-GNODE not only outperforms baseline models but also achieves higher accuracy and lower errors across different time intervals and datasets.展开更多
Spectrum-based fault localization (SBFL) generates a ranked list of suspicious elements by using the program execution spectrum, but the excessive number of elements ranked in parallel results in low localization accu...Spectrum-based fault localization (SBFL) generates a ranked list of suspicious elements by using the program execution spectrum, but the excessive number of elements ranked in parallel results in low localization accuracy. Most researchers consider intra-class dependencies to improve localization accuracy. However, some studies show that inter-class method call type faults account for more than 20%, which means such methods still have certain limitations. To solve the above problems, this paper proposes a two-phase software fault localization based on relational graph convolutional neural networks (Two-RGCNFL). Firstly, in Phase 1, the method call dependence graph (MCDG) of the program is constructed, the intra-class and inter-class dependencies in MCDG are extracted by using the relational graph convolutional neural network, and the classifier is used to identify the faulty methods. Then, the GraphSMOTE algorithm is improved to alleviate the impact of class imbalance on classification accuracy. Aiming at the problem of parallel ranking of element suspicious values in traditional SBFL technology, in Phase 2, Doc2Vec is used to learn static features, while spectrum information serves as dynamic features. A RankNet model based on siamese multi-layer perceptron is constructed to score and rank statements in the faulty method. This work conducts experiments on 5 real projects of Defects4J benchmark. Experimental results show that, compared with the traditional SBFL technique and two baseline methods, our approach improves the Top-1 accuracy by 262.86%, 29.59% and 53.01%, respectively, which verifies the effectiveness of Two-RGCNFL. Furthermore, this work verifies the importance of inter-class dependencies through ablation experiments.展开更多
The fast growth of mobile autonomous machines from traditional equipment to unmanned autonomous vehicles has fueled the demand for accurate and reliable localization solutions in diverse application domains.Ultra Wide...The fast growth of mobile autonomous machines from traditional equipment to unmanned autonomous vehicles has fueled the demand for accurate and reliable localization solutions in diverse application domains.Ultra Wide Band(UWB)technology has emerged as a promising candidate for addressing this need,offering high precision,immunity to multipath interference,and robust performance in challenging environments.In this comprehensive survey,we systematically explore UWB-based localization for mobile autonomous machines,spanning from fundamental principles to future trends.To the best of our knowledge,this review paper stands as the pioneer in systematically dissecting the algorithms of UWB-based localization for mobile autonomous machines,covering a spectrum from bottom-ranging schemes to advanced sensor fusion,error mitigation,and optimization techniques.By synthesizing existing knowledge,evaluating current methodologies,and highlighting future trends,this review aims to catalyze progress and innovation in the field,unlocking new opportunities for mobile autonomous machine applications across diverse industries and domains.Thus,it serves as a valuable resource for researchers,practitioners,and stakeholders interested in advancing the state-of-the-art UWB-based localization for mobile autonomous machines.展开更多
The proposed hybrid optimization algorithm integrates particle swarm optimizatio(PSO)with Ant Colony Optimization(ACO)to improve a number of pitfalls within PSO methods traditionally considered and/or applied to indus...The proposed hybrid optimization algorithm integrates particle swarm optimizatio(PSO)with Ant Colony Optimization(ACO)to improve a number of pitfalls within PSO methods traditionally considered and/or applied to industrial robots.Particle Swarm Optimization may frequently suffer from local optima and inaccuracies in identifying the geometric parameters,which are necessary for applications requiring high-accuracy performances.The proposed approach integrates pheromone-based learning of ACO with the D-H method of developing an error model;hence,the global search effectiveness together with the convergence accuracy is further improved.Comparison studies of the hybrid PSO-ACO algorithm show higher precision and effectiveness in the optimization of geometric error parameters compared to the traditional methods.This is a remarkable reduction of localization errors,thus yielding accuracy and reliability in industrial robotic systems,as the results show.This approach improves performance in those applications that demand high geometric calibration by reducing the geometric error.The paper provides an overview of input for developing robotics and automation,giving importance to precision in industrial engineering.The proposed hybrid methodology is a good way to enhance the working accuracy and effectiveness of industrial robots and shall enable their wide application to complex tasks that require a high degree of accuracy.展开更多
Ecological monitoring vehicles are equipped with a range of sensors and monitoring devices designed to gather data on ecological and environmental factors.These vehicles are crucial in various fields,including environ...Ecological monitoring vehicles are equipped with a range of sensors and monitoring devices designed to gather data on ecological and environmental factors.These vehicles are crucial in various fields,including environmental science research,ecological and environmental monitoring projects,disaster response,and emergency management.A key method employed in these vehicles for achieving high-precision positioning is LiDAR(lightlaser detection and ranging)-Visual Simultaneous Localization and Mapping(SLAM).However,maintaining highprecision localization in complex scenarios,such as degraded environments or when dynamic objects are present,remains a significant challenge.To address this issue,we integrate both semantic and texture information from LiDAR and cameras to enhance the robustness and efficiency of data registration.Specifically,semantic information simplifies the modeling of scene elements,reducing the reliance on dense point clouds,which can be less efficient.Meanwhile,visual texture information complements LiDAR-Visual localization by providing additional contextual details.By incorporating semantic and texture details frompaired images and point clouds,we significantly improve the quality of data association,thereby increasing the success rate of localization.This approach not only enhances the operational capabilities of ecological monitoring vehicles in complex environments but also contributes to improving the overall efficiency and effectiveness of ecological monitoring and environmental protection efforts.展开更多
This work presents a method for the three-dimensional localization of individual shallow NV center in diamond,leveraging the near-field quenching effect of a gold tip.Our experimental setup involves the use of an atom...This work presents a method for the three-dimensional localization of individual shallow NV center in diamond,leveraging the near-field quenching effect of a gold tip.Our experimental setup involves the use of an atomic force microscope to precisely move the gold tip close to the NV center,while simultaneously employing a home-made confocal microscope to monitor the fluorescence of the NV center.This approach allows for lateral super-resolution,achieving a full width at half maximum(FWHM)of 38.0 nm and a location uncertainty of 0.7 nm.Additionally,we show the potential of this method for determining the depth of the NV centers.We also attempt to determine the depth of the NV centers in combination with finite-difference time-domain(FDTD)simulations.Compared to other depth determination methods,this approach allows for simultaneous lateral and longitudinal localization of individual NV centers,and holds promise for facilitating manipulation of the local environment surrounding the NV center.展开更多
Automatic Dependent Surveillance-Broadcast(ADS-B)technology,with its open signal sharing,faces substantial security risks from false signals and spoofing attacks when broadcasting Unmanned Aerial Vehicle(UAV)informati...Automatic Dependent Surveillance-Broadcast(ADS-B)technology,with its open signal sharing,faces substantial security risks from false signals and spoofing attacks when broadcasting Unmanned Aerial Vehicle(UAV)information.This paper proposes a security position verification technique based on Multilateration(MLAT)to detect false signals,ensuring UAV safety and reliable airspace operations.First,the proposed method estimates the current position of the UAV by calculating the Time Difference of Arrival(TDOA),Time Sum of Arrival(TSOA),and Angle of Arrival(AOA)information.Then,this estimated position is compared with the ADS-B message to eliminate false UAV signals.Furthermore,a localization model based on TDOA/TSOA/AOA is established by utilizing reliable reference sources for base station time synchronization.Additionally,an improved Chan-Taylor algorithm is developed,incorporating the Constrained Weighted Least Squares(CWLS)method to initialize UAV position calculations.Finally,a false signal detection method is proposed to distinguish between true and false positioning targets.Numerical simulation results indicate that,at a positioning error threshold of 150 m,the improved Chan-Taylor algorithm based on TDOA/TSOA/AOA achieves 100%accuracy coverage,significantly enhancing localization precision.And the proposed false signal detection method achieves a detection accuracy rate of at least 90%within a 50-meter error range.展开更多
Objective To investigate the spatiotemporal patterns and socioeconomic factors influencing the incidence of tuberculosis(TB)in the Guangdong Province between 2010 and 2019.Method Spatial and temporal variations in TB ...Objective To investigate the spatiotemporal patterns and socioeconomic factors influencing the incidence of tuberculosis(TB)in the Guangdong Province between 2010 and 2019.Method Spatial and temporal variations in TB incidence were mapped using heat maps and hierarchical clustering.Socioenvironmental influencing factors were evaluated using a Bayesian spatiotemporal conditional autoregressive(ST-CAR)model.Results Annual incidence of TB in Guangdong decreased from 91.85/100,000 in 2010 to 53.06/100,000in 2019.Spatial hotspots were found in northeastern Guangdong,particularly in Heyuan,Shanwei,and Shantou,while Shenzhen,Dongguan,and Foshan had the lowest rates in the Pearl River Delta.The STCAR model showed that the TB risk was lower with higher per capita Gross Domestic Product(GDP)[Relative Risk(RR),0.91;95%Confidence Interval(CI):0.86–0.98],more the ratio of licensed physicians and physician(RR,0.94;95%CI:0.90-0.98),and higher per capita public expenditure(RR,0.94;95%CI:0.90–0.97),with a marginal effect of population density(RR,0.86;95%CI:0.86–1.00).Conclusion The incidence of TB in Guangdong varies spatially and temporally.Areas with poor economic conditions and insufficient healthcare resources are at an increased risk of TB infection.Strategies focusing on equitable health resource distribution and economic development are the key to TB control.展开更多
This study examines the effects of rapid land use changes in India,with a specific focus on Sonipat District in Haryana—a region undergoing significant urban expansion.Over the past two decades,rural landscapes in So...This study examines the effects of rapid land use changes in India,with a specific focus on Sonipat District in Haryana—a region undergoing significant urban expansion.Over the past two decades,rural landscapes in Sonipat have undergone notable transformation,as open spaces and agricultural lands are increasingly converted into residential colonies,commercial hubs,and industrial zones.While such changes reflect economic development and urban growth,they also raise critical concerns about sustainability,especially in terms of food security,groundwater depletion,and environmental degradation.The study examines land use changes between 2000 and 2024 using remote sensing techniques and spatial analysis.It further incorporates secondary data and insights from community-level interactions to assess the socio-economic and ecological impacts of this transformation.The findings indicate rising land fragmentation,loss of agricultural livelihoods,pressure on civic infrastructure,and increasing pollution—factors that threaten long-term regional sustainability.The study underscores the urgent need to reconcile urban development with environmental and social sustainability.By offering a detailed case study of Sonipat,this research contributes to the broader discourse on India’s urbanisation pathways.It aims to provide policymakers,planners,and researchers with evidence-based recommendations to manage land transitions more responsibly,promoting urban growth models that ensure ecological integrity,equitable development,and long-term resilience.展开更多
As Deepfake technology continues to evolve,the distinction between real and fake content becomes increasingly blurred.Most existing Deepfake video detectionmethods rely on single-frame facial image features,which limi...As Deepfake technology continues to evolve,the distinction between real and fake content becomes increasingly blurred.Most existing Deepfake video detectionmethods rely on single-frame facial image features,which limits their ability to capture temporal differences between frames.Current methods also exhibit limited generalization capabilities,struggling to detect content generated by unknown forgery algorithms.Moreover,the diversity and complexity of forgery techniques introduced by Artificial Intelligence Generated Content(AIGC)present significant challenges for traditional detection frameworks,whichmust balance high detection accuracy with robust performance.To address these challenges,we propose a novel Deepfake detection framework that combines a two-stream convolutional network with a Vision Transformer(ViT)module to enhance spatio-temporal feature representation.The ViT model extracts spatial features from the forged video,while the 3D convolutional network captures temporal features.The 3D convolution enables cross-frame feature extraction,allowing the model to detect subtle facial changes between frames.The confidence scores from both the ViT and 3D convolution submodels are fused at the decision layer,enabling themodel to effectively handle unknown forgery techniques.Focusing on Deepfake videos and GAN-generated images,the proposed approach is evaluated on two widely used public face forgery datasets.Compared to existing state-of-theartmethods,it achieves higher detection accuracy and better generalization performance,offering a robust solution for deepfake detection in real-world scenarios.展开更多
Sandfly fever is a viral infectious disease transmitted by sand flies that is widely prevalent in tropical and subtropical regions.Previous studies on its infection mechanism,immune response and diagnosis and treatmen...Sandfly fever is a viral infectious disease transmitted by sand flies that is widely prevalent in tropical and subtropical regions.Previous studies on its infection mechanism,immune response and diagnosis and treatment methods were lack of systematic.This study applied spatio-temporal omics technology to comprehensively explain the dynamic changes of immunity in the incubation period,exacerbation period,peak period and recovery period of Sandfl y fever,and integrated with diff erent coping strategies.To provide new research ideas for its overall research.展开更多
Agriculture holds a pivotal position in the economic fabric of every nation,yet concerns about agricultural carbon emission intensity(ACI)have become a major hurdle to achieving global economic sustainability.Focusing...Agriculture holds a pivotal position in the economic fabric of every nation,yet concerns about agricultural carbon emission intensity(ACI)have become a major hurdle to achieving global economic sustainability.Focusing on 31 provincial-level regions in China,this study uses the Exploratory Spatio-temporal Data Analysis(ESTDA)and Panel Quantile Regression(PQR)model to analyze the spatio-temporal interaction characteristics and influencing factors of ACI in China from 2004 to 2023.The findings are as follows:(1)ACI showed an overall downward trend,and the spatial distribution pattern was characterized by“high in the western region and low along the southeastern coast”.Although the overall disparity tended to converge,some high-carbon-intensity regions exhibited extreme trends.ACI displayed clear spatial directionality,with the spatial center shifting steadily toward the northeast.(2)Regions in the northwest,northeast,and central-south parts exhibited strong local spatial structural dynamics,and the local spatial dependence of ACI in each region showed a nonlinear trend.Generally speaking,the spatial association pattern demonstrated a certain degree of inertia in spatial transfer,reflecting strong path dependence or spatial lock-in characteristics.(3)Optimization of industrial structure and improvement in agricultural mechanization will increase ACI,while economic development can effectively reduce it.The impact of urbanization on ACI exhibits a nonlinear pattern.The coordinated development of economic growth and urbanization significantly reduces ACI,with a stronger emission reduction observed in regions with low ACI.The optimization of industrial structure,when combined with urbanization and environmental regulation,contributes to significant emission reductions particularly in high-ACI areas.Similarly,the synergy between agricultural mechanization and urbanization effectively lowers emissions in low-ACI regions,though this effect diminishes in areas with higher ACI.展开更多
Sloping farmland,particularly in mountainous and hilly areas,constitutes a significant component of regional farmland resources.An investigation into the spatio-temporal pattern of sloping farmland and its influencing...Sloping farmland,particularly in mountainous and hilly areas,constitutes a significant component of regional farmland resources.An investigation into the spatio-temporal pattern of sloping farmland and its influencing factors in China is imperative for the efficient utilization of farmland and the optimization of land space.We used land use transfer matrix,geographically weighted regression model and geographical detector to conduct this study.Results showed that sloping farmland in China firstly decreased and then increased from 2000 to 2020.The proportion of sloping farmland decreased radially outward from Sichuan basin to the surrounding areas.Change rates of sloping farmland with different slopes varied and the slope with 6°-15°underwent the fastest changes.The influencing factors of farmland at various slope degrees were different.For sloping farmland below 15°,land use intensity and elevation had the greatest contribution.For sloping farmland between 15°and 25°,elevation,land use intensity,and population density were the main influencing factors.Sloping farmland above 25°was mostly affected by natural factors.This study can provide scientific basis for rational development and protection of sloping farmland.展开更多
Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decode...Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decoder (ACSF-ED) network to predict the action and locate the object efficiently. In the Adaptive Cross-Scale Fusion Spatio-Temporal Encoder (ACSF ST-Encoder), the Asymptotic Cross-scale Feature-fusion Module (ACCFM) is designed to address the issue of information degradation caused by the propagation of high-level semantic information, thereby extracting high-quality multi-scale features to provide superior features for subsequent spatio-temporal information modeling. Within the Shared-Head Decoder structure, a shared classification and regression detection head is constructed. A multi-constraint loss function composed of one-to-one, one-to-many, and contrastive denoising losses is designed to address the problem of insufficient constraint force in predicting results with traditional methods. This loss function enhances the accuracy of model classification predictions and improves the proximity of regression position predictions to ground truth objects. The proposed method model is evaluated on the popular dataset UCF101-24 and JHMDB-21. Experimental results demonstrate that the proposed method achieves an accuracy of 81.52% on the Frame-mAP metric, surpassing current existing methods.展开更多
Electrocardiogram (ECG) analysis is critical for detecting arrhythmias, but traditional methods struggle with large-scale Electrocardiogram data and rare arrhythmia events in imbalanced datasets. These methods fail to...Electrocardiogram (ECG) analysis is critical for detecting arrhythmias, but traditional methods struggle with large-scale Electrocardiogram data and rare arrhythmia events in imbalanced datasets. These methods fail to perform multi-perspective learning of temporal signals and Electrocardiogram images, nor can they fully extract the latent information within the data, falling short of the accuracy required by clinicians. Therefore, this paper proposes an innovative hybrid multimodal spatiotemporal neural network to address these challenges. The model employs a multimodal data augmentation framework integrating visual and signal-based features to enhance the classification performance of rare arrhythmias in imbalanced datasets. Additionally, the spatiotemporal fusion module incorporates a spatiotemporal graph convolutional network to jointly model temporal and spatial features, uncovering complex dependencies within the Electrocardiogram data and improving the model’s ability to represent complex patterns. In experiments conducted on the MIT-BIH arrhythmia dataset, the model achieved 99.95% accuracy, 99.80% recall, and a 99.78% F1 score. The model was further validated for generalization using the clinical INCART arrhythmia dataset, and the results demonstrated its effectiveness in terms of both generalization and robustness.展开更多
基金supported by National Key Research and Development Plan of China(No.2022YFB3103304).
文摘False Data Injection Attacks(FDIAs)pose a critical security threat to modern power grids,corrupting state estimation and enabling malicious control actions that can lead to severe consequences,including cascading failures,large-scale blackouts,and significant economic losses.While detecting attacks is important,accurately localizing compromised nodes or measurements is even more critical,as it enables timely mitigation,targeted response,and enhanced system resilience beyond what detection alone can offer.Existing research typically models topological features using fixed structures,which can introduce irrelevant information and affect the effectiveness of feature extraction.To address this limitation,this paper proposes an FDIA localization model with adaptive neighborhood selection,which dynamically captures spatial dependencies of the power grid by adjusting node relationships based on data-driven similarities.The improved Transformer is employed to pre-fuse global spatial features of the graph,enriching the feature representation.To improve spatio-temporal correlation extraction for FDIA localization,the proposed model employs dilated causal convolution with a gating mechanism combined with graph convolution to capture and fuse long-range temporal features and adaptive topological features.This fully exploits the temporal dynamics and spatial dependencies inherent in the power grid.Finally,multi-source information is integrated to generate highly robust node embeddings,enhancing FDIA detection and localization.Experiments are conducted on IEEE 14,57,and 118-bus systems,and the results demonstrate that the proposed model substantially improves the accuracy of FDIA localization.Additional experiments are conducted to verify the effectiveness and robustness of the proposed model.
基金supported by the National Natural Science Foundation of China (No. 61672070, 61501007, 11675199, 61572004 and 81501155)the Key Project of Beijing Municipal Education Commission (No. KZ201910005008)+3 种基金general project of science and technology project of Beijing Municipal Education Commission (No. KM201610005023)the Beijing Municipal Natural Science Foundation (No. 4182005)Clinical Technology Innovation Program of Beijing Municipal Administration of Hospitals (No. XMLX201805)Beijing Municipal Science & Tech Commission (No. Z171100000117004)
文摘Source localization of focal electrical activity from scalp electroencephalogram (sEEG) signal is generally modeled as an inverse problem that is highly ill-posed. In this paper, a novel source localization method is proposed to model the EEG inverse problem using spatio-temporal long-short term memory recurrent neural networks (LSTM). The network model consists of two parts, sEEG encoding and source decoding, to model the sEEG signal and receive the regression of source location. As there does not exist enough annotated sEEG signals correspond to specific source locations, simulated data is generated with forward model using finite element method (FEM) to act as a part of training signals. A framework for source localization is proposed to estimate the source position based on simulated training data. Experiments are done on simulated testing data. The results on simulated data exhibit good robustness on noise signal, and the proposed network solves the EEG inverse problem with spatio-temporal deep network. The result show that the proposed method overcomes the highly ill-posed linear inverse problem with data driven learning.
基金National Natural Science Foundation of China,No.42161006Yunnan Fundamental Research Projects No.202201AT070094,No.202301BF070001-004+1 种基金Special Project for High-level Talents of Yunnan Province for Young Top Talents,No.C6213001159European Research Council(ERC)Starting-Grant STORIES,No.101040939。
文摘Due to water conflicts and allocation in the Lancang-Mekong River Basin(LMRB),the spatio-temporal differentiation of total water resources and the natural-human influence need to be clarified.This work investigated LMRB's terrestrial water storage anomaly(TWSA)and its spatio-temporal dynamics during 2002–2020.Considering the effects of natural factors and human activities,the respective contributions of climate variability and human activities to terrestrial water storage change(TWSC)were separated.Results showed that:(1)LMRB's TWSA decreased by 0.3158 cm/a.(2)TWSA showed a gradual increase in distribution from southwest of MRB to middle LMRB and from northeast of LRB to middle LMRB.TWSA positively changed in Myanmar while slightly changed in Laos and China.It negatively changed in Vietnam,Thailand and Cambodia.(3)TWSA components decreased in a descending order of soil moisture,groundwater and precipitation.(4)Natural factors had a substantial and spatial differentiated influence on TWSA over the LMRB.(5)Climate variability contributed 79%of TWSC in the LMRB while human activities contributed 21%with an increasing impact after 2008.The TWSC of upstream basin countries was found to be controlled by climate variability while Vietnam and Cambodia's TWSC has been controlled by human activities since 2012.
基金co-supported by the National Natural Science Foundation of China(No.12101608)the NSAF(No.U2230208)the Hunan Provincial Innovation Foundation for Postgraduate,China(No.CX20220034).
文摘This paper introduces techniques in Gaussian process regression model for spatiotemporal data collected from complex systems.This study focuses on extracting local structures and then constructing surrogate models based on Gaussian process assumptions.The proposed Dynamic Gaussian Process Regression(DGPR)consists of a sequence of local surrogate models related to each other.In DGPR,the time-based spatial clustering is carried out to divide the systems into sub-spatio-temporal parts whose interior has similar variation patterns,where the temporal information is used as the prior information for training the spatial-surrogate model.The DGPR is robust and especially suitable for the loosely coupled model structure,also allowing for parallel computation.The numerical results of the test function show the effectiveness of DGPR.Furthermore,the shock tube problem is successfully approximated under different phenomenon complexity.
基金supported by the NSF of China(11761046)Science and Technology Plan Foundation of Gansu Province of China(20JR5RA411)Foundation of A Hundred Youth Talents Training Program of Lanzhou Jiaotong University。
文摘This paper deals mainly with the existence and asymptotic behavior of traveling waves in a SIRH model with spatio-temporal delay and nonlocal dispersal based on Schauder’s fixed-point theorem and analysis techniques,which generalize the results of nonlocal SIRH models without relapse and delay.In particular,the difficulty of obtaining the asymptotic behavior of traveling waves for the appearance of spatio-temporal delay is overcome by the use of integral techniques and analysis techniques.Finally,the more general nonexistence result of traveling waves is also included.
文摘The ability to accurately predict urban traffic flows is crucial for optimising city operations.Consequently,various methods for forecasting urban traffic have been developed,focusing on analysing historical data to understand complex mobility patterns.Deep learning techniques,such as graph neural networks(GNNs),are popular for their ability to capture spatio-temporal dependencies.However,these models often become overly complex due to the large number of hyper-parameters involved.In this study,we introduce Dynamic Multi-Graph Spatial-Temporal Graph Neural Ordinary Differential Equation Networks(DMST-GNODE),a framework based on ordinary differential equations(ODEs)that autonomously discovers effective spatial-temporal graph neural network(STGNN)architectures for traffic prediction tasks.The comparative analysis of DMST-GNODE and baseline models indicates that DMST-GNODE model demonstrates superior performance across multiple datasets,consistently achieving the lowest Root Mean Square Error(RMSE)and Mean Absolute Error(MAE)values,alongside the highest accuracy.On the BKK(Bangkok)dataset,it outperformed other models with an RMSE of 3.3165 and an accuracy of 0.9367 for a 20-min interval,maintaining this trend across 40 and 60 min.Similarly,on the PeMS08 dataset,DMST-GNODE achieved the best performance with an RMSE of 19.4863 and an accuracy of 0.9377 at 20 min,demonstrating its effectiveness over longer periods.The Los_Loop dataset results further emphasise this model’s advantage,with an RMSE of 3.3422 and an accuracy of 0.7643 at 20 min,consistently maintaining superiority across all time intervals.These numerical highlights indicate that DMST-GNODE not only outperforms baseline models but also achieves higher accuracy and lower errors across different time intervals and datasets.
基金funded by the Youth Fund of the National Natural Science Foundation of China(Grant No.42261070).
文摘Spectrum-based fault localization (SBFL) generates a ranked list of suspicious elements by using the program execution spectrum, but the excessive number of elements ranked in parallel results in low localization accuracy. Most researchers consider intra-class dependencies to improve localization accuracy. However, some studies show that inter-class method call type faults account for more than 20%, which means such methods still have certain limitations. To solve the above problems, this paper proposes a two-phase software fault localization based on relational graph convolutional neural networks (Two-RGCNFL). Firstly, in Phase 1, the method call dependence graph (MCDG) of the program is constructed, the intra-class and inter-class dependencies in MCDG are extracted by using the relational graph convolutional neural network, and the classifier is used to identify the faulty methods. Then, the GraphSMOTE algorithm is improved to alleviate the impact of class imbalance on classification accuracy. Aiming at the problem of parallel ranking of element suspicious values in traditional SBFL technology, in Phase 2, Doc2Vec is used to learn static features, while spectrum information serves as dynamic features. A RankNet model based on siamese multi-layer perceptron is constructed to score and rank statements in the faulty method. This work conducts experiments on 5 real projects of Defects4J benchmark. Experimental results show that, compared with the traditional SBFL technique and two baseline methods, our approach improves the Top-1 accuracy by 262.86%, 29.59% and 53.01%, respectively, which verifies the effectiveness of Two-RGCNFL. Furthermore, this work verifies the importance of inter-class dependencies through ablation experiments.
文摘The fast growth of mobile autonomous machines from traditional equipment to unmanned autonomous vehicles has fueled the demand for accurate and reliable localization solutions in diverse application domains.Ultra Wide Band(UWB)technology has emerged as a promising candidate for addressing this need,offering high precision,immunity to multipath interference,and robust performance in challenging environments.In this comprehensive survey,we systematically explore UWB-based localization for mobile autonomous machines,spanning from fundamental principles to future trends.To the best of our knowledge,this review paper stands as the pioneer in systematically dissecting the algorithms of UWB-based localization for mobile autonomous machines,covering a spectrum from bottom-ranging schemes to advanced sensor fusion,error mitigation,and optimization techniques.By synthesizing existing knowledge,evaluating current methodologies,and highlighting future trends,this review aims to catalyze progress and innovation in the field,unlocking new opportunities for mobile autonomous machine applications across diverse industries and domains.Thus,it serves as a valuable resource for researchers,practitioners,and stakeholders interested in advancing the state-of-the-art UWB-based localization for mobile autonomous machines.
文摘The proposed hybrid optimization algorithm integrates particle swarm optimizatio(PSO)with Ant Colony Optimization(ACO)to improve a number of pitfalls within PSO methods traditionally considered and/or applied to industrial robots.Particle Swarm Optimization may frequently suffer from local optima and inaccuracies in identifying the geometric parameters,which are necessary for applications requiring high-accuracy performances.The proposed approach integrates pheromone-based learning of ACO with the D-H method of developing an error model;hence,the global search effectiveness together with the convergence accuracy is further improved.Comparison studies of the hybrid PSO-ACO algorithm show higher precision and effectiveness in the optimization of geometric error parameters compared to the traditional methods.This is a remarkable reduction of localization errors,thus yielding accuracy and reliability in industrial robotic systems,as the results show.This approach improves performance in those applications that demand high geometric calibration by reducing the geometric error.The paper provides an overview of input for developing robotics and automation,giving importance to precision in industrial engineering.The proposed hybrid methodology is a good way to enhance the working accuracy and effectiveness of industrial robots and shall enable their wide application to complex tasks that require a high degree of accuracy.
基金supported by the project“GEF9874:Strengthening Coordinated Approaches to Reduce Invasive Alien Species(lAS)Threats to Globally Significant Agrobiodiversity and Agroecosystems in China”funding from the Excellent Talent Training Funding Project in Dongcheng District,Beijing,with project number 2024-dchrcpyzz-9.
文摘Ecological monitoring vehicles are equipped with a range of sensors and monitoring devices designed to gather data on ecological and environmental factors.These vehicles are crucial in various fields,including environmental science research,ecological and environmental monitoring projects,disaster response,and emergency management.A key method employed in these vehicles for achieving high-precision positioning is LiDAR(lightlaser detection and ranging)-Visual Simultaneous Localization and Mapping(SLAM).However,maintaining highprecision localization in complex scenarios,such as degraded environments or when dynamic objects are present,remains a significant challenge.To address this issue,we integrate both semantic and texture information from LiDAR and cameras to enhance the robustness and efficiency of data registration.Specifically,semantic information simplifies the modeling of scene elements,reducing the reliance on dense point clouds,which can be less efficient.Meanwhile,visual texture information complements LiDAR-Visual localization by providing additional contextual details.By incorporating semantic and texture details frompaired images and point clouds,we significantly improve the quality of data association,thereby increasing the success rate of localization.This approach not only enhances the operational capabilities of ecological monitoring vehicles in complex environments but also contributes to improving the overall efficiency and effectiveness of ecological monitoring and environmental protection efforts.
基金supported by the National Natural Science Foundation of China(T2325023,92265204,12104447)the National Key R&D Program of China(2023YFF0718400)+1 种基金the Innovation Program for Quantum Science and Technology(2021ZD0302200)the Fundamental Research Funds for the Central Universities。
文摘This work presents a method for the three-dimensional localization of individual shallow NV center in diamond,leveraging the near-field quenching effect of a gold tip.Our experimental setup involves the use of an atomic force microscope to precisely move the gold tip close to the NV center,while simultaneously employing a home-made confocal microscope to monitor the fluorescence of the NV center.This approach allows for lateral super-resolution,achieving a full width at half maximum(FWHM)of 38.0 nm and a location uncertainty of 0.7 nm.Additionally,we show the potential of this method for determining the depth of the NV centers.We also attempt to determine the depth of the NV centers in combination with finite-difference time-domain(FDTD)simulations.Compared to other depth determination methods,this approach allows for simultaneous lateral and longitudinal localization of individual NV centers,and holds promise for facilitating manipulation of the local environment surrounding the NV center.
基金supported by the National Natural Science Foundation of China(Nos.U2441250,62301380,and 62231027)Natural Science Basic Research Program of Shaanxi,China(2024JC-JCQN-63)+3 种基金the Key Research and Development Program of Shaanxi,China(No.2023-YBGY-249)the Guangxi Key Research and Development Program,China(No.2022AB46002)the China Postdoctoral Science Foundation(No.2022M722504 and 2024T170696)the Innovation Capability Support Program of Shaanxi,China(No.2024RS-CXTD-01).
文摘Automatic Dependent Surveillance-Broadcast(ADS-B)technology,with its open signal sharing,faces substantial security risks from false signals and spoofing attacks when broadcasting Unmanned Aerial Vehicle(UAV)information.This paper proposes a security position verification technique based on Multilateration(MLAT)to detect false signals,ensuring UAV safety and reliable airspace operations.First,the proposed method estimates the current position of the UAV by calculating the Time Difference of Arrival(TDOA),Time Sum of Arrival(TSOA),and Angle of Arrival(AOA)information.Then,this estimated position is compared with the ADS-B message to eliminate false UAV signals.Furthermore,a localization model based on TDOA/TSOA/AOA is established by utilizing reliable reference sources for base station time synchronization.Additionally,an improved Chan-Taylor algorithm is developed,incorporating the Constrained Weighted Least Squares(CWLS)method to initialize UAV position calculations.Finally,a false signal detection method is proposed to distinguish between true and false positioning targets.Numerical simulation results indicate that,at a positioning error threshold of 150 m,the improved Chan-Taylor algorithm based on TDOA/TSOA/AOA achieves 100%accuracy coverage,significantly enhancing localization precision.And the proposed false signal detection method achieves a detection accuracy rate of at least 90%within a 50-meter error range.
基金supported by the Guangdong Provincial Clinical Research Center for Tuberculosis(No.2020B1111170014)。
文摘Objective To investigate the spatiotemporal patterns and socioeconomic factors influencing the incidence of tuberculosis(TB)in the Guangdong Province between 2010 and 2019.Method Spatial and temporal variations in TB incidence were mapped using heat maps and hierarchical clustering.Socioenvironmental influencing factors were evaluated using a Bayesian spatiotemporal conditional autoregressive(ST-CAR)model.Results Annual incidence of TB in Guangdong decreased from 91.85/100,000 in 2010 to 53.06/100,000in 2019.Spatial hotspots were found in northeastern Guangdong,particularly in Heyuan,Shanwei,and Shantou,while Shenzhen,Dongguan,and Foshan had the lowest rates in the Pearl River Delta.The STCAR model showed that the TB risk was lower with higher per capita Gross Domestic Product(GDP)[Relative Risk(RR),0.91;95%Confidence Interval(CI):0.86–0.98],more the ratio of licensed physicians and physician(RR,0.94;95%CI:0.90-0.98),and higher per capita public expenditure(RR,0.94;95%CI:0.90–0.97),with a marginal effect of population density(RR,0.86;95%CI:0.86–1.00).Conclusion The incidence of TB in Guangdong varies spatially and temporally.Areas with poor economic conditions and insufficient healthcare resources are at an increased risk of TB infection.Strategies focusing on equitable health resource distribution and economic development are the key to TB control.
文摘This study examines the effects of rapid land use changes in India,with a specific focus on Sonipat District in Haryana—a region undergoing significant urban expansion.Over the past two decades,rural landscapes in Sonipat have undergone notable transformation,as open spaces and agricultural lands are increasingly converted into residential colonies,commercial hubs,and industrial zones.While such changes reflect economic development and urban growth,they also raise critical concerns about sustainability,especially in terms of food security,groundwater depletion,and environmental degradation.The study examines land use changes between 2000 and 2024 using remote sensing techniques and spatial analysis.It further incorporates secondary data and insights from community-level interactions to assess the socio-economic and ecological impacts of this transformation.The findings indicate rising land fragmentation,loss of agricultural livelihoods,pressure on civic infrastructure,and increasing pollution—factors that threaten long-term regional sustainability.The study underscores the urgent need to reconcile urban development with environmental and social sustainability.By offering a detailed case study of Sonipat,this research contributes to the broader discourse on India’s urbanisation pathways.It aims to provide policymakers,planners,and researchers with evidence-based recommendations to manage land transitions more responsibly,promoting urban growth models that ensure ecological integrity,equitable development,and long-term resilience.
基金supported by National Natural Science Foundation of China(Nos.62477026,62177029,61807020)Humanities and Social Sciences Research Program of the Ministry of Education of China(No.23YJAZH047)the Startup Foundation for Introducing Talent of Nanjing University of Posts and Communications under Grant NY222034.
文摘As Deepfake technology continues to evolve,the distinction between real and fake content becomes increasingly blurred.Most existing Deepfake video detectionmethods rely on single-frame facial image features,which limits their ability to capture temporal differences between frames.Current methods also exhibit limited generalization capabilities,struggling to detect content generated by unknown forgery algorithms.Moreover,the diversity and complexity of forgery techniques introduced by Artificial Intelligence Generated Content(AIGC)present significant challenges for traditional detection frameworks,whichmust balance high detection accuracy with robust performance.To address these challenges,we propose a novel Deepfake detection framework that combines a two-stream convolutional network with a Vision Transformer(ViT)module to enhance spatio-temporal feature representation.The ViT model extracts spatial features from the forged video,while the 3D convolutional network captures temporal features.The 3D convolution enables cross-frame feature extraction,allowing the model to detect subtle facial changes between frames.The confidence scores from both the ViT and 3D convolution submodels are fused at the decision layer,enabling themodel to effectively handle unknown forgery techniques.Focusing on Deepfake videos and GAN-generated images,the proposed approach is evaluated on two widely used public face forgery datasets.Compared to existing state-of-theartmethods,it achieves higher detection accuracy and better generalization performance,offering a robust solution for deepfake detection in real-world scenarios.
基金College Students Innovation and Entrepreneurship Training Program(X202511049398)College Students Innovation and Entrepreneurship Training Program(X202511049201)+1 种基金College Students Innovation and Entrepreneurship Training Program(X202511258005S)University-Level Research Funding Program of Hainan Science and Technology Vocational University(HKKY2024-87)。
文摘Sandfly fever is a viral infectious disease transmitted by sand flies that is widely prevalent in tropical and subtropical regions.Previous studies on its infection mechanism,immune response and diagnosis and treatment methods were lack of systematic.This study applied spatio-temporal omics technology to comprehensively explain the dynamic changes of immunity in the incubation period,exacerbation period,peak period and recovery period of Sandfl y fever,and integrated with diff erent coping strategies.To provide new research ideas for its overall research.
基金National Natural Science Foundation of China,No.42230106,No.42171250State Key Laboratory of Earth Surface Processes and Resource Ecology,No.2022-ZD-04。
文摘Agriculture holds a pivotal position in the economic fabric of every nation,yet concerns about agricultural carbon emission intensity(ACI)have become a major hurdle to achieving global economic sustainability.Focusing on 31 provincial-level regions in China,this study uses the Exploratory Spatio-temporal Data Analysis(ESTDA)and Panel Quantile Regression(PQR)model to analyze the spatio-temporal interaction characteristics and influencing factors of ACI in China from 2004 to 2023.The findings are as follows:(1)ACI showed an overall downward trend,and the spatial distribution pattern was characterized by“high in the western region and low along the southeastern coast”.Although the overall disparity tended to converge,some high-carbon-intensity regions exhibited extreme trends.ACI displayed clear spatial directionality,with the spatial center shifting steadily toward the northeast.(2)Regions in the northwest,northeast,and central-south parts exhibited strong local spatial structural dynamics,and the local spatial dependence of ACI in each region showed a nonlinear trend.Generally speaking,the spatial association pattern demonstrated a certain degree of inertia in spatial transfer,reflecting strong path dependence or spatial lock-in characteristics.(3)Optimization of industrial structure and improvement in agricultural mechanization will increase ACI,while economic development can effectively reduce it.The impact of urbanization on ACI exhibits a nonlinear pattern.The coordinated development of economic growth and urbanization significantly reduces ACI,with a stronger emission reduction observed in regions with low ACI.The optimization of industrial structure,when combined with urbanization and environmental regulation,contributes to significant emission reductions particularly in high-ACI areas.Similarly,the synergy between agricultural mechanization and urbanization effectively lowers emissions in low-ACI regions,though this effect diminishes in areas with higher ACI.
基金supported by the Key Laboratory of Natural Resources Monitoring and Supervision in Southern Hilly Region,Ministry of Natural Resources(NRMSSHR2023Y02)Yunnan Key Laboratory of Plateau Geographic Processes and Environmental Changes,Faculty of Geography,Yunnan Normal University(PGPEC2304)China Scholarship Council。
文摘Sloping farmland,particularly in mountainous and hilly areas,constitutes a significant component of regional farmland resources.An investigation into the spatio-temporal pattern of sloping farmland and its influencing factors in China is imperative for the efficient utilization of farmland and the optimization of land space.We used land use transfer matrix,geographically weighted regression model and geographical detector to conduct this study.Results showed that sloping farmland in China firstly decreased and then increased from 2000 to 2020.The proportion of sloping farmland decreased radially outward from Sichuan basin to the surrounding areas.Change rates of sloping farmland with different slopes varied and the slope with 6°-15°underwent the fastest changes.The influencing factors of farmland at various slope degrees were different.For sloping farmland below 15°,land use intensity and elevation had the greatest contribution.For sloping farmland between 15°and 25°,elevation,land use intensity,and population density were the main influencing factors.Sloping farmland above 25°was mostly affected by natural factors.This study can provide scientific basis for rational development and protection of sloping farmland.
基金support for this work was supported by Key Lab of Intelligent and Green Flexographic Printing under Grant ZBKT202301.
文摘Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decoder (ACSF-ED) network to predict the action and locate the object efficiently. In the Adaptive Cross-Scale Fusion Spatio-Temporal Encoder (ACSF ST-Encoder), the Asymptotic Cross-scale Feature-fusion Module (ACCFM) is designed to address the issue of information degradation caused by the propagation of high-level semantic information, thereby extracting high-quality multi-scale features to provide superior features for subsequent spatio-temporal information modeling. Within the Shared-Head Decoder structure, a shared classification and regression detection head is constructed. A multi-constraint loss function composed of one-to-one, one-to-many, and contrastive denoising losses is designed to address the problem of insufficient constraint force in predicting results with traditional methods. This loss function enhances the accuracy of model classification predictions and improves the proximity of regression position predictions to ground truth objects. The proposed method model is evaluated on the popular dataset UCF101-24 and JHMDB-21. Experimental results demonstrate that the proposed method achieves an accuracy of 81.52% on the Frame-mAP metric, surpassing current existing methods.
基金supported by The Henan Province Science and Technology Research Project(242102211046)the Key Scientific Research Project of Higher Education Institutions in Henan Province(25A520039)+1 种基金theNatural Science Foundation project of Zhongyuan Institute of Technology(K2025YB011)the Zhongyuan University of Technology Graduate Education and Teaching Reform Research Project(JG202424).
文摘Electrocardiogram (ECG) analysis is critical for detecting arrhythmias, but traditional methods struggle with large-scale Electrocardiogram data and rare arrhythmia events in imbalanced datasets. These methods fail to perform multi-perspective learning of temporal signals and Electrocardiogram images, nor can they fully extract the latent information within the data, falling short of the accuracy required by clinicians. Therefore, this paper proposes an innovative hybrid multimodal spatiotemporal neural network to address these challenges. The model employs a multimodal data augmentation framework integrating visual and signal-based features to enhance the classification performance of rare arrhythmias in imbalanced datasets. Additionally, the spatiotemporal fusion module incorporates a spatiotemporal graph convolutional network to jointly model temporal and spatial features, uncovering complex dependencies within the Electrocardiogram data and improving the model’s ability to represent complex patterns. In experiments conducted on the MIT-BIH arrhythmia dataset, the model achieved 99.95% accuracy, 99.80% recall, and a 99.78% F1 score. The model was further validated for generalization using the clinical INCART arrhythmia dataset, and the results demonstrated its effectiveness in terms of both generalization and robustness.