Automatic segmentation of landslides from remote sensing imagery is challenging because traditional machine learning and early CNN-based models often fail to generalize across heterogeneous landscapes,where segmentati...Automatic segmentation of landslides from remote sensing imagery is challenging because traditional machine learning and early CNN-based models often fail to generalize across heterogeneous landscapes,where segmentation maps contain sparse and fragmented landslide regions under diverse geographical conditions.To address these issues,we propose a lightweight dual-stream siamese deep learning framework that integrates optical and topographical data fusion with an adaptive decoder,guided multimodal fusion,and deep supervision.The framework is built upon the synergistic combination of cross-attention,gated fusion,and sub-pixel upsampling within a unified dual-stream architecture specifically optimized for landslide segmentation,enabling efficient context modeling and robust feature exchange between modalities.The decoder captures long-range context at deeper levels using lightweight cross-attention and refines spatial details at shallower levels through attention-gated skip fusion,enabling precise boundary delineation and fewer false positives.The gated fusion further enhances multimodal integration of optical and topographical cues,and the deep supervision stabilizes training and improves generalization.Moreover,to mitigate checkerboard artifacts,a learnable sub-pixel upsampling is devised to replace the traditional transposed convolution.Despite its compact design with fewer parameters,the model consistently outperforms state-of-the-art baselines.Experiments on two benchmark datasets,Landslide4Sense and Bijie,confirm the effectiveness of the framework.On the Bijie dataset,it achieves an F1-score of 0.9110 and an intersection over union(IoU)of 0.8839.These results highlight its potential for accurate large-scale landslide inventory mapping and real-time disaster response.The implementation is publicly available at https://github.com/mishaown/DiGATe-UNet-LandSlide-Segmentation(accessed on 3 November 2025).展开更多
As the penetration ratio of wind power in active distribution networks continues to increase,the system exhibits some characteristics such as randomness and volatility.Fast and accurate short-term wind power predictio...As the penetration ratio of wind power in active distribution networks continues to increase,the system exhibits some characteristics such as randomness and volatility.Fast and accurate short-term wind power prediction is essential for algorithms like scheduling and optimization control.Based on the spatio-temporal features of Numerical Weather Prediction(NWP)data,it proposes the WVMD_DSN(Whale Optimization Algorithm,Variational Mode Decomposition,Dual Stream Network)model.The model first applies Pearson correlation coefficient(PCC)to choose some NWP features with strong correlation to wind power to form the feature set.Then,it decomposes the feature set using Variational Mode Decomposition(VMD)to eliminate the nonstationarity and obtains Intrinsic Mode Functions(IMFs).Here Whale Optimization Algorithm(WOA)is applied to optimise the key parameters of VMD,namely the number of mode components K and penalty factor a.Finally,incorporating attention mechanism(AM),Squeeze-Excitation Network(SENet),and Bidirectional Gated Recurrent Unit(BiGRU),it constructs the dual-stream network(DSN)for short-term wind power prediction.Comparative experiments demonstrate that the WVMD_DSN model outperforms existing baseline algorithms and exhibits good generalization performance.The relevant code is available at https://github.com/ruanyuyuan/Wind-power-forecast.git(accessed on 20 August 2024).展开更多
The accuracy of photovoltaic(PV)power prediction is significantly influenced by meteorological and environmental factors.To enhance ultra-short-term forecasting precision,this paper proposes an interpretable feedback ...The accuracy of photovoltaic(PV)power prediction is significantly influenced by meteorological and environmental factors.To enhance ultra-short-term forecasting precision,this paper proposes an interpretable feedback prediction method based on a parallel dual-stream Temporal Convolutional Network-Bidirectional Long Short-Term Memory(TCN-BiLSTM)architecture incorporating a spatiotemporal attention mechanism.Firstly,during data preprocessing,the optimal historical time window is determined through autocorrelation analysis while highly correlated features are selected as model inputs using Pearson correlation coefficients.Subsequently,a parallel dual-stream TCN-BiLSTM model is constructed where the TCN branch extracts localized transient features and the BiLSTM branch captures long-term periodic patterns,with spatiotemporal attention dynamically weighting spatiotemporal dependencies.Finally,Shapley Additive explanations(SHAP)additive analysis quantifies feature contribution rates and provides optimization feedback to the model.Validation using operational data from a PV power station in Northeast China demonstrates that compared to conventional deep learning models,the proposed method achieves a 17.6%reduction in root mean square error(RMSE),a 5.4%decrease in training time consumption,and a 4.78%improvement in continuous ranked probability score(CRPS),exhibiting significant advantages in both prediction accuracy and generalization capability.This approach enhances the application effectiveness of ultra-short-term PV power forecasting while simultaneously improving prediction accuracy and computational efficiency.展开更多
Human pose estimation is crucial across diverse applications,from healthcare to human-computer interaction.Integrating inertial measurement units(IMUs)with monocular vision methods holds great potential for leveraging...Human pose estimation is crucial across diverse applications,from healthcare to human-computer interaction.Integrating inertial measurement units(IMUs)with monocular vision methods holds great potential for leveraging complementary modalities;however,existing approaches are often limited by IMU drift,noise,and underutilization of visual information.To address these limitations,we propose a novel dual-stream feature extraction framework that effectively combines temporal IMU data and single-view image features for improved pose estimation.Short-term dependencies in IMU sequences are captured with convolutional layers,while a Transformerbased architecture models long-range temporal dynamics.To mitigate IMU drift and inter-sensor inconsistencies,a complementary filtering module is introduced alongside a cross-channel interaction mechanism.Features from the IMU and image streams are then fused via a dedicated fusion module and further refined utilizing a high-precision regression head for accurate pose prediction.Experimental results on benchmark datasets demonstrate that our method significantly outperforms existing techniques in terms of estimation,accuracy,and robustness,validating the effectiveness of our dual-stream architecture.展开更多
基金funded by the National Natural Science Foundation of China,grant number 62262045the Fundamental Research Funds for the Central Universities,grant number 2023CDJYGRH-YB11the Open Funding of SUGON Industrial Control and Security Center,grant number CUIT-SICSC-2025-03.
文摘Automatic segmentation of landslides from remote sensing imagery is challenging because traditional machine learning and early CNN-based models often fail to generalize across heterogeneous landscapes,where segmentation maps contain sparse and fragmented landslide regions under diverse geographical conditions.To address these issues,we propose a lightweight dual-stream siamese deep learning framework that integrates optical and topographical data fusion with an adaptive decoder,guided multimodal fusion,and deep supervision.The framework is built upon the synergistic combination of cross-attention,gated fusion,and sub-pixel upsampling within a unified dual-stream architecture specifically optimized for landslide segmentation,enabling efficient context modeling and robust feature exchange between modalities.The decoder captures long-range context at deeper levels using lightweight cross-attention and refines spatial details at shallower levels through attention-gated skip fusion,enabling precise boundary delineation and fewer false positives.The gated fusion further enhances multimodal integration of optical and topographical cues,and the deep supervision stabilizes training and improves generalization.Moreover,to mitigate checkerboard artifacts,a learnable sub-pixel upsampling is devised to replace the traditional transposed convolution.Despite its compact design with fewer parameters,the model consistently outperforms state-of-the-art baselines.Experiments on two benchmark datasets,Landslide4Sense and Bijie,confirm the effectiveness of the framework.On the Bijie dataset,it achieves an F1-score of 0.9110 and an intersection over union(IoU)of 0.8839.These results highlight its potential for accurate large-scale landslide inventory mapping and real-time disaster response.The implementation is publicly available at https://github.com/mishaown/DiGATe-UNet-LandSlide-Segmentation(accessed on 3 November 2025).
基金the Science and Technology Project of State Grid Corporation of China under Grant 5400-202117142A-0-0-00the National Natural Science Foundation of China under Grant 62372242.
文摘As the penetration ratio of wind power in active distribution networks continues to increase,the system exhibits some characteristics such as randomness and volatility.Fast and accurate short-term wind power prediction is essential for algorithms like scheduling and optimization control.Based on the spatio-temporal features of Numerical Weather Prediction(NWP)data,it proposes the WVMD_DSN(Whale Optimization Algorithm,Variational Mode Decomposition,Dual Stream Network)model.The model first applies Pearson correlation coefficient(PCC)to choose some NWP features with strong correlation to wind power to form the feature set.Then,it decomposes the feature set using Variational Mode Decomposition(VMD)to eliminate the nonstationarity and obtains Intrinsic Mode Functions(IMFs).Here Whale Optimization Algorithm(WOA)is applied to optimise the key parameters of VMD,namely the number of mode components K and penalty factor a.Finally,incorporating attention mechanism(AM),Squeeze-Excitation Network(SENet),and Bidirectional Gated Recurrent Unit(BiGRU),it constructs the dual-stream network(DSN)for short-term wind power prediction.Comparative experiments demonstrate that the WVMD_DSN model outperforms existing baseline algorithms and exhibits good generalization performance.The relevant code is available at https://github.com/ruanyuyuan/Wind-power-forecast.git(accessed on 20 August 2024).
基金funded by the National Natural Science Foundation of China(NSFC)(No.62066024)funded by Basic Scientific Research Projects of Higher Education Institutions in Liaoning Province(LJ212411632063)the National Undergraduate Training Program for Innovation and Entrepreneurship(S202511632045).
文摘The accuracy of photovoltaic(PV)power prediction is significantly influenced by meteorological and environmental factors.To enhance ultra-short-term forecasting precision,this paper proposes an interpretable feedback prediction method based on a parallel dual-stream Temporal Convolutional Network-Bidirectional Long Short-Term Memory(TCN-BiLSTM)architecture incorporating a spatiotemporal attention mechanism.Firstly,during data preprocessing,the optimal historical time window is determined through autocorrelation analysis while highly correlated features are selected as model inputs using Pearson correlation coefficients.Subsequently,a parallel dual-stream TCN-BiLSTM model is constructed where the TCN branch extracts localized transient features and the BiLSTM branch captures long-term periodic patterns,with spatiotemporal attention dynamically weighting spatiotemporal dependencies.Finally,Shapley Additive explanations(SHAP)additive analysis quantifies feature contribution rates and provides optimization feedback to the model.Validation using operational data from a PV power station in Northeast China demonstrates that compared to conventional deep learning models,the proposed method achieves a 17.6%reduction in root mean square error(RMSE),a 5.4%decrease in training time consumption,and a 4.78%improvement in continuous ranked probability score(CRPS),exhibiting significant advantages in both prediction accuracy and generalization capability.This approach enhances the application effectiveness of ultra-short-term PV power forecasting while simultaneously improving prediction accuracy and computational efficiency.
基金support provided by the European University of Atlantic.
文摘Human pose estimation is crucial across diverse applications,from healthcare to human-computer interaction.Integrating inertial measurement units(IMUs)with monocular vision methods holds great potential for leveraging complementary modalities;however,existing approaches are often limited by IMU drift,noise,and underutilization of visual information.To address these limitations,we propose a novel dual-stream feature extraction framework that effectively combines temporal IMU data and single-view image features for improved pose estimation.Short-term dependencies in IMU sequences are captured with convolutional layers,while a Transformerbased architecture models long-range temporal dynamics.To mitigate IMU drift and inter-sensor inconsistencies,a complementary filtering module is introduced alongside a cross-channel interaction mechanism.Features from the IMU and image streams are then fused via a dedicated fusion module and further refined utilizing a high-precision regression head for accurate pose prediction.Experimental results on benchmark datasets demonstrate that our method significantly outperforms existing techniques in terms of estimation,accuracy,and robustness,validating the effectiveness of our dual-stream architecture.