Accurately predicting the State of Health(SOH)of lithium-ion batteries is a critical challenge to ensure their reliability and safety in energy storage systems,such as electric vehicles and renewable energy grids.The ...Accurately predicting the State of Health(SOH)of lithium-ion batteries is a critical challenge to ensure their reliability and safety in energy storage systems,such as electric vehicles and renewable energy grids.The intricate battery degradation process is influenced by evolving spatial and temporal interactions among health indicators.Existing methods often fail to capture the dynamic interactions between health indicators over time,resulting in limited predictive accuracy.To address these challenges,we propose a novel framework,Dynamic Graph Learning with Spatial-Temporal Fusion Attention(DGL-STFA),which transforms health indicator series time-data into time-evolving graph representations.The framework employs multi-scale convolutional neural networks to capture diverse temporal patterns,a self-attention mechanism to construct dynamic adjacency matrices that adapt over time,and a temporal attention mechanism to identify and prioritize key moments that influence battery degradation.This combination enables DGL-STFA to effectively model both dynamic spatial relationships and long-term temporal dependencies,enhancing SOH prediction accuracy.Extensive experiments were conducted on the NASA and CALCE battery datasets,comparing this framework with traditional time-series prediction methods and other graph-based prediction methods.The results demonstrate that our framework significantly improves prediction accuracy,with a mean absolute error more than 30%lower than other methods.Further analysis demonstrated the robustness of DGL-STFA across various battery life stages,including early,mid,and end-of-life phases.These results highlight the capability of DGL-STFA to accurately predict SOH,addressing critical challenges in advancing battery health monitoring for energy storage applications.展开更多
Bird's-eye-view(BEV)perception is a core technology for autonomous driving systems.However,existing solutions face the dilemma of high costs associated with multimodal methods and limited performance of vision-onl...Bird's-eye-view(BEV)perception is a core technology for autonomous driving systems.However,existing solutions face the dilemma of high costs associated with multimodal methods and limited performance of vision-only approaches.To address this issue,this paper proposes a framework named“a lightweight pure visual BEV perception method based on dual distillation of spatial-temporal knowledge”.This framework innovatively designs a lightweight vision-only student model based on Res Net,which leverages a dual distillation mechanism to learn from a powerful teacher model that integrates temporal information from both image and light detection and ranging(LiDAR)modalities.Specifically,we distill efficient multi-modal feature extraction and spatial fusion capabilities from the BEVFusion model,and distill advanced temporal information fusion and spatiotemporal attention mechanisms from the BEVFormer model.This dual distillation strategy enables the student model to achieve perception performance close to that of multi-modal models without relying on Li DAR.Experimental results on the nu Scenes dataset demonstrate that the proposed model significantly outperforms classical vision-only algorithms,achieves comparable performance to current state-of-the-art vision-only methods on the nu Scenes detection leaderboard in terms of both mean average precision(mAP)and the nu Scenes detection score(NDS)metrics,and exhibits notable advantages in inference computational efficiency.Although the proposed dual-teacher paradigm incurs higher offline training costs compared to single-model approaches,it yields a streamlined and highly efficient student model suitable for resource-constrained real-time deployment.This provides an effective pathway toward low-cost,high-performance autonomous driving perception systems.展开更多
Smart grid substation operations often take place in hazardous environments and pose significant threats to the safety of power personnel.Relying solely on manual supervision can lead to inadequate oversight.In respon...Smart grid substation operations often take place in hazardous environments and pose significant threats to the safety of power personnel.Relying solely on manual supervision can lead to inadequate oversight.In response to the demand for technology to identify improper operations in substation work scenarios,this paper proposes a substation safety action recognition technology to avoid the misoperation and enhance the safety management.In general,this paper utilizes a dual-branch transformer network to extract spatial and temporal information from the video dataset of operational behaviors in complex substation environments.Firstly,in order to capture the spatial-temporal correlation of people's behaviors in smart grid substation,we devise a sparse attention module and a segmented linear attention module that are embedded into spatial branch transformer and temporal branch transformer respectively.To avoid the redundancy of spatial and temporal information,we fuse the temporal and spatial features using a tensor decomposition fusion module by a decoupled manner.Experimental results indicate that our proposed method accurately detects improper operational behaviors in substation work scenarios,outperforming other existing methods in terms of detection and recognition accuracy.展开更多
As a significant city in the Yangtze River Delta regions,Hefei has experienced rapid changes in the sources of air pollution due to its high-speed economic development and urban expansion.However,there has been limite...As a significant city in the Yangtze River Delta regions,Hefei has experienced rapid changes in the sources of air pollution due to its high-speed economic development and urban expansion.However,there has been limited research in recent years on the spatial-temporal distribution and emission of its atmospheric pollutants.To address this,this study conducted mobile observations of urban roads using the Mobile-DOAS instrument from June 2021 to May 2022.The monitoring results exhibit a favourable consistent with TROPOMI satellite data and ground monitoring station data.Temporally,there were pronounced seasonal variations in air pollutants.Spatially,high concentration of HCHO and NO_(2)were closely associated with traffic congestion on roadways,while heightened SO_(2)levels were attributed to winter heating and industrial emissions.The study also revealed that with the implementation of road policies,the average vehicle speed increased by 95.4%,while the NO concentration decreased by 54.4%.In the estimation of urban NO_(x)emission flux,it was observed that in temporal terms,compared with inventory data,the emissions calculated viamobile measurements exhibitedmore distinct seasonal patterns,with the highest emission rate of 349 g/sec in winter and the lowest of 142 g/sec in summer.In spatial terms,the significant difference in emissions between the inner and outer ring roads also suggests the presence of the city’s primary NO_(x)emission sources in the area between these two rings.This study offers data support for formulating the next phase of air pollution control measures in urban areas.展开更多
In the current situation of decelerating economic expansion,examining the digital economy(DE)as a novel economic model is beneficial for the local economy’s sustainable and high-quality development(HQD).We analyzed p...In the current situation of decelerating economic expansion,examining the digital economy(DE)as a novel economic model is beneficial for the local economy’s sustainable and high-quality development(HQD).We analyzed panel data from the Yellow River(YR)region from 2013 to 2021 and discovered notable spatial variances in the composite index and coupling coordination of the two systems.Specifically,the downstream region exhibited the highest coupling coordination,while the upstream region had the lowest.We identified that favorable factors such as economic development,innovation,industrial upgrading,and government intervention can bolster the coupling.Our findings provide a valuable framework for promoting DE and HQD in the YR region.展开更多
Accurate traffic flow prediction has a profound impact on modern traffic management. Traffic flow has complex spatial-temporal correlations and periodicity, which poses difficulties for precise prediction. To address ...Accurate traffic flow prediction has a profound impact on modern traffic management. Traffic flow has complex spatial-temporal correlations and periodicity, which poses difficulties for precise prediction. To address this problem, a Multi-head Self-attention and Spatial-Temporal Graph Convolutional Network (MSSTGCN) for multiscale traffic flow prediction is proposed. Firstly, to capture the hidden traffic periodicity of traffic flow, traffic flow is divided into three kinds of periods, including hourly, daily, and weekly data. Secondly, a graph attention residual layer is constructed to learn the global spatial features across regions. Local spatial-temporal dependence is captured by using a T-GCN module. Thirdly, a transformer layer is introduced to learn the long-term dependence in time. A position embedding mechanism is introduced to label position information for all traffic sequences. Thus, this multi-head self-attention mechanism can recognize the sequence order and allocate weights for different time nodes. Experimental results on four real-world datasets show that the MSSTGCN performs better than the baseline methods and can be successfully adapted to traffic prediction tasks.展开更多
Visible and infrared(RGB-IR)fusion object detection plays an important role in security,disaster relief,etc.In recent years,deep-learning-based RGB-IR fusion detection methods have been developing rapidly,but still st...Visible and infrared(RGB-IR)fusion object detection plays an important role in security,disaster relief,etc.In recent years,deep-learning-based RGB-IR fusion detection methods have been developing rapidly,but still struggle to deal with the complex and changing scenarios captured by drones,mainly due to two reasons:(A)RGB-IR fusion detectors are susceptible to inferior inputs that degrade performance and stability.(B)RGB-IR fusion detectors are susceptible to redundant features that reduce accuracy and efficiency.In this paper,an innovative RGB-IR fusion detection framework based on global-local feature optimization,named GLFDet,is proposed to improve the detection performance and efficiency of drone-captured objects.The key components of GLFDet include a Global Feature Optimization(GFO)module,a Local Feature Optimization(LFO)module and a Channel Separation Fusion(CSF)module.Specifically,GFO calculates the information content of the input image from the frequency domain and optimizes the features holistically.Then,LFO dynamically selects high-value features and filters out low-value features before fusion,which significantly improves the efficiency of fusion.Finally,CSF fuses the RGB and IR features across the corresponding channels,which avoids the rearrangement of the channel relationships and enhances the model stability.Extensive experimental results show that the proposed method achieves the best performance on three popular RGB-IR datasets Drone Vehicle,VEDAI,and LLVIP.In addition,GLFDet is more lightweight than other comparable models,making it more appealing to edge devices such as drones.The code is available at https://github.com/lao chen330/GLFDet.展开更多
Spatial-temporal traffic prediction technology is crucial for network planning,resource allocation optimizing,and user experience improving.With the development of virtual network operators,multi-operator collaboratio...Spatial-temporal traffic prediction technology is crucial for network planning,resource allocation optimizing,and user experience improving.With the development of virtual network operators,multi-operator collaborations,and edge computing,spatial-temporal traffic data has taken on a distributed nature.Consequently,noncentralized spatial-temporal traffic prediction solutions have emerged as a recent research focus.Currently,the majority of research typically adopts federated learning methods to train traffic prediction models distributed on each base station.This method reduces additional burden on communication systems.However,this method has a drawback:it cannot handle irregular traffic data.Due to unstable wireless network environments,device failures,insufficient storage resources,etc.,data missing inevitably occurs during the process of collecting traffic data.This results in the irregular nature of distributed traffic data.Yet,commonly used traffic prediction models such as Recurrent Neural Networks(RNN)and Long Short-Term Memory(LSTM)typically assume that the data is complete and regular.To address the challenge of handling irregular traffic data,this paper transforms irregular traffic prediction into problems of estimating latent variables and generating future traffic.To solve the aforementioned problems,this paper introduces split learning to design a structured distributed learning framework.The framework comprises a Global-level Spatial structure mining Model(GSM)and several Nodelevel Generative Models(NGMs).NGM and GSM represent Seq2Seq models deployed on the base station and graph neural network models deployed on the cloud or central controller.Firstly,the time embedding layer in NGM establishes the mapping relationship between irregular traffic data and regular latent temporal feature variables.Secondly,GSM collects statistical feature parameters of latent temporal feature variables from various nodes and executes graph embedding for spatial-temporal traffic data.Finally,NGM generates future traffic based on latent temporal and spatial feature variables.The introduction of the time attention mechanism enhances the framework’s capability to handle irregular traffic data.Graph attention network introduces spatially correlated base station traffic feature information into local traffic prediction,which compensates for missing information in local irregular traffic data.The proposed framework effectively addresses the distributed prediction issues of irregular traffic data.By testing on real world datasets,the proposed framework improves traffic prediction accuracy by 35%compared to other commonly used distributed traffic prediction methods.展开更多
A dual-phase synergistic enhancement method was adopted to strengthen the Al-Mn-Mg-Sc-Zr alloy fabricated by laser powder bed fusion(LPBF)by leveraging the unique advantages of Er and TiB_(2).Spherical powders of 0.5w...A dual-phase synergistic enhancement method was adopted to strengthen the Al-Mn-Mg-Sc-Zr alloy fabricated by laser powder bed fusion(LPBF)by leveraging the unique advantages of Er and TiB_(2).Spherical powders of 0.5wt%Er-1wt%TiB_(2)/Al-Mn-Mg-Sc-Zr nanocomposite were prepared using vacuum homogenization technique,and the density of samples prepared through the LPBF process reached 99.8%.The strengthening and toughening mechanisms of Er-TiB_(2)were investigated.The results show that Al_(3)Er diffraction peaks are detected by X-ray diffraction analysis,and texture strength decreases according to electron backscatter diffraction results.The added Er and TiB_(2)nano-reinforcing phases act as heterogeneous nucleation sites during the LPBF forming process,hindering grain growth and effectively refining the grains.After incorporating the Er-TiB_(2)dual-phase nano-reinforcing phases,the tensile strength and elongation at break of the LPBF-deposited samples reach 550 MPa and 18.7%,which are 13.4%and 26.4%higher than those of the matrix material,respectively.展开更多
The process of nuclear fusion in the presence of a laser field was theoretically analyzed.The analysis is applicable to most fusion reactions and different types of currently available intense lasers,from X-ray free-e...The process of nuclear fusion in the presence of a laser field was theoretically analyzed.The analysis is applicable to most fusion reactions and different types of currently available intense lasers,from X-ray free-electron lasers to solid-state near-infrared lasers.Laser fields were shown to enhance the fusion yields,and the mechanism of this enhancement was explained.Low-frequency lasers are more efficient in enhancing fusion than high-frequency lasers.The calculation results show enhancements of fusion yields by orders of magnitude with currently available intense low-frequency laser fields.The temperature requirement for controlled nuclear fusion may be reduced with the aid of intense laser fields.展开更多
Fault diagnosis of rolling bearings is crucial for ensuring the stable operation of mechanical equipment and production safety in industrial environments.However,due to the nonlinearity and non-stationarity of collect...Fault diagnosis of rolling bearings is crucial for ensuring the stable operation of mechanical equipment and production safety in industrial environments.However,due to the nonlinearity and non-stationarity of collected vibration signals,single-modal methods struggle to capture fault features fully.This paper proposes a rolling bearing fault diagnosis method based on multi-modal information fusion.The method first employs the Hippopotamus Optimization Algorithm(HO)to optimize the number of modes in Variational Mode Decomposition(VMD)to achieve optimal modal decomposition performance.It combines Convolutional Neural Networks(CNN)and Gated Recurrent Units(GRU)to extract temporal features from one-dimensional time-series signals.Meanwhile,the Markovian Transition Field(MTF)is used to transform one-dimensional signals into two-dimensional images for spatial feature mining.Through visualization techniques,the effectiveness of generated images from different parameter combinations is compared to determine the optimal parameter configuration.A multi-modal network(GSTCN)is constructed by integrating Swin-Transformer and the Convolutional Block Attention Module(CBAM),where the attention module is utilized to enhance fault features.Finally,the fault features extracted from different modalities are deeply fused and fed into a fully connected layer to complete fault classification.Experimental results show that the GSTCN model achieves an average diagnostic accuracy of 99.5%across three datasets,significantly outperforming existing comparison methods.This demonstrates that the proposed model has high diagnostic precision and good generalization ability,providing an efficient and reliable solution for rolling bearing fault diagnosis.展开更多
Parkinson’s disease remains a major clinical issue in terms of early detection,especially during its prodromal stage when symptoms are not evident or not distinct.To address this problem,we proposed a new deep learni...Parkinson’s disease remains a major clinical issue in terms of early detection,especially during its prodromal stage when symptoms are not evident or not distinct.To address this problem,we proposed a new deep learning 2-based approach for detecting Parkinson’s disease before any of the overt symptoms develop during their prodromal stage.We used 5 publicly accessible datasets,including UCI Parkinson’s Voice,Spiral Drawings,PaHaW,NewHandPD,and PPMI,and implemented a dual stream CNN–BiLSTM architecture with Fisher-weighted feature merging and SHAP-based explanation.The findings reveal that the model’s performance was superior and achieved 98.2%,a F1-score of 0.981,and AUC of 0.991 on the UCI Voice dataset.The model’s performance on the remaining datasets was also comparable,with up to a 2–7 percent betterment in accuracy compared to existing strong models such as CNN–RNN–MLP,ILN–GNet,and CASENet.Across the evidence,the findings back the diagnostic promise of micro-tremor assessment and demonstrate that combining temporal and spatial features with a scatter-based segment for a multi-modal approach can be an effective and scalable platform for an“early,”interpretable PD screening system.展开更多
A low-temperature-resistant and high-strength stainless-steel jacket is a key component in the superconducting magnet of a fusion reactor.The development of cryogenic structural materials with high strength and toughn...A low-temperature-resistant and high-strength stainless-steel jacket is a key component in the superconducting magnet of a fusion reactor.The development of cryogenic structural materials with high strength and toughness poses a challenge for the future development of high-field superconducting magnets in fusion reactors.The yield strength of the International Thermonuclear Experimental Reactor developed for low-temperature structural materials at 4.2K is below 1100MPa,which fails to meet the demand for structural components with yield strengths exceeding 1500MPa at 4.2K in the future fusion reactors.CHSN01(formerly N50H),which is a low-temperature structural material developed in China,exhibits exceptional strength and toughness,thereby making it highly promising for practical applications.Recently,a 30 t jacket measuring approximately 5000m in total length was produced.Its low-temperature mechanical properties were tested using a sampling method to ensure compliance with application requirements.This paper presents the experimental data of the CHSN01 jacket and tests of the physical properties of the material in the temperature range of 4–300 K.The physical properties were unaffected by magnetic field.Furthermore,this paper discusses the feasibility of employing CHSN01 as a cryogenic structural material capable of withstanding high magnetic fields in next-generation fusion reactors.展开更多
Traffic sign detection is a critical component of driving systems.Single-stage network-based traffic sign detection algorithms,renowned for their fast detection speeds and high accuracy,have become the dominant approa...Traffic sign detection is a critical component of driving systems.Single-stage network-based traffic sign detection algorithms,renowned for their fast detection speeds and high accuracy,have become the dominant approach in current practices.However,in complex and dynamic traffic scenes,particularly with smaller traffic sign objects,challenges such as missed and false detections can lead to reduced overall detection accuracy.To address this issue,this paper proposes a detection algorithm that integrates edge and shape information.Recognizing that traffic signs have specific shapes and distinct edge contours,this paper introduces an edge feature extraction branch within the backbone network,enabling adaptive fusion with features of the same hierarchical level.Additionally,a shape prior convolution module is designed to replaces the first two convolutional modules of the backbone network,aimed at enhancing the model's perception ability for specific shape objects and reducing its sensitivity to background noise.The algorithm was evaluated on the CCTSDB and TT100k datasets,and compared to YOLOv8s,the mAP50 values increased by 3.0%and 10.4%,respectively,demonstrating the effectiveness of the proposed method in improving the accuracy of traffic sign detection.展开更多
AIM:To investigate the effects of binocular fusional C-optotypes(positive/negative)and 2D planar C-optotypes on the amplitude and stability of transient accommodation(TAC)in adults,and to provide a basis for non-conta...AIM:To investigate the effects of binocular fusional C-optotypes(positive/negative)and 2D planar C-optotypes on the amplitude and stability of transient accommodation(TAC)in adults,and to provide a basis for non-contact myopia intervention.METHODS:This was a self-controlled study.Using redblue 3D technology,four experimental stages were set up:Test A[fixating on the 1 m negative fusional C-optotypes,8△base-in(BI)],Test B(fixating on the 5 m planar C-optotypes),Test C(fixating on the 1 m planar C-optotypes),and Test D[fixating on the 1 m positive fusional C-optotypes,20△base-out(BO)].A WAM-5500 open-field autorefractor was used to measure TAC and accommodative microfluctuations[evaluated via interquartile range(IQR)and median-based coefficient of variation(CVmed)].Additionally,the convergence accommodation to convergence(CA/C)ratio was calculated,and a visual fatigue questionnaire was administered to assess participants’subjective visual comfort.RESULTS:A total of 21 subjects(7 males,14 females;aged 23-41y)with normal binocular visual function were enrolled.The results showed that the TAC increased gradually across the four stages,and these values were Test A(-0.35±0.26 D)<Test B(-0.46±0.24 D)<Test C(-0.77±0.32 D)<Test D(-1.38±0.31 D).There were significant overall differences(F=56.136,P<0.001).Compared with Test C,Test A reduced TAC by 0.42 D(P<0.05),while Test D increased it by 0.61 D(P<0.001).There was no significant intergroup difference in accommodative fluctuation amplitude(all P>0.05),but the fluctuation stability of Test D showed a significant difference between the first 20s and the second 20s(P=0.017).The CA/C ratio was significantly higher in Test D(0.05±0.02 D/△)than in Test A(0.03±0.02 D/△,P=0.007),indicating stronger accommodation-convergence linkage during positive fusional fixation.The visual fatigue scores of all stages were low(median 0-1),with Test D slightly higher than Test B and Test C(P<0.05).No linear correlation was found between TAC and age(all r<0.1,P>0.05).CONCLUSION:Negative fusional C-optotypes induce ciliary muscle relaxation to reduce TAC,while positive fusional C-optotypes enhance accommodation-convergence coordination to increase TAC.The red-blue 3D-based noncontact training mode exhibits good safety(median visual fatigue scores:0-1 across all tests)and provides a novel dual-directional(relaxation-activation)strategy for myopia prevention and control.展开更多
Camouflaged Object Detection(COD)aims to identify objects that share highly similar patterns—such as texture,intensity,and color—with their surrounding environment.Due to their intrinsic resemblance to the backgroun...Camouflaged Object Detection(COD)aims to identify objects that share highly similar patterns—such as texture,intensity,and color—with their surrounding environment.Due to their intrinsic resemblance to the background,camouflaged objects often exhibit vague boundaries and varying scales,making it challenging to accurately locate targets and delineate their indistinct edges.To address this,we propose a novel camouflaged object detection network called Edge-Guided and Multi-scale Fusion Network(EGMFNet),which leverages edge-guided multi-scale integration for enhanced performance.The model incorporates two innovative components:a Multi-scale Fusion Module(MSFM)and an Edge-Guided Attention Module(EGA).These designs exploit multi-scale features to uncover subtle cues between candidate objects and the background while emphasizing camouflaged object boundaries.Moreover,recognizing the rich contextual information in fused features,we introduce a Dual-Branch Global Context Module(DGCM)to refine features using extensive global context,thereby generatingmore informative representations.Experimental results on four benchmark datasets demonstrate that EGMFNet outperforms state-of-the-art methods across five evaluation metrics.Specifically,on COD10K,our EGMFNet-P improves F_(β)by 4.8 points and reduces mean absolute error(MAE)by 0.006 compared with ZoomNeXt;on NC4K,it achieves a 3.6-point increase in F_(β).OnCAMO and CHAMELEON,it obtains 4.5-point increases in F_(β),respectively.These consistent gains substantiate the superiority and robustness of EGMFNet.展开更多
基金sponsored by the National Key Research and Development Program of China(No.2023YFB4606200)Key Program of Science and Technology of Yunnan Province,China (No.202302AB080020)Key Project of Shanghai Zhangjiang National Independent hnovation Demonstration Zone,China(No.ZJ2021-2D-006).
文摘Accurately predicting the State of Health(SOH)of lithium-ion batteries is a critical challenge to ensure their reliability and safety in energy storage systems,such as electric vehicles and renewable energy grids.The intricate battery degradation process is influenced by evolving spatial and temporal interactions among health indicators.Existing methods often fail to capture the dynamic interactions between health indicators over time,resulting in limited predictive accuracy.To address these challenges,we propose a novel framework,Dynamic Graph Learning with Spatial-Temporal Fusion Attention(DGL-STFA),which transforms health indicator series time-data into time-evolving graph representations.The framework employs multi-scale convolutional neural networks to capture diverse temporal patterns,a self-attention mechanism to construct dynamic adjacency matrices that adapt over time,and a temporal attention mechanism to identify and prioritize key moments that influence battery degradation.This combination enables DGL-STFA to effectively model both dynamic spatial relationships and long-term temporal dependencies,enhancing SOH prediction accuracy.Extensive experiments were conducted on the NASA and CALCE battery datasets,comparing this framework with traditional time-series prediction methods and other graph-based prediction methods.The results demonstrate that our framework significantly improves prediction accuracy,with a mean absolute error more than 30%lower than other methods.Further analysis demonstrated the robustness of DGL-STFA across various battery life stages,including early,mid,and end-of-life phases.These results highlight the capability of DGL-STFA to accurately predict SOH,addressing critical challenges in advancing battery health monitoring for energy storage applications.
基金supported by the National Natural Science Foundation of China(42476084,62203456,42276199)the Stable Support Project of National Key Laboratory(WDZC 20245250302)the National Key R&D Program of China(2024YFC2813502,2024YFC2813302)。
文摘Bird's-eye-view(BEV)perception is a core technology for autonomous driving systems.However,existing solutions face the dilemma of high costs associated with multimodal methods and limited performance of vision-only approaches.To address this issue,this paper proposes a framework named“a lightweight pure visual BEV perception method based on dual distillation of spatial-temporal knowledge”.This framework innovatively designs a lightweight vision-only student model based on Res Net,which leverages a dual distillation mechanism to learn from a powerful teacher model that integrates temporal information from both image and light detection and ranging(LiDAR)modalities.Specifically,we distill efficient multi-modal feature extraction and spatial fusion capabilities from the BEVFusion model,and distill advanced temporal information fusion and spatiotemporal attention mechanisms from the BEVFormer model.This dual distillation strategy enables the student model to achieve perception performance close to that of multi-modal models without relying on Li DAR.Experimental results on the nu Scenes dataset demonstrate that the proposed model significantly outperforms classical vision-only algorithms,achieves comparable performance to current state-of-the-art vision-only methods on the nu Scenes detection leaderboard in terms of both mean average precision(mAP)and the nu Scenes detection score(NDS)metrics,and exhibits notable advantages in inference computational efficiency.Although the proposed dual-teacher paradigm incurs higher offline training costs compared to single-model approaches,it yields a streamlined and highly efficient student model suitable for resource-constrained real-time deployment.This provides an effective pathway toward low-cost,high-performance autonomous driving perception systems.
文摘Smart grid substation operations often take place in hazardous environments and pose significant threats to the safety of power personnel.Relying solely on manual supervision can lead to inadequate oversight.In response to the demand for technology to identify improper operations in substation work scenarios,this paper proposes a substation safety action recognition technology to avoid the misoperation and enhance the safety management.In general,this paper utilizes a dual-branch transformer network to extract spatial and temporal information from the video dataset of operational behaviors in complex substation environments.Firstly,in order to capture the spatial-temporal correlation of people's behaviors in smart grid substation,we devise a sparse attention module and a segmented linear attention module that are embedded into spatial branch transformer and temporal branch transformer respectively.To avoid the redundancy of spatial and temporal information,we fuse the temporal and spatial features using a tensor decomposition fusion module by a decoupled manner.Experimental results indicate that our proposed method accurately detects improper operational behaviors in substation work scenarios,outperforming other existing methods in terms of detection and recognition accuracy.
基金supported by the National Natural Science Foundation of China(Nos.U19A2044,42105132,42030609,41975037,and 42105133)the National Key Research and Development Program of China(No.2022YFC3703502)+1 种基金the Plan for Anhui Major Provincial Science&Technology Project(No.202203a07020003)Hefei Ecological Environment Bureau Project(No.2020BFFFD01804).
文摘As a significant city in the Yangtze River Delta regions,Hefei has experienced rapid changes in the sources of air pollution due to its high-speed economic development and urban expansion.However,there has been limited research in recent years on the spatial-temporal distribution and emission of its atmospheric pollutants.To address this,this study conducted mobile observations of urban roads using the Mobile-DOAS instrument from June 2021 to May 2022.The monitoring results exhibit a favourable consistent with TROPOMI satellite data and ground monitoring station data.Temporally,there were pronounced seasonal variations in air pollutants.Spatially,high concentration of HCHO and NO_(2)were closely associated with traffic congestion on roadways,while heightened SO_(2)levels were attributed to winter heating and industrial emissions.The study also revealed that with the implementation of road policies,the average vehicle speed increased by 95.4%,while the NO concentration decreased by 54.4%.In the estimation of urban NO_(x)emission flux,it was observed that in temporal terms,compared with inventory data,the emissions calculated viamobile measurements exhibitedmore distinct seasonal patterns,with the highest emission rate of 349 g/sec in winter and the lowest of 142 g/sec in summer.In spatial terms,the significant difference in emissions between the inner and outer ring roads also suggests the presence of the city’s primary NO_(x)emission sources in the area between these two rings.This study offers data support for formulating the next phase of air pollution control measures in urban areas.
基金supported by the National Office for Philosophy and Social Sciences(grant reference 22&ZD067).
文摘In the current situation of decelerating economic expansion,examining the digital economy(DE)as a novel economic model is beneficial for the local economy’s sustainable and high-quality development(HQD).We analyzed panel data from the Yellow River(YR)region from 2013 to 2021 and discovered notable spatial variances in the composite index and coupling coordination of the two systems.Specifically,the downstream region exhibited the highest coupling coordination,while the upstream region had the lowest.We identified that favorable factors such as economic development,innovation,industrial upgrading,and government intervention can bolster the coupling.Our findings provide a valuable framework for promoting DE and HQD in the YR region.
基金supported by the National Natural Science Foundation of China(Grant Nos.62472149,62376089,62202147)Hubei Provincial Science and Technology Plan Project(2023BCB04100).
文摘Accurate traffic flow prediction has a profound impact on modern traffic management. Traffic flow has complex spatial-temporal correlations and periodicity, which poses difficulties for precise prediction. To address this problem, a Multi-head Self-attention and Spatial-Temporal Graph Convolutional Network (MSSTGCN) for multiscale traffic flow prediction is proposed. Firstly, to capture the hidden traffic periodicity of traffic flow, traffic flow is divided into three kinds of periods, including hourly, daily, and weekly data. Secondly, a graph attention residual layer is constructed to learn the global spatial features across regions. Local spatial-temporal dependence is captured by using a T-GCN module. Thirdly, a transformer layer is introduced to learn the long-term dependence in time. A position embedding mechanism is introduced to label position information for all traffic sequences. Thus, this multi-head self-attention mechanism can recognize the sequence order and allocate weights for different time nodes. Experimental results on four real-world datasets show that the MSSTGCN performs better than the baseline methods and can be successfully adapted to traffic prediction tasks.
基金supported by the National Natural Science Foundation of China(No.62276204)the Fundamental Research Funds for the Central Universities,China(No.YJSJ24011)+1 种基金the Natural Science Basic Research Program of Shaanxi,China(Nos.2022JM-340 and 2023-JC-QN-0710)the China Postdoctoral Science Foundation(Nos.2020T130494 and 2018M633470)。
文摘Visible and infrared(RGB-IR)fusion object detection plays an important role in security,disaster relief,etc.In recent years,deep-learning-based RGB-IR fusion detection methods have been developing rapidly,but still struggle to deal with the complex and changing scenarios captured by drones,mainly due to two reasons:(A)RGB-IR fusion detectors are susceptible to inferior inputs that degrade performance and stability.(B)RGB-IR fusion detectors are susceptible to redundant features that reduce accuracy and efficiency.In this paper,an innovative RGB-IR fusion detection framework based on global-local feature optimization,named GLFDet,is proposed to improve the detection performance and efficiency of drone-captured objects.The key components of GLFDet include a Global Feature Optimization(GFO)module,a Local Feature Optimization(LFO)module and a Channel Separation Fusion(CSF)module.Specifically,GFO calculates the information content of the input image from the frequency domain and optimizes the features holistically.Then,LFO dynamically selects high-value features and filters out low-value features before fusion,which significantly improves the efficiency of fusion.Finally,CSF fuses the RGB and IR features across the corresponding channels,which avoids the rearrangement of the channel relationships and enhances the model stability.Extensive experimental results show that the proposed method achieves the best performance on three popular RGB-IR datasets Drone Vehicle,VEDAI,and LLVIP.In addition,GLFDet is more lightweight than other comparable models,making it more appealing to edge devices such as drones.The code is available at https://github.com/lao chen330/GLFDet.
基金supported by the Beijing Natural Science Foundation(Certificate Number:L234025).
文摘Spatial-temporal traffic prediction technology is crucial for network planning,resource allocation optimizing,and user experience improving.With the development of virtual network operators,multi-operator collaborations,and edge computing,spatial-temporal traffic data has taken on a distributed nature.Consequently,noncentralized spatial-temporal traffic prediction solutions have emerged as a recent research focus.Currently,the majority of research typically adopts federated learning methods to train traffic prediction models distributed on each base station.This method reduces additional burden on communication systems.However,this method has a drawback:it cannot handle irregular traffic data.Due to unstable wireless network environments,device failures,insufficient storage resources,etc.,data missing inevitably occurs during the process of collecting traffic data.This results in the irregular nature of distributed traffic data.Yet,commonly used traffic prediction models such as Recurrent Neural Networks(RNN)and Long Short-Term Memory(LSTM)typically assume that the data is complete and regular.To address the challenge of handling irregular traffic data,this paper transforms irregular traffic prediction into problems of estimating latent variables and generating future traffic.To solve the aforementioned problems,this paper introduces split learning to design a structured distributed learning framework.The framework comprises a Global-level Spatial structure mining Model(GSM)and several Nodelevel Generative Models(NGMs).NGM and GSM represent Seq2Seq models deployed on the base station and graph neural network models deployed on the cloud or central controller.Firstly,the time embedding layer in NGM establishes the mapping relationship between irregular traffic data and regular latent temporal feature variables.Secondly,GSM collects statistical feature parameters of latent temporal feature variables from various nodes and executes graph embedding for spatial-temporal traffic data.Finally,NGM generates future traffic based on latent temporal and spatial feature variables.The introduction of the time attention mechanism enhances the framework’s capability to handle irregular traffic data.Graph attention network introduces spatially correlated base station traffic feature information into local traffic prediction,which compensates for missing information in local irregular traffic data.The proposed framework effectively addresses the distributed prediction issues of irregular traffic data.By testing on real world datasets,the proposed framework improves traffic prediction accuracy by 35%compared to other commonly used distributed traffic prediction methods.
基金Shaanxi Province Qin Chuangyuan“Scientist+Engineer”Team Construction Project(2022KXJ-071)2022 Qin Chuangyuan Achievement Transformation Incubation Capacity Improvement Project(2022JH-ZHFHTS-0012)+8 种基金Shaanxi Province Key Research and Development Plan-“Two Chains”Integration Key Project-Qin Chuangyuan General Window Industrial Cluster Project(2023QCY-LL-02)Xixian New Area Science and Technology Plan(2022-YXYJ-003,2022-XXCY-010)2024 Scientific Research Project of Shaanxi National Defense Industry Vocational and Technical College(Gfy24-07)Shaanxi Vocational and Technical Education Association 2024 Vocational Education Teaching Reform Research Topic(2024SZX354)National Natural Science Foundation of China(U24A20115)2024 Shaanxi Provincial Education Department Service Local Special Scientific Research Program Project-Industrialization Cultivation Project(24JC005,24JC063)Shaanxi Province“14th Five-Year Plan”Education Science Plan,2024 Project(SGH24Y3181)National Key Research and Development Program of China(2023YFB4606400)Longmen Laboratory Frontier Exploration Topics Project(LMQYTSKT003)。
文摘A dual-phase synergistic enhancement method was adopted to strengthen the Al-Mn-Mg-Sc-Zr alloy fabricated by laser powder bed fusion(LPBF)by leveraging the unique advantages of Er and TiB_(2).Spherical powders of 0.5wt%Er-1wt%TiB_(2)/Al-Mn-Mg-Sc-Zr nanocomposite were prepared using vacuum homogenization technique,and the density of samples prepared through the LPBF process reached 99.8%.The strengthening and toughening mechanisms of Er-TiB_(2)were investigated.The results show that Al_(3)Er diffraction peaks are detected by X-ray diffraction analysis,and texture strength decreases according to electron backscatter diffraction results.The added Er and TiB_(2)nano-reinforcing phases act as heterogeneous nucleation sites during the LPBF forming process,hindering grain growth and effectively refining the grains.After incorporating the Er-TiB_(2)dual-phase nano-reinforcing phases,the tensile strength and elongation at break of the LPBF-deposited samples reach 550 MPa and 18.7%,which are 13.4%and 26.4%higher than those of the matrix material,respectively.
基金supported by the National Natural Science Foundation of China(Nos.12405288,12374241,12474484,U2330401,12088101)the Natural Science Foundation of Top Talent of SZTU(No.GDRC202526)。
文摘The process of nuclear fusion in the presence of a laser field was theoretically analyzed.The analysis is applicable to most fusion reactions and different types of currently available intense lasers,from X-ray free-electron lasers to solid-state near-infrared lasers.Laser fields were shown to enhance the fusion yields,and the mechanism of this enhancement was explained.Low-frequency lasers are more efficient in enhancing fusion than high-frequency lasers.The calculation results show enhancements of fusion yields by orders of magnitude with currently available intense low-frequency laser fields.The temperature requirement for controlled nuclear fusion may be reduced with the aid of intense laser fields.
基金funded by the Jilin Provincial Department of Science and Technology,grant number 20230101208JC.
文摘Fault diagnosis of rolling bearings is crucial for ensuring the stable operation of mechanical equipment and production safety in industrial environments.However,due to the nonlinearity and non-stationarity of collected vibration signals,single-modal methods struggle to capture fault features fully.This paper proposes a rolling bearing fault diagnosis method based on multi-modal information fusion.The method first employs the Hippopotamus Optimization Algorithm(HO)to optimize the number of modes in Variational Mode Decomposition(VMD)to achieve optimal modal decomposition performance.It combines Convolutional Neural Networks(CNN)and Gated Recurrent Units(GRU)to extract temporal features from one-dimensional time-series signals.Meanwhile,the Markovian Transition Field(MTF)is used to transform one-dimensional signals into two-dimensional images for spatial feature mining.Through visualization techniques,the effectiveness of generated images from different parameter combinations is compared to determine the optimal parameter configuration.A multi-modal network(GSTCN)is constructed by integrating Swin-Transformer and the Convolutional Block Attention Module(CBAM),where the attention module is utilized to enhance fault features.Finally,the fault features extracted from different modalities are deeply fused and fed into a fully connected layer to complete fault classification.Experimental results show that the GSTCN model achieves an average diagnostic accuracy of 99.5%across three datasets,significantly outperforming existing comparison methods.This demonstrates that the proposed model has high diagnostic precision and good generalization ability,providing an efficient and reliable solution for rolling bearing fault diagnosis.
基金supported via funding from Prince Sattam bin Abdulaziz University project number(PSAU/2025/03/32440).
文摘Parkinson’s disease remains a major clinical issue in terms of early detection,especially during its prodromal stage when symptoms are not evident or not distinct.To address this problem,we proposed a new deep learning 2-based approach for detecting Parkinson’s disease before any of the overt symptoms develop during their prodromal stage.We used 5 publicly accessible datasets,including UCI Parkinson’s Voice,Spiral Drawings,PaHaW,NewHandPD,and PPMI,and implemented a dual stream CNN–BiLSTM architecture with Fisher-weighted feature merging and SHAP-based explanation.The findings reveal that the model’s performance was superior and achieved 98.2%,a F1-score of 0.981,and AUC of 0.991 on the UCI Voice dataset.The model’s performance on the remaining datasets was also comparable,with up to a 2–7 percent betterment in accuracy compared to existing strong models such as CNN–RNN–MLP,ILN–GNet,and CASENet.Across the evidence,the findings back the diagnostic promise of micro-tremor assessment and demonstrate that combining temporal and spatial features with a scatter-based segment for a multi-modal approach can be an effective and scalable platform for an“early,”interpretable PD screening system.
基金supported in part by the National Natural Science Foundation of China(No.12305196)Anhui Provincial Natural Science Foundation(No.2308085QA23)+1 种基金Open Fund of Magnetic confinement Fusion Laboratory of Anhui Province(No.2023AMF03003)Science Foundation of Institute of Plasma Physics,Chinese Academy of Sciences(No.DSJJ-2024-10).
文摘A low-temperature-resistant and high-strength stainless-steel jacket is a key component in the superconducting magnet of a fusion reactor.The development of cryogenic structural materials with high strength and toughness poses a challenge for the future development of high-field superconducting magnets in fusion reactors.The yield strength of the International Thermonuclear Experimental Reactor developed for low-temperature structural materials at 4.2K is below 1100MPa,which fails to meet the demand for structural components with yield strengths exceeding 1500MPa at 4.2K in the future fusion reactors.CHSN01(formerly N50H),which is a low-temperature structural material developed in China,exhibits exceptional strength and toughness,thereby making it highly promising for practical applications.Recently,a 30 t jacket measuring approximately 5000m in total length was produced.Its low-temperature mechanical properties were tested using a sampling method to ensure compliance with application requirements.This paper presents the experimental data of the CHSN01 jacket and tests of the physical properties of the material in the temperature range of 4–300 K.The physical properties were unaffected by magnetic field.Furthermore,this paper discusses the feasibility of employing CHSN01 as a cryogenic structural material capable of withstanding high magnetic fields in next-generation fusion reactors.
基金supported by the National Natural Science Foundation of China(Grant Nos.62572057,62272049,U24A20331)Beijing Natural Science Foundation(Grant Nos.4232026,4242020)Academic Research Projects of Beijing Union University(Grant No.ZK10202404).
文摘Traffic sign detection is a critical component of driving systems.Single-stage network-based traffic sign detection algorithms,renowned for their fast detection speeds and high accuracy,have become the dominant approach in current practices.However,in complex and dynamic traffic scenes,particularly with smaller traffic sign objects,challenges such as missed and false detections can lead to reduced overall detection accuracy.To address this issue,this paper proposes a detection algorithm that integrates edge and shape information.Recognizing that traffic signs have specific shapes and distinct edge contours,this paper introduces an edge feature extraction branch within the backbone network,enabling adaptive fusion with features of the same hierarchical level.Additionally,a shape prior convolution module is designed to replaces the first two convolutional modules of the backbone network,aimed at enhancing the model's perception ability for specific shape objects and reducing its sensitivity to background noise.The algorithm was evaluated on the CCTSDB and TT100k datasets,and compared to YOLOv8s,the mAP50 values increased by 3.0%and 10.4%,respectively,demonstrating the effectiveness of the proposed method in improving the accuracy of traffic sign detection.
文摘AIM:To investigate the effects of binocular fusional C-optotypes(positive/negative)and 2D planar C-optotypes on the amplitude and stability of transient accommodation(TAC)in adults,and to provide a basis for non-contact myopia intervention.METHODS:This was a self-controlled study.Using redblue 3D technology,four experimental stages were set up:Test A[fixating on the 1 m negative fusional C-optotypes,8△base-in(BI)],Test B(fixating on the 5 m planar C-optotypes),Test C(fixating on the 1 m planar C-optotypes),and Test D[fixating on the 1 m positive fusional C-optotypes,20△base-out(BO)].A WAM-5500 open-field autorefractor was used to measure TAC and accommodative microfluctuations[evaluated via interquartile range(IQR)and median-based coefficient of variation(CVmed)].Additionally,the convergence accommodation to convergence(CA/C)ratio was calculated,and a visual fatigue questionnaire was administered to assess participants’subjective visual comfort.RESULTS:A total of 21 subjects(7 males,14 females;aged 23-41y)with normal binocular visual function were enrolled.The results showed that the TAC increased gradually across the four stages,and these values were Test A(-0.35±0.26 D)<Test B(-0.46±0.24 D)<Test C(-0.77±0.32 D)<Test D(-1.38±0.31 D).There were significant overall differences(F=56.136,P<0.001).Compared with Test C,Test A reduced TAC by 0.42 D(P<0.05),while Test D increased it by 0.61 D(P<0.001).There was no significant intergroup difference in accommodative fluctuation amplitude(all P>0.05),but the fluctuation stability of Test D showed a significant difference between the first 20s and the second 20s(P=0.017).The CA/C ratio was significantly higher in Test D(0.05±0.02 D/△)than in Test A(0.03±0.02 D/△,P=0.007),indicating stronger accommodation-convergence linkage during positive fusional fixation.The visual fatigue scores of all stages were low(median 0-1),with Test D slightly higher than Test B and Test C(P<0.05).No linear correlation was found between TAC and age(all r<0.1,P>0.05).CONCLUSION:Negative fusional C-optotypes induce ciliary muscle relaxation to reduce TAC,while positive fusional C-optotypes enhance accommodation-convergence coordination to increase TAC.The red-blue 3D-based noncontact training mode exhibits good safety(median visual fatigue scores:0-1 across all tests)and provides a novel dual-directional(relaxation-activation)strategy for myopia prevention and control.
基金financially supported byChongqingUniversity of Technology Graduate Innovation Foundation(Grant No.gzlcx20253267).
文摘Camouflaged Object Detection(COD)aims to identify objects that share highly similar patterns—such as texture,intensity,and color—with their surrounding environment.Due to their intrinsic resemblance to the background,camouflaged objects often exhibit vague boundaries and varying scales,making it challenging to accurately locate targets and delineate their indistinct edges.To address this,we propose a novel camouflaged object detection network called Edge-Guided and Multi-scale Fusion Network(EGMFNet),which leverages edge-guided multi-scale integration for enhanced performance.The model incorporates two innovative components:a Multi-scale Fusion Module(MSFM)and an Edge-Guided Attention Module(EGA).These designs exploit multi-scale features to uncover subtle cues between candidate objects and the background while emphasizing camouflaged object boundaries.Moreover,recognizing the rich contextual information in fused features,we introduce a Dual-Branch Global Context Module(DGCM)to refine features using extensive global context,thereby generatingmore informative representations.Experimental results on four benchmark datasets demonstrate that EGMFNet outperforms state-of-the-art methods across five evaluation metrics.Specifically,on COD10K,our EGMFNet-P improves F_(β)by 4.8 points and reduces mean absolute error(MAE)by 0.006 compared with ZoomNeXt;on NC4K,it achieves a 3.6-point increase in F_(β).OnCAMO and CHAMELEON,it obtains 4.5-point increases in F_(β),respectively.These consistent gains substantiate the superiority and robustness of EGMFNet.