To improve image quality under low illumination conditions,a novel low-light image enhancement method is proposed in this paper based on multi-illumination estimation and multi-scale fusion(MIMS).Firstly,the illuminat...To improve image quality under low illumination conditions,a novel low-light image enhancement method is proposed in this paper based on multi-illumination estimation and multi-scale fusion(MIMS).Firstly,the illumination is processed by contrast-limited adaptive histogram equalization(CLAHE),adaptive complementary gamma function(ACG),and adaptive detail preserving S-curve(ADPS),respectively,to obtain three components.Then,the fusion-relevant features,exposure,and color contrast are selected as the weight maps.Subsequently,these components and weight maps are fused through multi-scale to generate enhanced illumination.Finally,the enhanced images are obtained by multiplying the enhanced illumination and reflectance.Compared with existing approaches,this proposed method achieves an average increase of 0.81%and 2.89%in the structural similarity index measurement(SSIM)and peak signal-to-noise ratio(PSNR),and a decrease of 6.17%and 32.61%in the natural image quality evaluator(NIQE)and gradient magnitude similarity deviation(GMSD),respectively.展开更多
Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approach...Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments.展开更多
Video emotion recognition is widely used due to its alignment with the temporal characteristics of human emotional expression,but existingmodels have significant shortcomings.On the one hand,Transformermultihead self-...Video emotion recognition is widely used due to its alignment with the temporal characteristics of human emotional expression,but existingmodels have significant shortcomings.On the one hand,Transformermultihead self-attention modeling of global temporal dependency has problems of high computational overhead and feature similarity.On the other hand,fixed-size convolution kernels are often used,which have weak perception ability for emotional regions of different scales.Therefore,this paper proposes a video emotion recognition model that combines multi-scale region-aware convolution with temporal interactive sampling.In terms of space,multi-branch large-kernel stripe convolution is used to perceive emotional region features at different scales,and attention weights are generated for each scale feature.In terms of time,multi-layer odd-even down-sampling is performed on the time series,and oddeven sub-sequence interaction is performed to solve the problem of feature similarity,while reducing computational costs due to the linear relationship between sampling and convolution overhead.This paper was tested on CMU-MOSI,CMU-MOSEI,and Hume Reaction.The Acc-2 reached 83.4%,85.2%,and 81.2%,respectively.The experimental results show that the model can significantly improve the accuracy of emotion recognition.展开更多
Camouflaged Object Detection(COD)aims to identify objects that share highly similar patterns—such as texture,intensity,and color—with their surrounding environment.Due to their intrinsic resemblance to the backgroun...Camouflaged Object Detection(COD)aims to identify objects that share highly similar patterns—such as texture,intensity,and color—with their surrounding environment.Due to their intrinsic resemblance to the background,camouflaged objects often exhibit vague boundaries and varying scales,making it challenging to accurately locate targets and delineate their indistinct edges.To address this,we propose a novel camouflaged object detection network called Edge-Guided and Multi-scale Fusion Network(EGMFNet),which leverages edge-guided multi-scale integration for enhanced performance.The model incorporates two innovative components:a Multi-scale Fusion Module(MSFM)and an Edge-Guided Attention Module(EGA).These designs exploit multi-scale features to uncover subtle cues between candidate objects and the background while emphasizing camouflaged object boundaries.Moreover,recognizing the rich contextual information in fused features,we introduce a Dual-Branch Global Context Module(DGCM)to refine features using extensive global context,thereby generatingmore informative representations.Experimental results on four benchmark datasets demonstrate that EGMFNet outperforms state-of-the-art methods across five evaluation metrics.Specifically,on COD10K,our EGMFNet-P improves F_(β)by 4.8 points and reduces mean absolute error(MAE)by 0.006 compared with ZoomNeXt;on NC4K,it achieves a 3.6-point increase in F_(β).OnCAMO and CHAMELEON,it obtains 4.5-point increases in F_(β),respectively.These consistent gains substantiate the superiority and robustness of EGMFNet.展开更多
Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectra...Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.展开更多
Tomato is a major economic crop worldwide,and diseases on tomato leaves can significantly reduce both yield and quality.Traditional manual inspection is inefficient and highly subjective,making it difficult to meet th...Tomato is a major economic crop worldwide,and diseases on tomato leaves can significantly reduce both yield and quality.Traditional manual inspection is inefficient and highly subjective,making it difficult to meet the requirements of early disease identification in complex natural environments.To address this issue,this study proposes an improved YOLO11-based model,YOLO-SPDNet(Scale Sequence Fusion,Position-Channel Attention,and Dual Enhancement Network).The model integrates the SEAM(Self-Ensembling Attention Mechanism)semantic enhancement module,the MLCA(Mixed Local Channel Attention)lightweight attention mechanism,and the SPA(Scale-Position-Detail Awareness)module composed of SSFF(Scale Sequence Feature Fusion),TFE(Triple Feature Encoding),and CPAM(Channel and Position Attention Mechanism).These enhancements strengthen fine-grained lesion detection while maintaining model lightweightness.Experimental results show that YOLO-SPDNet achieves an accuracy of 91.8%,a recall of 86.5%,and an mAP@0.5 of 90.6%on the test set,with a computational complexity of 12.5 GFLOPs.Furthermore,the model reaches a real-time inference speed of 987 FPS,making it suitable for deployment on mobile agricultural terminals and online monitoring systems.Comparative analysis and ablation studies further validate the reliability and practical applicability of the proposed model in complex natural scenes.展开更多
The Qingtongxia Irrigation District in Ningxia is an important hydrological and ecological region.To assess its ecological environment quality from 2001 to 2021 across multiple scales and identify driving factors,a mo...The Qingtongxia Irrigation District in Ningxia is an important hydrological and ecological region.To assess its ecological environment quality from 2001 to 2021 across multiple scales and identify driving factors,a modified remote sensing ecological index(MRSEI)was developed by incorporating evapotranspiration.Spatial and temporal patterns were analyzed using the coefficient of variation,spatial autocorrelation,and semi-variogram methods,while influencing factors were explored via the optimal parameter geographical detector model.The MRSEI’s first principal component loadings and rankings aligned with those of RSEI(average contribution:81.31%),effectively reflecting spatiotemporal variations.At sub-irrigation district and landscape scales,ecological quality was slightly lower than at the district level but remained stable.Moderate and good ecological grades accounted for 36.28%and 33.38%of the area,respectively,at the district scale,and the moderate grade reached 70.48%on smaller scales.Spatial heterogeneity intensified with decreasing scale,and human activity lost explanatory power below a 5 km range.Human factors mainly drove ecological differentiation at the district scale,while natural factors dominated at finer scales.The MRSEI offers a novel tool for ecological assessment in arid/semi-arid areas and supports scale-adapted ecological protection strategies.展开更多
Semantic segmentation for mixed scenes of aerial remote sensing and road traffic is one of the key technologies for visual perception of flying cars.The State-of-the-Art(SOTA)semantic segmentation methods have made re...Semantic segmentation for mixed scenes of aerial remote sensing and road traffic is one of the key technologies for visual perception of flying cars.The State-of-the-Art(SOTA)semantic segmentation methods have made remarkable achievements in both fine-grained segmentation and real-time performance.However,when faced with the huge differences in scale and semantic categories brought about by the mixed scenes of aerial remote sensing and road traffic,they still face great challenges and there is little related research.Addressing the above issue,this paper proposes a semantic segmentation model specifically for mixed datasets of aerial remote sensing and road traffic scenes.First,a novel decoding-recoding multi-scale feature iterative refinement structure is proposed,which utilizes the re-integration and continuous enhancement of multi-scale information to effectively deal with the huge scale differences between cross-domain scenes,while using a fully convolutional structure to ensure the lightweight and real-time requirements.Second,a welldesigned cross-window attention mechanism combined with a global information integration decoding block forms an enhanced global context perception,which can effectively capture the long-range dependencies and multi-scale global context information of different scenes,thereby achieving fine-grained semantic segmentation.The proposed method is tested on a large-scale mixed dataset of aerial remote sensing and road traffic scenes.The results confirm that it can effectively deal with the problem of large-scale differences in cross-domain scenes.Its segmentation accuracy surpasses that of the SOTA methods,which meets the real-time requirements.展开更多
Distributed Denial of Service(DDoS)attacks are one of the severe threats to network infrastructure,sometimes bypassing traditional diagnosis algorithms because of their evolving complexity.PresentMachine Learning(ML)t...Distributed Denial of Service(DDoS)attacks are one of the severe threats to network infrastructure,sometimes bypassing traditional diagnosis algorithms because of their evolving complexity.PresentMachine Learning(ML)techniques for DDoS attack diagnosis normally apply network traffic statistical features such as packet sizes and inter-arrival times.However,such techniques sometimes fail to capture complicated relations among various traffic flows.In this paper,we present a new multi-scale ensemble strategy given the Graph Neural Networks(GNNs)for improving DDoS detection.Our technique divides traffic into macro-and micro-level elements,letting various GNN models to get the two corase-scale anomalies and subtle,stealthy attack models.Through modeling network traffic as graph-structured data,GNNs efficiently learn intricate relations among network entities.The proposed ensemble learning algorithm combines the results of several GNNs to improve generalization,robustness,and scalability.Extensive experiments on three benchmark datasets—UNSW-NB15,CICIDS2017,and CICDDoS2019—show that our approach outperforms traditional machine learning and deep learning models in detecting both high-rate and low-rate(stealthy)DDoS attacks,with significant improvements in accuracy and recall.These findings demonstrate the suggested method’s applicability and robustness for real-world implementation in contexts where several DDoS patterns coexist.展开更多
One-dimensional ensemble dispersion entropy(EDE1D)is an effective nonlinear dynamic analysis method for complexity measurement of time series.However,it is only restricted to assessing the complexity of one-di-mension...One-dimensional ensemble dispersion entropy(EDE1D)is an effective nonlinear dynamic analysis method for complexity measurement of time series.However,it is only restricted to assessing the complexity of one-di-mensional time series(TS1d)with the extracted complexity features only at a single scale.Aiming at these problems,a new nonlinear dynamic analysis method termed two-dimensional composite multi-scale ensemble Gramian dispersion entropy(CMEGDE_(2D))is proposed in this paper.First,the TS_(1D) is transformed into a two-dimensional image(I_(2D))by using Gramian angular fields(GAF)with more internal data structures and geometri features,which preserve the global characteristics and time dependence of vibration signals.Second,the I2D is analyzed at multiple scales through the composite coarse-graining method,which overcomes the limitation of a single scale and provides greater stability compared to traditional coarse-graining methods.Subsequently,a new fault diagnosis method of rolling bearing is proposed based on the proposed CMEGDE_(2D) for fault feature ex-traction and the chicken swarm algorithm optimized support vector machine(CsO-SvM)for fault pattern identification.The simulation signals and two data sets of rolling bearings are utilized to verify the effectiveness of the proposed fault diagnosis method.The results demonstrate that the proposed method has stronger dis-crimination ability,higher fault diagnosis accuracy and better stability than the other compared methods.展开更多
Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To ...Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To address these issues,this paper proposes SIM-Net,an enhanced detection framework derived from YOLOv11.The model integrates SPDConv to preserve fine-grained features for small object detection,introduces a novel convolutional partial attention module(C2PAM)to suppress redundant background information and highlight salient regions,and employs a multi-scale fusion network(MFN)with a multi-grain contextual module(MGCT)to strengthen contextual representation and accelerate inference.Experimental evaluations demonstrate that SIM-Net achieves 92.4%mAP,92%accuracy,and 89.4%recall with an inference speed of 75.1 FPS,outperforming existing state-of-the-art methods.These results confirm the robustness and real-time applicability of SIM-Net for PCB defect inspection.展开更多
Advanced healthcare monitors for air pollution applications pose a significant challenge in achieving a balance between high-performance filtration and multifunctional smart integration.Electrospinning triboelectric n...Advanced healthcare monitors for air pollution applications pose a significant challenge in achieving a balance between high-performance filtration and multifunctional smart integration.Electrospinning triboelectric nanogenerators(TENG)provide a significant potential for use under such difficult circumstances.We have successfully constructed a high-performance TENG utilizing a novel multi-scale nanofiber architecture.Nylon 66(PA66)and chitosan quaternary ammonium salt(HACC)composites were prepared by electrospinning,and PA66/H multiscale nanofiber membranes composed of nanofibers(≈73 nm)and submicron-fibers(≈123 nm)were formed.PA66/H multi-scale nanofiber membrane as the positive electrode and negative electrode-spun PVDF-HFP nanofiber membrane composed of respiration-driven PVDF-HFP@PA66/H TENG.The resulting PVDF-HFP@PA66/H TENG based air filter utilizes electrostatic adsorption and physical interception mechanisms,achieving PM_(0.3)filtration efficiency over 99%with a pressure drop of only 48 Pa.Besides,PVDF-HFP@PA66/H TENG exhibits excellent stability in high-humidity environments,with filtration efficiency reduced by less than 1%.At the same time,the TENG achieves periodic contact separation through breathing drive to achieve self-power,which can ensure the long-term stability of the filtration efficiency.In addition to the air filtration function,TENG can also monitor health in real time by capturing human breathing signals without external power supply.This integrated system combines high-efficiency air filtration,self-powered operation,and health monitoring,presenting an innovative solution for air purification,smart protective equipment,and portable health monitoring.These findings highlight the potential of this technology for diverse applications,offering a promising direction for advancing multifunctional air filtration systems.展开更多
The development of metallic mineral resources generates a significant amount of solid waste,such as tailings and waste rock.Cemented tailings and waste-rock backfill(CTWB)is an effective method for managing and dispos...The development of metallic mineral resources generates a significant amount of solid waste,such as tailings and waste rock.Cemented tailings and waste-rock backfill(CTWB)is an effective method for managing and disposing of this mining waste.This study employs a macro-meso-micro testing method to investigate the effects of the waste rock grading index(WGI)and loading rate(LR)on the uniaxial compressive strength(UCS),pore structure,and micromorphology of CTWB materials.Pore structures were analyzed using scanning electron microscopy(SEM)and mercury intrusion porosimetry(MIP).The particles(pores)and cracks analysis system(PCAS)software was used to quantitatively characterize the multi-scale micropores in the SEM images.The key findings indicate that the macroscopic results(UCS)of CTWB materials correspond to the microscopic results(pore structure and micromorphology).Changes in porosity largely depend on the conditions of waste rock grading index and loading rate.The inclusion of waste rock initially increases and then decreases the UCS,while porosity first decreases and then increases,with a critical waste rock grading index of 0.6.As the loading rate increases,UCS initially rises and then falls,while porosity gradually increases.Based on MIP and SEM results,at waste rock grading index 0.6,the most probable pore diameters,total pore area(TPA),pore number(PN),maximum pore area(MPA),and area probability distribution index(APDI)are minimized,while average pore form factor(APF)and fractal dimension of pore porosity distribution(FDPD)are maximized,indicating the most compact pore structure.At a loading rate of 12.0 mm/min,the most probable pore diameters,TPA,PN,MPA,APF,and APDI reach their maximum values,while FDPD reaches its minimum value.Finally,the mechanism of CTWB materials during compression is analyzed,based on the quantitative results of UCS and porosity.The research findings play a crucial role in ensuring the successful application of CTWB materials in deep metal mines.展开更多
The orthogonal time frequency space(OTFS)modulation is a novel modulation scheme that can effectively cope with the high Doppler expansion caused by high mobility.Since it modulates data on delay-Doppler(DD)domain and...The orthogonal time frequency space(OTFS)modulation is a novel modulation scheme that can effectively cope with the high Doppler expansion caused by high mobility.Since it modulates data on delay-Doppler(DD)domain and makes full use of the sparse characteristics of DD domain,it has been widely studied to design efficient channel estimation and signal detection schemes.In this paper,we design a novel superimposed pilot pattern with transition band,which replaces the traditional embedded pilot(EP)guard zero-symbols,and perform a two-stage channel estimation.In the first stage,we fully utilize the dispersion characteristics of OTFS signal in DD domain,and use threshold decision to make coarse channel estimation.In the second stage,we use the results of the coarse estimation for iterative signal detection and accurate channel estimation.During the second stage,we make full use of the sparsity of the channel in DD domain,remodel the received signal into the form of sparse channel vector multiplied by channel coefficient matrix,and introduce Doppler index segmentation factor(DISF)to subdivide the Doppler index to solve the problem of fractional Doppler.Simulations reveal that,the scheme proposed in this paper has higher spectral efficiency compared with traditional EP scheme and lower peak-to-average power ratio(PAPR)compared with traditional superimposed pilot scheme.展开更多
This study proposes a multi-scale simplified residual convolutional neural network(MS-SRCNN)for the precise prediction of Mg-Nd binary alloy compositions from scanning electron microscope(SEM)images.A multi-scale data...This study proposes a multi-scale simplified residual convolutional neural network(MS-SRCNN)for the precise prediction of Mg-Nd binary alloy compositions from scanning electron microscope(SEM)images.A multi-scale data structure is established by spatially aligning and stacking SEM images at different magnifications.The MS-SRCNN significantly reduces computational runtime by over 90%compared to traditional architectures like ResNet50,VGG16,and VGG19,without compromising prediction accuracy.The model demonstrates more excellent predictive performance,achieving a>5%increase in R^(2) compared to single-scale models.Furthermore,the MS-SRCNN exhibits robust composition prediction capability across other Mg-based binary alloys,including Mg-La,Mg-Sn,Mg-Ce,Mg-Sm,Mg-Ag,and Mg-Y,thereby emphasizing its generalization and extrapolation potential.This research establishes a non-destructive,microstructure-informed composition analysis framework,reduces characterization time compared to traditional experiment methods and provides insights into the composition-microstructure relationship in diverse material systems.展开更多
With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods ...With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods face numerous challenges in practical deployment,including scale variation handling,feature degradation,and complex backgrounds.To address these issues,we propose Edge-enhanced and Detail-Capturing You Only Look Once(EHDC-YOLO),a novel framework for object detection in Unmanned Aerial Vehicle(UAV)imagery.Based on the You Only Look Once version 11 nano(YOLOv11n)baseline,EHDC-YOLO systematically introduces several architectural enhancements:(1)a Multi-Scale Edge Enhancement(MSEE)module that leverages multi-scale pooling and edge information to enhance boundary feature extraction;(2)an Enhanced Feature Pyramid Network(EFPN)that integrates P2-level features with Cross Stage Partial(CSP)structures and OmniKernel convolutions for better fine-grained representation;and(3)Dynamic Head(DyHead)with multi-dimensional attention mechanisms for enhanced cross-scale modeling and perspective adaptability.Comprehensive experiments on the Vision meets Drones for Detection(VisDrone-DET)2019 dataset demonstrate that EHDC-YOLO achieves significant improvements,increasing mean Average Precision(mAP)@0.5 from 33.2%to 46.1%(an absolute improvement of 12.9 percentage points)and mAP@0.5:0.95 from 19.5%to 28.0%(an absolute improvement of 8.5 percentage points)compared with the YOLOv11n baseline,while maintaining a reasonable parameter count(2.81 M vs the baseline’s 2.58 M).Further ablation studies confirm the effectiveness of each proposed component,while visualization results highlight EHDC-YOLO’s superior performance in detecting objects and handling occlusions in complex drone scenarios.展开更多
Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious an...Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious and they are numerous,resulting in low detection accuracy by deep learning models.Therefore,we proposed a new multi-scale fusion crater detection algorithm(MSF-CDA)based on the YOLO11 to improve the accuracy of lunar impact crater detection,especially for small craters with a diameter of<1 km.Using the images taken by the LROC(Lunar Reconnaissance Orbiter Camera)at the Chang’e-4(CE-4)landing area,we constructed three separate datasets for craters with diameters of 0-70 m,70-140 m,and>140 m.We then trained three submodels separately with these three datasets.Additionally,we designed a slicing-amplifying-slicing strategy to enhance the ability to extract features from small craters.To handle redundant predictions,we proposed a new Non-Maximum Suppression with Area Filtering method to fuse the results in overlapping targets within the multi-scale submodels.Finally,our new MSF-CDA method achieved high detection performance,with the Precision,Recall,and F1 score having values of 0.991,0.987,and 0.989,respectively,perfectly addressing the problems induced by the lesser features and sample imbalance of small craters.Our MSF-CDA can provide strong data support for more in-depth study of the geological evolution of the lunar surface and finer geological age estimations.This strategy can also be used to detect other small objects with lesser features and sample imbalance problems.We detected approximately 500,000 impact craters in an area of approximately 214 km2 around the CE-4 landing area.By statistically analyzing the new data,we updated the distribution function of the number and diameter of impact craters.Finally,we identified the most suitable lighting conditions for detecting impact crater targets by analyzing the effect of different lighting conditions on the detection accuracy.展开更多
In GNSS-denied environments,signals of opportunity(SOP)offer an efficient and passive solution for navigation and positioning by utilizing ambient signals.Nevertheless,conventional SOP techniques face significant chal...In GNSS-denied environments,signals of opportunity(SOP)offer an efficient and passive solution for navigation and positioning by utilizing ambient signals.Nevertheless,conventional SOP techniques face significant challenges in real-time processing,especially under sub-Nyquist sampling conditions,due to high data acquisition rates and offgrid errors.To address this,this paper proposes the signal reconstruction and kernel sparse encoding(SRKSE)model,a novel general framework for high-precision parameter estimation.By combining compressed sensing with a deep unfolding network,the SRKSE model not only achieves robust signal reconstruction but also effectively reduces quantization errors.Key innovations of SRKSE include dual crossattention mechanisms for enhanced feature extraction,sinc sparse kernel encoding to minimize quantization errors,and a custom loss function for balanced optimization.With these advancements,SRKSE achieves up to a 650-fold improvement in time of arrival(TOA)estimation accuracy while operating at just 1%of the Nyquist sampling rate.The SRKSE surpasses both conventional and deep learning-based techniques in accuracy and efficiency,especially when operating under sub-Nyquist sampling conditions.Simulations and real-world experiments confirm the reliability and potential of SRKSE for real-time applications in IoT and wireless communication.展开更多
The growing use of lithium-ion batteries in electric transportation and grid-scale storage systems has intensified the need for accurate and highly generalizable state-of-health(SOH)estimation.Conventional approaches ...The growing use of lithium-ion batteries in electric transportation and grid-scale storage systems has intensified the need for accurate and highly generalizable state-of-health(SOH)estimation.Conventional approaches often suffer from reduced accuracy under dynamically uncertain state-of-charge(SOC)operating ranges and heterogeneous aging stresses.This study presents a unified SOH estimation framework that integrates physics-informed modeling,subspace identification,and Transformer-based learning.A reduced-order model is derived from simplified electrochemical dynamics,providing an interpretable and computationally efficient representation of battery behavior.Subspace identification across a wide SOC and SOH range yields degradation-sensitive features,which the Transformer uses to capture long-range aging dynamics via multi-head self-attention.Experiments on LiFePO4 cells under joint-cell training show consistently accurate SOH estimation,with a maximum error of 1.39%,demonstrating the framework’s effectiveness in decoupling SOC and SOH effects.In cross-cell validation,where training and validation are performed on different cells,the model maintains a maximum error of 2.06%,confirming strong generalization to unseen aging trajectories.Comparative experiments on LiFePO_(4)and public LiCoO_(2)datasets confirm the framework’s cross-chemistry applicability.By extracting low-dimensional,physically interpretable features via subspace identification,the framework significantly reduces training cost while maintaining high SOH estimation accuracy,outperforming conventional data-driven models lacking physical guidance.展开更多
Presented in this study is a novel method for estimating the depth of single underwater source in shallow water,utilizing vector sensors.The approach leverages the depth distribution of the broadband Stokes parameters...Presented in this study is a novel method for estimating the depth of single underwater source in shallow water,utilizing vector sensors.The approach leverages the depth distribution of the broadband Stokes parameters to estimate source depth accurately.Unlike traditional matched field processing(MFP)and matched mode processing(MMP),the proposed approach can estimate source depth directly from the data received by sensors without requiring complete environmental information.Firstly,the broadband Stokes parameters(BSP)are established using the normal mode theory.Then the nonstationary phase approximation is used to simplify the theoretical derivation,which is necessary when dealing with broadband integrals.Additionally,range terms of the BSP are eliminated by normalization.By analyzing the depth distribution of the normalized broadband Stokes parameters(NBSP),it is found that the NBSP exhibit extreme values at the source depth,which can be used for source depth estimation.So the proposed depth estimation method is based on searching the peaks of the NBSP.Simulations show that this method is effective in relatively simple shallow water environments.Finally,the effect of source range,frequency bandwidth,sound speed profile(SSP),water depth,and signal-to-noise ratio(SNR)are studied.The findings indicate that the proposed method can accurately estimate the source depth when the SNR is greater than-5 d B and does not need to consider model mismatch issues.Additionally,variations in environmental parameters have minimal impact on estimation accuracy.Compared to MFP,the proposed method requires a higher SNR,but demonstrates superior robustness against fluctuations in environmental parameters.展开更多
基金supported by the National Key R&D Program of China(No.2022YFB3205101)NSAF(No.U2230116)。
文摘To improve image quality under low illumination conditions,a novel low-light image enhancement method is proposed in this paper based on multi-illumination estimation and multi-scale fusion(MIMS).Firstly,the illumination is processed by contrast-limited adaptive histogram equalization(CLAHE),adaptive complementary gamma function(ACG),and adaptive detail preserving S-curve(ADPS),respectively,to obtain three components.Then,the fusion-relevant features,exposure,and color contrast are selected as the weight maps.Subsequently,these components and weight maps are fused through multi-scale to generate enhanced illumination.Finally,the enhanced images are obtained by multiplying the enhanced illumination and reflectance.Compared with existing approaches,this proposed method achieves an average increase of 0.81%and 2.89%in the structural similarity index measurement(SSIM)and peak signal-to-noise ratio(PSNR),and a decrease of 6.17%and 32.61%in the natural image quality evaluator(NIQE)and gradient magnitude similarity deviation(GMSD),respectively.
基金funded by the National Natural Science Foundation of China,grant numbers 52374156 and 62476005。
文摘Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments.
基金supported,in part,by the National Nature Science Foundation of China under Grant 62272236,62376128in part,by the Natural Science Foundation of Jiangsu Province under Grant BK20201136,BK20191401.
文摘Video emotion recognition is widely used due to its alignment with the temporal characteristics of human emotional expression,but existingmodels have significant shortcomings.On the one hand,Transformermultihead self-attention modeling of global temporal dependency has problems of high computational overhead and feature similarity.On the other hand,fixed-size convolution kernels are often used,which have weak perception ability for emotional regions of different scales.Therefore,this paper proposes a video emotion recognition model that combines multi-scale region-aware convolution with temporal interactive sampling.In terms of space,multi-branch large-kernel stripe convolution is used to perceive emotional region features at different scales,and attention weights are generated for each scale feature.In terms of time,multi-layer odd-even down-sampling is performed on the time series,and oddeven sub-sequence interaction is performed to solve the problem of feature similarity,while reducing computational costs due to the linear relationship between sampling and convolution overhead.This paper was tested on CMU-MOSI,CMU-MOSEI,and Hume Reaction.The Acc-2 reached 83.4%,85.2%,and 81.2%,respectively.The experimental results show that the model can significantly improve the accuracy of emotion recognition.
基金financially supported byChongqingUniversity of Technology Graduate Innovation Foundation(Grant No.gzlcx20253267).
文摘Camouflaged Object Detection(COD)aims to identify objects that share highly similar patterns—such as texture,intensity,and color—with their surrounding environment.Due to their intrinsic resemblance to the background,camouflaged objects often exhibit vague boundaries and varying scales,making it challenging to accurately locate targets and delineate their indistinct edges.To address this,we propose a novel camouflaged object detection network called Edge-Guided and Multi-scale Fusion Network(EGMFNet),which leverages edge-guided multi-scale integration for enhanced performance.The model incorporates two innovative components:a Multi-scale Fusion Module(MSFM)and an Edge-Guided Attention Module(EGA).These designs exploit multi-scale features to uncover subtle cues between candidate objects and the background while emphasizing camouflaged object boundaries.Moreover,recognizing the rich contextual information in fused features,we introduce a Dual-Branch Global Context Module(DGCM)to refine features using extensive global context,thereby generatingmore informative representations.Experimental results on four benchmark datasets demonstrate that EGMFNet outperforms state-of-the-art methods across five evaluation metrics.Specifically,on COD10K,our EGMFNet-P improves F_(β)by 4.8 points and reduces mean absolute error(MAE)by 0.006 compared with ZoomNeXt;on NC4K,it achieves a 3.6-point increase in F_(β).OnCAMO and CHAMELEON,it obtains 4.5-point increases in F_(β),respectively.These consistent gains substantiate the superiority and robustness of EGMFNet.
基金supported by the Henan Province Key R&D Project under Grant 241111210400the Henan Provincial Science and Technology Research Project under Grants 252102211047,252102211062,252102211055 and 232102210069+2 种基金the Jiangsu Provincial Scheme Double Initiative Plan JSS-CBS20230474,the XJTLU RDF-21-02-008the Science and Technology Innovation Project of Zhengzhou University of Light Industry under Grant 23XNKJTD0205the Higher Education Teaching Reform Research and Practice Project of Henan Province under Grant 2024SJGLX0126。
文摘Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.
基金Tianmin Tianyuan Boutique Vegetable Industry Technology Service Station(Grant No.2024120011003081)Development of Environmental Monitoring and Traceability System for Wuqing Agricultural Production Areas(Grant No.2024120011001866)。
文摘Tomato is a major economic crop worldwide,and diseases on tomato leaves can significantly reduce both yield and quality.Traditional manual inspection is inefficient and highly subjective,making it difficult to meet the requirements of early disease identification in complex natural environments.To address this issue,this study proposes an improved YOLO11-based model,YOLO-SPDNet(Scale Sequence Fusion,Position-Channel Attention,and Dual Enhancement Network).The model integrates the SEAM(Self-Ensembling Attention Mechanism)semantic enhancement module,the MLCA(Mixed Local Channel Attention)lightweight attention mechanism,and the SPA(Scale-Position-Detail Awareness)module composed of SSFF(Scale Sequence Feature Fusion),TFE(Triple Feature Encoding),and CPAM(Channel and Position Attention Mechanism).These enhancements strengthen fine-grained lesion detection while maintaining model lightweightness.Experimental results show that YOLO-SPDNet achieves an accuracy of 91.8%,a recall of 86.5%,and an mAP@0.5 of 90.6%on the test set,with a computational complexity of 12.5 GFLOPs.Furthermore,the model reaches a real-time inference speed of 987 FPS,making it suitable for deployment on mobile agricultural terminals and online monitoring systems.Comparative analysis and ablation studies further validate the reliability and practical applicability of the proposed model in complex natural scenes.
基金National Key Research&Development Program of China,No.2021YFC3201201Ningxia Key Research and Development Program(Special Talents),No.2023BSB03021+1 种基金Natural Science Foundation of Ningxia,No.2023AAC05014University First-Class Discipline Construction Project of Ningxia,No.NXYLXK2021A03。
文摘The Qingtongxia Irrigation District in Ningxia is an important hydrological and ecological region.To assess its ecological environment quality from 2001 to 2021 across multiple scales and identify driving factors,a modified remote sensing ecological index(MRSEI)was developed by incorporating evapotranspiration.Spatial and temporal patterns were analyzed using the coefficient of variation,spatial autocorrelation,and semi-variogram methods,while influencing factors were explored via the optimal parameter geographical detector model.The MRSEI’s first principal component loadings and rankings aligned with those of RSEI(average contribution:81.31%),effectively reflecting spatiotemporal variations.At sub-irrigation district and landscape scales,ecological quality was slightly lower than at the district level but remained stable.Moderate and good ecological grades accounted for 36.28%and 33.38%of the area,respectively,at the district scale,and the moderate grade reached 70.48%on smaller scales.Spatial heterogeneity intensified with decreasing scale,and human activity lost explanatory power below a 5 km range.Human factors mainly drove ecological differentiation at the district scale,while natural factors dominated at finer scales.The MRSEI offers a novel tool for ecological assessment in arid/semi-arid areas and supports scale-adapted ecological protection strategies.
基金supported by the National Key Research and Development of China(No.2022YFB2503400).
文摘Semantic segmentation for mixed scenes of aerial remote sensing and road traffic is one of the key technologies for visual perception of flying cars.The State-of-the-Art(SOTA)semantic segmentation methods have made remarkable achievements in both fine-grained segmentation and real-time performance.However,when faced with the huge differences in scale and semantic categories brought about by the mixed scenes of aerial remote sensing and road traffic,they still face great challenges and there is little related research.Addressing the above issue,this paper proposes a semantic segmentation model specifically for mixed datasets of aerial remote sensing and road traffic scenes.First,a novel decoding-recoding multi-scale feature iterative refinement structure is proposed,which utilizes the re-integration and continuous enhancement of multi-scale information to effectively deal with the huge scale differences between cross-domain scenes,while using a fully convolutional structure to ensure the lightweight and real-time requirements.Second,a welldesigned cross-window attention mechanism combined with a global information integration decoding block forms an enhanced global context perception,which can effectively capture the long-range dependencies and multi-scale global context information of different scenes,thereby achieving fine-grained semantic segmentation.The proposed method is tested on a large-scale mixed dataset of aerial remote sensing and road traffic scenes.The results confirm that it can effectively deal with the problem of large-scale differences in cross-domain scenes.Its segmentation accuracy surpasses that of the SOTA methods,which meets the real-time requirements.
文摘Distributed Denial of Service(DDoS)attacks are one of the severe threats to network infrastructure,sometimes bypassing traditional diagnosis algorithms because of their evolving complexity.PresentMachine Learning(ML)techniques for DDoS attack diagnosis normally apply network traffic statistical features such as packet sizes and inter-arrival times.However,such techniques sometimes fail to capture complicated relations among various traffic flows.In this paper,we present a new multi-scale ensemble strategy given the Graph Neural Networks(GNNs)for improving DDoS detection.Our technique divides traffic into macro-and micro-level elements,letting various GNN models to get the two corase-scale anomalies and subtle,stealthy attack models.Through modeling network traffic as graph-structured data,GNNs efficiently learn intricate relations among network entities.The proposed ensemble learning algorithm combines the results of several GNNs to improve generalization,robustness,and scalability.Extensive experiments on three benchmark datasets—UNSW-NB15,CICIDS2017,and CICDDoS2019—show that our approach outperforms traditional machine learning and deep learning models in detecting both high-rate and low-rate(stealthy)DDoS attacks,with significant improvements in accuracy and recall.These findings demonstrate the suggested method’s applicability and robustness for real-world implementation in contexts where several DDoS patterns coexist.
基金Supported by the National Natural Science Foundation of China(Grant No.51975004)the Outstanding Youth Fund of Universities in Anhui Province of China(Grant No.2022AH020032).
文摘One-dimensional ensemble dispersion entropy(EDE1D)is an effective nonlinear dynamic analysis method for complexity measurement of time series.However,it is only restricted to assessing the complexity of one-di-mensional time series(TS1d)with the extracted complexity features only at a single scale.Aiming at these problems,a new nonlinear dynamic analysis method termed two-dimensional composite multi-scale ensemble Gramian dispersion entropy(CMEGDE_(2D))is proposed in this paper.First,the TS_(1D) is transformed into a two-dimensional image(I_(2D))by using Gramian angular fields(GAF)with more internal data structures and geometri features,which preserve the global characteristics and time dependence of vibration signals.Second,the I2D is analyzed at multiple scales through the composite coarse-graining method,which overcomes the limitation of a single scale and provides greater stability compared to traditional coarse-graining methods.Subsequently,a new fault diagnosis method of rolling bearing is proposed based on the proposed CMEGDE_(2D) for fault feature ex-traction and the chicken swarm algorithm optimized support vector machine(CsO-SvM)for fault pattern identification.The simulation signals and two data sets of rolling bearings are utilized to verify the effectiveness of the proposed fault diagnosis method.The results demonstrate that the proposed method has stronger dis-crimination ability,higher fault diagnosis accuracy and better stability than the other compared methods.
文摘Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To address these issues,this paper proposes SIM-Net,an enhanced detection framework derived from YOLOv11.The model integrates SPDConv to preserve fine-grained features for small object detection,introduces a novel convolutional partial attention module(C2PAM)to suppress redundant background information and highlight salient regions,and employs a multi-scale fusion network(MFN)with a multi-grain contextual module(MGCT)to strengthen contextual representation and accelerate inference.Experimental evaluations demonstrate that SIM-Net achieves 92.4%mAP,92%accuracy,and 89.4%recall with an inference speed of 75.1 FPS,outperforming existing state-of-the-art methods.These results confirm the robustness and real-time applicability of SIM-Net for PCB defect inspection.
基金financial support from the National Key Research and Development Program of China(2022YFB3804905)National Natural Science Foundation of China(22375047,22378068,and 22378071)+1 种基金Natural Science Foundation of Fujian Province(2022J01568)111 Project(No.D17005).
文摘Advanced healthcare monitors for air pollution applications pose a significant challenge in achieving a balance between high-performance filtration and multifunctional smart integration.Electrospinning triboelectric nanogenerators(TENG)provide a significant potential for use under such difficult circumstances.We have successfully constructed a high-performance TENG utilizing a novel multi-scale nanofiber architecture.Nylon 66(PA66)and chitosan quaternary ammonium salt(HACC)composites were prepared by electrospinning,and PA66/H multiscale nanofiber membranes composed of nanofibers(≈73 nm)and submicron-fibers(≈123 nm)were formed.PA66/H multi-scale nanofiber membrane as the positive electrode and negative electrode-spun PVDF-HFP nanofiber membrane composed of respiration-driven PVDF-HFP@PA66/H TENG.The resulting PVDF-HFP@PA66/H TENG based air filter utilizes electrostatic adsorption and physical interception mechanisms,achieving PM_(0.3)filtration efficiency over 99%with a pressure drop of only 48 Pa.Besides,PVDF-HFP@PA66/H TENG exhibits excellent stability in high-humidity environments,with filtration efficiency reduced by less than 1%.At the same time,the TENG achieves periodic contact separation through breathing drive to achieve self-power,which can ensure the long-term stability of the filtration efficiency.In addition to the air filtration function,TENG can also monitor health in real time by capturing human breathing signals without external power supply.This integrated system combines high-efficiency air filtration,self-powered operation,and health monitoring,presenting an innovative solution for air purification,smart protective equipment,and portable health monitoring.These findings highlight the potential of this technology for diverse applications,offering a promising direction for advancing multifunctional air filtration systems.
基金Project(2022YFC2904103)supported by the National Key Research and Development Program of ChinaProjects(52374112,52274108)supported by the National Natural Science Foundation of China+1 种基金Projects(BX20220036,BX20230041)supported by the Postdoctoral Innovation Talents Support Program,ChinaProject(2232080)supported by the Beijing Natural Science Foundation,China。
文摘The development of metallic mineral resources generates a significant amount of solid waste,such as tailings and waste rock.Cemented tailings and waste-rock backfill(CTWB)is an effective method for managing and disposing of this mining waste.This study employs a macro-meso-micro testing method to investigate the effects of the waste rock grading index(WGI)and loading rate(LR)on the uniaxial compressive strength(UCS),pore structure,and micromorphology of CTWB materials.Pore structures were analyzed using scanning electron microscopy(SEM)and mercury intrusion porosimetry(MIP).The particles(pores)and cracks analysis system(PCAS)software was used to quantitatively characterize the multi-scale micropores in the SEM images.The key findings indicate that the macroscopic results(UCS)of CTWB materials correspond to the microscopic results(pore structure and micromorphology).Changes in porosity largely depend on the conditions of waste rock grading index and loading rate.The inclusion of waste rock initially increases and then decreases the UCS,while porosity first decreases and then increases,with a critical waste rock grading index of 0.6.As the loading rate increases,UCS initially rises and then falls,while porosity gradually increases.Based on MIP and SEM results,at waste rock grading index 0.6,the most probable pore diameters,total pore area(TPA),pore number(PN),maximum pore area(MPA),and area probability distribution index(APDI)are minimized,while average pore form factor(APF)and fractal dimension of pore porosity distribution(FDPD)are maximized,indicating the most compact pore structure.At a loading rate of 12.0 mm/min,the most probable pore diameters,TPA,PN,MPA,APF,and APDI reach their maximum values,while FDPD reaches its minimum value.Finally,the mechanism of CTWB materials during compression is analyzed,based on the quantitative results of UCS and porosity.The research findings play a crucial role in ensuring the successful application of CTWB materials in deep metal mines.
基金supported by National Natural Science Foundation(NNSF)of China under Grant 62001351the Foundation of National Key Laboratory of Electromagnetic Environment(6142403220202)the Stability Support Fund for Basic Military Industrial Research Institutes(A240104130).
文摘The orthogonal time frequency space(OTFS)modulation is a novel modulation scheme that can effectively cope with the high Doppler expansion caused by high mobility.Since it modulates data on delay-Doppler(DD)domain and makes full use of the sparse characteristics of DD domain,it has been widely studied to design efficient channel estimation and signal detection schemes.In this paper,we design a novel superimposed pilot pattern with transition band,which replaces the traditional embedded pilot(EP)guard zero-symbols,and perform a two-stage channel estimation.In the first stage,we fully utilize the dispersion characteristics of OTFS signal in DD domain,and use threshold decision to make coarse channel estimation.In the second stage,we use the results of the coarse estimation for iterative signal detection and accurate channel estimation.During the second stage,we make full use of the sparsity of the channel in DD domain,remodel the received signal into the form of sparse channel vector multiplied by channel coefficient matrix,and introduce Doppler index segmentation factor(DISF)to subdivide the Doppler index to solve the problem of fractional Doppler.Simulations reveal that,the scheme proposed in this paper has higher spectral efficiency compared with traditional EP scheme and lower peak-to-average power ratio(PAPR)compared with traditional superimposed pilot scheme.
基金funded by the National Natural Science Foundation of China(No.52204407)the Natural Science Foundation of Jiangsu Province(No.BK20220595)the China Postdoctoral Science Foundation(No.2022M723689).
文摘This study proposes a multi-scale simplified residual convolutional neural network(MS-SRCNN)for the precise prediction of Mg-Nd binary alloy compositions from scanning electron microscope(SEM)images.A multi-scale data structure is established by spatially aligning and stacking SEM images at different magnifications.The MS-SRCNN significantly reduces computational runtime by over 90%compared to traditional architectures like ResNet50,VGG16,and VGG19,without compromising prediction accuracy.The model demonstrates more excellent predictive performance,achieving a>5%increase in R^(2) compared to single-scale models.Furthermore,the MS-SRCNN exhibits robust composition prediction capability across other Mg-based binary alloys,including Mg-La,Mg-Sn,Mg-Ce,Mg-Sm,Mg-Ag,and Mg-Y,thereby emphasizing its generalization and extrapolation potential.This research establishes a non-destructive,microstructure-informed composition analysis framework,reduces characterization time compared to traditional experiment methods and provides insights into the composition-microstructure relationship in diverse material systems.
文摘With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods face numerous challenges in practical deployment,including scale variation handling,feature degradation,and complex backgrounds.To address these issues,we propose Edge-enhanced and Detail-Capturing You Only Look Once(EHDC-YOLO),a novel framework for object detection in Unmanned Aerial Vehicle(UAV)imagery.Based on the You Only Look Once version 11 nano(YOLOv11n)baseline,EHDC-YOLO systematically introduces several architectural enhancements:(1)a Multi-Scale Edge Enhancement(MSEE)module that leverages multi-scale pooling and edge information to enhance boundary feature extraction;(2)an Enhanced Feature Pyramid Network(EFPN)that integrates P2-level features with Cross Stage Partial(CSP)structures and OmniKernel convolutions for better fine-grained representation;and(3)Dynamic Head(DyHead)with multi-dimensional attention mechanisms for enhanced cross-scale modeling and perspective adaptability.Comprehensive experiments on the Vision meets Drones for Detection(VisDrone-DET)2019 dataset demonstrate that EHDC-YOLO achieves significant improvements,increasing mean Average Precision(mAP)@0.5 from 33.2%to 46.1%(an absolute improvement of 12.9 percentage points)and mAP@0.5:0.95 from 19.5%to 28.0%(an absolute improvement of 8.5 percentage points)compared with the YOLOv11n baseline,while maintaining a reasonable parameter count(2.81 M vs the baseline’s 2.58 M).Further ablation studies confirm the effectiveness of each proposed component,while visualization results highlight EHDC-YOLO’s superior performance in detecting objects and handling occlusions in complex drone scenarios.
基金the National Key Research and Development Program of China (Grant No.2022YFF0711400)the National Space Science Data Center Youth Open Project (Grant No. NSSDC2302001)
文摘Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious and they are numerous,resulting in low detection accuracy by deep learning models.Therefore,we proposed a new multi-scale fusion crater detection algorithm(MSF-CDA)based on the YOLO11 to improve the accuracy of lunar impact crater detection,especially for small craters with a diameter of<1 km.Using the images taken by the LROC(Lunar Reconnaissance Orbiter Camera)at the Chang’e-4(CE-4)landing area,we constructed three separate datasets for craters with diameters of 0-70 m,70-140 m,and>140 m.We then trained three submodels separately with these three datasets.Additionally,we designed a slicing-amplifying-slicing strategy to enhance the ability to extract features from small craters.To handle redundant predictions,we proposed a new Non-Maximum Suppression with Area Filtering method to fuse the results in overlapping targets within the multi-scale submodels.Finally,our new MSF-CDA method achieved high detection performance,with the Precision,Recall,and F1 score having values of 0.991,0.987,and 0.989,respectively,perfectly addressing the problems induced by the lesser features and sample imbalance of small craters.Our MSF-CDA can provide strong data support for more in-depth study of the geological evolution of the lunar surface and finer geological age estimations.This strategy can also be used to detect other small objects with lesser features and sample imbalance problems.We detected approximately 500,000 impact craters in an area of approximately 214 km2 around the CE-4 landing area.By statistically analyzing the new data,we updated the distribution function of the number and diameter of impact craters.Finally,we identified the most suitable lighting conditions for detecting impact crater targets by analyzing the effect of different lighting conditions on the detection accuracy.
基金National Key Laboratory of Unmanned Aerial Vehicle Technology(No.202408)Key Laboratory of Smart Earth(No.KF2023ZD01-05)。
文摘In GNSS-denied environments,signals of opportunity(SOP)offer an efficient and passive solution for navigation and positioning by utilizing ambient signals.Nevertheless,conventional SOP techniques face significant challenges in real-time processing,especially under sub-Nyquist sampling conditions,due to high data acquisition rates and offgrid errors.To address this,this paper proposes the signal reconstruction and kernel sparse encoding(SRKSE)model,a novel general framework for high-precision parameter estimation.By combining compressed sensing with a deep unfolding network,the SRKSE model not only achieves robust signal reconstruction but also effectively reduces quantization errors.Key innovations of SRKSE include dual crossattention mechanisms for enhanced feature extraction,sinc sparse kernel encoding to minimize quantization errors,and a custom loss function for balanced optimization.With these advancements,SRKSE achieves up to a 650-fold improvement in time of arrival(TOA)estimation accuracy while operating at just 1%of the Nyquist sampling rate.The SRKSE surpasses both conventional and deep learning-based techniques in accuracy and efficiency,especially when operating under sub-Nyquist sampling conditions.Simulations and real-world experiments confirm the reliability and potential of SRKSE for real-time applications in IoT and wireless communication.
基金supported by the National Natural Science Foundation of China(No.52207228)the Beijing Natural Science Foundation,China(No.3224070)the National Natural Science Foundation of China(No.52077208).
文摘The growing use of lithium-ion batteries in electric transportation and grid-scale storage systems has intensified the need for accurate and highly generalizable state-of-health(SOH)estimation.Conventional approaches often suffer from reduced accuracy under dynamically uncertain state-of-charge(SOC)operating ranges and heterogeneous aging stresses.This study presents a unified SOH estimation framework that integrates physics-informed modeling,subspace identification,and Transformer-based learning.A reduced-order model is derived from simplified electrochemical dynamics,providing an interpretable and computationally efficient representation of battery behavior.Subspace identification across a wide SOC and SOH range yields degradation-sensitive features,which the Transformer uses to capture long-range aging dynamics via multi-head self-attention.Experiments on LiFePO4 cells under joint-cell training show consistently accurate SOH estimation,with a maximum error of 1.39%,demonstrating the framework’s effectiveness in decoupling SOC and SOH effects.In cross-cell validation,where training and validation are performed on different cells,the model maintains a maximum error of 2.06%,confirming strong generalization to unseen aging trajectories.Comparative experiments on LiFePO_(4)and public LiCoO_(2)datasets confirm the framework’s cross-chemistry applicability.By extracting low-dimensional,physically interpretable features via subspace identification,the framework significantly reduces training cost while maintaining high SOH estimation accuracy,outperforming conventional data-driven models lacking physical guidance.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.12274348 and 12004335)the National Key Research and Development Program of China(Grant No.2024YFC2813800)。
文摘Presented in this study is a novel method for estimating the depth of single underwater source in shallow water,utilizing vector sensors.The approach leverages the depth distribution of the broadband Stokes parameters to estimate source depth accurately.Unlike traditional matched field processing(MFP)and matched mode processing(MMP),the proposed approach can estimate source depth directly from the data received by sensors without requiring complete environmental information.Firstly,the broadband Stokes parameters(BSP)are established using the normal mode theory.Then the nonstationary phase approximation is used to simplify the theoretical derivation,which is necessary when dealing with broadband integrals.Additionally,range terms of the BSP are eliminated by normalization.By analyzing the depth distribution of the normalized broadband Stokes parameters(NBSP),it is found that the NBSP exhibit extreme values at the source depth,which can be used for source depth estimation.So the proposed depth estimation method is based on searching the peaks of the NBSP.Simulations show that this method is effective in relatively simple shallow water environments.Finally,the effect of source range,frequency bandwidth,sound speed profile(SSP),water depth,and signal-to-noise ratio(SNR)are studied.The findings indicate that the proposed method can accurately estimate the source depth when the SNR is greater than-5 d B and does not need to consider model mismatch issues.Additionally,variations in environmental parameters have minimal impact on estimation accuracy.Compared to MFP,the proposed method requires a higher SNR,but demonstrates superior robustness against fluctuations in environmental parameters.