Unmanned aerial vehicle(UAV)-borne gamma-ray spectrum survey plays a crucial role in geological mapping,radioactive mineral exploration,and environmental monitoring.However,raw data are often compromised by flight and...Unmanned aerial vehicle(UAV)-borne gamma-ray spectrum survey plays a crucial role in geological mapping,radioactive mineral exploration,and environmental monitoring.However,raw data are often compromised by flight and instrument background noise,as well as detector resolution limitations,which affect the accuracy of geological interpretations.This study aims to explore the application of the Real-ESRGAN algorithm in the super-resolution reconstruction of UAV-borne gamma-ray spectrum images to enhance spatial resolution and the quality of geological feature visualization.We conducted super-resolution reconstruction experiments with 2×,4×and 6×magnification using the Real-ESRGAN algorithm,comparing the results with three other mainstream algorithms(SRCNN,SRGAN,FSRCNN)to verify the superiority in image quality.The experimental results indicate that Real-ESRGAN achieved a structural similarity index(SSIM)value of 0.950 at 2×magnification,significantly higher than the other algorithms,demonstrating its advantage in detail preservation.Furthermore,Real-ESRGAN effectively reduced ringing and overshoot artifacts,enhancing the clarity of geological structures and mineral deposit sites,thus providing high-quality visual information for geological exploration.展开更多
In this article,the authors explore the online updating estimation for general estimating equations(EEs)in heterogeneous streaming data settings.The framework is based on more conservative model assumptions,leading to...In this article,the authors explore the online updating estimation for general estimating equations(EEs)in heterogeneous streaming data settings.The framework is based on more conservative model assumptions,leading to more robust estimations and preventing misspecification.The authors establish the standard renewable estimation under blockwise heterogeneity assumption,which can correctly specify model in some sense.To mitigate heterogeneity and enhance estimation accuracy,the authors propose two novel online detection and fusion strategies,with corresponding algorithms provided.Theoretical properties of the proposed methods are demonstrated in the context of small block sizes.Extensive numerical experiments validate the theoretical findings.Real data analysis of the Ford Gobike docked bike-sharing dataset verifies the feasibility and robustness of the proposed methods.展开更多
Single Image Super-Resolution(SISR)seeks to reconstruct high-resolution(HR)images from lowresolution(LR)inputs,thereby enhancing visual fidelity and the perception of fine details.While Transformer-based models—such ...Single Image Super-Resolution(SISR)seeks to reconstruct high-resolution(HR)images from lowresolution(LR)inputs,thereby enhancing visual fidelity and the perception of fine details.While Transformer-based models—such as SwinIR,Restormer,and HAT—have recently achieved impressive results in super-resolution tasks by capturing global contextual information,these methods often suffer from substantial computational and memory overhead,which limits their deployment on resource-constrained edge devices.To address these challenges,we propose a novel lightweight super-resolution network,termed Binary Attention-Guided Information Distillation(BAID),which integrates frequency-aware modeling with a binary attention mechanism to significantly reduce computational complexity and parameter count whilemaintaining strong reconstruction performance.The network combines a high–low frequency decoupling strategy with a local–global attention sharing mechanism,enabling efficient compression of redundant computations through binary attention guidance.At the core of the architecture lies the Attention-Guided Distillation Block(AGDB),which retains the strengths of the information distillation framework while introducing a sparse binary attention module to enhance both inference efficiency and feature representation.Extensive×4 superresolution experiments on four standard benchmarks—Set5,Set14,BSD100,and Urban100—demonstrate that BAID achieves Peak Signal-to-Noise Ratio(PSNR)values of 32.13,28.51,27.47,and 26.15,respectively,with only 1.22 million parameters and 26.1 G Floating-Point Operations(FLOPs),outperforming other state-of-the-art lightweight methods such as Information Multi-Distillation Network(IMDN)and Residual Feature Distillation Network(RFDN).These results highlight the proposed model’s ability to deliver high-quality image reconstruction while offering strong deployment efficiency,making it well-suited for image restoration tasks in resource-limited environments.展开更多
This study develops a surrogate super-resolution(SR)framework that accelerates finite element method(FEM)-based computational fluid dynamics(CFD)using deep learning.High-resolution(HR)FEM-based CFDremains computationa...This study develops a surrogate super-resolution(SR)framework that accelerates finite element method(FEM)-based computational fluid dynamics(CFD)using deep learning.High-resolution(HR)FEM-based CFDremains computationally prohibitive for time-sensitive applications,including patient-specific aneurysm hemodynamics where rapid turnaround is valuable.The proposed pipeline learns to reconstruct HR velocity-magnitude fields fromlow-resolution(LR)FEM solutions generated under the same governing equations and boundary conditions.It consistsof three modules:(i)offline pre-training of a residual network on representative vascular geometries;(ii)lightweightfine-tuning to adapt the pretrained model to geometric variability,including patient-specific aneurysm morphologies;and(iii)an unstructured-to-structured sampling strategy with region-of-interest upsampling that concentrates resolution in flow-critical zones(e.g.,the aneurysm sac)rather than the full domain.This targeted reconstruction substantiallyreduces inference and post-processing cost while preserving key HR flow features.Experiments on cerebral aneurysmmodels show that HR velocity-magnitude fields can be recovered with accuracy comparable to direct HR simulationsat less than 1%of the direct HR simulation cost per analysis(LR simulation and SR inference),while adaptation to newgeometries requires only lightweight fine-tuning with limited target-specific HR data.While clinical endpoints andadditional variables(e.g.,pressure or wall-based metrics)are left for future work,the results indicate that the proposedsurrogate SR approach can streamline FEM-based CFD workflows toward near real-time hemodynamic analysis acrossmorphologically similar vascular models.展开更多
High-resolution remote sensing imagery is essential for critical applications such as precision agriculture,urban management planning,and military reconnaissance.Although significant progress has been made in singleim...High-resolution remote sensing imagery is essential for critical applications such as precision agriculture,urban management planning,and military reconnaissance.Although significant progress has been made in singleimage super-resolution(SISR)using generative adversarial networks(GANs),existing approaches still face challenges in recovering high-frequency details,effectively utilizing features,maintaining structural integrity,and ensuring training stability—particularly when dealing with the complex textures characteristic of remote sensing imagery.To address these limitations,this paper proposes the Improved ResidualModule and AttentionMechanism Network(IRMANet),a novel architecture specifically designed for remote sensing image reconstruction.IRMANet builds upon the Super-Resolution Generative Adversarial Network(SRGAN)framework and introduces several key innovations.First,the Enhanced Residual Unit(ERU)enhances feature reuse and stabilizes training through deep residual connections.Second,the Self-Attention Residual Block(SARB)incorporates a self-attentionmechanism into the Improved Residual Module(IRM)to effectivelymodel long-range dependencies and automatically emphasize salient features.Additionally,the IRM adopts amulti-scale feature fusion strategy to facilitate synergistic interactions between local detail and global semantic information.The effectiveness of each component is validated through ablation studies,while comprehensive comparative experiments on standard remote sensing datasets demonstrate that IRMANet significantly outperforms both the baseline and state-of-the-art methods in terms of perceptual quality and quantitative metrics.Specifically,compared to the baseline model,at a magnification factor of 2,IRMANet achieves an improvement of 0.24 dB in peak signal-to-noise ratio(PSNR)and 0.54 in structural similarity index(SSIM);at a magnification factor of 4,it achieves gains of 0.22 dB in PSNR and 0.51 in SSIM.These results confirm that the proposedmethod effectively enhances detail representation and structural reconstruction accuracy in complex remote sensing scenarios,offering robust technical support for high-precision detection and identification of both military and civilian aircraft.展开更多
Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods ex...Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods exhibit deficiencies in detail recovery and noise suppression,particularly when processing complex landscapes(e.g.,forests,farmlands),leading to artifacts and spectral distortions that limit practical utility.To address this,we propose an enhanced Super-Resolution Generative Adversarial Network(SRGAN)framework featuring three key innovations:(1)Replacement of L1/L2 loss with a robust Charbonnier loss to suppress noise while preserving edge details via adaptive gradient balancing;(2)A multi-loss joint optimization strategy dynamically weighting Charbonnier loss(β=0.5),Visual Geometry Group(VGG)perceptual loss(α=1),and adversarial loss(γ=0.1)to synergize pixel-level accuracy and perceptual quality;(3)A multi-scale residual network(MSRN)capturing cross-scale texture features(e.g.,forest canopies,mountain contours).Validated on Sentinel-2(10 m)and SPOT-6/7(2.5 m)datasets covering 904 km2 in Motuo County,Xizang,our method outperforms the SRGAN baseline(SR4RS)with Peak Signal-to-Noise Ratio(PSNR)gains of 0.29 dB and Structural Similarity Index(SSIM)improvements of 3.08%on forest imagery.Visual comparisons confirm enhanced texture continuity despite marginal Learned Perceptual Image Patch Similarity(LPIPS)increases.The method significantly improves noise robustness and edge retention in complex geomorphology,demonstrating 18%faster response in forest fire early warning and providing high-resolution support for agricultural/urban monitoring.Future work will integrate spectral constraints and lightweight architectures.展开更多
The orthogonal time frequency space(OTFS)modulation is a novel modulation scheme that can effectively cope with the high Doppler expansion caused by high mobility.Since it modulates data on delay-Doppler(DD)domain and...The orthogonal time frequency space(OTFS)modulation is a novel modulation scheme that can effectively cope with the high Doppler expansion caused by high mobility.Since it modulates data on delay-Doppler(DD)domain and makes full use of the sparse characteristics of DD domain,it has been widely studied to design efficient channel estimation and signal detection schemes.In this paper,we design a novel superimposed pilot pattern with transition band,which replaces the traditional embedded pilot(EP)guard zero-symbols,and perform a two-stage channel estimation.In the first stage,we fully utilize the dispersion characteristics of OTFS signal in DD domain,and use threshold decision to make coarse channel estimation.In the second stage,we use the results of the coarse estimation for iterative signal detection and accurate channel estimation.During the second stage,we make full use of the sparsity of the channel in DD domain,remodel the received signal into the form of sparse channel vector multiplied by channel coefficient matrix,and introduce Doppler index segmentation factor(DISF)to subdivide the Doppler index to solve the problem of fractional Doppler.Simulations reveal that,the scheme proposed in this paper has higher spectral efficiency compared with traditional EP scheme and lower peak-to-average power ratio(PAPR)compared with traditional superimposed pilot scheme.展开更多
Real-time multi-person pose estimation(MPE)built upon neural network architectures aims to simultaneously detect multiple human instances and regress joint coordinates in dynamic scenes.However,due to factors such as ...Real-time multi-person pose estimation(MPE)built upon neural network architectures aims to simultaneously detect multiple human instances and regress joint coordinates in dynamic scenes.However,due to factors such as high model complexity and limited expression of keypoint information,both the efficiency and accuracy of real-time MPE remain to be improved.To mitigate the adverse impacts caused by the aforementioned issues,this work develops FSEM-Pose,a real-time MPE model rooted in the YOLOv10 framework.In detail,first,FSEM-Pose upgrades the backbone module of the baseline network by introducing the Feature Shuffling-Convolution(FS-Conv),which effectively reduces the backbone size while maximizing the retention of spatial information from the input image.Second,FSEM-Pose incorporates a Feature Saliency Enhancement Module(FSEM)to strengthen the feature encoding of human keypoints,thereby improving the accuracy of pose estimation.Finally,FSEM-Pose further enhances inference efficiency via a lightweight optimization of the head using shared convolutional layers.Our method achieves competitive results across multiple accuracy and efficiency metrics on the MS COCO 2017 and CrowdPose datasets.While being lightweight in design,it improves average precision(AP)by 2.1%and 2.5%,respectively.展开更多
In GNSS-denied environments,signals of opportunity(SOP)offer an efficient and passive solution for navigation and positioning by utilizing ambient signals.Nevertheless,conventional SOP techniques face significant chal...In GNSS-denied environments,signals of opportunity(SOP)offer an efficient and passive solution for navigation and positioning by utilizing ambient signals.Nevertheless,conventional SOP techniques face significant challenges in real-time processing,especially under sub-Nyquist sampling conditions,due to high data acquisition rates and offgrid errors.To address this,this paper proposes the signal reconstruction and kernel sparse encoding(SRKSE)model,a novel general framework for high-precision parameter estimation.By combining compressed sensing with a deep unfolding network,the SRKSE model not only achieves robust signal reconstruction but also effectively reduces quantization errors.Key innovations of SRKSE include dual crossattention mechanisms for enhanced feature extraction,sinc sparse kernel encoding to minimize quantization errors,and a custom loss function for balanced optimization.With these advancements,SRKSE achieves up to a 650-fold improvement in time of arrival(TOA)estimation accuracy while operating at just 1%of the Nyquist sampling rate.The SRKSE surpasses both conventional and deep learning-based techniques in accuracy and efficiency,especially when operating under sub-Nyquist sampling conditions.Simulations and real-world experiments confirm the reliability and potential of SRKSE for real-time applications in IoT and wireless communication.展开更多
Lithium-ion(Li-ion)batteries stand as the dominant energy storage solution,despite their widespread adoption,precisely determining the state of charge(SOC)continues to pose significant difficulties,with direct implica...Lithium-ion(Li-ion)batteries stand as the dominant energy storage solution,despite their widespread adoption,precisely determining the state of charge(SOC)continues to pose significant difficulties,with direct implications for battery safety,operational reliability,and overall performance.Current SOC estimation techniques often demonstrate limited accuracy,particularly when confronted with complex operational scenarios and wide temperature variations,where their generalization capacity and dynamic adaptation prove insufficient.To address these shortcomings,this work presents a PSO-TCN-Transformer network model for SOC estimation.This research uses the Particle Swarm Optimization(PSO)method to automatically configure the architectural parameters of the Temporal Convolutional Network(TCN)and Transformer components.This automated optimization enhances the model’s ability to represent the dynamically evolving nature of SOC.Additionally,this integrated framework significantly increases the model’s capacity to capture SOC dynamics in complex operational scenarios.During training and evaluation using a comprehensive dataset that covers complex operating conditions and a broad temperature spanning from−20℃ to 40℃,the proposed model achieves a root mean square error(RMSE)of less than 0.6%,a maximum absolute error(MAXE)below 4.0%,and a coefficient of determination(R^(2))of 99.99%.Additional comparative experiments on data from an energy storage company further verify the model’s superior performance,with an RMSE of 1.18%and an MAXE of 1.95%.The implications of this work extend to the development of optimization strategies and hybrid architectures,providing insights that can be adapted for state estimation across a range of complex dynamic systems.展开更多
The growing use of lithium-ion batteries in electric transportation and grid-scale storage systems has intensified the need for accurate and highly generalizable state-of-health(SOH)estimation.Conventional approaches ...The growing use of lithium-ion batteries in electric transportation and grid-scale storage systems has intensified the need for accurate and highly generalizable state-of-health(SOH)estimation.Conventional approaches often suffer from reduced accuracy under dynamically uncertain state-of-charge(SOC)operating ranges and heterogeneous aging stresses.This study presents a unified SOH estimation framework that integrates physics-informed modeling,subspace identification,and Transformer-based learning.A reduced-order model is derived from simplified electrochemical dynamics,providing an interpretable and computationally efficient representation of battery behavior.Subspace identification across a wide SOC and SOH range yields degradation-sensitive features,which the Transformer uses to capture long-range aging dynamics via multi-head self-attention.Experiments on LiFePO4 cells under joint-cell training show consistently accurate SOH estimation,with a maximum error of 1.39%,demonstrating the framework’s effectiveness in decoupling SOC and SOH effects.In cross-cell validation,where training and validation are performed on different cells,the model maintains a maximum error of 2.06%,confirming strong generalization to unseen aging trajectories.Comparative experiments on LiFePO_(4)and public LiCoO_(2)datasets confirm the framework’s cross-chemistry applicability.By extracting low-dimensional,physically interpretable features via subspace identification,the framework significantly reduces training cost while maintaining high SOH estimation accuracy,outperforming conventional data-driven models lacking physical guidance.展开更多
Presented in this study is a novel method for estimating the depth of single underwater source in shallow water,utilizing vector sensors.The approach leverages the depth distribution of the broadband Stokes parameters...Presented in this study is a novel method for estimating the depth of single underwater source in shallow water,utilizing vector sensors.The approach leverages the depth distribution of the broadband Stokes parameters to estimate source depth accurately.Unlike traditional matched field processing(MFP)and matched mode processing(MMP),the proposed approach can estimate source depth directly from the data received by sensors without requiring complete environmental information.Firstly,the broadband Stokes parameters(BSP)are established using the normal mode theory.Then the nonstationary phase approximation is used to simplify the theoretical derivation,which is necessary when dealing with broadband integrals.Additionally,range terms of the BSP are eliminated by normalization.By analyzing the depth distribution of the normalized broadband Stokes parameters(NBSP),it is found that the NBSP exhibit extreme values at the source depth,which can be used for source depth estimation.So the proposed depth estimation method is based on searching the peaks of the NBSP.Simulations show that this method is effective in relatively simple shallow water environments.Finally,the effect of source range,frequency bandwidth,sound speed profile(SSP),water depth,and signal-to-noise ratio(SNR)are studied.The findings indicate that the proposed method can accurately estimate the source depth when the SNR is greater than-5 d B and does not need to consider model mismatch issues.Additionally,variations in environmental parameters have minimal impact on estimation accuracy.Compared to MFP,the proposed method requires a higher SNR,but demonstrates superior robustness against fluctuations in environmental parameters.展开更多
Considering the impact of terminal impact time constraints and the state information of maneuvering targets on the guidance accuracy in multi-UAV cooperative guidance,this paper proposes an impact time cooperative con...Considering the impact of terminal impact time constraints and the state information of maneuvering targets on the guidance accuracy in multi-UAV cooperative guidance,this paper proposes an impact time cooperative control guidance law(ITCCG)that combines the optimal error dynamics with an improved adaptive cubature Kalman filter(IACKF)algorithm.First,a terminal impact time feedback term is introduced into proportional navigation guidance based on the relative virtual guidance model,and terminal time control is achieved through optimal error dynamics.Then,the Huber loss function is used to reduce the impact of measurement outliers,and the diagonal decomposition is applied to address the issue of non-positive definite matrices that cannot undergo Cholesky decomposition.Finally,the ITCCG and IACKF algorithms combined achieve multi-UAV time-cooperated guidance based on maneuvering target state estimation.Simulation results show that the proposed algorithm effectively reduces the target state estimation error and achieves cooperative guidance within the desired time frame.展开更多
The 6D pose estimation of objects is of great significance for the intelligent assembly and sorting of industrial parts.In the industrial robot production scenarios,the 6D pose estimation of industrial parts mainly fa...The 6D pose estimation of objects is of great significance for the intelligent assembly and sorting of industrial parts.In the industrial robot production scenarios,the 6D pose estimation of industrial parts mainly faces two challenges:one is the loss of information and interference caused by occlusion and stacking in the sorting scenario,the other is the difficulty of feature extraction due to the weak texture of industrial parts.To address the above problems,this paper proposes an attention-based pixel-level voting network for 6D pose estimation of weakly textured industrial parts,namely CB-PVNet.On the one hand,the voting scheme can predict the keypoints of affected pixels,which improves the accuracy of keypoint localization even in scenarios such as weak texture and partial occlusion.On the other hand,the attention mechanism can extract interesting features of the object while suppressing useless features of surroundings.Extensive comparative experiments were conducted on both public datasets(including LINEMOD,Occlusion LINEMOD and T-LESS datasets)and self-made datasets.The experimental results indicate that the proposed network CB-PVNet can achieve accuracy of ADD(-s)comparable to state-of-the-art using only RGB images while ensuring real-time performance.Additionally,we also conducted robot grasping experiments in the real world.The balance between accuracy and computational efficiency makes the method well-suited for applications in industrial automation.展开更多
This paper aims to develop a unified Bayesian approach for clustered data analysis when observations are subject to missingness at random.The authors consider a general framework in which the parameters of interest ar...This paper aims to develop a unified Bayesian approach for clustered data analysis when observations are subject to missingness at random.The authors consider a general framework in which the parameters of interest are defined through estimating equations,and the probability of missingness follows a general parametric form.The generalized method of moments framework is employed to derive an optimal combination of inverse-probability-weighted estimating equations for the parameters of interest and score equations for propensity score.Using this framework,the authors develop a quasi-Bayesian analysis for clustered samples with missing values.A unified model selection approach is also proposed to compare models characterized by different moment conditions.The authors systematically evaluate the large-sample properties of the proposed quasi-posterior density with both fixed and shrinking priors and establish the selection consistency of the proposed model selection criterion.The proposed results are valid under very mild conditions and offer significant advantages for parameters defined through non-smooth estimating functions.Extensive numerical studies demonstrate that the proposed method performs exceptionally well in finite samples.展开更多
Quantile regression(QR)has become an important tool to measure dependence of response variable's quantiles on a number of predictors for heterogeneous data,especially heavy-tailed data and outliers.However,it is q...Quantile regression(QR)has become an important tool to measure dependence of response variable's quantiles on a number of predictors for heterogeneous data,especially heavy-tailed data and outliers.However,it is quite challenging to make statistical inference on distributed high-dimensional QR with missing data due to the distributed nature,sparsity and missingness of data and nondifferentiable quantile loss function.To overcome the challenge,this paper develops a communicationefficient method to select variables and estimate parameters by utilizing a smooth function to approximate the non-differentiable quantile loss function and incorporating the idea of the inverse probability weighting and the penalty function.The proposed approach has three merits.First,it is both computationally and communicationally efficient because only the first-and second-order information of the approximate objective function are communicated at each iteration.Second,the proposed estimators possess the oracle property after a limited number of iterations without constraint on the number of machines.Third,the proposed method simultaneously selects variables and estimates parameters within a distributed framework,ensuring robustness to the specified response probability or propensity score function of the missing data mechanism.Simulation studies and a real example are used to illustrate the effectiveness of the proposed methodologies.展开更多
(Quasi-)closed-form results for the statistical properties of unmanned aerial vehicle(UAV)airto-ground channels are derived for the first time using a novel spatial-vector-based method from a threedimensional(3-D)arbi...(Quasi-)closed-form results for the statistical properties of unmanned aerial vehicle(UAV)airto-ground channels are derived for the first time using a novel spatial-vector-based method from a threedimensional(3-D)arbitrary-elevation one-cylinder model.The derived results include a closed-form expression for the space-time correlation function and some quasi-closed-form ones for the space-Doppler power spectrum density,the level crossing rate,and the average fading duration,which are shown to be the generalizations of those previously obtained from the two-dimensional(2-D)one-ring model and the 3-D low-elevation one-cylinder model for terrestrial mobile-to-mobile channels.The close agreements between the theoretical results and the simulations as well as the measurements validate the utility of the derived channel statistics.Based on the derived expressions,the impacts of some parameters on the channel characteristics are investigated in an effective,efficient,and explicable way,which leads to a general guideline on the manual parameter estimation from the measurement description.展开更多
Human pose estimation is crucial across diverse applications,from healthcare to human-computer interaction.Integrating inertial measurement units(IMUs)with monocular vision methods holds great potential for leveraging...Human pose estimation is crucial across diverse applications,from healthcare to human-computer interaction.Integrating inertial measurement units(IMUs)with monocular vision methods holds great potential for leveraging complementary modalities;however,existing approaches are often limited by IMU drift,noise,and underutilization of visual information.To address these limitations,we propose a novel dual-stream feature extraction framework that effectively combines temporal IMU data and single-view image features for improved pose estimation.Short-term dependencies in IMU sequences are captured with convolutional layers,while a Transformerbased architecture models long-range temporal dynamics.To mitigate IMU drift and inter-sensor inconsistencies,a complementary filtering module is introduced alongside a cross-channel interaction mechanism.Features from the IMU and image streams are then fused via a dedicated fusion module and further refined utilizing a high-precision regression head for accurate pose prediction.Experimental results on benchmark datasets demonstrate that our method significantly outperforms existing techniques in terms of estimation,accuracy,and robustness,validating the effectiveness of our dual-stream architecture.展开更多
The estimation of the Number of Sources(NoS)is a significant challenge in signal processing,particularly due to the impact of colored noise on the performance of NoS estimation.This paper proposes a Multidimensional F...The estimation of the Number of Sources(NoS)is a significant challenge in signal processing,particularly due to the impact of colored noise on the performance of NoS estimation.This paper proposes a Multidimensional Feature Network(MFNet)which is designed for NoS estimation by extracting features of the sampled received signals and Sampled Covariance Matrix(SCM).The MFNet treats the raw signal and the SCM as two different types of data,and is able to achieve NoS estimation under colored noise and imperfect array.MFNet employs the Gated Recurrent Unit(GRU)to capture sequential information from the original signal data and to construct the Pseudo Covariance Matrix(PCM).Subsequently,various dimensional features,including eigenvalues and the Gerschgorin disk radius,are extracted from both the PCM and SCM,which are then jointly input into the subsequent network.An overall accuracy of 82%can be achieved after network training.The ablation experimental results demonstrate the effectiveness of multiple inputs.And simulation results demonstrate that the proposed MFNet achieves higher estimation accuracy compared to existing algorithms and exhibits greater robustness against colored noise.展开更多
Accurately estimating depth from underwater monocular images is essential for the target tracking task of unmanned underwater vehicles.This work proposes a method based on the Lpg-Lap Unet architecture.First,the Unet ...Accurately estimating depth from underwater monocular images is essential for the target tracking task of unmanned underwater vehicles.This work proposes a method based on the Lpg-Lap Unet architecture.First,the Unet architecture integrates Laplacian pyramid depth residuals and Sobel operators to improve the boundary details in depth images,which may suffer from the feature loss caused by upsampling and the blurriness of underwater images.Multiscale local planar guidance layers then fully exploit the intermediate depth features,and a comprehensive loss function ensures robustness and accuracy.Experimental results on benchmarks demonstrate the effectiveness of Lpg-Lap Unet and its superior performance over state-of-the-art models.An underwater target tracking system is then designed to further validate its real-time capabilities in the AirSim simulation platform.展开更多
基金supported by the National Natural Science Foundation of China(Nos.12205044 and 12265003)2024 Jiangxi Province Civil-Military Integration Research Institute‘BeiDou+’Project Subtopic(No.2024JXRH0Y06).
文摘Unmanned aerial vehicle(UAV)-borne gamma-ray spectrum survey plays a crucial role in geological mapping,radioactive mineral exploration,and environmental monitoring.However,raw data are often compromised by flight and instrument background noise,as well as detector resolution limitations,which affect the accuracy of geological interpretations.This study aims to explore the application of the Real-ESRGAN algorithm in the super-resolution reconstruction of UAV-borne gamma-ray spectrum images to enhance spatial resolution and the quality of geological feature visualization.We conducted super-resolution reconstruction experiments with 2×,4×and 6×magnification using the Real-ESRGAN algorithm,comparing the results with three other mainstream algorithms(SRCNN,SRGAN,FSRCNN)to verify the superiority in image quality.The experimental results indicate that Real-ESRGAN achieved a structural similarity index(SSIM)value of 0.950 at 2×magnification,significantly higher than the other algorithms,demonstrating its advantage in detail preservation.Furthermore,Real-ESRGAN effectively reduced ringing and overshoot artifacts,enhancing the clarity of geological structures and mineral deposit sites,thus providing high-quality visual information for geological exploration.
基金supported in part by the National Natural Science Foundation of China under Grant No.12471281in part by the National Statistical Science Research Project under Grant No.2022LD03。
文摘In this article,the authors explore the online updating estimation for general estimating equations(EEs)in heterogeneous streaming data settings.The framework is based on more conservative model assumptions,leading to more robust estimations and preventing misspecification.The authors establish the standard renewable estimation under blockwise heterogeneity assumption,which can correctly specify model in some sense.To mitigate heterogeneity and enhance estimation accuracy,the authors propose two novel online detection and fusion strategies,with corresponding algorithms provided.Theoretical properties of the proposed methods are demonstrated in the context of small block sizes.Extensive numerical experiments validate the theoretical findings.Real data analysis of the Ford Gobike docked bike-sharing dataset verifies the feasibility and robustness of the proposed methods.
基金funded by Project of Sichuan Provincial Department of Science and Technology under 2025JDKP0150the Fundamental Research Funds for the Central Universities under 25CAFUC03093.
文摘Single Image Super-Resolution(SISR)seeks to reconstruct high-resolution(HR)images from lowresolution(LR)inputs,thereby enhancing visual fidelity and the perception of fine details.While Transformer-based models—such as SwinIR,Restormer,and HAT—have recently achieved impressive results in super-resolution tasks by capturing global contextual information,these methods often suffer from substantial computational and memory overhead,which limits their deployment on resource-constrained edge devices.To address these challenges,we propose a novel lightweight super-resolution network,termed Binary Attention-Guided Information Distillation(BAID),which integrates frequency-aware modeling with a binary attention mechanism to significantly reduce computational complexity and parameter count whilemaintaining strong reconstruction performance.The network combines a high–low frequency decoupling strategy with a local–global attention sharing mechanism,enabling efficient compression of redundant computations through binary attention guidance.At the core of the architecture lies the Attention-Guided Distillation Block(AGDB),which retains the strengths of the information distillation framework while introducing a sparse binary attention module to enhance both inference efficiency and feature representation.Extensive×4 superresolution experiments on four standard benchmarks—Set5,Set14,BSD100,and Urban100—demonstrate that BAID achieves Peak Signal-to-Noise Ratio(PSNR)values of 32.13,28.51,27.47,and 26.15,respectively,with only 1.22 million parameters and 26.1 G Floating-Point Operations(FLOPs),outperforming other state-of-the-art lightweight methods such as Information Multi-Distillation Network(IMDN)and Residual Feature Distillation Network(RFDN).These results highlight the proposed model’s ability to deliver high-quality image reconstruction while offering strong deployment efficiency,making it well-suited for image restoration tasks in resource-limited environments.
文摘This study develops a surrogate super-resolution(SR)framework that accelerates finite element method(FEM)-based computational fluid dynamics(CFD)using deep learning.High-resolution(HR)FEM-based CFDremains computationally prohibitive for time-sensitive applications,including patient-specific aneurysm hemodynamics where rapid turnaround is valuable.The proposed pipeline learns to reconstruct HR velocity-magnitude fields fromlow-resolution(LR)FEM solutions generated under the same governing equations and boundary conditions.It consistsof three modules:(i)offline pre-training of a residual network on representative vascular geometries;(ii)lightweightfine-tuning to adapt the pretrained model to geometric variability,including patient-specific aneurysm morphologies;and(iii)an unstructured-to-structured sampling strategy with region-of-interest upsampling that concentrates resolution in flow-critical zones(e.g.,the aneurysm sac)rather than the full domain.This targeted reconstruction substantiallyreduces inference and post-processing cost while preserving key HR flow features.Experiments on cerebral aneurysmmodels show that HR velocity-magnitude fields can be recovered with accuracy comparable to direct HR simulationsat less than 1%of the direct HR simulation cost per analysis(LR simulation and SR inference),while adaptation to newgeometries requires only lightweight fine-tuning with limited target-specific HR data.While clinical endpoints andadditional variables(e.g.,pressure or wall-based metrics)are left for future work,the results indicate that the proposedsurrogate SR approach can streamline FEM-based CFD workflows toward near real-time hemodynamic analysis acrossmorphologically similar vascular models.
基金funded by the Henan Province Key R&D Program Project,“Research and Application Demonstration of Class Ⅱ Superlattice Medium Wave High Temperature Infrared Detector Technology”,grant number 231111210400.
文摘High-resolution remote sensing imagery is essential for critical applications such as precision agriculture,urban management planning,and military reconnaissance.Although significant progress has been made in singleimage super-resolution(SISR)using generative adversarial networks(GANs),existing approaches still face challenges in recovering high-frequency details,effectively utilizing features,maintaining structural integrity,and ensuring training stability—particularly when dealing with the complex textures characteristic of remote sensing imagery.To address these limitations,this paper proposes the Improved ResidualModule and AttentionMechanism Network(IRMANet),a novel architecture specifically designed for remote sensing image reconstruction.IRMANet builds upon the Super-Resolution Generative Adversarial Network(SRGAN)framework and introduces several key innovations.First,the Enhanced Residual Unit(ERU)enhances feature reuse and stabilizes training through deep residual connections.Second,the Self-Attention Residual Block(SARB)incorporates a self-attentionmechanism into the Improved Residual Module(IRM)to effectivelymodel long-range dependencies and automatically emphasize salient features.Additionally,the IRM adopts amulti-scale feature fusion strategy to facilitate synergistic interactions between local detail and global semantic information.The effectiveness of each component is validated through ablation studies,while comprehensive comparative experiments on standard remote sensing datasets demonstrate that IRMANet significantly outperforms both the baseline and state-of-the-art methods in terms of perceptual quality and quantitative metrics.Specifically,compared to the baseline model,at a magnification factor of 2,IRMANet achieves an improvement of 0.24 dB in peak signal-to-noise ratio(PSNR)and 0.54 in structural similarity index(SSIM);at a magnification factor of 4,it achieves gains of 0.22 dB in PSNR and 0.51 in SSIM.These results confirm that the proposedmethod effectively enhances detail representation and structural reconstruction accuracy in complex remote sensing scenarios,offering robust technical support for high-precision detection and identification of both military and civilian aircraft.
基金This study was supported by:Inner Mongolia Academy of Forestry Sciences Open Research Project(Grant No.KF2024MS03)The Project to Improve the Scientific Research Capacity of the Inner Mongolia Academy of Forestry Sciences(Grant No.2024NLTS04)The Innovation and Entrepreneurship Training Program for Undergraduates of Beijing Forestry University(Grant No.X202410022268).
文摘Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods exhibit deficiencies in detail recovery and noise suppression,particularly when processing complex landscapes(e.g.,forests,farmlands),leading to artifacts and spectral distortions that limit practical utility.To address this,we propose an enhanced Super-Resolution Generative Adversarial Network(SRGAN)framework featuring three key innovations:(1)Replacement of L1/L2 loss with a robust Charbonnier loss to suppress noise while preserving edge details via adaptive gradient balancing;(2)A multi-loss joint optimization strategy dynamically weighting Charbonnier loss(β=0.5),Visual Geometry Group(VGG)perceptual loss(α=1),and adversarial loss(γ=0.1)to synergize pixel-level accuracy and perceptual quality;(3)A multi-scale residual network(MSRN)capturing cross-scale texture features(e.g.,forest canopies,mountain contours).Validated on Sentinel-2(10 m)and SPOT-6/7(2.5 m)datasets covering 904 km2 in Motuo County,Xizang,our method outperforms the SRGAN baseline(SR4RS)with Peak Signal-to-Noise Ratio(PSNR)gains of 0.29 dB and Structural Similarity Index(SSIM)improvements of 3.08%on forest imagery.Visual comparisons confirm enhanced texture continuity despite marginal Learned Perceptual Image Patch Similarity(LPIPS)increases.The method significantly improves noise robustness and edge retention in complex geomorphology,demonstrating 18%faster response in forest fire early warning and providing high-resolution support for agricultural/urban monitoring.Future work will integrate spectral constraints and lightweight architectures.
基金supported by National Natural Science Foundation(NNSF)of China under Grant 62001351the Foundation of National Key Laboratory of Electromagnetic Environment(6142403220202)the Stability Support Fund for Basic Military Industrial Research Institutes(A240104130).
文摘The orthogonal time frequency space(OTFS)modulation is a novel modulation scheme that can effectively cope with the high Doppler expansion caused by high mobility.Since it modulates data on delay-Doppler(DD)domain and makes full use of the sparse characteristics of DD domain,it has been widely studied to design efficient channel estimation and signal detection schemes.In this paper,we design a novel superimposed pilot pattern with transition band,which replaces the traditional embedded pilot(EP)guard zero-symbols,and perform a two-stage channel estimation.In the first stage,we fully utilize the dispersion characteristics of OTFS signal in DD domain,and use threshold decision to make coarse channel estimation.In the second stage,we use the results of the coarse estimation for iterative signal detection and accurate channel estimation.During the second stage,we make full use of the sparsity of the channel in DD domain,remodel the received signal into the form of sparse channel vector multiplied by channel coefficient matrix,and introduce Doppler index segmentation factor(DISF)to subdivide the Doppler index to solve the problem of fractional Doppler.Simulations reveal that,the scheme proposed in this paper has higher spectral efficiency compared with traditional EP scheme and lower peak-to-average power ratio(PAPR)compared with traditional superimposed pilot scheme.
基金supported by the Talent Startup Program of Huangshan University under Grant No.2025xkjq003Additional partial funding was gratefully received from the Scientific Research Project of the Anhui Provincial Department of Education under Grant No.2025AHGXZK40303.
文摘Real-time multi-person pose estimation(MPE)built upon neural network architectures aims to simultaneously detect multiple human instances and regress joint coordinates in dynamic scenes.However,due to factors such as high model complexity and limited expression of keypoint information,both the efficiency and accuracy of real-time MPE remain to be improved.To mitigate the adverse impacts caused by the aforementioned issues,this work develops FSEM-Pose,a real-time MPE model rooted in the YOLOv10 framework.In detail,first,FSEM-Pose upgrades the backbone module of the baseline network by introducing the Feature Shuffling-Convolution(FS-Conv),which effectively reduces the backbone size while maximizing the retention of spatial information from the input image.Second,FSEM-Pose incorporates a Feature Saliency Enhancement Module(FSEM)to strengthen the feature encoding of human keypoints,thereby improving the accuracy of pose estimation.Finally,FSEM-Pose further enhances inference efficiency via a lightweight optimization of the head using shared convolutional layers.Our method achieves competitive results across multiple accuracy and efficiency metrics on the MS COCO 2017 and CrowdPose datasets.While being lightweight in design,it improves average precision(AP)by 2.1%and 2.5%,respectively.
基金National Key Laboratory of Unmanned Aerial Vehicle Technology(No.202408)Key Laboratory of Smart Earth(No.KF2023ZD01-05)。
文摘In GNSS-denied environments,signals of opportunity(SOP)offer an efficient and passive solution for navigation and positioning by utilizing ambient signals.Nevertheless,conventional SOP techniques face significant challenges in real-time processing,especially under sub-Nyquist sampling conditions,due to high data acquisition rates and offgrid errors.To address this,this paper proposes the signal reconstruction and kernel sparse encoding(SRKSE)model,a novel general framework for high-precision parameter estimation.By combining compressed sensing with a deep unfolding network,the SRKSE model not only achieves robust signal reconstruction but also effectively reduces quantization errors.Key innovations of SRKSE include dual crossattention mechanisms for enhanced feature extraction,sinc sparse kernel encoding to minimize quantization errors,and a custom loss function for balanced optimization.With these advancements,SRKSE achieves up to a 650-fold improvement in time of arrival(TOA)estimation accuracy while operating at just 1%of the Nyquist sampling rate.The SRKSE surpasses both conventional and deep learning-based techniques in accuracy and efficiency,especially when operating under sub-Nyquist sampling conditions.Simulations and real-world experiments confirm the reliability and potential of SRKSE for real-time applications in IoT and wireless communication.
基金funded in part by the Doctoral Scientific Research Foundation of Beijing University of Civil Engineering and Architecture under Grant ZF15054in part by the Pyramid Talent Training Project of Beijing University of Civil Engineering and Architecture under Grant GJZJ20220802in part by the BUCEA Post Graduate Innovation Project under Grant PG2024095.
文摘Lithium-ion(Li-ion)batteries stand as the dominant energy storage solution,despite their widespread adoption,precisely determining the state of charge(SOC)continues to pose significant difficulties,with direct implications for battery safety,operational reliability,and overall performance.Current SOC estimation techniques often demonstrate limited accuracy,particularly when confronted with complex operational scenarios and wide temperature variations,where their generalization capacity and dynamic adaptation prove insufficient.To address these shortcomings,this work presents a PSO-TCN-Transformer network model for SOC estimation.This research uses the Particle Swarm Optimization(PSO)method to automatically configure the architectural parameters of the Temporal Convolutional Network(TCN)and Transformer components.This automated optimization enhances the model’s ability to represent the dynamically evolving nature of SOC.Additionally,this integrated framework significantly increases the model’s capacity to capture SOC dynamics in complex operational scenarios.During training and evaluation using a comprehensive dataset that covers complex operating conditions and a broad temperature spanning from−20℃ to 40℃,the proposed model achieves a root mean square error(RMSE)of less than 0.6%,a maximum absolute error(MAXE)below 4.0%,and a coefficient of determination(R^(2))of 99.99%.Additional comparative experiments on data from an energy storage company further verify the model’s superior performance,with an RMSE of 1.18%and an MAXE of 1.95%.The implications of this work extend to the development of optimization strategies and hybrid architectures,providing insights that can be adapted for state estimation across a range of complex dynamic systems.
基金supported by the National Natural Science Foundation of China(No.52207228)the Beijing Natural Science Foundation,China(No.3224070)the National Natural Science Foundation of China(No.52077208).
文摘The growing use of lithium-ion batteries in electric transportation and grid-scale storage systems has intensified the need for accurate and highly generalizable state-of-health(SOH)estimation.Conventional approaches often suffer from reduced accuracy under dynamically uncertain state-of-charge(SOC)operating ranges and heterogeneous aging stresses.This study presents a unified SOH estimation framework that integrates physics-informed modeling,subspace identification,and Transformer-based learning.A reduced-order model is derived from simplified electrochemical dynamics,providing an interpretable and computationally efficient representation of battery behavior.Subspace identification across a wide SOC and SOH range yields degradation-sensitive features,which the Transformer uses to capture long-range aging dynamics via multi-head self-attention.Experiments on LiFePO4 cells under joint-cell training show consistently accurate SOH estimation,with a maximum error of 1.39%,demonstrating the framework’s effectiveness in decoupling SOC and SOH effects.In cross-cell validation,where training and validation are performed on different cells,the model maintains a maximum error of 2.06%,confirming strong generalization to unseen aging trajectories.Comparative experiments on LiFePO_(4)and public LiCoO_(2)datasets confirm the framework’s cross-chemistry applicability.By extracting low-dimensional,physically interpretable features via subspace identification,the framework significantly reduces training cost while maintaining high SOH estimation accuracy,outperforming conventional data-driven models lacking physical guidance.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.12274348 and 12004335)the National Key Research and Development Program of China(Grant No.2024YFC2813800)。
文摘Presented in this study is a novel method for estimating the depth of single underwater source in shallow water,utilizing vector sensors.The approach leverages the depth distribution of the broadband Stokes parameters to estimate source depth accurately.Unlike traditional matched field processing(MFP)and matched mode processing(MMP),the proposed approach can estimate source depth directly from the data received by sensors without requiring complete environmental information.Firstly,the broadband Stokes parameters(BSP)are established using the normal mode theory.Then the nonstationary phase approximation is used to simplify the theoretical derivation,which is necessary when dealing with broadband integrals.Additionally,range terms of the BSP are eliminated by normalization.By analyzing the depth distribution of the normalized broadband Stokes parameters(NBSP),it is found that the NBSP exhibit extreme values at the source depth,which can be used for source depth estimation.So the proposed depth estimation method is based on searching the peaks of the NBSP.Simulations show that this method is effective in relatively simple shallow water environments.Finally,the effect of source range,frequency bandwidth,sound speed profile(SSP),water depth,and signal-to-noise ratio(SNR)are studied.The findings indicate that the proposed method can accurately estimate the source depth when the SNR is greater than-5 d B and does not need to consider model mismatch issues.Additionally,variations in environmental parameters have minimal impact on estimation accuracy.Compared to MFP,the proposed method requires a higher SNR,but demonstrates superior robustness against fluctuations in environmental parameters.
基金supported by the Fundamental Research Funds for the Central Universities of China(FRF-TP-24-058A)with additional support from the National Key Laboratory of Helicopter Aeromechanics(2024-ZSJ-LB-02-02).
文摘Considering the impact of terminal impact time constraints and the state information of maneuvering targets on the guidance accuracy in multi-UAV cooperative guidance,this paper proposes an impact time cooperative control guidance law(ITCCG)that combines the optimal error dynamics with an improved adaptive cubature Kalman filter(IACKF)algorithm.First,a terminal impact time feedback term is introduced into proportional navigation guidance based on the relative virtual guidance model,and terminal time control is achieved through optimal error dynamics.Then,the Huber loss function is used to reduce the impact of measurement outliers,and the diagonal decomposition is applied to address the issue of non-positive definite matrices that cannot undergo Cholesky decomposition.Finally,the ITCCG and IACKF algorithms combined achieve multi-UAV time-cooperated guidance based on maneuvering target state estimation.Simulation results show that the proposed algorithm effectively reduces the target state estimation error and achieves cooperative guidance within the desired time frame.
基金supported by the Knowledge Innovation Program of Wuhan-Shuguang Project(Grant No.2023010201020443)the School-Level Scientific Research Project Funding Program of Jianghan University(Grant No.2022XKZX33)the Natural Science Foundation of Hubei Province(Grant No.2024AFB466).
文摘The 6D pose estimation of objects is of great significance for the intelligent assembly and sorting of industrial parts.In the industrial robot production scenarios,the 6D pose estimation of industrial parts mainly faces two challenges:one is the loss of information and interference caused by occlusion and stacking in the sorting scenario,the other is the difficulty of feature extraction due to the weak texture of industrial parts.To address the above problems,this paper proposes an attention-based pixel-level voting network for 6D pose estimation of weakly textured industrial parts,namely CB-PVNet.On the one hand,the voting scheme can predict the keypoints of affected pixels,which improves the accuracy of keypoint localization even in scenarios such as weak texture and partial occlusion.On the other hand,the attention mechanism can extract interesting features of the object while suppressing useless features of surroundings.Extensive comparative experiments were conducted on both public datasets(including LINEMOD,Occlusion LINEMOD and T-LESS datasets)and self-made datasets.The experimental results indicate that the proposed network CB-PVNet can achieve accuracy of ADD(-s)comparable to state-of-the-art using only RGB images while ensuring real-time performance.Additionally,we also conducted robot grasping experiments in the real world.The balance between accuracy and computational efficiency makes the method well-suited for applications in industrial automation.
基金supported by the National Key R&D Program of China under Grant No.2022YFA1003701the National Natural Science Foundation of China under Grant Nos.12331009 and 12071416the Yunnan Fundamental Research Projects under Grant No.202201AV070006。
文摘This paper aims to develop a unified Bayesian approach for clustered data analysis when observations are subject to missingness at random.The authors consider a general framework in which the parameters of interest are defined through estimating equations,and the probability of missingness follows a general parametric form.The generalized method of moments framework is employed to derive an optimal combination of inverse-probability-weighted estimating equations for the parameters of interest and score equations for propensity score.Using this framework,the authors develop a quasi-Bayesian analysis for clustered samples with missing values.A unified model selection approach is also proposed to compare models characterized by different moment conditions.The authors systematically evaluate the large-sample properties of the proposed quasi-posterior density with both fixed and shrinking priors and establish the selection consistency of the proposed model selection criterion.The proposed results are valid under very mild conditions and offer significant advantages for parameters defined through non-smooth estimating functions.Extensive numerical studies demonstrate that the proposed method performs exceptionally well in finite samples.
基金supported by the National Key R&D Program of China under Grant No.2022YFA1003701the Open Research Fund of Yunnan Key Laboratory of Statistical Modeling and Data Analysis,Yunnan University under Grant No.SMDAYB2023004。
文摘Quantile regression(QR)has become an important tool to measure dependence of response variable's quantiles on a number of predictors for heterogeneous data,especially heavy-tailed data and outliers.However,it is quite challenging to make statistical inference on distributed high-dimensional QR with missing data due to the distributed nature,sparsity and missingness of data and nondifferentiable quantile loss function.To overcome the challenge,this paper develops a communicationefficient method to select variables and estimate parameters by utilizing a smooth function to approximate the non-differentiable quantile loss function and incorporating the idea of the inverse probability weighting and the penalty function.The proposed approach has three merits.First,it is both computationally and communicationally efficient because only the first-and second-order information of the approximate objective function are communicated at each iteration.Second,the proposed estimators possess the oracle property after a limited number of iterations without constraint on the number of machines.Third,the proposed method simultaneously selects variables and estimates parameters within a distributed framework,ensuring robustness to the specified response probability or propensity score function of the missing data mechanism.Simulation studies and a real example are used to illustrate the effectiveness of the proposed methodologies.
基金supported in part by the National Key Research and Development Program of China(2021YFB2900501)in part by the Shaanxi Science and Technology Innovation Team(2023-CX-TD-03)+3 种基金in part by the Science and Technology Program of Shaanxi Province(2021GXLH-Z-038)in part by the Natural Science Foundation of Hunan Province(2023JJ40607 and 2023JJ50045)in part by the Scientific Research Foundation of Hunan Provincial Education Department(23B0713 and 24B0603)in part by the National Natural Science Foundation of China(62401371,62101275,and 62372070).
文摘(Quasi-)closed-form results for the statistical properties of unmanned aerial vehicle(UAV)airto-ground channels are derived for the first time using a novel spatial-vector-based method from a threedimensional(3-D)arbitrary-elevation one-cylinder model.The derived results include a closed-form expression for the space-time correlation function and some quasi-closed-form ones for the space-Doppler power spectrum density,the level crossing rate,and the average fading duration,which are shown to be the generalizations of those previously obtained from the two-dimensional(2-D)one-ring model and the 3-D low-elevation one-cylinder model for terrestrial mobile-to-mobile channels.The close agreements between the theoretical results and the simulations as well as the measurements validate the utility of the derived channel statistics.Based on the derived expressions,the impacts of some parameters on the channel characteristics are investigated in an effective,efficient,and explicable way,which leads to a general guideline on the manual parameter estimation from the measurement description.
基金support provided by the European University of Atlantic.
文摘Human pose estimation is crucial across diverse applications,from healthcare to human-computer interaction.Integrating inertial measurement units(IMUs)with monocular vision methods holds great potential for leveraging complementary modalities;however,existing approaches are often limited by IMU drift,noise,and underutilization of visual information.To address these limitations,we propose a novel dual-stream feature extraction framework that effectively combines temporal IMU data and single-view image features for improved pose estimation.Short-term dependencies in IMU sequences are captured with convolutional layers,while a Transformerbased architecture models long-range temporal dynamics.To mitigate IMU drift and inter-sensor inconsistencies,a complementary filtering module is introduced alongside a cross-channel interaction mechanism.Features from the IMU and image streams are then fused via a dedicated fusion module and further refined utilizing a high-precision regression head for accurate pose prediction.Experimental results on benchmark datasets demonstrate that our method significantly outperforms existing techniques in terms of estimation,accuracy,and robustness,validating the effectiveness of our dual-stream architecture.
基金supported by the National Natural Science Foundation of China(Nos.62171469,62071029)。
文摘The estimation of the Number of Sources(NoS)is a significant challenge in signal processing,particularly due to the impact of colored noise on the performance of NoS estimation.This paper proposes a Multidimensional Feature Network(MFNet)which is designed for NoS estimation by extracting features of the sampled received signals and Sampled Covariance Matrix(SCM).The MFNet treats the raw signal and the SCM as two different types of data,and is able to achieve NoS estimation under colored noise and imperfect array.MFNet employs the Gated Recurrent Unit(GRU)to capture sequential information from the original signal data and to construct the Pseudo Covariance Matrix(PCM).Subsequently,various dimensional features,including eigenvalues and the Gerschgorin disk radius,are extracted from both the PCM and SCM,which are then jointly input into the subsequent network.An overall accuracy of 82%can be achieved after network training.The ablation experimental results demonstrate the effectiveness of multiple inputs.And simulation results demonstrate that the proposed MFNet achieves higher estimation accuracy compared to existing algorithms and exhibits greater robustness against colored noise.
基金partially supported by the Natural Science Foundation of Shandong Province,China(No.ZR2023ME009)the National Natural Science Foundation of China(No.51909252)。
文摘Accurately estimating depth from underwater monocular images is essential for the target tracking task of unmanned underwater vehicles.This work proposes a method based on the Lpg-Lap Unet architecture.First,the Unet architecture integrates Laplacian pyramid depth residuals and Sobel operators to improve the boundary details in depth images,which may suffer from the feature loss caused by upsampling and the blurriness of underwater images.Multiscale local planar guidance layers then fully exploit the intermediate depth features,and a comprehensive loss function ensures robustness and accuracy.Experimental results on benchmarks demonstrate the effectiveness of Lpg-Lap Unet and its superior performance over state-of-the-art models.An underwater target tracking system is then designed to further validate its real-time capabilities in the AirSim simulation platform.