The orthogonal time frequency space(OTFS)modulation is a novel modulation scheme that can effectively cope with the high Doppler expansion caused by high mobility.Since it modulates data on delay-Doppler(DD)domain and...The orthogonal time frequency space(OTFS)modulation is a novel modulation scheme that can effectively cope with the high Doppler expansion caused by high mobility.Since it modulates data on delay-Doppler(DD)domain and makes full use of the sparse characteristics of DD domain,it has been widely studied to design efficient channel estimation and signal detection schemes.In this paper,we design a novel superimposed pilot pattern with transition band,which replaces the traditional embedded pilot(EP)guard zero-symbols,and perform a two-stage channel estimation.In the first stage,we fully utilize the dispersion characteristics of OTFS signal in DD domain,and use threshold decision to make coarse channel estimation.In the second stage,we use the results of the coarse estimation for iterative signal detection and accurate channel estimation.During the second stage,we make full use of the sparsity of the channel in DD domain,remodel the received signal into the form of sparse channel vector multiplied by channel coefficient matrix,and introduce Doppler index segmentation factor(DISF)to subdivide the Doppler index to solve the problem of fractional Doppler.Simulations reveal that,the scheme proposed in this paper has higher spectral efficiency compared with traditional EP scheme and lower peak-to-average power ratio(PAPR)compared with traditional superimposed pilot scheme.展开更多
(Quasi-)closed-form results for the statistical properties of unmanned aerial vehicle(UAV)airto-ground channels are derived for the first time using a novel spatial-vector-based method from a threedimensional(3-D)arbi...(Quasi-)closed-form results for the statistical properties of unmanned aerial vehicle(UAV)airto-ground channels are derived for the first time using a novel spatial-vector-based method from a threedimensional(3-D)arbitrary-elevation one-cylinder model.The derived results include a closed-form expression for the space-time correlation function and some quasi-closed-form ones for the space-Doppler power spectrum density,the level crossing rate,and the average fading duration,which are shown to be the generalizations of those previously obtained from the two-dimensional(2-D)one-ring model and the 3-D low-elevation one-cylinder model for terrestrial mobile-to-mobile channels.The close agreements between the theoretical results and the simulations as well as the measurements validate the utility of the derived channel statistics.Based on the derived expressions,the impacts of some parameters on the channel characteristics are investigated in an effective,efficient,and explicable way,which leads to a general guideline on the manual parameter estimation from the measurement description.展开更多
Accurate time delay estimation of target echo signals is a critical component of underwater target localization.In active sonar systems,echo signal processing is vulnerable to the effects of reverberation and noise in...Accurate time delay estimation of target echo signals is a critical component of underwater target localization.In active sonar systems,echo signal processing is vulnerable to the effects of reverberation and noise in the maritime environment.This paper proposes a novel method for estimating target time delay using multi-bright spot echoes,assuming the target’s size and depth are known.Aiming to effectively enhance the extraction of geometric features from the target echoes and mitigate the impact of reverberation and noise,the proposed approach employs the fractional order Fourier transform-frequency sliced wavelet transform to extract multi-bright spot echoes.Using the highlighting model theory and the target size information,an observation matrix is constructed to represent multi-angle incident signals and obtain the theoretical scattered echo signals from different angles.Aiming to accurately estimate the target’s time delay,waveform similarity coefficients and mean square error values between the theoretical return signals and received signals are computed across various incident angles and time delays.Simulation results show that,compared to the conventional matched filter,the proposed algorithm reduces the relative error by 65.9%-91.5%at a signal-to noise ratio of-25 dB,and by 66.7%-88.9%at a signal-to-reverberation ratio of−10 dB.This algorithm provides a new approach for the precise localization of submerged targets in shallow water environments.展开更多
With the rapid progress of the artificial intelligence(AI)technology and mobile internet,3D hand pose estimation has become critical to various intelligent application areas,e.g.,human-computer interaction.To avoid th...With the rapid progress of the artificial intelligence(AI)technology and mobile internet,3D hand pose estimation has become critical to various intelligent application areas,e.g.,human-computer interaction.To avoid the low accuracy of single-modal estimation and the high complexity of traditional multi-modal 3D estimation,this paper proposes a novel multi-modal multi-view(MMV)3D hand pose estimation system,which introduces a registration before translation(RT)-translation before registration(TR)jointed conditional generative adversarial network(cGAN)to train a multi-modal registration network,and then employs the multi-modal feature fusion to achieve high-quality estimation,with low hardware and software costs both in data acquisition and processing.Experimental results demonstrate that the MMV system is effective and feasible in various scenarios.It is promising for the MMV system to be used in broad intelligent application areas.展开更多
The estimation of fish mass is one of the most basic and important tasks in aquaculture.Acquiring the mass of fish at different growth stages is of great significance for feeding,monitoring the health status of fish,a...The estimation of fish mass is one of the most basic and important tasks in aquaculture.Acquiring the mass of fish at different growth stages is of great significance for feeding,monitoring the health status of fish,and making breeding plans to increase production.The existing estimation methods for fish mass often stay in the 2D plane,and it is difficult to obtain the 3D information on fish,which will lead to the error.To solve this problem,a multi-view method was proposed to obtain the 3D information of fish and predict the mass of fish through a two-stage neural network with an edge-sensitive module.In the first stage,the side-and downward-view images of the fish and some 3D information,such as side area,top area,length,deflection angle,and pitch angle,were captured to estimate the size of the fish through two vertically placed cameras.Then the area of the fish at different views was estimated accurately through the pre-trained image segmentation neural network with an edgesensitive module.In the second stage,a fully connected neural network was constructed to regress the fish mass based on the 3D information obtained in the previous stage.The experimental results indicate that the proposed method can accurately estimate the fish mass and outperform the existing estimation methods.展开更多
Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical robots.This st...Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical robots.This study proposes a novel end-to-end disparity estimation model to address these challenges.Our approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions,integrating multi-scale image information to enhance robustness against lighting interferences.This study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison,improving accuracy and efficiency.The model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot,comprising simulated silicone heart sequences and real heart video data.Experimental results demonstrate significant improvement in the network’s resistance to lighting interference without substantially increasing parameters.Moreover,the model exhibited faster convergence during training,contributing to overall performance enhancement.This study advances endoscopic image processing accuracy and has potential implications for surgical robot applications in complex environments.展开更多
Previous multi-view 3D human pose estimation methods neither correlate different human joints in each view nor model learnable correlations between the same joints in different views explicitly,meaning that skeleton s...Previous multi-view 3D human pose estimation methods neither correlate different human joints in each view nor model learnable correlations between the same joints in different views explicitly,meaning that skeleton structure information is not utilized and multi-view pose information is not completely fused.Moreover,existing graph convolutional operations do not consider the specificity of different joints and different views of pose information when processing skeleton graphs,making the correlation weights between nodes in the graph and their neighborhood nodes shared.Existing Graph Convolutional Networks(GCNs)cannot extract global and deeplevel skeleton structure information and view correlations efficiently.To solve these problems,pre-estimated multiview 2D poses are designed as a multi-view skeleton graph to fuse skeleton priors and view correlations explicitly to process occlusion problem,with the skeleton-edge and symmetry-edge representing the structure correlations between adjacent joints in each viewof skeleton graph and the view-edge representing the view correlations between the same joints in different views.To make graph convolution operation mine elaborate and sufficient skeleton structure information and view correlations,different correlation weights are assigned to different categories of neighborhood nodes and further assigned to each node in the graph.Based on the graph convolution operation proposed above,a Residual Graph Convolution(RGC)module is designed as the basic module to be combined with the simplified Hourglass architecture to construct the Hourglass-GCN as our 3D pose estimation network.Hourglass-GCNwith a symmetrical and concise architecture processes three scales ofmulti-viewskeleton graphs to extract local-to-global scale and shallow-to-deep level skeleton features efficiently.Experimental results on common large 3D pose dataset Human3.6M and MPI-INF-3DHP show that Hourglass-GCN outperforms some excellent methods in 3D pose estimation accuracy.展开更多
The burgeoning market for lithium-ion batteries has stimulated a growing need for more reliable battery performance monitoring. Accurate state-of-health(SOH) estimation is critical for ensuring battery operational per...The burgeoning market for lithium-ion batteries has stimulated a growing need for more reliable battery performance monitoring. Accurate state-of-health(SOH) estimation is critical for ensuring battery operational performance. Despite numerous data-driven methods reported in existing research for battery SOH estimation, these methods often exhibit inconsistent performance across different application scenarios. To address this issue and overcome the performance limitations of individual data-driven models,integrating multiple models for SOH estimation has received considerable attention. Ensemble learning(EL) typically leverages the strengths of multiple base models to achieve more robust and accurate outputs. However, the lack of a clear review of current research hinders the further development of ensemble methods in SOH estimation. Therefore, this paper comprehensively reviews multi-model ensemble learning methods for battery SOH estimation. First, existing ensemble methods are systematically categorized into 6 classes based on their combination strategies. Different realizations and underlying connections are meticulously analyzed for each category of EL methods, highlighting distinctions, innovations, and typical applications. Subsequently, these ensemble methods are comprehensively compared in terms of base models, combination strategies, and publication trends. Evaluations across 6 dimensions underscore the outstanding performance of stacking-based ensemble methods. Following this, these ensemble methods are further inspected from the perspectives of weighted ensemble and diversity, aiming to inspire potential approaches for enhancing ensemble performance. Moreover, addressing challenges such as base model selection, measuring model robustness and uncertainty, and interpretability of ensemble models in practical applications is emphasized. Finally, future research prospects are outlined, specifically noting that deep learning ensemble is poised to advance ensemble methods for battery SOH estimation. The convergence of advanced machine learning with ensemble learning is anticipated to yield valuable avenues for research. Accelerated research in ensemble learning holds promising prospects for achieving more accurate and reliable battery SOH estimation under real-world conditions.展开更多
Premise:The com bined effects of modern healthcare practices which prolong lifespan and declining birthrates have created unprecedented changes in age demographics worldwide that are especially pronounced in Japan,Sou...Premise:The com bined effects of modern healthcare practices which prolong lifespan and declining birthrates have created unprecedented changes in age demographics worldwide that are especially pronounced in Japan,South Korea,Europe,and North America.Since old age is the most significant predictor of dementia,global healthcare systems must rise to the challenge of providing care for those with neurodegenerative disorders.展开更多
Interference significantly impacts the performance of the Global Navigation Satellite Systems(GNSS),highlighting the need for advanced interference localization technology to bolster anti-interference and defense capa...Interference significantly impacts the performance of the Global Navigation Satellite Systems(GNSS),highlighting the need for advanced interference localization technology to bolster anti-interference and defense capabilities.The Uniform Circular Array(UCA)enables concurrent estimation of the Direction of Arrival(DOA)in both azimuth and elevation.Given the paramount importance of stability and real-time performance in interference localization,this work proposes an innovative approach to reduce the complexity and increase the robustness of the DOA estimation.The proposed method reduces computational complexity by selecting a reduced number of array elements to reconstruct a non-uniform sparse array from a UCA.To ensure DOA estimation accuracy,minimizing the Cramér-Rao Bound(CRB)is the objective,and the Spatial Correlation Coefficient(SCC)is incorporated as a constraint to mitigate side-lobe.The optimization model is a quadratic fractional model,which is solved by Semi-Definite Relaxation(SDR).When the array has perturbations,the mathematical expressions for CRB and SCC are re-derived to enhance the robustness of the reconstructed array.Simulation and hardware experiments validate the effectiveness of the proposed method in estimating interference DOA,showing high robustness and reductions in hardware and computational costs associated with DOA estimation.展开更多
The application of high-performance imaging sensors in space-based space surveillance systems makes it possible to recognize space objects and estimate their poses using vision-based methods. In this paper, we propose...The application of high-performance imaging sensors in space-based space surveillance systems makes it possible to recognize space objects and estimate their poses using vision-based methods. In this paper, we proposed a kernel regression-based method for joint multi-view space object recognition and pose estimation. We built a new simulated satellite image dataset named BUAA-SID 1.5 to test our method using different image representations. We evaluated our method for recognition-only tasks, pose estimation-only tasks, and joint recognition and pose estimation tasks. Experimental results show that our method outperforms the state-of-the-arts in space object recognition, and can recognize space objects and estimate their poses effectively and robustly against noise and lighting conditions.展开更多
Developing sensorless techniques for estimating battery expansion is essential for effective mechanical state monitoring,improving the accuracy of digital twin simulation and abnormality detection.Therefore,this paper...Developing sensorless techniques for estimating battery expansion is essential for effective mechanical state monitoring,improving the accuracy of digital twin simulation and abnormality detection.Therefore,this paper presents a data-driven approach to expansion estimation using electromechanical coupled models with machine learning.The proposed method integrates reduced-order impedance models with data-driven mechanical models,coupling the electrochemical and mechanical states through the state of charge(SOC)and mechanical pressure within a state estimation framework.The coupling relationship was established through experimental insights into pressure-related impedance parameters and the nonlinear mechanical behavior with SOC and pressure.The data-driven model was interpreted by introducing a novel swelling coefficient defined by component stiffnesses to capture the nonlinear mechanical behavior across various mechanical constraints.Sensitivity analysis of the impedance model shows that updating model parameters with pressure can reduce the mean absolute error of simulated voltage by 20 mV and SOC estimation error by 2%.The results demonstrate the model's estimation capabilities,achieving a root mean square error of less than 1 kPa when the maximum expansion force is from 30 kPa to 120 kPa,outperforming calibrated stiffness models and other machine learning techniques.The model's robustness and generalizability are further supported by its effective handling of SOC estimation and pressure measurement errors.This work highlights the importance of the proposed framework in enhancing state estimation and fault diagnosis for lithium-ion batteries.展开更多
Cyber-physical systems(CPSs)are regarded as the backbone of the fourth industrial revolution,in which communication,physical processes,and computer technology are integrated.In modern industrial systems,CPSs are widel...Cyber-physical systems(CPSs)are regarded as the backbone of the fourth industrial revolution,in which communication,physical processes,and computer technology are integrated.In modern industrial systems,CPSs are widely utilized across various domains,such as smart grids,smart healthcare systems,smart vehicles,and smart manufacturing,among others.Due to their unique spatial distribution,CPSs are highly vulnerable to cyber-attacks,which may result in severe performance degradation and even system instability.Consequently,the security concerns of CPSs have attracted significant attention in recent years.In this paper,a comprehensive survey on the security issues of CPSs under cyber-attacks is provided.Firstly,mathematical descriptions of various types of cyberattacks are introduced in detail.Secondly,two types of secure estimation and control processing schemes,including robust methods and active methods,are reviewed.Thirdly,research findings related to secure control and estimation problems for different types of CPSs are summarized.Finally,the survey is concluded by outlining the challenges and suggesting potential research directions for the future.展开更多
Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate...Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate-distortion (R-D) model. We reasonably allocate bit-rate among views based on the correlation analysisl The proposed algorithm consists of three levels to control the rate bits more accurately, of which the frame layer allocates bits according to the frame complexity and the temporal activity. Extensive experiments show that the proposed algorithm can control the bit rate efficiently.展开更多
Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application scenarios.With the introduction of end-to-end direct regression methods,the fi...Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application scenarios.With the introduction of end-to-end direct regression methods,the field has entered a new stage of development.However,the regression results of joints that are more heavily influenced by external factors are not accurate enough even for the optimal method.In this paper,we propose an effective feature recalibration module based on the channel attention mechanism and a relative optimal calibration strategy,which is applied to themulti-viewmulti-person 3D human pose estimation task to achieve improved detection accuracy for joints that are more severely affected by external factors.Specifically,it achieves relative optimal weight adjustment of joint feature information through the recalibration module and strategy,which enables the model to learn the dependencies between joints and the dependencies between people and their corresponding joints.We call this method as the Efficient Recalibration Network(ER-Net).Finally,experiments were conducted on two benchmark datasets for this task,Campus and Shelf,in which the PCP reached 97.3% and 98.3%,respectively.展开更多
The emergence of next generation networks(NextG),including 5G and beyond,is reshaping the technological landscape of cellular and mobile networks.These networks are sufficiently scaled to interconnect billions of user...The emergence of next generation networks(NextG),including 5G and beyond,is reshaping the technological landscape of cellular and mobile networks.These networks are sufficiently scaled to interconnect billions of users and devices.Researchers in academia and industry are focusing on technological advancements to achieve highspeed transmission,cell planning,and latency reduction to facilitate emerging applications such as virtual reality,the metaverse,smart cities,smart health,and autonomous vehicles.NextG continuously improves its network functionality to support these applications.Multiple input multiple output(MIMO)technology offers spectral efficiency,dependability,and overall performance in conjunctionwithNextG.This article proposes a secure channel estimation technique in MIMO topology using a norm-estimation model to provide comprehensive insights into protecting NextG network components against adversarial attacks.The technique aims to create long-lasting and secure NextG networks using this extended approach.The viability of MIMO applications and modern AI-driven methodologies to combat cybersecurity threats are explored in this research.Moreover,the proposed model demonstrates high performance in terms of reliability and accuracy,with a 20%reduction in the MalOut-RealOut-Diff metric compared to existing state-of-the-art techniques.展开更多
The reuse of liquid propellant rocket engines has increased the difficulty of their control and estimation.State and parameter Moving Horizon Estimation(MHE)is an optimization-based strategy that provides the necessar...The reuse of liquid propellant rocket engines has increased the difficulty of their control and estimation.State and parameter Moving Horizon Estimation(MHE)is an optimization-based strategy that provides the necessary information for model predictive control.Despite the many advantages of MHE,long computation time has limited its applications for system-level models of liquid propellant rocket engines.To address this issue,we propose an asynchronous MHE method called advanced-multi-step MHE with Noise Covariance Estimation(amsMHE-NCE).This method computes the MHE problem asynchronously to obtain the states and parameters and can be applied to multi-threaded computations.In the background,the state and covariance estimation optimization problems are computed using multiple sampling times.In real-time,sensitivity is used to quickly approximate state and parameter estimates.A covariance estimation method is developed using sensitivity to avoid redundant MHE problem calculations in case of sensor degradation during engine reuse.The amsMHE-NCE is validated through three cases based on the space shuttle main engine system-level model,and we demonstrate that it can provide more accurate real-time estimates of states and parameters compared to other commonly used estimation methods.展开更多
When estimating the capacity of lithium-ion batteries offline or online,it is essential to extract a health feature(HF)that can effectively characterize capacity degradation under both conventional ideal and complex d...When estimating the capacity of lithium-ion batteries offline or online,it is essential to extract a health feature(HF)that can effectively characterize capacity degradation under both conventional ideal and complex dynamic operating conditions.However,the extraction of most HFs relies on complete charge-discharge cycle data,making them less adaptable to complex dynamic operating conditions.Existing mechanism HFs,while capable of characterizing capacity degradation from a mechanism perspective,suffer from limitations such as insufficient physical model expressiveness,high dimension,and redundancy of the mechanism HF.These issues increase the complexity of subsequent modeling of the relationship between HFs and capacity,thereby restricting their promotion in engineering practice.To meet this gap,this paper proposes a novel mechanism-based HF.Firstly,a multi-physical fields coupling model is developed to describe the interactions between electrochemical,thermal,and aging behaviors of the battery.Secondly,based on the aging mechanism,the accumulated charge of lithium lost during the formation of the solid electrolyte interphase(SEI)film is extracted as HF to provide a more intuitive representation of capacity degradation.Then,to reduce estimation errors caused by considering only a single aging mechanism,multiple representative regression models are employed to establish the mapping relationship between the mechanism HF and capacity,further enhancing the accuracy of final results.Finally,the proposed method is implemented and validated using real battery data under three different types of operating conditions.Experimental results demonstrate that,compared to other commonly used HFs,the proposed HF exhibits significant competitive advantages in handling incomplete cycle data,unknown operating conditions,and capacity estimation models.The minimum estimation error under ideal conditions is 0.0074,and the minimum estimation error under complex dynamic conditions is 0.0268.展开更多
The beyond fifth-generation Internet of Things requires more capable channel coding schemes to achieve high-reliability,low-complexity and lowlatency communications.The theoretical analysis of error-correction perform...The beyond fifth-generation Internet of Things requires more capable channel coding schemes to achieve high-reliability,low-complexity and lowlatency communications.The theoretical analysis of error-correction performance of channel coding functions as a significant way of optimizing the transmission reliability and efficiency.In this paper,the efficient estimation methods of the block error rate(BLER)performance for rate-compatible polar codes(RCPC)are proposed under several scenarios.Firstly,the BLER performance of RCPC is generally evaluated in the additive white Gaussian noise channels.That is further extended into the Rayleigh fading channel case using an equivalent estimation method.Moreover,with respect to the powerful decoder such as successive cancellation list decoding,the performance estimation is derived analytically based on the polar weight spectrum and BLER upper bounds.Theoretical evaluation and numerical simulation results show that the estimated performance can fit well the practical simulated results of RCPC under the objective conditions,verifying the validity of our proposed performance estimation methods.Furthermore,the application designs of the reliability estimation of RCPC are explored,particularly in the advantages of the signal-to-noise(SNR)estimation and throughput efficiency optimization of polar coded hybrid automatic repeat request.展开更多
基金supported by National Natural Science Foundation(NNSF)of China under Grant 62001351the Foundation of National Key Laboratory of Electromagnetic Environment(6142403220202)the Stability Support Fund for Basic Military Industrial Research Institutes(A240104130).
文摘The orthogonal time frequency space(OTFS)modulation is a novel modulation scheme that can effectively cope with the high Doppler expansion caused by high mobility.Since it modulates data on delay-Doppler(DD)domain and makes full use of the sparse characteristics of DD domain,it has been widely studied to design efficient channel estimation and signal detection schemes.In this paper,we design a novel superimposed pilot pattern with transition band,which replaces the traditional embedded pilot(EP)guard zero-symbols,and perform a two-stage channel estimation.In the first stage,we fully utilize the dispersion characteristics of OTFS signal in DD domain,and use threshold decision to make coarse channel estimation.In the second stage,we use the results of the coarse estimation for iterative signal detection and accurate channel estimation.During the second stage,we make full use of the sparsity of the channel in DD domain,remodel the received signal into the form of sparse channel vector multiplied by channel coefficient matrix,and introduce Doppler index segmentation factor(DISF)to subdivide the Doppler index to solve the problem of fractional Doppler.Simulations reveal that,the scheme proposed in this paper has higher spectral efficiency compared with traditional EP scheme and lower peak-to-average power ratio(PAPR)compared with traditional superimposed pilot scheme.
基金supported in part by the National Key Research and Development Program of China(2021YFB2900501)in part by the Shaanxi Science and Technology Innovation Team(2023-CX-TD-03)+3 种基金in part by the Science and Technology Program of Shaanxi Province(2021GXLH-Z-038)in part by the Natural Science Foundation of Hunan Province(2023JJ40607 and 2023JJ50045)in part by the Scientific Research Foundation of Hunan Provincial Education Department(23B0713 and 24B0603)in part by the National Natural Science Foundation of China(62401371,62101275,and 62372070).
文摘(Quasi-)closed-form results for the statistical properties of unmanned aerial vehicle(UAV)airto-ground channels are derived for the first time using a novel spatial-vector-based method from a threedimensional(3-D)arbitrary-elevation one-cylinder model.The derived results include a closed-form expression for the space-time correlation function and some quasi-closed-form ones for the space-Doppler power spectrum density,the level crossing rate,and the average fading duration,which are shown to be the generalizations of those previously obtained from the two-dimensional(2-D)one-ring model and the 3-D low-elevation one-cylinder model for terrestrial mobile-to-mobile channels.The close agreements between the theoretical results and the simulations as well as the measurements validate the utility of the derived channel statistics.Based on the derived expressions,the impacts of some parameters on the channel characteristics are investigated in an effective,efficient,and explicable way,which leads to a general guideline on the manual parameter estimation from the measurement description.
基金Supported by the State Key Laboratory of Acoustics and Marine Information Chinese Academy of Sciences(SKL A202507).
文摘Accurate time delay estimation of target echo signals is a critical component of underwater target localization.In active sonar systems,echo signal processing is vulnerable to the effects of reverberation and noise in the maritime environment.This paper proposes a novel method for estimating target time delay using multi-bright spot echoes,assuming the target’s size and depth are known.Aiming to effectively enhance the extraction of geometric features from the target echoes and mitigate the impact of reverberation and noise,the proposed approach employs the fractional order Fourier transform-frequency sliced wavelet transform to extract multi-bright spot echoes.Using the highlighting model theory and the target size information,an observation matrix is constructed to represent multi-angle incident signals and obtain the theoretical scattered echo signals from different angles.Aiming to accurately estimate the target’s time delay,waveform similarity coefficients and mean square error values between the theoretical return signals and received signals are computed across various incident angles and time delays.Simulation results show that,compared to the conventional matched filter,the proposed algorithm reduces the relative error by 65.9%-91.5%at a signal-to noise ratio of-25 dB,and by 66.7%-88.9%at a signal-to-reverberation ratio of−10 dB.This algorithm provides a new approach for the precise localization of submerged targets in shallow water environments.
文摘With the rapid progress of the artificial intelligence(AI)technology and mobile internet,3D hand pose estimation has become critical to various intelligent application areas,e.g.,human-computer interaction.To avoid the low accuracy of single-modal estimation and the high complexity of traditional multi-modal 3D estimation,this paper proposes a novel multi-modal multi-view(MMV)3D hand pose estimation system,which introduces a registration before translation(RT)-translation before registration(TR)jointed conditional generative adversarial network(cGAN)to train a multi-modal registration network,and then employs the multi-modal feature fusion to achieve high-quality estimation,with low hardware and software costs both in data acquisition and processing.Experimental results demonstrate that the MMV system is effective and feasible in various scenarios.It is promising for the MMV system to be used in broad intelligent application areas.
基金funded by Guangdong Provincial Natural Science Foundation General Project(Grant No.2023A1515011700)GuangDong Basic and Applied Basic Research Foundation(Grant No.2022A1515110007)+1 种基金the Guangdong Provincial Natural Science Foundation General Project(Grant No.2023A1515012869)GDAS'Project of Science and Technology Development(Grant No.2022GDASZH-2022010108).
文摘The estimation of fish mass is one of the most basic and important tasks in aquaculture.Acquiring the mass of fish at different growth stages is of great significance for feeding,monitoring the health status of fish,and making breeding plans to increase production.The existing estimation methods for fish mass often stay in the 2D plane,and it is difficult to obtain the 3D information on fish,which will lead to the error.To solve this problem,a multi-view method was proposed to obtain the 3D information of fish and predict the mass of fish through a two-stage neural network with an edge-sensitive module.In the first stage,the side-and downward-view images of the fish and some 3D information,such as side area,top area,length,deflection angle,and pitch angle,were captured to estimate the size of the fish through two vertically placed cameras.Then the area of the fish at different views was estimated accurately through the pre-trained image segmentation neural network with an edgesensitive module.In the second stage,a fully connected neural network was constructed to regress the fish mass based on the 3D information obtained in the previous stage.The experimental results indicate that the proposed method can accurately estimate the fish mass and outperform the existing estimation methods.
基金Supported by Sichuan Science and Technology Program(2023YFSY0026,2023YFH0004)Supported by the Institute of Information&Communications Technology Planning&Evaluation(IITP)grant funded by the Korean government(MSIT)(No.RS-2022-00155885,Artificial Intelligence Convergence Innovation Human Resources Development(Hanyang University ERICA)).
文摘Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical robots.This study proposes a novel end-to-end disparity estimation model to address these challenges.Our approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions,integrating multi-scale image information to enhance robustness against lighting interferences.This study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison,improving accuracy and efficiency.The model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot,comprising simulated silicone heart sequences and real heart video data.Experimental results demonstrate significant improvement in the network’s resistance to lighting interference without substantially increasing parameters.Moreover,the model exhibited faster convergence during training,contributing to overall performance enhancement.This study advances endoscopic image processing accuracy and has potential implications for surgical robot applications in complex environments.
基金supported in part by the National Natural Science Foundation of China under Grants 61973065,U20A20197,61973063.
文摘Previous multi-view 3D human pose estimation methods neither correlate different human joints in each view nor model learnable correlations between the same joints in different views explicitly,meaning that skeleton structure information is not utilized and multi-view pose information is not completely fused.Moreover,existing graph convolutional operations do not consider the specificity of different joints and different views of pose information when processing skeleton graphs,making the correlation weights between nodes in the graph and their neighborhood nodes shared.Existing Graph Convolutional Networks(GCNs)cannot extract global and deeplevel skeleton structure information and view correlations efficiently.To solve these problems,pre-estimated multiview 2D poses are designed as a multi-view skeleton graph to fuse skeleton priors and view correlations explicitly to process occlusion problem,with the skeleton-edge and symmetry-edge representing the structure correlations between adjacent joints in each viewof skeleton graph and the view-edge representing the view correlations between the same joints in different views.To make graph convolution operation mine elaborate and sufficient skeleton structure information and view correlations,different correlation weights are assigned to different categories of neighborhood nodes and further assigned to each node in the graph.Based on the graph convolution operation proposed above,a Residual Graph Convolution(RGC)module is designed as the basic module to be combined with the simplified Hourglass architecture to construct the Hourglass-GCN as our 3D pose estimation network.Hourglass-GCNwith a symmetrical and concise architecture processes three scales ofmulti-viewskeleton graphs to extract local-to-global scale and shallow-to-deep level skeleton features efficiently.Experimental results on common large 3D pose dataset Human3.6M and MPI-INF-3DHP show that Hourglass-GCN outperforms some excellent methods in 3D pose estimation accuracy.
基金National Natural Science Foundation of China (52075420)Fundamental Research Funds for the Central Universities (xzy022023049)National Key Research and Development Program of China (2023YFB3408600)。
文摘The burgeoning market for lithium-ion batteries has stimulated a growing need for more reliable battery performance monitoring. Accurate state-of-health(SOH) estimation is critical for ensuring battery operational performance. Despite numerous data-driven methods reported in existing research for battery SOH estimation, these methods often exhibit inconsistent performance across different application scenarios. To address this issue and overcome the performance limitations of individual data-driven models,integrating multiple models for SOH estimation has received considerable attention. Ensemble learning(EL) typically leverages the strengths of multiple base models to achieve more robust and accurate outputs. However, the lack of a clear review of current research hinders the further development of ensemble methods in SOH estimation. Therefore, this paper comprehensively reviews multi-model ensemble learning methods for battery SOH estimation. First, existing ensemble methods are systematically categorized into 6 classes based on their combination strategies. Different realizations and underlying connections are meticulously analyzed for each category of EL methods, highlighting distinctions, innovations, and typical applications. Subsequently, these ensemble methods are comprehensively compared in terms of base models, combination strategies, and publication trends. Evaluations across 6 dimensions underscore the outstanding performance of stacking-based ensemble methods. Following this, these ensemble methods are further inspected from the perspectives of weighted ensemble and diversity, aiming to inspire potential approaches for enhancing ensemble performance. Moreover, addressing challenges such as base model selection, measuring model robustness and uncertainty, and interpretability of ensemble models in practical applications is emphasized. Finally, future research prospects are outlined, specifically noting that deep learning ensemble is poised to advance ensemble methods for battery SOH estimation. The convergence of advanced machine learning with ensemble learning is anticipated to yield valuable avenues for research. Accelerated research in ensemble learning holds promising prospects for achieving more accurate and reliable battery SOH estimation under real-world conditions.
基金funded by the Natural Sciences and Engineering Research Council of Canada(RGPIN:2016-05964&2023-04283 to JHK)the University of Manitoba Tri-Agency Bridge Funding(#57289 to JHK)the Ricard Foundation’s Baxter Bursary(to JP)。
文摘Premise:The com bined effects of modern healthcare practices which prolong lifespan and declining birthrates have created unprecedented changes in age demographics worldwide that are especially pronounced in Japan,South Korea,Europe,and North America.Since old age is the most significant predictor of dementia,global healthcare systems must rise to the challenge of providing care for those with neurodegenerative disorders.
基金the financial support from the National Key Research and Development Program of China(No.2023YFB3907001)the National Natural Science Foundation of China(Nos.U2233217,62371029)the UK Engineering and Physical Sciences Research Council(EPSRC),China(Nos.EP/M026981/1,EP/T021063/1 and EP/T024917/)。
文摘Interference significantly impacts the performance of the Global Navigation Satellite Systems(GNSS),highlighting the need for advanced interference localization technology to bolster anti-interference and defense capabilities.The Uniform Circular Array(UCA)enables concurrent estimation of the Direction of Arrival(DOA)in both azimuth and elevation.Given the paramount importance of stability and real-time performance in interference localization,this work proposes an innovative approach to reduce the complexity and increase the robustness of the DOA estimation.The proposed method reduces computational complexity by selecting a reduced number of array elements to reconstruct a non-uniform sparse array from a UCA.To ensure DOA estimation accuracy,minimizing the Cramér-Rao Bound(CRB)is the objective,and the Spatial Correlation Coefficient(SCC)is incorporated as a constraint to mitigate side-lobe.The optimization model is a quadratic fractional model,which is solved by Semi-Definite Relaxation(SDR).When the array has perturbations,the mathematical expressions for CRB and SCC are re-derived to enhance the robustness of the reconstructed array.Simulation and hardware experiments validate the effectiveness of the proposed method in estimating interference DOA,showing high robustness and reductions in hardware and computational costs associated with DOA estimation.
基金co-supported by the National Natural Science Foundation of China (Grant Nos. 61371134, 61071137)the National Basic Research Program of China (No. 2010CB327900)
文摘The application of high-performance imaging sensors in space-based space surveillance systems makes it possible to recognize space objects and estimate their poses using vision-based methods. In this paper, we proposed a kernel regression-based method for joint multi-view space object recognition and pose estimation. We built a new simulated satellite image dataset named BUAA-SID 1.5 to test our method using different image representations. We evaluated our method for recognition-only tasks, pose estimation-only tasks, and joint recognition and pose estimation tasks. Experimental results show that our method outperforms the state-of-the-arts in space object recognition, and can recognize space objects and estimate their poses effectively and robustly against noise and lighting conditions.
基金Fund supported this work for Excellent Youth Scholars of China(Grant No.52222708)the National Natural Science Foundation of China(Grant No.51977007)+1 种基金Part of this work is supported by the research project“SPEED”(03XP0585)at RWTH Aachen Universityfunded by the German Federal Ministry of Education and Research(BMBF)。
文摘Developing sensorless techniques for estimating battery expansion is essential for effective mechanical state monitoring,improving the accuracy of digital twin simulation and abnormality detection.Therefore,this paper presents a data-driven approach to expansion estimation using electromechanical coupled models with machine learning.The proposed method integrates reduced-order impedance models with data-driven mechanical models,coupling the electrochemical and mechanical states through the state of charge(SOC)and mechanical pressure within a state estimation framework.The coupling relationship was established through experimental insights into pressure-related impedance parameters and the nonlinear mechanical behavior with SOC and pressure.The data-driven model was interpreted by introducing a novel swelling coefficient defined by component stiffnesses to capture the nonlinear mechanical behavior across various mechanical constraints.Sensitivity analysis of the impedance model shows that updating model parameters with pressure can reduce the mean absolute error of simulated voltage by 20 mV and SOC estimation error by 2%.The results demonstrate the model's estimation capabilities,achieving a root mean square error of less than 1 kPa when the maximum expansion force is from 30 kPa to 120 kPa,outperforming calibrated stiffness models and other machine learning techniques.The model's robustness and generalizability are further supported by its effective handling of SOC estimation and pressure measurement errors.This work highlights the importance of the proposed framework in enhancing state estimation and fault diagnosis for lithium-ion batteries.
文摘Cyber-physical systems(CPSs)are regarded as the backbone of the fourth industrial revolution,in which communication,physical processes,and computer technology are integrated.In modern industrial systems,CPSs are widely utilized across various domains,such as smart grids,smart healthcare systems,smart vehicles,and smart manufacturing,among others.Due to their unique spatial distribution,CPSs are highly vulnerable to cyber-attacks,which may result in severe performance degradation and even system instability.Consequently,the security concerns of CPSs have attracted significant attention in recent years.In this paper,a comprehensive survey on the security issues of CPSs under cyber-attacks is provided.Firstly,mathematical descriptions of various types of cyberattacks are introduced in detail.Secondly,two types of secure estimation and control processing schemes,including robust methods and active methods,are reviewed.Thirdly,research findings related to secure control and estimation problems for different types of CPSs are summarized.Finally,the survey is concluded by outlining the challenges and suggesting potential research directions for the future.
基金supported by the National Natural Science Foundation of China (Grant Nos.60832003,60672052,60902085,60972137)the Key Project of Shanghai Municipal Education Commission (Grant No.09ZZ90)+2 种基金the Natural Science Foundation of Shanghai(Grant No.09ZR1412500)the Innovation Foundation of Shanghai University (Grants Nos.10YZ09,SHUCX091061)the Shuguang Plan of Shanghai Education Development Foundation (Grant No.06SG43)
文摘Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate-distortion (R-D) model. We reasonably allocate bit-rate among views based on the correlation analysisl The proposed algorithm consists of three levels to control the rate bits more accurately, of which the frame layer allocates bits according to the frame complexity and the temporal activity. Extensive experiments show that the proposed algorithm can control the bit rate efficiently.
基金supported in part by the Key Program of NSFC (Grant No.U1908214)Special Project of Central Government Guiding Local Science and Technology Development (Grant No.2021JH6/10500140)+3 种基金Program for the Liaoning Distinguished Professor,Program for Innovative Research Team in University of Liaoning Province (LT2020015)Dalian (2021RT06)and Dalian University (XLJ202010)the Science and Technology Innovation Fund of Dalian (Grant No.2020JJ25CY001)Dalian University Scientific Research Platform Project (No.202101YB03).
文摘Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application scenarios.With the introduction of end-to-end direct regression methods,the field has entered a new stage of development.However,the regression results of joints that are more heavily influenced by external factors are not accurate enough even for the optimal method.In this paper,we propose an effective feature recalibration module based on the channel attention mechanism and a relative optimal calibration strategy,which is applied to themulti-viewmulti-person 3D human pose estimation task to achieve improved detection accuracy for joints that are more severely affected by external factors.Specifically,it achieves relative optimal weight adjustment of joint feature information through the recalibration module and strategy,which enables the model to learn the dependencies between joints and the dependencies between people and their corresponding joints.We call this method as the Efficient Recalibration Network(ER-Net).Finally,experiments were conducted on two benchmark datasets for this task,Campus and Shelf,in which the PCP reached 97.3% and 98.3%,respectively.
基金funding from King Saud University through Researchers Supporting Project number(RSP2024R387),King Saud University,Riyadh,Saudi Arabia.
文摘The emergence of next generation networks(NextG),including 5G and beyond,is reshaping the technological landscape of cellular and mobile networks.These networks are sufficiently scaled to interconnect billions of users and devices.Researchers in academia and industry are focusing on technological advancements to achieve highspeed transmission,cell planning,and latency reduction to facilitate emerging applications such as virtual reality,the metaverse,smart cities,smart health,and autonomous vehicles.NextG continuously improves its network functionality to support these applications.Multiple input multiple output(MIMO)technology offers spectral efficiency,dependability,and overall performance in conjunctionwithNextG.This article proposes a secure channel estimation technique in MIMO topology using a norm-estimation model to provide comprehensive insights into protecting NextG network components against adversarial attacks.The technique aims to create long-lasting and secure NextG networks using this extended approach.The viability of MIMO applications and modern AI-driven methodologies to combat cybersecurity threats are explored in this research.Moreover,the proposed model demonstrates high performance in terms of reliability and accuracy,with a 20%reduction in the MalOut-RealOut-Diff metric compared to existing state-of-the-art techniques.
基金supported by the National Natural Science Foundation of China(Nos.62120106003 and 62173301)。
文摘The reuse of liquid propellant rocket engines has increased the difficulty of their control and estimation.State and parameter Moving Horizon Estimation(MHE)is an optimization-based strategy that provides the necessary information for model predictive control.Despite the many advantages of MHE,long computation time has limited its applications for system-level models of liquid propellant rocket engines.To address this issue,we propose an asynchronous MHE method called advanced-multi-step MHE with Noise Covariance Estimation(amsMHE-NCE).This method computes the MHE problem asynchronously to obtain the states and parameters and can be applied to multi-threaded computations.In the background,the state and covariance estimation optimization problems are computed using multiple sampling times.In real-time,sensitivity is used to quickly approximate state and parameter estimates.A covariance estimation method is developed using sensitivity to avoid redundant MHE problem calculations in case of sensor degradation during engine reuse.The amsMHE-NCE is validated through three cases based on the space shuttle main engine system-level model,and we demonstrate that it can provide more accurate real-time estimates of states and parameters compared to other commonly used estimation methods.
基金supported by the National Natural Science Foundation of China(NSFC,No.62303031)the Fundamental Research Funds for the Central Universities。
文摘When estimating the capacity of lithium-ion batteries offline or online,it is essential to extract a health feature(HF)that can effectively characterize capacity degradation under both conventional ideal and complex dynamic operating conditions.However,the extraction of most HFs relies on complete charge-discharge cycle data,making them less adaptable to complex dynamic operating conditions.Existing mechanism HFs,while capable of characterizing capacity degradation from a mechanism perspective,suffer from limitations such as insufficient physical model expressiveness,high dimension,and redundancy of the mechanism HF.These issues increase the complexity of subsequent modeling of the relationship between HFs and capacity,thereby restricting their promotion in engineering practice.To meet this gap,this paper proposes a novel mechanism-based HF.Firstly,a multi-physical fields coupling model is developed to describe the interactions between electrochemical,thermal,and aging behaviors of the battery.Secondly,based on the aging mechanism,the accumulated charge of lithium lost during the formation of the solid electrolyte interphase(SEI)film is extracted as HF to provide a more intuitive representation of capacity degradation.Then,to reduce estimation errors caused by considering only a single aging mechanism,multiple representative regression models are employed to establish the mapping relationship between the mechanism HF and capacity,further enhancing the accuracy of final results.Finally,the proposed method is implemented and validated using real battery data under three different types of operating conditions.Experimental results demonstrate that,compared to other commonly used HFs,the proposed HF exhibits significant competitive advantages in handling incomplete cycle data,unknown operating conditions,and capacity estimation models.The minimum estimation error under ideal conditions is 0.0074,and the minimum estimation error under complex dynamic conditions is 0.0268.
基金supported by National Natural Science Foundation of China(No.62201596)Research Planning Project of National University of Defense Technology(ZK22-45).
文摘The beyond fifth-generation Internet of Things requires more capable channel coding schemes to achieve high-reliability,low-complexity and lowlatency communications.The theoretical analysis of error-correction performance of channel coding functions as a significant way of optimizing the transmission reliability and efficiency.In this paper,the efficient estimation methods of the block error rate(BLER)performance for rate-compatible polar codes(RCPC)are proposed under several scenarios.Firstly,the BLER performance of RCPC is generally evaluated in the additive white Gaussian noise channels.That is further extended into the Rayleigh fading channel case using an equivalent estimation method.Moreover,with respect to the powerful decoder such as successive cancellation list decoding,the performance estimation is derived analytically based on the polar weight spectrum and BLER upper bounds.Theoretical evaluation and numerical simulation results show that the estimated performance can fit well the practical simulated results of RCPC under the objective conditions,verifying the validity of our proposed performance estimation methods.Furthermore,the application designs of the reliability estimation of RCPC are explored,particularly in the advantages of the signal-to-noise(SNR)estimation and throughput efficiency optimization of polar coded hybrid automatic repeat request.