Missing data presents a crucial challenge in data analysis,especially in high-dimensional datasets,where missing data often leads to biased conclusions and degraded model performance.In this study,we present a novel a...Missing data presents a crucial challenge in data analysis,especially in high-dimensional datasets,where missing data often leads to biased conclusions and degraded model performance.In this study,we present a novel autoencoder-based imputation framework that integrates a composite loss function to enhance robustness and precision.The proposed loss combines(i)a guided,masked mean squared error focusing on missing entries;(ii)a noise-aware regularization term to improve resilience against data corruption;and(iii)a variance penalty to encourage expressive yet stable reconstructions.We evaluate the proposed model across four missingness mechanisms,such as Missing Completely at Random,Missing at Random,Missing Not at Random,and Missing Not at Random with quantile censorship,under systematically varied feature counts,sample sizes,and missingness ratios ranging from 5%to 60%.Four publicly available real-world datasets(Stroke Prediction,Pima Indians Diabetes,Cardiovascular Disease,and Framingham Heart Study)were used,and the obtained results show that our proposed model consistently outperforms baseline methods,including traditional and deep learning-based techniques.An ablation study reveals the additive value of each component in the loss function.Additionally,we assessed the downstream utility of imputed data through classification tasks,where datasets imputed by the proposed method yielded the highest receiver operating characteristic area under the curve scores across all scenarios.The model demonstrates strong scalability and robustness,improving performance with larger datasets and higher feature counts.These results underscore the capacity of the proposed method to produce not only numerically accurate but also semantically useful imputations,making it a promising solution for robust data recovery in clinical applications.展开更多
Partial least squares (PLS) model maximizes the covariance between process variables and quality variables,making it widely used in quality-related fault detection.However,traditional PLS methods focus primarily on li...Partial least squares (PLS) model maximizes the covariance between process variables and quality variables,making it widely used in quality-related fault detection.However,traditional PLS methods focus primarily on linear processes,leading to poor performance in dynamic nonlinear processes.In this paper,a novel quality-related fault detection method,named DiCAE-PLS,is developed by combining dynamic-inner convolutional autoencoder with PLS.In the proposed DiCAE-PLS method,latent features are first extracted through dynamic-inner convolutional autoencoder (DiCAE) to capture process dynamics and nonlinearity from process variables.Then,a PLS model is established to build the relationship between the extracted latent features and the final product quality.To detect quality-related faults,Hotelling's T^(2) statistic is employed.The developed quality-related fault detection is applied to the widely used industrial benchmark of the Tennessee.展开更多
In the field of intelligent air combat,real-time and accurate recognition of within-visual-range(WVR)maneuver actions serves as the foundational cornerstone for constructing autonomous decision-making systems.However,...In the field of intelligent air combat,real-time and accurate recognition of within-visual-range(WVR)maneuver actions serves as the foundational cornerstone for constructing autonomous decision-making systems.However,existing methods face two major challenges:traditional feature engineering suffers from insufficient effective dimensionality in the feature space due to kinematic coupling,making it difficult to distinguish essential differences between maneuvers,while end-to-end deep learning models lack controllability in implicit feature learning and fail to model high-order long-range temporal dependencies.This paper proposes a trajectory feature pre-extraction method based on a Long-range Masked Autoencoder(LMAE),incorporating three key innovations:(1)Random Fragment High-ratio Masking(RFH-Mask),which enforces the model to learn long-range temporal correlations by masking 80%of trajectory data while retaining continuous fragments;(2)Kalman Filter-Guided Objective Function(KFG-OF),integrating trajectory continuity constraints to align the feature space with kinematic principles;and(3)Two-stage Decoupled Architecture,enabling efficient and controllable feature learning through unsupervised pre-training and frozen-feature transfer.Experimental results demonstrate that LMAE significantly improves the average recognition accuracy for 20-class maneuvers compared to traditional end-to-end models,while significantly accelerating convergence speed.The contributions of this work lie in:introducing high-masking-rate autoencoders into low-informationdensity trajectory analysis,proposing a feature engineering framework with enhanced controllability and efficiency,and providing a novel technical pathway for intelligent air combat decision-making systems.展开更多
【目的】矿产资源是人类生存和经济发展的重要物质基础,开展矿山监测、建立矿山监测模型对矿产资源的高效开发和矿区环境保护具有重要意义。针对露天矿区背景复杂、目标尺度多样且小目标聚集的特点,本研究旨在构建兼顾监测精度与效率的...【目的】矿产资源是人类生存和经济发展的重要物质基础,开展矿山监测、建立矿山监测模型对矿产资源的高效开发和矿区环境保护具有重要意义。针对露天矿区背景复杂、目标尺度多样且小目标聚集的特点,本研究旨在构建兼顾监测精度与效率的轻量化模型,以提升矿区目标地物监测的准确性和效率。【方法】现有遥感数据集存在的样本单一、地域局限等问题,因此本文基于0.9 m天地图与1.8 m谷歌影像构建了不同气候背景、大范围和多种地物的六大露天煤矿基地OMTSFD(Open-pit Mine Typical Surface Features Dataset)数据集,提出改进的YOLO11-DAE算法进行模型训练与验证。首先,在骨干网络和特征金字塔中引入C3K2-DBB模块以增强多尺度特征捕获能力;其次,采用ADown模块替换网络下采样卷积,增强了模块对不同特征的表征能力,减少了低对比度场景的细节丢失;最后,采用E_Detect高效检测头降低模型复杂度和参数量,实现模型轻量化。【结果】实验表明,YOLO11-DAE的每秒帧数(Frames Per Second,FPS)为528.100,模型推理速度较快,精确率(Precision,P)、召回率(Recall,R)、综合评价指标(F1-Score,F1)、平均精度均值(Mean Average Precision,mAP)分别达到0.932、0.894、0.913和0.950,显著优于YOLOv5n、YOLOv8n和YOLOv10n算法,相较于YOLOv11n各项指标分别提高7.600%、10.000%、8.800%、8.000%。【结论】YOLO11-DAE算法能够满足矿区实时监测,并适用于多尺度、多背景等复杂场景的目标识别,实现了高精度、低漏检率的监测目标,达到了模型可应用性与实时性的平衡。展开更多
【目的】设计一种基于FIML和DAE的填充缺失值的方法,即聚类全信息选择性过滤编码器数据填补算法(clustering-based comprehensive information selective filtering encoder data imputation algorithm,CFSM-DAE),为水稻种质资源缺失数...【目的】设计一种基于FIML和DAE的填充缺失值的方法,即聚类全信息选择性过滤编码器数据填补算法(clustering-based comprehensive information selective filtering encoder data imputation algorithm,CFSM-DAE),为水稻种质资源缺失数据进行填充。【方法】利用聚类辅助避免数据异常值对算法的影响,采用选择性过滤层用于识别高质量估算、减少低质量估算的影响。传统的DAE框架通常没有选择性过滤层,所有的估算值都被视为同等重要,无法区分高质量和低质量的估算值。为了进一步提高估算精度,研究采用集成框架将全信息最大似然性(FIML)与多对抗性自编码器(DAE)结合的方法(CFSM-DAE),在选择性过滤层基础上,自适应填充,即当估算值不符合设定阈值时,采用FIML填充策略以确保填充结果的稳定性和精确度,从而进一步来提高整体估算精度。在3种缺失数据机制(随机缺失(MAR)、完全随机缺失(MCAR)和非随机缺失(MNAR))下对模拟数据和实际水稻种质资源数据集进行研究,将CFSM-DAE方法与多种常用填充算法比较(全信息最大似然性(FIML)、对抗自编码器(DAE)、K近邻填充(KNN)、随机森林(RF)、链式方程多重插补(MICE))。【结果】CFSM-DAE在模拟数据上的表现为S_(RME)=0.0676,E_(MA)=0.0093,R^(2)=0.9958;在水稻种质资源数据上的表现为S_(RME)=0.0395,E_(MA)=0.0078,R^(2)=0.8913。相比之下,其他算法如DAE在这两类数据下的SRME表现分别为0.8896和0.7707;KNN算法的EMA表现分别为0.1183和0.1305;FIML算法的R2表现为0.3382和0.7321。因此,CFSM-DAE在多个评价指标上相较于其他算法都表现出了一定的提升,CFSM-DAE在模拟数据和水稻种质资源数据的表现优于其他算法。【结论】CFSM-DAE方法通过结合聚类、选择性过滤和全信息最大似然性等策略,显著提高了水稻种质资源数据中缺失值的填补精度,展示了其在处理复杂缺失值问题上的有效性和潜力。展开更多
Deep Learning has been widely used to model soft sensors in modern industrial processes with nonlinear variables and uncertainty.Due to the outstanding ability for high-level feature extraction,stacked autoencoder(SAE...Deep Learning has been widely used to model soft sensors in modern industrial processes with nonlinear variables and uncertainty.Due to the outstanding ability for high-level feature extraction,stacked autoencoder(SAE)has been widely used to improve the model accuracy of soft sensors.However,with the increase of network layers,SAE may encounter serious information loss issues,which affect the modeling performance of soft sensors.Besides,there are typically very few labeled samples in the data set,which brings challenges to traditional neural networks to solve.In this paper,a multi-scale feature fused stacked autoencoder(MFF-SAE)is suggested for feature representation related to hierarchical output,where stacked autoencoder,mutual information(MI)and multi-scale feature fusion(MFF)strategies are integrated.Based on correlation analysis between output and input variables,critical hidden variables are extracted from the original variables in each autoencoder's input layer,which are correspondingly given varying weights.Besides,an integration strategy based on multi-scale feature fusion is adopted to mitigate the impact of information loss with the deepening of the network layers.Then,the MFF-SAE method is designed and stacked to form deep networks.Two practical industrial processes are utilized to evaluate the performance of MFF-SAE.Results from simulations indicate that in comparison to other cutting-edge techniques,the proposed method may considerably enhance the accuracy of soft sensor modeling,where the suggested method reduces the root mean square error(RMSE)by 71.8%,17.1%and 64.7%,15.1%,respectively.展开更多
针对现有电动汽车电池状态估计方法存在运算效率低和估算准确率低的问题,提出一种模型以估算电动汽车电池荷电状态(state of charge,SOC)和健康状态(state of health,SOH)。采用堆叠降噪自编码器(stacked denosing auto encoder,SDAE)...针对现有电动汽车电池状态估计方法存在运算效率低和估算准确率低的问题,提出一种模型以估算电动汽车电池荷电状态(state of charge,SOC)和健康状态(state of health,SOH)。采用堆叠降噪自编码器(stacked denosing auto encoder,SDAE)清洗电压、电流和温度数据中的异常数据和空缺数据,减小对估算精度的影响。引入动态通道剪枝(dynamical channel pruning,DCP)技术对Informer模型进行稀疏化处理,提高剪枝后模型的性能和稳定性。将清洗过的数据输入DCPInformer模型实现SOC和SOH的精确估计。实验结果表明,所提出的SDAE-DCPInformer模型估计SOC的平均绝对误差和均方根误差分别达到0.25%和0.38%,估计SOH的平均绝对误差和均方根误差分别达到了0.51%和0.64%。与传统Transformer等模型相比,所提模型预测SOC和SOH的速度更快,估算准确度有效提升,拥有的更好稳定性和泛化性。展开更多
Task-oriented point cloud sampling aims to select a representative subset from the input,tailored to specific application scenarios and task requirements.However,existing approaches rarely tackle the problem of redund...Task-oriented point cloud sampling aims to select a representative subset from the input,tailored to specific application scenarios and task requirements.However,existing approaches rarely tackle the problem of redundancy caused by local structural similarities in 3D objects,which limits the performance of sampling.To address this issue,this paper introduces a novel task-oriented point cloud masked autoencoder-based sampling network(Point-MASNet),inspired by the masked autoencoder mechanism.Point-MASNet employs a voxel-based random non-overlapping masking strategy,which allows the model to selectively learn and capture distinctive local structural features from the input data.This approach effectively mitigates redundancy and enhances the representativeness of the sampled subset.In addition,we propose a lightweight,symmetrically structured keypoint reconstruction network,designed as an autoencoder.This network is optimized to efficiently extract latent features while enabling refined reconstructions.Extensive experiments demonstrate that Point-MASNet achieves competitive sampling performance across classification,registration,and reconstruction tasks.展开更多
Osteoporosis is a major cause of bone fracture and can be characterised by both mass loss and microstructure deterioration of the bone.The modern way of osteoporosis assessment is through the measurement of bone miner...Osteoporosis is a major cause of bone fracture and can be characterised by both mass loss and microstructure deterioration of the bone.The modern way of osteoporosis assessment is through the measurement of bone mineral density,which is not able to unveil the pathological condition from the mesoscale aspect.To obtain mesoscale information from computed tomography(CT),the super-resolution(SR)approach for volumetric imaging data is required.A deep learning model AESR3D is proposed to recover high-resolution(HR)Micro-CT from low-resolution Micro-CT and implement an unsupervised segmentation for better trabecular observation and measurement.A new regularisation overcomplete autoencoder framework for the SR task is proposed and theoretically analysed.The best performance is achieved on structural similarity measure of trabecular CT SR task compared with the state-of-the-art models in both natural and medical image SR tasks.The HR and SR images show a high correlation(r=0.996,intraclass correlation coefficients=0.917)on trabecular bone morphological indicators.The results also prove the effectiveness of our regularisation framework when training a large capacity model.展开更多
Electronic nose and thermal images are effective ways to diagnose the presence of gases in real-time realtime.Multimodal fusion of these modalities can result in the development of highly accurate diagnostic systems.T...Electronic nose and thermal images are effective ways to diagnose the presence of gases in real-time realtime.Multimodal fusion of these modalities can result in the development of highly accurate diagnostic systems.The low-cost thermal imaging software produces low-resolution thermal images in grayscale format,hence necessitating methods for improving the resolution and colorizing the images.The objective of this paper is to develop and train a super-resolution generative adversarial network for improving the resolution of the thermal images,followed by a sparse autoencoder for colorization of thermal images and amultimodal convolutional neural network for gas detection using electronic nose and thermal images.The dataset used comprises 6400 thermal images and electronic nose measurements for four classes.A multimodal Convolutional Neural Network(CNN)comprising an EfficientNetB2 pre-trainedmodel was developed using both early and late feature fusion.The Super Resolution Generative Adversarial Network(SRGAN)model was developed and trained on low and high-resolution thermal images.Asparse autoencoder was trained on the grayscale and colorized thermal images.The SRGAN was trained on lowand high-resolution thermal images,achieving a Structural Similarity Index(SSIM)of 90.28,a Peak Signal-to-Noise Ratio(PSNR)of 68.74,and a Mean Absolute Error(MAE)of 0.066.The autoencoder model produced an MAE of 0.035,a Mean Squared Error(MSE)of 0.006,and a Root Mean Squared Error(RMSE)of 0.0705.The multimodal CNN,trained on these images and electronic nose measurements using both early and late fusion techniques,achieved accuracies of 97.89% and 98.55%,respectively.Hence,the proposed framework can be of great aid for the integration with low-cost software to generate high quality thermal camera images and highly accurate detection of gases in real-time.展开更多
High-speed railway bridges are essential components of any railway transportation system that should keep adequate levels of serviceability and safety.In this context,drive-by methodologies have emerged as a feasible ...High-speed railway bridges are essential components of any railway transportation system that should keep adequate levels of serviceability and safety.In this context,drive-by methodologies have emerged as a feasible and cost-effective monitor-ing solution for detecting damage on railway bridges while minimizing train operation interruptions.Moreover,integrating advanced sensor technologies and machine learning algorithms has significantly enhanced structural health monitoring(SHM)for bridges.Despite being increasingly used in traditional SHM applications,studies using autoencoders within drive-by methodologies are rare,especially in the railway field.This study presents a novel approach for drive-by damage detection in HSR bridges.The methodology relies on acceleration records collected from multiple bridge crossings by an operational train equipped with onboard sensors.Log-Mel spectrogram features derived from the acceleration records are used together with sparse autoencoders for computing statistical distribution-based damage indexes.Numerical simulations were performed on a 3D vehicle-track-bridge interaction system model implemented in Matlab to evaluate the robustness and effectiveness of the proposed approach,considering several damage scenarios,vehicle speeds,and environmental and operational variations,such as multiple track irregularities and varying measurement noise.The results show that the pro-posed approach can successfully detect damages,as well as characterize their severity,especially for very early-stage dam-ages.This demonstrates the high potential of applying Mel-frequency damage-sensitive features associated with machine learning algorithms in the drive-by condition assessment of high-speed railway bridges.展开更多
It is crucial to predict future mechanical behaviors for the prevention of structural disasters.Especially for underground construction,the structural mechanical behaviors are affected by multiple internal and externa...It is crucial to predict future mechanical behaviors for the prevention of structural disasters.Especially for underground construction,the structural mechanical behaviors are affected by multiple internal and external factors due to the complex conditions.Given that the existing models fail to take into account all the factors and accurate prediction of the multiple time series simultaneously is difficult using these models,this study proposed an improved prediction model through the autoencoder fused long-and short-term time-series network driven by the mass number of monitoring data.Then,the proposed model was formalized on multiple time series of strain monitoring data.Also,the discussion analysis with a classical baseline and an ablation experiment was conducted to verify the effectiveness of the prediction model.As the results indicate,the proposed model shows obvious superiority in predicting the future mechanical behaviors of structures.As a case study,the presented model was applied to the Nanjing Dinghuaimen tunnel to predict the stain variation on a different time scale in the future.展开更多
文摘Missing data presents a crucial challenge in data analysis,especially in high-dimensional datasets,where missing data often leads to biased conclusions and degraded model performance.In this study,we present a novel autoencoder-based imputation framework that integrates a composite loss function to enhance robustness and precision.The proposed loss combines(i)a guided,masked mean squared error focusing on missing entries;(ii)a noise-aware regularization term to improve resilience against data corruption;and(iii)a variance penalty to encourage expressive yet stable reconstructions.We evaluate the proposed model across four missingness mechanisms,such as Missing Completely at Random,Missing at Random,Missing Not at Random,and Missing Not at Random with quantile censorship,under systematically varied feature counts,sample sizes,and missingness ratios ranging from 5%to 60%.Four publicly available real-world datasets(Stroke Prediction,Pima Indians Diabetes,Cardiovascular Disease,and Framingham Heart Study)were used,and the obtained results show that our proposed model consistently outperforms baseline methods,including traditional and deep learning-based techniques.An ablation study reveals the additive value of each component in the loss function.Additionally,we assessed the downstream utility of imputed data through classification tasks,where datasets imputed by the proposed method yielded the highest receiver operating characteristic area under the curve scores across all scenarios.The model demonstrates strong scalability and robustness,improving performance with larger datasets and higher feature counts.These results underscore the capacity of the proposed method to produce not only numerically accurate but also semantically useful imputations,making it a promising solution for robust data recovery in clinical applications.
基金supported in part by the National Natural Science Foundation of China(62573387)the Natural Science Foundation of Zhejiang province,China(LY24F030004)the Fundamental Research Funds of Zhejiang Sci-Tech University(25222139-Y).
文摘Partial least squares (PLS) model maximizes the covariance between process variables and quality variables,making it widely used in quality-related fault detection.However,traditional PLS methods focus primarily on linear processes,leading to poor performance in dynamic nonlinear processes.In this paper,a novel quality-related fault detection method,named DiCAE-PLS,is developed by combining dynamic-inner convolutional autoencoder with PLS.In the proposed DiCAE-PLS method,latent features are first extracted through dynamic-inner convolutional autoencoder (DiCAE) to capture process dynamics and nonlinearity from process variables.Then,a PLS model is established to build the relationship between the extracted latent features and the final product quality.To detect quality-related faults,Hotelling's T^(2) statistic is employed.The developed quality-related fault detection is applied to the widely used industrial benchmark of the Tennessee.
文摘In the field of intelligent air combat,real-time and accurate recognition of within-visual-range(WVR)maneuver actions serves as the foundational cornerstone for constructing autonomous decision-making systems.However,existing methods face two major challenges:traditional feature engineering suffers from insufficient effective dimensionality in the feature space due to kinematic coupling,making it difficult to distinguish essential differences between maneuvers,while end-to-end deep learning models lack controllability in implicit feature learning and fail to model high-order long-range temporal dependencies.This paper proposes a trajectory feature pre-extraction method based on a Long-range Masked Autoencoder(LMAE),incorporating three key innovations:(1)Random Fragment High-ratio Masking(RFH-Mask),which enforces the model to learn long-range temporal correlations by masking 80%of trajectory data while retaining continuous fragments;(2)Kalman Filter-Guided Objective Function(KFG-OF),integrating trajectory continuity constraints to align the feature space with kinematic principles;and(3)Two-stage Decoupled Architecture,enabling efficient and controllable feature learning through unsupervised pre-training and frozen-feature transfer.Experimental results demonstrate that LMAE significantly improves the average recognition accuracy for 20-class maneuvers compared to traditional end-to-end models,while significantly accelerating convergence speed.The contributions of this work lie in:introducing high-masking-rate autoencoders into low-informationdensity trajectory analysis,proposing a feature engineering framework with enhanced controllability and efficiency,and providing a novel technical pathway for intelligent air combat decision-making systems.
文摘【目的】矿产资源是人类生存和经济发展的重要物质基础,开展矿山监测、建立矿山监测模型对矿产资源的高效开发和矿区环境保护具有重要意义。针对露天矿区背景复杂、目标尺度多样且小目标聚集的特点,本研究旨在构建兼顾监测精度与效率的轻量化模型,以提升矿区目标地物监测的准确性和效率。【方法】现有遥感数据集存在的样本单一、地域局限等问题,因此本文基于0.9 m天地图与1.8 m谷歌影像构建了不同气候背景、大范围和多种地物的六大露天煤矿基地OMTSFD(Open-pit Mine Typical Surface Features Dataset)数据集,提出改进的YOLO11-DAE算法进行模型训练与验证。首先,在骨干网络和特征金字塔中引入C3K2-DBB模块以增强多尺度特征捕获能力;其次,采用ADown模块替换网络下采样卷积,增强了模块对不同特征的表征能力,减少了低对比度场景的细节丢失;最后,采用E_Detect高效检测头降低模型复杂度和参数量,实现模型轻量化。【结果】实验表明,YOLO11-DAE的每秒帧数(Frames Per Second,FPS)为528.100,模型推理速度较快,精确率(Precision,P)、召回率(Recall,R)、综合评价指标(F1-Score,F1)、平均精度均值(Mean Average Precision,mAP)分别达到0.932、0.894、0.913和0.950,显著优于YOLOv5n、YOLOv8n和YOLOv10n算法,相较于YOLOv11n各项指标分别提高7.600%、10.000%、8.800%、8.000%。【结论】YOLO11-DAE算法能够满足矿区实时监测,并适用于多尺度、多背景等复杂场景的目标识别,实现了高精度、低漏检率的监测目标,达到了模型可应用性与实时性的平衡。
文摘【目的】设计一种基于FIML和DAE的填充缺失值的方法,即聚类全信息选择性过滤编码器数据填补算法(clustering-based comprehensive information selective filtering encoder data imputation algorithm,CFSM-DAE),为水稻种质资源缺失数据进行填充。【方法】利用聚类辅助避免数据异常值对算法的影响,采用选择性过滤层用于识别高质量估算、减少低质量估算的影响。传统的DAE框架通常没有选择性过滤层,所有的估算值都被视为同等重要,无法区分高质量和低质量的估算值。为了进一步提高估算精度,研究采用集成框架将全信息最大似然性(FIML)与多对抗性自编码器(DAE)结合的方法(CFSM-DAE),在选择性过滤层基础上,自适应填充,即当估算值不符合设定阈值时,采用FIML填充策略以确保填充结果的稳定性和精确度,从而进一步来提高整体估算精度。在3种缺失数据机制(随机缺失(MAR)、完全随机缺失(MCAR)和非随机缺失(MNAR))下对模拟数据和实际水稻种质资源数据集进行研究,将CFSM-DAE方法与多种常用填充算法比较(全信息最大似然性(FIML)、对抗自编码器(DAE)、K近邻填充(KNN)、随机森林(RF)、链式方程多重插补(MICE))。【结果】CFSM-DAE在模拟数据上的表现为S_(RME)=0.0676,E_(MA)=0.0093,R^(2)=0.9958;在水稻种质资源数据上的表现为S_(RME)=0.0395,E_(MA)=0.0078,R^(2)=0.8913。相比之下,其他算法如DAE在这两类数据下的SRME表现分别为0.8896和0.7707;KNN算法的EMA表现分别为0.1183和0.1305;FIML算法的R2表现为0.3382和0.7321。因此,CFSM-DAE在多个评价指标上相较于其他算法都表现出了一定的提升,CFSM-DAE在模拟数据和水稻种质资源数据的表现优于其他算法。【结论】CFSM-DAE方法通过结合聚类、选择性过滤和全信息最大似然性等策略,显著提高了水稻种质资源数据中缺失值的填补精度,展示了其在处理复杂缺失值问题上的有效性和潜力。
基金supported by the National Key Research and Development Program of China(2023YFB3307800)National Natural Science Foundation of China(62394343,62373155)+2 种基金Major Science and Technology Project of Xinjiang(No.2022A01006-4)State Key Laboratory of Industrial Control Technology,China(Grant No.ICT2024A26)Fundamental Research Funds for the Central Universities.
文摘Deep Learning has been widely used to model soft sensors in modern industrial processes with nonlinear variables and uncertainty.Due to the outstanding ability for high-level feature extraction,stacked autoencoder(SAE)has been widely used to improve the model accuracy of soft sensors.However,with the increase of network layers,SAE may encounter serious information loss issues,which affect the modeling performance of soft sensors.Besides,there are typically very few labeled samples in the data set,which brings challenges to traditional neural networks to solve.In this paper,a multi-scale feature fused stacked autoencoder(MFF-SAE)is suggested for feature representation related to hierarchical output,where stacked autoencoder,mutual information(MI)and multi-scale feature fusion(MFF)strategies are integrated.Based on correlation analysis between output and input variables,critical hidden variables are extracted from the original variables in each autoencoder's input layer,which are correspondingly given varying weights.Besides,an integration strategy based on multi-scale feature fusion is adopted to mitigate the impact of information loss with the deepening of the network layers.Then,the MFF-SAE method is designed and stacked to form deep networks.Two practical industrial processes are utilized to evaluate the performance of MFF-SAE.Results from simulations indicate that in comparison to other cutting-edge techniques,the proposed method may considerably enhance the accuracy of soft sensor modeling,where the suggested method reduces the root mean square error(RMSE)by 71.8%,17.1%and 64.7%,15.1%,respectively.
文摘针对现有电动汽车电池状态估计方法存在运算效率低和估算准确率低的问题,提出一种模型以估算电动汽车电池荷电状态(state of charge,SOC)和健康状态(state of health,SOH)。采用堆叠降噪自编码器(stacked denosing auto encoder,SDAE)清洗电压、电流和温度数据中的异常数据和空缺数据,减小对估算精度的影响。引入动态通道剪枝(dynamical channel pruning,DCP)技术对Informer模型进行稀疏化处理,提高剪枝后模型的性能和稳定性。将清洗过的数据输入DCPInformer模型实现SOC和SOH的精确估计。实验结果表明,所提出的SDAE-DCPInformer模型估计SOC的平均绝对误差和均方根误差分别达到0.25%和0.38%,估计SOH的平均绝对误差和均方根误差分别达到了0.51%和0.64%。与传统Transformer等模型相比,所提模型预测SOC和SOH的速度更快,估算准确度有效提升,拥有的更好稳定性和泛化性。
基金supported by the National Key Research and Development Program of China(2022YFB3103500)the National Natural Science Foundation of China(62473033,62571027)+1 种基金in part by the Beijing Natural Science Foundation(L231012)the State Scholarship Fund from the China Scholarship Council.
文摘Task-oriented point cloud sampling aims to select a representative subset from the input,tailored to specific application scenarios and task requirements.However,existing approaches rarely tackle the problem of redundancy caused by local structural similarities in 3D objects,which limits the performance of sampling.To address this issue,this paper introduces a novel task-oriented point cloud masked autoencoder-based sampling network(Point-MASNet),inspired by the masked autoencoder mechanism.Point-MASNet employs a voxel-based random non-overlapping masking strategy,which allows the model to selectively learn and capture distinctive local structural features from the input data.This approach effectively mitigates redundancy and enhances the representativeness of the sampled subset.In addition,we propose a lightweight,symmetrically structured keypoint reconstruction network,designed as an autoencoder.This network is optimized to efficiently extract latent features while enabling refined reconstructions.Extensive experiments demonstrate that Point-MASNet achieves competitive sampling performance across classification,registration,and reconstruction tasks.
基金Beijing Natural Science Foundation-Haidian original Innovation Joint Foundation,Grant/Award Number:L192016Joint Funds of the National Natural Science Foundation of China,Grant/Award Number:U21A20489+3 种基金National Natural Science Foundation of China,Grant/Award Number:62003330Shenzhen Fundamental Research Funds,Grant/Award Numbers:JCYJ20220818101608019,JCYJ20190807170407391,JCYJ20180507182415428Natural Science Foundation of Guangdong Province,Grant/Award Number:2019A1515011699Guangdong-Hong Kong-Macao Joint Laboratory of Human-Machine Intelligence-Synergy Systems,Shenzhen Institute of Advanced Technology。
文摘Osteoporosis is a major cause of bone fracture and can be characterised by both mass loss and microstructure deterioration of the bone.The modern way of osteoporosis assessment is through the measurement of bone mineral density,which is not able to unveil the pathological condition from the mesoscale aspect.To obtain mesoscale information from computed tomography(CT),the super-resolution(SR)approach for volumetric imaging data is required.A deep learning model AESR3D is proposed to recover high-resolution(HR)Micro-CT from low-resolution Micro-CT and implement an unsupervised segmentation for better trabecular observation and measurement.A new regularisation overcomplete autoencoder framework for the SR task is proposed and theoretically analysed.The best performance is achieved on structural similarity measure of trabecular CT SR task compared with the state-of-the-art models in both natural and medical image SR tasks.The HR and SR images show a high correlation(r=0.996,intraclass correlation coefficients=0.917)on trabecular bone morphological indicators.The results also prove the effectiveness of our regularisation framework when training a large capacity model.
基金funded by the Centre for Advanced Modelling and Geospatial Information Systems(CAMGIS),Faculty of Engineering and IT,University of Technology Sydneysupported by the Researchers Supporting Project,King Saud University,Riyadh,Saudi Arabia,under Project RSP2025 R14.
文摘Electronic nose and thermal images are effective ways to diagnose the presence of gases in real-time realtime.Multimodal fusion of these modalities can result in the development of highly accurate diagnostic systems.The low-cost thermal imaging software produces low-resolution thermal images in grayscale format,hence necessitating methods for improving the resolution and colorizing the images.The objective of this paper is to develop and train a super-resolution generative adversarial network for improving the resolution of the thermal images,followed by a sparse autoencoder for colorization of thermal images and amultimodal convolutional neural network for gas detection using electronic nose and thermal images.The dataset used comprises 6400 thermal images and electronic nose measurements for four classes.A multimodal Convolutional Neural Network(CNN)comprising an EfficientNetB2 pre-trainedmodel was developed using both early and late feature fusion.The Super Resolution Generative Adversarial Network(SRGAN)model was developed and trained on low and high-resolution thermal images.Asparse autoencoder was trained on the grayscale and colorized thermal images.The SRGAN was trained on lowand high-resolution thermal images,achieving a Structural Similarity Index(SSIM)of 90.28,a Peak Signal-to-Noise Ratio(PSNR)of 68.74,and a Mean Absolute Error(MAE)of 0.066.The autoencoder model produced an MAE of 0.035,a Mean Squared Error(MSE)of 0.006,and a Root Mean Squared Error(RMSE)of 0.0705.The multimodal CNN,trained on these images and electronic nose measurements using both early and late fusion techniques,achieved accuracies of 97.89% and 98.55%,respectively.Hence,the proposed framework can be of great aid for the integration with low-cost software to generate high quality thermal camera images and highly accurate detection of gases in real-time.
基金support of CNPq(Brazilian Ministry of Science and Technology Agency),of CAPES(Higher Education Improvement Agency),of FAPESP(São Paulo Research Foundation)under grant#2022/13045-1,of VALE Catedra Under Rail and of Base Funding-UIDB/04708/2020Programmatic Funding-UIDP/04708/2020 of the CONSTRUCT-“Instituto de I&D em Estruturas e Construções”.
文摘High-speed railway bridges are essential components of any railway transportation system that should keep adequate levels of serviceability and safety.In this context,drive-by methodologies have emerged as a feasible and cost-effective monitor-ing solution for detecting damage on railway bridges while minimizing train operation interruptions.Moreover,integrating advanced sensor technologies and machine learning algorithms has significantly enhanced structural health monitoring(SHM)for bridges.Despite being increasingly used in traditional SHM applications,studies using autoencoders within drive-by methodologies are rare,especially in the railway field.This study presents a novel approach for drive-by damage detection in HSR bridges.The methodology relies on acceleration records collected from multiple bridge crossings by an operational train equipped with onboard sensors.Log-Mel spectrogram features derived from the acceleration records are used together with sparse autoencoders for computing statistical distribution-based damage indexes.Numerical simulations were performed on a 3D vehicle-track-bridge interaction system model implemented in Matlab to evaluate the robustness and effectiveness of the proposed approach,considering several damage scenarios,vehicle speeds,and environmental and operational variations,such as multiple track irregularities and varying measurement noise.The results show that the pro-posed approach can successfully detect damages,as well as characterize their severity,especially for very early-stage dam-ages.This demonstrates the high potential of applying Mel-frequency damage-sensitive features associated with machine learning algorithms in the drive-by condition assessment of high-speed railway bridges.
基金National Key Research and Development Program of China,Grant/Award Number:2018YFB2101003National Natural Science Foundation of China,Grant/Award Numbers:51991395,U1806226,51778033,51822802,71901011,U1811463,51991391Science and Technology Major Project of Beijing,Grant/Award Number:Z191100002519012。
文摘It is crucial to predict future mechanical behaviors for the prevention of structural disasters.Especially for underground construction,the structural mechanical behaviors are affected by multiple internal and external factors due to the complex conditions.Given that the existing models fail to take into account all the factors and accurate prediction of the multiple time series simultaneously is difficult using these models,this study proposed an improved prediction model through the autoencoder fused long-and short-term time-series network driven by the mass number of monitoring data.Then,the proposed model was formalized on multiple time series of strain monitoring data.Also,the discussion analysis with a classical baseline and an ablation experiment was conducted to verify the effectiveness of the prediction model.As the results indicate,the proposed model shows obvious superiority in predicting the future mechanical behaviors of structures.As a case study,the presented model was applied to the Nanjing Dinghuaimen tunnel to predict the stain variation on a different time scale in the future.