Missing data presents a crucial challenge in data analysis,especially in high-dimensional datasets,where missing data often leads to biased conclusions and degraded model performance.In this study,we present a novel a...Missing data presents a crucial challenge in data analysis,especially in high-dimensional datasets,where missing data often leads to biased conclusions and degraded model performance.In this study,we present a novel autoencoder-based imputation framework that integrates a composite loss function to enhance robustness and precision.The proposed loss combines(i)a guided,masked mean squared error focusing on missing entries;(ii)a noise-aware regularization term to improve resilience against data corruption;and(iii)a variance penalty to encourage expressive yet stable reconstructions.We evaluate the proposed model across four missingness mechanisms,such as Missing Completely at Random,Missing at Random,Missing Not at Random,and Missing Not at Random with quantile censorship,under systematically varied feature counts,sample sizes,and missingness ratios ranging from 5%to 60%.Four publicly available real-world datasets(Stroke Prediction,Pima Indians Diabetes,Cardiovascular Disease,and Framingham Heart Study)were used,and the obtained results show that our proposed model consistently outperforms baseline methods,including traditional and deep learning-based techniques.An ablation study reveals the additive value of each component in the loss function.Additionally,we assessed the downstream utility of imputed data through classification tasks,where datasets imputed by the proposed method yielded the highest receiver operating characteristic area under the curve scores across all scenarios.The model demonstrates strong scalability and robustness,improving performance with larger datasets and higher feature counts.These results underscore the capacity of the proposed method to produce not only numerically accurate but also semantically useful imputations,making it a promising solution for robust data recovery in clinical applications.展开更多
Partial least squares (PLS) model maximizes the covariance between process variables and quality variables,making it widely used in quality-related fault detection.However,traditional PLS methods focus primarily on li...Partial least squares (PLS) model maximizes the covariance between process variables and quality variables,making it widely used in quality-related fault detection.However,traditional PLS methods focus primarily on linear processes,leading to poor performance in dynamic nonlinear processes.In this paper,a novel quality-related fault detection method,named DiCAE-PLS,is developed by combining dynamic-inner convolutional autoencoder with PLS.In the proposed DiCAE-PLS method,latent features are first extracted through dynamic-inner convolutional autoencoder (DiCAE) to capture process dynamics and nonlinearity from process variables.Then,a PLS model is established to build the relationship between the extracted latent features and the final product quality.To detect quality-related faults,Hotelling's T^(2) statistic is employed.The developed quality-related fault detection is applied to the widely used industrial benchmark of the Tennessee.展开更多
In the field of intelligent air combat,real-time and accurate recognition of within-visual-range(WVR)maneuver actions serves as the foundational cornerstone for constructing autonomous decision-making systems.However,...In the field of intelligent air combat,real-time and accurate recognition of within-visual-range(WVR)maneuver actions serves as the foundational cornerstone for constructing autonomous decision-making systems.However,existing methods face two major challenges:traditional feature engineering suffers from insufficient effective dimensionality in the feature space due to kinematic coupling,making it difficult to distinguish essential differences between maneuvers,while end-to-end deep learning models lack controllability in implicit feature learning and fail to model high-order long-range temporal dependencies.This paper proposes a trajectory feature pre-extraction method based on a Long-range Masked Autoencoder(LMAE),incorporating three key innovations:(1)Random Fragment High-ratio Masking(RFH-Mask),which enforces the model to learn long-range temporal correlations by masking 80%of trajectory data while retaining continuous fragments;(2)Kalman Filter-Guided Objective Function(KFG-OF),integrating trajectory continuity constraints to align the feature space with kinematic principles;and(3)Two-stage Decoupled Architecture,enabling efficient and controllable feature learning through unsupervised pre-training and frozen-feature transfer.Experimental results demonstrate that LMAE significantly improves the average recognition accuracy for 20-class maneuvers compared to traditional end-to-end models,while significantly accelerating convergence speed.The contributions of this work lie in:introducing high-masking-rate autoencoders into low-informationdensity trajectory analysis,proposing a feature engineering framework with enhanced controllability and efficiency,and providing a novel technical pathway for intelligent air combat decision-making systems.展开更多
为提升高风电渗透率电网的频率稳定性与逐步惯性控制SIC参数自适应性,针对功率恢复阶段二次频率跌落的核心问题,采用变分自编码器与门控循环单元融合架构,提出一种结合灰狼优化算法样本生成、变分自编码器非线性特征提取与降噪、门控循...为提升高风电渗透率电网的频率稳定性与逐步惯性控制SIC参数自适应性,针对功率恢复阶段二次频率跌落的核心问题,采用变分自编码器与门控循环单元融合架构,提出一种结合灰狼优化算法样本生成、变分自编码器非线性特征提取与降噪、门控循环单元动态映射建模的风电调频优化控制方法。通过改进IEEE39节点系统构建仿真模型,验证所提方法的抑制效果与鲁棒性。结果表明:相较于传统智能方法,文中方法控制参数预测的关键误差指标最大降低53.3%,在5~25 d B噪声环境下综合性能指标稳定高于0.85,25%数据缺失场景下仍保持0.7以上评分,系统频率二次跌落幅度显著减小,为高比例风电并网系统提供了兼具自适应与抗干扰能力的调频解决方案。展开更多
剩余寿命(Remaining Useful Life, RUL)预测对于维护工业设备的可靠性和安全性至关重要,但现有的RUL预测方法在处理高维传感器数据以及捕捉时间退化模式方面仍然面临诸多挑战。为了解决上述问题,提出一种退化趋势平滑约束下基于双向长...剩余寿命(Remaining Useful Life, RUL)预测对于维护工业设备的可靠性和安全性至关重要,但现有的RUL预测方法在处理高维传感器数据以及捕捉时间退化模式方面仍然面临诸多挑战。为了解决上述问题,提出一种退化趋势平滑约束下基于双向长短时记忆网络-变分自编码器(Bidirectional Long Short Term-Memory-Variational Auto Encoder, BLSTM-VAE)的RUL预测方法。该方法首先进行数据预处理,包括数据降噪、滑动窗口分段和标签修正等步骤。然后设计基于BLSTM的VAE型特征提取器,以有效提取时间序列数据中的非线性关系和长距离依赖关系。最后提出一种基于流形学习的退化趋势平滑约束模块,通过局部不变性假设来增强模型的稳健性和泛化能力。通过航空发动机数据集数据集进行验证,结果表明所提出的RUL预测方法在数据集上的表现优于现有的多种RUL预测方法,具有更低的预测误差和更高的稳定性。展开更多
文摘Missing data presents a crucial challenge in data analysis,especially in high-dimensional datasets,where missing data often leads to biased conclusions and degraded model performance.In this study,we present a novel autoencoder-based imputation framework that integrates a composite loss function to enhance robustness and precision.The proposed loss combines(i)a guided,masked mean squared error focusing on missing entries;(ii)a noise-aware regularization term to improve resilience against data corruption;and(iii)a variance penalty to encourage expressive yet stable reconstructions.We evaluate the proposed model across four missingness mechanisms,such as Missing Completely at Random,Missing at Random,Missing Not at Random,and Missing Not at Random with quantile censorship,under systematically varied feature counts,sample sizes,and missingness ratios ranging from 5%to 60%.Four publicly available real-world datasets(Stroke Prediction,Pima Indians Diabetes,Cardiovascular Disease,and Framingham Heart Study)were used,and the obtained results show that our proposed model consistently outperforms baseline methods,including traditional and deep learning-based techniques.An ablation study reveals the additive value of each component in the loss function.Additionally,we assessed the downstream utility of imputed data through classification tasks,where datasets imputed by the proposed method yielded the highest receiver operating characteristic area under the curve scores across all scenarios.The model demonstrates strong scalability and robustness,improving performance with larger datasets and higher feature counts.These results underscore the capacity of the proposed method to produce not only numerically accurate but also semantically useful imputations,making it a promising solution for robust data recovery in clinical applications.
基金supported in part by the National Natural Science Foundation of China(62573387)the Natural Science Foundation of Zhejiang province,China(LY24F030004)the Fundamental Research Funds of Zhejiang Sci-Tech University(25222139-Y).
文摘Partial least squares (PLS) model maximizes the covariance between process variables and quality variables,making it widely used in quality-related fault detection.However,traditional PLS methods focus primarily on linear processes,leading to poor performance in dynamic nonlinear processes.In this paper,a novel quality-related fault detection method,named DiCAE-PLS,is developed by combining dynamic-inner convolutional autoencoder with PLS.In the proposed DiCAE-PLS method,latent features are first extracted through dynamic-inner convolutional autoencoder (DiCAE) to capture process dynamics and nonlinearity from process variables.Then,a PLS model is established to build the relationship between the extracted latent features and the final product quality.To detect quality-related faults,Hotelling's T^(2) statistic is employed.The developed quality-related fault detection is applied to the widely used industrial benchmark of the Tennessee.
文摘In the field of intelligent air combat,real-time and accurate recognition of within-visual-range(WVR)maneuver actions serves as the foundational cornerstone for constructing autonomous decision-making systems.However,existing methods face two major challenges:traditional feature engineering suffers from insufficient effective dimensionality in the feature space due to kinematic coupling,making it difficult to distinguish essential differences between maneuvers,while end-to-end deep learning models lack controllability in implicit feature learning and fail to model high-order long-range temporal dependencies.This paper proposes a trajectory feature pre-extraction method based on a Long-range Masked Autoencoder(LMAE),incorporating three key innovations:(1)Random Fragment High-ratio Masking(RFH-Mask),which enforces the model to learn long-range temporal correlations by masking 80%of trajectory data while retaining continuous fragments;(2)Kalman Filter-Guided Objective Function(KFG-OF),integrating trajectory continuity constraints to align the feature space with kinematic principles;and(3)Two-stage Decoupled Architecture,enabling efficient and controllable feature learning through unsupervised pre-training and frozen-feature transfer.Experimental results demonstrate that LMAE significantly improves the average recognition accuracy for 20-class maneuvers compared to traditional end-to-end models,while significantly accelerating convergence speed.The contributions of this work lie in:introducing high-masking-rate autoencoders into low-informationdensity trajectory analysis,proposing a feature engineering framework with enhanced controllability and efficiency,and providing a novel technical pathway for intelligent air combat decision-making systems.
文摘为提升高风电渗透率电网的频率稳定性与逐步惯性控制SIC参数自适应性,针对功率恢复阶段二次频率跌落的核心问题,采用变分自编码器与门控循环单元融合架构,提出一种结合灰狼优化算法样本生成、变分自编码器非线性特征提取与降噪、门控循环单元动态映射建模的风电调频优化控制方法。通过改进IEEE39节点系统构建仿真模型,验证所提方法的抑制效果与鲁棒性。结果表明:相较于传统智能方法,文中方法控制参数预测的关键误差指标最大降低53.3%,在5~25 d B噪声环境下综合性能指标稳定高于0.85,25%数据缺失场景下仍保持0.7以上评分,系统频率二次跌落幅度显著减小,为高比例风电并网系统提供了兼具自适应与抗干扰能力的调频解决方案。
文摘剩余寿命(Remaining Useful Life, RUL)预测对于维护工业设备的可靠性和安全性至关重要,但现有的RUL预测方法在处理高维传感器数据以及捕捉时间退化模式方面仍然面临诸多挑战。为了解决上述问题,提出一种退化趋势平滑约束下基于双向长短时记忆网络-变分自编码器(Bidirectional Long Short Term-Memory-Variational Auto Encoder, BLSTM-VAE)的RUL预测方法。该方法首先进行数据预处理,包括数据降噪、滑动窗口分段和标签修正等步骤。然后设计基于BLSTM的VAE型特征提取器,以有效提取时间序列数据中的非线性关系和长距离依赖关系。最后提出一种基于流形学习的退化趋势平滑约束模块,通过局部不变性假设来增强模型的稳健性和泛化能力。通过航空发动机数据集数据集进行验证,结果表明所提出的RUL预测方法在数据集上的表现优于现有的多种RUL预测方法,具有更低的预测误差和更高的稳定性。