Underground engineering projects such as deep tunnel excavation often encounter rockburst disasters accompanied by numerous microseismic events.Rapid interpretation of microseismic signals is crucial for the timely id...Underground engineering projects such as deep tunnel excavation often encounter rockburst disasters accompanied by numerous microseismic events.Rapid interpretation of microseismic signals is crucial for the timely identification of rockbursts.However,conventional processing encompasses multi-step workflows,including classification,denoising,picking,locating,and computational analysis,coupled with manual intervention,which collectively compromise the reliability of early warnings.To address these challenges,this study innovatively proposes the“microseismic stethoscope"-a multi-task machine learning and deep learning model designed for the automated processing of massive microseismic signals.This model efficiently extracts three key parameters that are necessary for recognizing rockburst disasters:rupture location,microseismic energy,and moment magnitude.Specifically,the model extracts raw waveform features from three dedicated sub-networks:a classifier for source zone classification,and two regressors for microseismic energy and moment magnitude estimation.This model demonstrates superior efficiency compared to traditional processing and semi-automated processing,reducing per-event processing time from 0.71 s to 0.49 s to merely 0.036 s.It concurrently achieves 98%accuracy in source zone classification,with microseismic energy and moment magnitude estimation errors of 0.13 and 0.05,respectively.This model has been well applied and validated in the Daxiagu Tunnel case in Sichuan,China.The application results indicate that the model is as accurate as traditional methods in determining source parameters,and thus can be used to identify potential geomechanical processes of rockburst disasters.By enhancing the signal processing reliability of microseismic events,the proposed model in this study presents a significant advancement in the identification of rockburst disasters.展开更多
Oxide dispersion strengthened(ODS)alloys are extensively used owing to high thermostability and creep strength contributed from uniformly dispersed fine oxides particles.However,the existence of these strengthening pa...Oxide dispersion strengthened(ODS)alloys are extensively used owing to high thermostability and creep strength contributed from uniformly dispersed fine oxides particles.However,the existence of these strengthening particles also deteriorates the processability and it is of great importance to establish accurate processing maps to guide the thermomechanical processes to enhance the formability.In this study,we performed particle swarm optimization-based back propagation artificial neural network model to predict the high temperature flow behavior of 0.25wt%Al2O3 particle-reinforced Cu alloys,and compared the accuracy with that of derived by Arrhenius-type constitutive model and back propagation artificial neural network model.To train these models,we obtained the raw data by fabricating ODS Cu alloys using the internal oxidation and reduction method,and conducting systematic hot compression tests between 400 and800℃with strain rates of 10^(-2)-10 S^(-1).At last,processing maps for ODS Cu alloys were proposed by combining processing parameters,mechanical behavior,microstructure characterization,and the modeling results achieved a coefficient of determination higher than>99%.展开更多
The key challenge in the preparation of perovskite solar cells is to enhance the reproducibility of PSC manufacturing,particularly by better controlling multiple high-dimensional process parameters.This study proposes...The key challenge in the preparation of perovskite solar cells is to enhance the reproducibility of PSC manufacturing,particularly by better controlling multiple high-dimensional process parameters.This study proposes a machine learning(ML)approach to efficiently predict and analyze perovskite film fabrication processes.By evaluating five classic ML algorithms on 130 experimental data sets from blade-coating parameters,the Random Forest(RF)model was identified as the most effective,enabling rapid prediction of over 100,000 parameter sets in just 10 min-equivalent to 3 years of manual experimentation.The RF model demonstrated strong predictive accuracy,with an R^(2) close to 0.8.This approach led to the identification of optimal process parameter combinations,significantly improving the reproducibility of PSCs and reducing performance variance by approximately threefold,thereby advancing the development of scalable manufacturing processes.展开更多
This study seeks to establish a novel,semi-automatic system that utilizes Industry 4.0 principles to effectively determine both acceptable and rejectable concrete cubes with regard to their failure modes,significantly...This study seeks to establish a novel,semi-automatic system that utilizes Industry 4.0 principles to effectively determine both acceptable and rejectable concrete cubes with regard to their failure modes,significantly contributing to the dependability of concrete quality evaluations.The study utilizes image processing and machine learning(ML)methods,namely object detectionmodels such as YOLOv8 and Convolutional Neural Networks(CNNs),to evaluate images of concrete cubes.These models are trained and validated on an extensive database of annotated images from real-world and laboratory conditions.Preliminary results indicate a good performance in the classification of concrete cube failure modes.The proposed system accurately identifies cracks,determines the severity of damage to structures,indicating the potential to minimize human errors and discrepancies that might occur through the current techniques to detect the failure mode of concrete cubes.Thedeveloped systemcould significantly improve the reliability of concrete cube assessments,reduce resource wastage,and contribute to more sustainable construction practices.By minimizing material costs and errors,this innovation supports the construction industry’s move towards sustainability.展开更多
Low-voltage direct current(DC)microgrids have recently emerged as a promising and viable alternative to traditional alternating cur-rent(AC)microgrids,offering numerous advantages.Consequently,researchers are explorin...Low-voltage direct current(DC)microgrids have recently emerged as a promising and viable alternative to traditional alternating cur-rent(AC)microgrids,offering numerous advantages.Consequently,researchers are exploring the potential of DC microgrids across var-ious configurations.However,despite the sustainability and accuracy offered by DC microgrids,they pose various challenges when integrated into modern power distribution systems.Among these challenges,fault diagnosis holds significant importance.Rapid fault detection in DC microgrids is essential to maintain stability and ensure an uninterrupted power supply to critical loads.A primary chal-lenge is the lack of standards and guidelines for the protection and safety of DC microgrids,including fault detection,location,and clear-ing procedures for both grid-connected and islanded modes.In response,this study presents a brief overview of various approaches for protecting DC microgrids.展开更多
Deep learning now underpins many state-of-the-art systems for biomedical image and signal processing,enabling automated lesion detection,physiological monitoring,and therapy planning with accuracy that rivals expert p...Deep learning now underpins many state-of-the-art systems for biomedical image and signal processing,enabling automated lesion detection,physiological monitoring,and therapy planning with accuracy that rivals expert performance.This survey reviews the principal model families as convolutional,recurrent,generative,reinforcement,autoencoder,and transfer-learning approaches as emphasising how their architectural choices map to tasks such as segmentation,classification,reconstruction,and anomaly detection.A dedicated treatment of multimodal fusion networks shows how imaging features can be integrated with genomic profiles and clinical records to yield more robust,context-aware predictions.To support clinical adoption,we outline post-hoc explainability techniques(Grad-CAM,SHAP,LIME)and describe emerging intrinsically interpretable designs that expose decision logic to end users.Regulatory guidance from the U.S.FDA,the European Medicines Agency,and the EU AI Act is summarised,linking transparency and lifecycle-monitoring requirements to concrete development practices.Remaining challenges as data imbalance,computational cost,privacy constraints,and cross-domain generalization are discussed alongside promising solutions such as federated learning,uncertainty quantification,and lightweight 3-D architectures.The article therefore offers researchers,clinicians,and policymakers a concise,practice-oriented roadmap for deploying trustworthy deep-learning systems in healthcare.展开更多
Covert timing channels(CTC)exploit network resources to establish hidden communication pathways,posing signi cant risks to data security and policy compliance.erefore,detecting such hidden and dangerous threats remain...Covert timing channels(CTC)exploit network resources to establish hidden communication pathways,posing signi cant risks to data security and policy compliance.erefore,detecting such hidden and dangerous threats remains one of the security challenges. is paper proposes LinguTimeX,a new framework that combines natural language processing with arti cial intelligence,along with explainable Arti cial Intelligence(AI)not only to detect CTC but also to provide insights into the decision process.LinguTimeX performs multidimensional feature extraction by fusing linguistic attributes with temporal network patterns to identify covert channels precisely.LinguTimeX demonstrates strong e ectiveness in detecting CTC across multiple languages;namely English,Arabic,and Chinese.Speci cally,the LSTM and RNN models achieved F1 scores of 90%on the English dataset,89%on the Arabic dataset,and 88%on the Chinese dataset,showcasing their superior performance and ability to generalize across multiple languages. is highlights their robustness in detecting CTCs within security systems,regardless of the language or cultural context of the data.In contrast,the DeepForest model produced F1-scores ranging from 86%to 87%across the same datasets,further con rming its e ectiveness in CTC detection.Although other algorithms also showed reasonable accuracy,the LSTM and RNN models consistently outperformed them in multilingual settings,suggesting that deep learning models might be better suited for this particular problem.展开更多
Human Activity Recognition(HAR)is a novel area for computer vision.It has a great impact on healthcare,smart environments,and surveillance while is able to automatically detect human behavior.It plays a vital role in ...Human Activity Recognition(HAR)is a novel area for computer vision.It has a great impact on healthcare,smart environments,and surveillance while is able to automatically detect human behavior.It plays a vital role in many applications,such as smart home,healthcare,human computer interaction,sports analysis,and especially,intelligent surveillance.In this paper,we propose a robust and efficient HAR system by leveraging deep learning paradigms,including pre-trained models,CNN architectures,and their average-weighted fusion.However,due to the diversity of human actions and various environmental influences,as well as a lack of data and resources,achieving high recognition accuracy remain elusive.In this work,a weighted average ensemble technique is employed to fuse three deep learning models:EfficientNet,ResNet50,and a custom CNN.The results of this study indicate that using a weighted average ensemble strategy for developing more effective HAR models may be a promising idea for detection and classification of human activities.Experiments by using the benchmark dataset proved that the proposed weighted ensemble approach outperformed existing approaches in terms of accuracy and other key performance measures.The combined average-weighted ensemble of pre-trained and CNN models obtained an accuracy of 98%,compared to 97%,96%,and 95%for the customized CNN,EfficientNet,and ResNet50 models,respectively.展开更多
To investigate the complex relationship between rolling process parameters and mechanical properties of AZ31 magnesium alloy rolled sheets,the Leave-One-Out Cross-Validation(LOOCV)and parameter tuning were applied to ...To investigate the complex relationship between rolling process parameters and mechanical properties of AZ31 magnesium alloy rolled sheets,the Leave-One-Out Cross-Validation(LOOCV)and parameter tuning were applied to optimizing hyper-parameters for the four(BPNN,SVR,RF,and KNN)machine learning models.An interpretable prediction model based on machine learning and SHapley Additive exPlanations(SHAP),as well as an analytical method combining the SHAP model and the Pearson Correlation Coefficient(PCC),were proposed.The results showed that among the four models,the SVR model was able to simultaneously and accurately predict the ultimate tensile strength(UTS)and elongation(EL).According to the combination analysis of PCC and the magnesium alloy rolling forming mechanism,it was found that strain rate and reduction displayed a negative and positive correlation with UTS,respectively,while rolling temperature and reduction illustrated a positive and negative correlation with EL,respectively.Through the SHAP method,which could interpret the output results of the SVR machine learning model,it was deduced that reduction and strain rate played an important role in the SVR model of the outputs of the UTS and EL,respectively.Combining SHAP with PCC,it was found that strain rate and reduction had a greater influence on the UTS than rolling temperature,whereas strain rate and rolling temperature had more influence on the EL compared to reduction.展开更多
The estimation of the Number of Sources(NoS)is a significant challenge in signal processing,particularly due to the impact of colored noise on the performance of NoS estimation.This paper proposes a Multidimensional F...The estimation of the Number of Sources(NoS)is a significant challenge in signal processing,particularly due to the impact of colored noise on the performance of NoS estimation.This paper proposes a Multidimensional Feature Network(MFNet)which is designed for NoS estimation by extracting features of the sampled received signals and Sampled Covariance Matrix(SCM).The MFNet treats the raw signal and the SCM as two different types of data,and is able to achieve NoS estimation under colored noise and imperfect array.MFNet employs the Gated Recurrent Unit(GRU)to capture sequential information from the original signal data and to construct the Pseudo Covariance Matrix(PCM).Subsequently,various dimensional features,including eigenvalues and the Gerschgorin disk radius,are extracted from both the PCM and SCM,which are then jointly input into the subsequent network.An overall accuracy of 82%can be achieved after network training.The ablation experimental results demonstrate the effectiveness of multiple inputs.And simulation results demonstrate that the proposed MFNet achieves higher estimation accuracy compared to existing algorithms and exhibits greater robustness against colored noise.展开更多
Recent advances in deep learning have significantly improved flood detection and segmentation from aerial and satellite imagery.However,conventional convolutional neural networks(CNNs)often struggle in complex flood s...Recent advances in deep learning have significantly improved flood detection and segmentation from aerial and satellite imagery.However,conventional convolutional neural networks(CNNs)often struggle in complex flood scenarios involving reflections,occlusions,or indistinct boundaries due to limited contextual modeling.To address these challenges,we propose a hybrid flood segmentation framework that integrates a Vision Transformer(ViT)encoder with a U-Net decoder,enhanced by a novel Flood-Aware Refinement Block(FARB).The FARB module improves boundary delineation and suppresses noise by combining residual smoothing with spatial-channel attention mechanisms.We evaluate our model on a UAV-acquired flood imagery dataset,demonstrating that the proposed ViTUNet+FARB architecture outperforms existing CNN and Transformer-based models in terms of accuracy and mean Intersection over Union(mIoU).Detailed ablation studies further validate the contribution of each component,confirming that the FARB design significantly enhances segmentation quality.To its better performance and computational efficiency,the proposed framework is well-suited for flood monitoring and disaster response applications,particularly in resource-constrained environments.展开更多
This paper proposes a novel framework based on the Stackelberg game and deep reinforcement learning for multi-microgrids(MGs)in achieving peer-to-peer(P2P)energy trading.A multi-leaders,multi-followers Stackelberg gam...This paper proposes a novel framework based on the Stackelberg game and deep reinforcement learning for multi-microgrids(MGs)in achieving peer-to-peer(P2P)energy trading.A multi-leaders,multi-followers Stackelberg game is utilized to model the P2P energy trading process.Stackelberg equilibrium(SE)is regarded as a P2P optimal trading strategy.A two-stage privacy protection solution technique combining data-driven and model-driven is developed to obtain the SE.Specifically,energy storage scheduling problem in MGs is formulated as a Markov decision process with discrete periods,and a multi-action single-observation deep deterministic policy gradient(MASO-DDPG)algorithm is proposed to tackle optimal scheduling of energy storage in the first stage.According to optimal scheduling of energy storage,the closed-form expression for SE based on model-driven is derived,and distributed SE solution technique(DSET)is developed to obtain SE in the second stage.Case studies involving a 4-Microgrid demonstrate the P2P electricity price obtained by the two-stage method,as a novel pricing mechanism,can reasonably regulate microgrid operation mode and improve microgrid income participating in the P2P market,which verifies effectiveness and superiority of the proposed P2P energy trading model and two-stage solution method.展开更多
In this study,an automated multimodal system for detecting,classifying,and dating fruit was developed using a two-stage YOLOv11 pipeline.In the first stage,the YOLOv11 detection model locates individual date fruits in...In this study,an automated multimodal system for detecting,classifying,and dating fruit was developed using a two-stage YOLOv11 pipeline.In the first stage,the YOLOv11 detection model locates individual date fruits in real time by drawing bounding boxes around them.These bounding boxes are subsequently passed to a YOLOv11 classification model,which analyzes cropped images and assigns class labels.An additional counting module automatically tallies the detected fruits,offering a near-instantaneous estimation of quantity.The experimental results suggest high precision and recall for detection,high classification accuracy(across 15 classes),and near-perfect counting in real time.This paper presents a multi-stage pipeline for date fruit detection,classification,and automated counting,employing YOLOv11-based models to achieve high accuracy while maintaining real-time throughput.The results demonstrated that the detection precision exceeded 90%,the classification accuracy approached 92%,and the counting module correlated closely with the manual tallies.These findings confirm the potential of reducing manual labour and enhancing operational efficiency in post-harvesting processes.Future studies will include dataset expansion,user-centric interfaces,and integration with harvesting robotics.展开更多
The malicious dissemination of hate speech via compromised accounts,automated bot networks and malware-driven social media campaigns has become a growing cybersecurity concern.Automatically detecting such content in S...The malicious dissemination of hate speech via compromised accounts,automated bot networks and malware-driven social media campaigns has become a growing cybersecurity concern.Automatically detecting such content in Spanish is challenging due to linguistic complexity and the scarcity of annotated resources.In this paper,we compare two predominant AI-based approaches for the forensic detection of malicious hate speech:(1)finetuning encoder-only models that have been trained in Spanish and(2)In-Context Learning techniques(Zero-and Few-Shot Learning)with large-scale language models.Our approach goes beyond binary classification,proposing a comprehensive,multidimensional evaluation that labels each text by:(1)type of speech,(2)recipient,(3)level of intensity(ordinal)and(4)targeted group(multi-label).Performance is evaluated using an annotated Spanish corpus,standard metrics such as precision,recall and F1-score and stability-oriented metrics to evaluate the stability of the transition from zero-shot to few-shot prompting(Zero-to-Few Shot Retention and Zero-to-Few Shot Gain)are applied.The results indicate that fine-tuned encoder-only models(notably MarIA and BETO variants)consistently deliver the strongest and most reliable performance:in our experiments their macro F1-scores lie roughly in the range of approximately 46%–66%depending on the task.Zero-shot approaches are much less stable and typically yield substantially lower performance(observed F1-scores range approximately 0%–39%),often producing invalid outputs in practice.Few-shot prompting(e.g.,Qwen 38B,Mistral 7B)generally improves stability and recall relative to pure zero-shot,bringing F1-scores into a moderate range of approximately 20%–51%but still falling short of fully fine-tuned models.These findings highlight the importance of supervised adaptation and discuss the potential of both paradigms as components in AI-powered cybersecurity and malware forensics systems designed to identify and mitigate coordinated online hate campaigns.展开更多
Underwater images frequently suffer from chromatic distortion,blurred details,and low contrast,posing significant challenges for enhancement.This paper introduces AquaTree,a novel underwater image enhancement(UIE)meth...Underwater images frequently suffer from chromatic distortion,blurred details,and low contrast,posing significant challenges for enhancement.This paper introduces AquaTree,a novel underwater image enhancement(UIE)method that reformulates the task as a Markov Decision Process(MDP)through the integration of Monte Carlo Tree Search(MCTS)and deep reinforcement learning(DRL).The framework employs an action space of 25 enhancement operators,strategically grouped for basic attribute adjustment,color component balance,correction,and deblurring.Exploration within MCTS is guided by a dual-branch convolutional network,enabling intelligent sequential operator selection.Our core contributions include:(1)a multimodal state representation combining CIELab color histograms with deep perceptual features,(2)a dual-objective reward mechanism optimizing chromatic fidelity and perceptual consistency,and(3)an alternating training strategy co-optimizing enhancement sequences and network parameters.We further propose two inference schemes:an MCTS-based approach prioritizing accuracy at higher computational cost,and an efficient network policy enabling real-time processing with minimal quality loss.Comprehensive evaluations on the UIEB Dataset and Color correction and haze removal comparisons on the U45 Dataset demonstrate AquaTree’s superiority,significantly outperforming nine state-of-the-art methods across five established underwater image quality metrics.展开更多
The wireless cloud robotic system(WCRS),which fully integrates sensing,communication,computing,and control capabilities as an intelligent agent,is a promising way to achieve intelligent manufacturing due to easy deplo...The wireless cloud robotic system(WCRS),which fully integrates sensing,communication,computing,and control capabilities as an intelligent agent,is a promising way to achieve intelligent manufacturing due to easy deployment and flexible expansion.However,the high-precision control of WCRS requires deterministic wireless communication,which is always challenging in the complex and dynamic radio space.This paper employs the reconfigurable intelligent surface(RIS)to establish a novel RIS-assisted WCRS architecture,where the radio channel is controlled to achieve ultra-reliable,low-delay,and low-jitter communication for high-precision closed-loop motion control.However,control and communication are strongly coupled and should be co-optimized.Fully considering the constraints of control input threshold,control delay deadline,beam phase,antenna power,and information distortion,we establish a stability maximization problem to jointly optimize control input compensation,RIS phase shift,and beamforming.Herein,a new jitter-oriented system stability objective with respect to control error and communication jitter is defined and the closed-form expression of control delay deadline is derived based on the Jensen Inequality and Lyapunov-Krasovskii functional.Due to the time-varying and partial observability of the channel and robot states,we model the problem as a partially observable Markov decision process(POMDP).To solve this complex problem,we propose a multi-agent transfer reinforcement learning algorithm named LSTM-PPO-MATRL,where the LSTM-enhanced proximal policy optimization(PPO)is designed to approximate an optimal solution and the option-guided policy transfer learning is proposed to facilitate the learning process.By centralized training and decentralized execution,LSTM-PPO-MATRL is validated by extensive experiments on MuJoCo tasks for both low-mobility and high-mobility robotic control scenarios.The results demonstrate that LSTM-PPO-MATRL not only realizes high learning efficiency,but also supports low-delay,low-jitter communication for low error control,where 71.9%control accuracy improvement and 68.7%delay jitter reduction are achieved compared to the PPO-MADRL baseline.展开更多
Robot interaction control with variable impedance parameters may conform to task requirements during continuous interaction with dynamic environments.Iterative learning(IL)is effective to learn desired impedance param...Robot interaction control with variable impedance parameters may conform to task requirements during continuous interaction with dynamic environments.Iterative learning(IL)is effective to learn desired impedance parameters for robots under unknown environments,and Gaussian process(GP)is a nonparametric Bayesian approach that models complicated functions with provable confidence using limited data.In this paper,we propose an impedance IL method enhanced by a sparse online Gaussian process(SOGP)to speed up learning convergence and improve generalization.The SOGP for variable impedance modeling is updated in the same iteration by removing similar data points from previous iterations while learning impedance parameters in multiple iterations.The proposed IL-SOGP method is verified by high-fidelity simulations of a collaborative robot with 7 degrees of freedom based on the admittance control framework.It is shown that the proposed method accelerates iterative convergence and improves generalization compared to the classical IL-based impedance learning method.展开更多
Accurate retrieval of casting 3D models is crucial for process reuse.Current methods primarily focus on shape similarity,neglecting process design features,which compromises reusability.In this study,a novel deep lear...Accurate retrieval of casting 3D models is crucial for process reuse.Current methods primarily focus on shape similarity,neglecting process design features,which compromises reusability.In this study,a novel deep learning retrieval method for process reuse was proposed,which integrates process design features into the retrieval of casting 3D models.This method leverages the comparative language-image pretraining(CLIP)model to extract shape features from the three views and sectional views of the casting model and combines them with process design features such as modulus,main wall thickness,symmetry,and length-to-height ratio to enhance process reusability.A database of 230 production casting models was established for model validation.Results indicate that incorporating process design features improves model accuracy by 6.09%,reaching 97.82%,and increases process similarity by 30.25%.The reusability of the process was further verified using the casting simulation software EasyCast.The results show that the process retrieved after integrating process design features produces the least shrinkage in the target model,demonstrating this method’s superior ability for process reuse.This approach does not require a large dataset for training and optimization,making it highly applicable to casting process design and related manufacturing processes.展开更多
The optimization of reaction processes is crucial for the green, efficient, and sustainable development of the chemical industry. However, how to address the problems posed by multiple variables, nonlinearities, and u...The optimization of reaction processes is crucial for the green, efficient, and sustainable development of the chemical industry. However, how to address the problems posed by multiple variables, nonlinearities, and uncertainties during optimization remains a formidable challenge. In this study, a strategy combining interpretable machine learning with metaheuristic optimization algorithms is employed to optimize the reaction process. First, experimental data from a biodiesel production process are collected to establish a database. These data are then used to construct a predictive model based on artificial neural network (ANN) models. Subsequently, interpretable machine learning techniques are applied for quantitative analysis and verification of the model. Finally, four metaheuristic optimization algorithms are coupled with the ANN model to achieve the desired optimization. The research results show that the methanol: palm fatty acid distillate (PFAD) molar ratio contributes the most to the reaction outcome, accounting for 41%. The ANN-simulated annealing (SA) hybrid method is more suitable for this optimization, and the optimal process parameters are a catalyst concentration of 3.00% (mass), a methanol: PFAD molar ratio of 8.67, and a reaction time of 30 min. This study provides deeper insights into reaction process optimization, which will facilitate future applications in various reaction optimization processes.展开更多
In the context of intelligent manufacturing,the modern hot strip mill process(HSMP)shows characteristics such as diversification of products,multi-specification batch production,and demand-oriented customization.These...In the context of intelligent manufacturing,the modern hot strip mill process(HSMP)shows characteristics such as diversification of products,multi-specification batch production,and demand-oriented customization.These characteristics pose significant challenges to ensuring process stability and consistency of product performance.Therefore,exploring the potential relationship between product performance and the production process,and developing a comprehensive performance evaluation method adapted to modern HSMP have become an urgent issue.A comprehensive performance evaluation method for HSMP by integrating multi-task learning and stacked performance-related autoencoder is proposed to solve the problems such as incomplete performance indicators(PIs)data,insufficient real-time acquisition requirements,and coupling of multiple PIs.First,according to the existing Chinese standards,a comprehensive performance evaluation grade strategy for strip steel is designed.The random forest model is established to predict and complete the parts of PIs data that could not be obtained in real-time.Second,a stacked performance-related autoencoder(SPAE)model is proposed to extract the deep features closely related to the product performance.Then,considering the correlation between PIs,the multi-task learning framework is introduced to output the subitem ratings and comprehensive product performance rating results of the strip steel online in real-time,where each task represents a subitem of comprehensive performance.Finally,the effectiveness of the method is verified on a real HSMP dataset,and the results show that the accuracy of the proposed method is as high as 94.8%,which is superior to the other comparative methods.展开更多
基金supported by the National Natural Science Foundation of China(Grant Nos.42130719 and 42177173)the Doctoral Direct Train Project of Chongqing Natural Science Foundation(Grant No.CSTB2023NSCQ-BSX0029).
文摘Underground engineering projects such as deep tunnel excavation often encounter rockburst disasters accompanied by numerous microseismic events.Rapid interpretation of microseismic signals is crucial for the timely identification of rockbursts.However,conventional processing encompasses multi-step workflows,including classification,denoising,picking,locating,and computational analysis,coupled with manual intervention,which collectively compromise the reliability of early warnings.To address these challenges,this study innovatively proposes the“microseismic stethoscope"-a multi-task machine learning and deep learning model designed for the automated processing of massive microseismic signals.This model efficiently extracts three key parameters that are necessary for recognizing rockburst disasters:rupture location,microseismic energy,and moment magnitude.Specifically,the model extracts raw waveform features from three dedicated sub-networks:a classifier for source zone classification,and two regressors for microseismic energy and moment magnitude estimation.This model demonstrates superior efficiency compared to traditional processing and semi-automated processing,reducing per-event processing time from 0.71 s to 0.49 s to merely 0.036 s.It concurrently achieves 98%accuracy in source zone classification,with microseismic energy and moment magnitude estimation errors of 0.13 and 0.05,respectively.This model has been well applied and validated in the Daxiagu Tunnel case in Sichuan,China.The application results indicate that the model is as accurate as traditional methods in determining source parameters,and thus can be used to identify potential geomechanical processes of rockburst disasters.By enhancing the signal processing reliability of microseismic events,the proposed model in this study presents a significant advancement in the identification of rockburst disasters.
基金financial support of the National Natural Science Foundation of China(No.52371103)the Fundamental Research Funds for the Central Universities,China(No.2242023K40028)+1 种基金the Open Research Fund of Jiangsu Key Laboratory for Advanced Metallic Materials,China(No.AMM2023B01).financial support of the Research Fund of Shihezi Key Laboratory of AluminumBased Advanced Materials,China(No.2023PT02)financial support of Guangdong Province Science and Technology Major Project,China(No.2021B0301030005)。
文摘Oxide dispersion strengthened(ODS)alloys are extensively used owing to high thermostability and creep strength contributed from uniformly dispersed fine oxides particles.However,the existence of these strengthening particles also deteriorates the processability and it is of great importance to establish accurate processing maps to guide the thermomechanical processes to enhance the formability.In this study,we performed particle swarm optimization-based back propagation artificial neural network model to predict the high temperature flow behavior of 0.25wt%Al2O3 particle-reinforced Cu alloys,and compared the accuracy with that of derived by Arrhenius-type constitutive model and back propagation artificial neural network model.To train these models,we obtained the raw data by fabricating ODS Cu alloys using the internal oxidation and reduction method,and conducting systematic hot compression tests between 400 and800℃with strain rates of 10^(-2)-10 S^(-1).At last,processing maps for ODS Cu alloys were proposed by combining processing parameters,mechanical behavior,microstructure characterization,and the modeling results achieved a coefficient of determination higher than>99%.
基金Key Research and Development Program of Hubei Province,China(Grant No.2022BAA096)Zhejiang Provincial Natural Science Foundation of China(This material is based upon work funded by Zhejiang Provincial Natural Science Foundation of China under Grant No.LR25A020002)support of the Center for Materials Analysis and Characterization,Material Characterization Lab,and Nanofabrication Lab at Hubei University。
文摘The key challenge in the preparation of perovskite solar cells is to enhance the reproducibility of PSC manufacturing,particularly by better controlling multiple high-dimensional process parameters.This study proposes a machine learning(ML)approach to efficiently predict and analyze perovskite film fabrication processes.By evaluating five classic ML algorithms on 130 experimental data sets from blade-coating parameters,the Random Forest(RF)model was identified as the most effective,enabling rapid prediction of over 100,000 parameter sets in just 10 min-equivalent to 3 years of manual experimentation.The RF model demonstrated strong predictive accuracy,with an R^(2) close to 0.8.This approach led to the identification of optimal process parameter combinations,significantly improving the reproducibility of PSCs and reducing performance variance by approximately threefold,thereby advancing the development of scalable manufacturing processes.
文摘This study seeks to establish a novel,semi-automatic system that utilizes Industry 4.0 principles to effectively determine both acceptable and rejectable concrete cubes with regard to their failure modes,significantly contributing to the dependability of concrete quality evaluations.The study utilizes image processing and machine learning(ML)methods,namely object detectionmodels such as YOLOv8 and Convolutional Neural Networks(CNNs),to evaluate images of concrete cubes.These models are trained and validated on an extensive database of annotated images from real-world and laboratory conditions.Preliminary results indicate a good performance in the classification of concrete cube failure modes.The proposed system accurately identifies cracks,determines the severity of damage to structures,indicating the potential to minimize human errors and discrepancies that might occur through the current techniques to detect the failure mode of concrete cubes.Thedeveloped systemcould significantly improve the reliability of concrete cube assessments,reduce resource wastage,and contribute to more sustainable construction practices.By minimizing material costs and errors,this innovation supports the construction industry’s move towards sustainability.
文摘Low-voltage direct current(DC)microgrids have recently emerged as a promising and viable alternative to traditional alternating cur-rent(AC)microgrids,offering numerous advantages.Consequently,researchers are exploring the potential of DC microgrids across var-ious configurations.However,despite the sustainability and accuracy offered by DC microgrids,they pose various challenges when integrated into modern power distribution systems.Among these challenges,fault diagnosis holds significant importance.Rapid fault detection in DC microgrids is essential to maintain stability and ensure an uninterrupted power supply to critical loads.A primary chal-lenge is the lack of standards and guidelines for the protection and safety of DC microgrids,including fault detection,location,and clear-ing procedures for both grid-connected and islanded modes.In response,this study presents a brief overview of various approaches for protecting DC microgrids.
基金supported by the Science Committee of the Ministry of Higher Education and Science of the Republic of Kazakhstan within the framework of grant AP23489899“Applying Deep Learning and Neuroimaging Methods for Brain Stroke Diagnosis”.
文摘Deep learning now underpins many state-of-the-art systems for biomedical image and signal processing,enabling automated lesion detection,physiological monitoring,and therapy planning with accuracy that rivals expert performance.This survey reviews the principal model families as convolutional,recurrent,generative,reinforcement,autoencoder,and transfer-learning approaches as emphasising how their architectural choices map to tasks such as segmentation,classification,reconstruction,and anomaly detection.A dedicated treatment of multimodal fusion networks shows how imaging features can be integrated with genomic profiles and clinical records to yield more robust,context-aware predictions.To support clinical adoption,we outline post-hoc explainability techniques(Grad-CAM,SHAP,LIME)and describe emerging intrinsically interpretable designs that expose decision logic to end users.Regulatory guidance from the U.S.FDA,the European Medicines Agency,and the EU AI Act is summarised,linking transparency and lifecycle-monitoring requirements to concrete development practices.Remaining challenges as data imbalance,computational cost,privacy constraints,and cross-domain generalization are discussed alongside promising solutions such as federated learning,uncertainty quantification,and lightweight 3-D architectures.The article therefore offers researchers,clinicians,and policymakers a concise,practice-oriented roadmap for deploying trustworthy deep-learning systems in healthcare.
基金This study is financed by the European Union-NextGenerationEU,through the National Recovery and Resilience Plan of the Republic of Bulgaria,Project No.BG-RRP-2.013-0001.
文摘Covert timing channels(CTC)exploit network resources to establish hidden communication pathways,posing signi cant risks to data security and policy compliance.erefore,detecting such hidden and dangerous threats remains one of the security challenges. is paper proposes LinguTimeX,a new framework that combines natural language processing with arti cial intelligence,along with explainable Arti cial Intelligence(AI)not only to detect CTC but also to provide insights into the decision process.LinguTimeX performs multidimensional feature extraction by fusing linguistic attributes with temporal network patterns to identify covert channels precisely.LinguTimeX demonstrates strong e ectiveness in detecting CTC across multiple languages;namely English,Arabic,and Chinese.Speci cally,the LSTM and RNN models achieved F1 scores of 90%on the English dataset,89%on the Arabic dataset,and 88%on the Chinese dataset,showcasing their superior performance and ability to generalize across multiple languages. is highlights their robustness in detecting CTCs within security systems,regardless of the language or cultural context of the data.In contrast,the DeepForest model produced F1-scores ranging from 86%to 87%across the same datasets,further con rming its e ectiveness in CTC detection.Although other algorithms also showed reasonable accuracy,the LSTM and RNN models consistently outperformed them in multilingual settings,suggesting that deep learning models might be better suited for this particular problem.
基金supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2026R765),Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.
文摘Human Activity Recognition(HAR)is a novel area for computer vision.It has a great impact on healthcare,smart environments,and surveillance while is able to automatically detect human behavior.It plays a vital role in many applications,such as smart home,healthcare,human computer interaction,sports analysis,and especially,intelligent surveillance.In this paper,we propose a robust and efficient HAR system by leveraging deep learning paradigms,including pre-trained models,CNN architectures,and their average-weighted fusion.However,due to the diversity of human actions and various environmental influences,as well as a lack of data and resources,achieving high recognition accuracy remain elusive.In this work,a weighted average ensemble technique is employed to fuse three deep learning models:EfficientNet,ResNet50,and a custom CNN.The results of this study indicate that using a weighted average ensemble strategy for developing more effective HAR models may be a promising idea for detection and classification of human activities.Experiments by using the benchmark dataset proved that the proposed weighted ensemble approach outperformed existing approaches in terms of accuracy and other key performance measures.The combined average-weighted ensemble of pre-trained and CNN models obtained an accuracy of 98%,compared to 97%,96%,and 95%for the customized CNN,EfficientNet,and ResNet50 models,respectively.
基金supported by the National Natural Science Foundation of China(Nos.52471132,52475356,52071139,U21A20130)the National Social Science Fund of China(No.21BJL075)+1 种基金the Natural Science Foundation of Fujian Province for Distinguished Young Scholars,China(No.2024J010031)the Natural Science Foundation of Chongqing,China(No.CSTB2023NSCQ-MSX0886)。
文摘To investigate the complex relationship between rolling process parameters and mechanical properties of AZ31 magnesium alloy rolled sheets,the Leave-One-Out Cross-Validation(LOOCV)and parameter tuning were applied to optimizing hyper-parameters for the four(BPNN,SVR,RF,and KNN)machine learning models.An interpretable prediction model based on machine learning and SHapley Additive exPlanations(SHAP),as well as an analytical method combining the SHAP model and the Pearson Correlation Coefficient(PCC),were proposed.The results showed that among the four models,the SVR model was able to simultaneously and accurately predict the ultimate tensile strength(UTS)and elongation(EL).According to the combination analysis of PCC and the magnesium alloy rolling forming mechanism,it was found that strain rate and reduction displayed a negative and positive correlation with UTS,respectively,while rolling temperature and reduction illustrated a positive and negative correlation with EL,respectively.Through the SHAP method,which could interpret the output results of the SVR machine learning model,it was deduced that reduction and strain rate played an important role in the SVR model of the outputs of the UTS and EL,respectively.Combining SHAP with PCC,it was found that strain rate and reduction had a greater influence on the UTS than rolling temperature,whereas strain rate and rolling temperature had more influence on the EL compared to reduction.
基金supported by the National Natural Science Foundation of China(Nos.62171469,62071029)。
文摘The estimation of the Number of Sources(NoS)is a significant challenge in signal processing,particularly due to the impact of colored noise on the performance of NoS estimation.This paper proposes a Multidimensional Feature Network(MFNet)which is designed for NoS estimation by extracting features of the sampled received signals and Sampled Covariance Matrix(SCM).The MFNet treats the raw signal and the SCM as two different types of data,and is able to achieve NoS estimation under colored noise and imperfect array.MFNet employs the Gated Recurrent Unit(GRU)to capture sequential information from the original signal data and to construct the Pseudo Covariance Matrix(PCM).Subsequently,various dimensional features,including eigenvalues and the Gerschgorin disk radius,are extracted from both the PCM and SCM,which are then jointly input into the subsequent network.An overall accuracy of 82%can be achieved after network training.The ablation experimental results demonstrate the effectiveness of multiple inputs.And simulation results demonstrate that the proposed MFNet achieves higher estimation accuracy compared to existing algorithms and exhibits greater robustness against colored noise.
基金supported by the National Research Foundation of Korea(NRF)grant funded by theKorea government(MSIT)(No.RS-2024-00405278)partially supported by the Jeju Industry-University Convergence District Project for Promoting Industry-Campus Cooperationfunded by the Ministry of Trade,Industry and Energy(MOTIE,Korea)[Project Name:Jeju Industry-University Convergence District Project for Promoting Industry-Campus Cooperation/Project Number:P0029950].
文摘Recent advances in deep learning have significantly improved flood detection and segmentation from aerial and satellite imagery.However,conventional convolutional neural networks(CNNs)often struggle in complex flood scenarios involving reflections,occlusions,or indistinct boundaries due to limited contextual modeling.To address these challenges,we propose a hybrid flood segmentation framework that integrates a Vision Transformer(ViT)encoder with a U-Net decoder,enhanced by a novel Flood-Aware Refinement Block(FARB).The FARB module improves boundary delineation and suppresses noise by combining residual smoothing with spatial-channel attention mechanisms.We evaluate our model on a UAV-acquired flood imagery dataset,demonstrating that the proposed ViTUNet+FARB architecture outperforms existing CNN and Transformer-based models in terms of accuracy and mean Intersection over Union(mIoU).Detailed ablation studies further validate the contribution of each component,confirming that the FARB design significantly enhances segmentation quality.To its better performance and computational efficiency,the proposed framework is well-suited for flood monitoring and disaster response applications,particularly in resource-constrained environments.
基金supported in part by the Fundamental Research Funds for the Central Universities(No.2020YJS162).
文摘This paper proposes a novel framework based on the Stackelberg game and deep reinforcement learning for multi-microgrids(MGs)in achieving peer-to-peer(P2P)energy trading.A multi-leaders,multi-followers Stackelberg game is utilized to model the P2P energy trading process.Stackelberg equilibrium(SE)is regarded as a P2P optimal trading strategy.A two-stage privacy protection solution technique combining data-driven and model-driven is developed to obtain the SE.Specifically,energy storage scheduling problem in MGs is formulated as a Markov decision process with discrete periods,and a multi-action single-observation deep deterministic policy gradient(MASO-DDPG)algorithm is proposed to tackle optimal scheduling of energy storage in the first stage.According to optimal scheduling of energy storage,the closed-form expression for SE based on model-driven is derived,and distributed SE solution technique(DSET)is developed to obtain SE in the second stage.Case studies involving a 4-Microgrid demonstrate the P2P electricity price obtained by the two-stage method,as a novel pricing mechanism,can reasonably regulate microgrid operation mode and improve microgrid income participating in the P2P market,which verifies effectiveness and superiority of the proposed P2P energy trading model and two-stage solution method.
基金supported by the Deanship of Scientific Research,Vice Presidency for Graduate Studies and Scientific Research,King Faisal University,Saudi Arabia,Grant No.KFU250098.
文摘In this study,an automated multimodal system for detecting,classifying,and dating fruit was developed using a two-stage YOLOv11 pipeline.In the first stage,the YOLOv11 detection model locates individual date fruits in real time by drawing bounding boxes around them.These bounding boxes are subsequently passed to a YOLOv11 classification model,which analyzes cropped images and assigns class labels.An additional counting module automatically tallies the detected fruits,offering a near-instantaneous estimation of quantity.The experimental results suggest high precision and recall for detection,high classification accuracy(across 15 classes),and near-perfect counting in real time.This paper presents a multi-stage pipeline for date fruit detection,classification,and automated counting,employing YOLOv11-based models to achieve high accuracy while maintaining real-time throughput.The results demonstrated that the detection precision exceeded 90%,the classification accuracy approached 92%,and the counting module correlated closely with the manual tallies.These findings confirm the potential of reducing manual labour and enhancing operational efficiency in post-harvesting processes.Future studies will include dataset expansion,user-centric interfaces,and integration with harvesting robotics.
基金the research project LaTe4PoliticES(PID2022-138099OB-I00)funded by MCIN/AEI/10.13039/501100011033 and the European Fund for Regional Development(ERDF)-a way to make Europe.Tomás Bernal-Beltrán is supported by University of Murcia through the predoctoral programme.
文摘The malicious dissemination of hate speech via compromised accounts,automated bot networks and malware-driven social media campaigns has become a growing cybersecurity concern.Automatically detecting such content in Spanish is challenging due to linguistic complexity and the scarcity of annotated resources.In this paper,we compare two predominant AI-based approaches for the forensic detection of malicious hate speech:(1)finetuning encoder-only models that have been trained in Spanish and(2)In-Context Learning techniques(Zero-and Few-Shot Learning)with large-scale language models.Our approach goes beyond binary classification,proposing a comprehensive,multidimensional evaluation that labels each text by:(1)type of speech,(2)recipient,(3)level of intensity(ordinal)and(4)targeted group(multi-label).Performance is evaluated using an annotated Spanish corpus,standard metrics such as precision,recall and F1-score and stability-oriented metrics to evaluate the stability of the transition from zero-shot to few-shot prompting(Zero-to-Few Shot Retention and Zero-to-Few Shot Gain)are applied.The results indicate that fine-tuned encoder-only models(notably MarIA and BETO variants)consistently deliver the strongest and most reliable performance:in our experiments their macro F1-scores lie roughly in the range of approximately 46%–66%depending on the task.Zero-shot approaches are much less stable and typically yield substantially lower performance(observed F1-scores range approximately 0%–39%),often producing invalid outputs in practice.Few-shot prompting(e.g.,Qwen 38B,Mistral 7B)generally improves stability and recall relative to pure zero-shot,bringing F1-scores into a moderate range of approximately 20%–51%but still falling short of fully fine-tuned models.These findings highlight the importance of supervised adaptation and discuss the potential of both paradigms as components in AI-powered cybersecurity and malware forensics systems designed to identify and mitigate coordinated online hate campaigns.
基金supported by theHubei Provincial Technology Innovation Special Project and the Natural Science Foundation of Hubei Province under Grants 2023BEB024,2024AFC066,respectively.
文摘Underwater images frequently suffer from chromatic distortion,blurred details,and low contrast,posing significant challenges for enhancement.This paper introduces AquaTree,a novel underwater image enhancement(UIE)method that reformulates the task as a Markov Decision Process(MDP)through the integration of Monte Carlo Tree Search(MCTS)and deep reinforcement learning(DRL).The framework employs an action space of 25 enhancement operators,strategically grouped for basic attribute adjustment,color component balance,correction,and deblurring.Exploration within MCTS is guided by a dual-branch convolutional network,enabling intelligent sequential operator selection.Our core contributions include:(1)a multimodal state representation combining CIELab color histograms with deep perceptual features,(2)a dual-objective reward mechanism optimizing chromatic fidelity and perceptual consistency,and(3)an alternating training strategy co-optimizing enhancement sequences and network parameters.We further propose two inference schemes:an MCTS-based approach prioritizing accuracy at higher computational cost,and an efficient network policy enabling real-time processing with minimal quality loss.Comprehensive evaluations on the UIEB Dataset and Color correction and haze removal comparisons on the U45 Dataset demonstrate AquaTree’s superiority,significantly outperforming nine state-of-the-art methods across five established underwater image quality metrics.
基金supported in part by the National Natural Science Foundation of China(62522320,92267108,62173322)Liaoning Revitalization Talents Program(XLYC2403062)the Science and Technology Program of Liaoning Province(2023JH3/10200004,2022JH25/10100005)。
文摘The wireless cloud robotic system(WCRS),which fully integrates sensing,communication,computing,and control capabilities as an intelligent agent,is a promising way to achieve intelligent manufacturing due to easy deployment and flexible expansion.However,the high-precision control of WCRS requires deterministic wireless communication,which is always challenging in the complex and dynamic radio space.This paper employs the reconfigurable intelligent surface(RIS)to establish a novel RIS-assisted WCRS architecture,where the radio channel is controlled to achieve ultra-reliable,low-delay,and low-jitter communication for high-precision closed-loop motion control.However,control and communication are strongly coupled and should be co-optimized.Fully considering the constraints of control input threshold,control delay deadline,beam phase,antenna power,and information distortion,we establish a stability maximization problem to jointly optimize control input compensation,RIS phase shift,and beamforming.Herein,a new jitter-oriented system stability objective with respect to control error and communication jitter is defined and the closed-form expression of control delay deadline is derived based on the Jensen Inequality and Lyapunov-Krasovskii functional.Due to the time-varying and partial observability of the channel and robot states,we model the problem as a partially observable Markov decision process(POMDP).To solve this complex problem,we propose a multi-agent transfer reinforcement learning algorithm named LSTM-PPO-MATRL,where the LSTM-enhanced proximal policy optimization(PPO)is designed to approximate an optimal solution and the option-guided policy transfer learning is proposed to facilitate the learning process.By centralized training and decentralized execution,LSTM-PPO-MATRL is validated by extensive experiments on MuJoCo tasks for both low-mobility and high-mobility robotic control scenarios.The results demonstrate that LSTM-PPO-MATRL not only realizes high learning efficiency,but also supports low-delay,low-jitter communication for low error control,where 71.9%control accuracy improvement and 68.7%delay jitter reduction are achieved compared to the PPO-MADRL baseline.
基金supported in part by the National Research Foundation of Korea(NRF)Grant Funded by the Korea Government(MSIT)(RS-2025-00555064).Recommended by Associate Editor Zengguang Hou.
文摘Robot interaction control with variable impedance parameters may conform to task requirements during continuous interaction with dynamic environments.Iterative learning(IL)is effective to learn desired impedance parameters for robots under unknown environments,and Gaussian process(GP)is a nonparametric Bayesian approach that models complicated functions with provable confidence using limited data.In this paper,we propose an impedance IL method enhanced by a sparse online Gaussian process(SOGP)to speed up learning convergence and improve generalization.The SOGP for variable impedance modeling is updated in the same iteration by removing similar data points from previous iterations while learning impedance parameters in multiple iterations.The proposed IL-SOGP method is verified by high-fidelity simulations of a collaborative robot with 7 degrees of freedom based on the admittance control framework.It is shown that the proposed method accelerates iterative convergence and improves generalization compared to the classical IL-based impedance learning method.
基金supported by the National Natural Science Foundation of China(Nos.52074246,52275390,52375394)the National Defense Basic Scientific Research Program of China(No.JCKY2020408B002)the Key R&D Program of Shanxi Province(No.202102050201011).
文摘Accurate retrieval of casting 3D models is crucial for process reuse.Current methods primarily focus on shape similarity,neglecting process design features,which compromises reusability.In this study,a novel deep learning retrieval method for process reuse was proposed,which integrates process design features into the retrieval of casting 3D models.This method leverages the comparative language-image pretraining(CLIP)model to extract shape features from the three views and sectional views of the casting model and combines them with process design features such as modulus,main wall thickness,symmetry,and length-to-height ratio to enhance process reusability.A database of 230 production casting models was established for model validation.Results indicate that incorporating process design features improves model accuracy by 6.09%,reaching 97.82%,and increases process similarity by 30.25%.The reusability of the process was further verified using the casting simulation software EasyCast.The results show that the process retrieved after integrating process design features produces the least shrinkage in the target model,demonstrating this method’s superior ability for process reuse.This approach does not require a large dataset for training and optimization,making it highly applicable to casting process design and related manufacturing processes.
基金supported by the National Natural Science Foundation of China(22408227,22238005)the Postdoctoral Research Foundation of China(GZC20231576).
文摘The optimization of reaction processes is crucial for the green, efficient, and sustainable development of the chemical industry. However, how to address the problems posed by multiple variables, nonlinearities, and uncertainties during optimization remains a formidable challenge. In this study, a strategy combining interpretable machine learning with metaheuristic optimization algorithms is employed to optimize the reaction process. First, experimental data from a biodiesel production process are collected to establish a database. These data are then used to construct a predictive model based on artificial neural network (ANN) models. Subsequently, interpretable machine learning techniques are applied for quantitative analysis and verification of the model. Finally, four metaheuristic optimization algorithms are coupled with the ANN model to achieve the desired optimization. The research results show that the methanol: palm fatty acid distillate (PFAD) molar ratio contributes the most to the reaction outcome, accounting for 41%. The ANN-simulated annealing (SA) hybrid method is more suitable for this optimization, and the optimal process parameters are a catalyst concentration of 3.00% (mass), a methanol: PFAD molar ratio of 8.67, and a reaction time of 30 min. This study provides deeper insights into reaction process optimization, which will facilitate future applications in various reaction optimization processes.
基金supported by the National Natural Science Foundation of China(NSFC)under Grants(Nos.U21A20483,62373040 and 62273031).
文摘In the context of intelligent manufacturing,the modern hot strip mill process(HSMP)shows characteristics such as diversification of products,multi-specification batch production,and demand-oriented customization.These characteristics pose significant challenges to ensuring process stability and consistency of product performance.Therefore,exploring the potential relationship between product performance and the production process,and developing a comprehensive performance evaluation method adapted to modern HSMP have become an urgent issue.A comprehensive performance evaluation method for HSMP by integrating multi-task learning and stacked performance-related autoencoder is proposed to solve the problems such as incomplete performance indicators(PIs)data,insufficient real-time acquisition requirements,and coupling of multiple PIs.First,according to the existing Chinese standards,a comprehensive performance evaluation grade strategy for strip steel is designed.The random forest model is established to predict and complete the parts of PIs data that could not be obtained in real-time.Second,a stacked performance-related autoencoder(SPAE)model is proposed to extract the deep features closely related to the product performance.Then,considering the correlation between PIs,the multi-task learning framework is introduced to output the subitem ratings and comprehensive product performance rating results of the strip steel online in real-time,where each task represents a subitem of comprehensive performance.Finally,the effectiveness of the method is verified on a real HSMP dataset,and the results show that the accuracy of the proposed method is as high as 94.8%,which is superior to the other comparative methods.