期刊文献+
共找到13,941篇文章
< 1 2 250 >
每页显示 20 50 100
A Survey of Cooperative Multi-agent Reinforcement Learning for Multi-task Scenarios 被引量:1
1
作者 Jiajun CHAI Zijie ZHAO +1 位作者 Yuanheng ZHU Dongbin ZHAO 《Artificial Intelligence Science and Engineering》 2025年第2期98-121,共24页
Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-... Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-robot control.Empowering cooperative MARL with multi-task decision-making capabilities is expected to further broaden its application scope.In multi-task scenarios,cooperative MARL algorithms need to address 3 types of multi-task problems:reward-related multi-task,arising from different reward functions;multi-domain multi-task,caused by differences in state and action spaces,state transition functions;and scalability-related multi-task,resulting from the dynamic variation in the number of agents.Most existing studies focus on scalability-related multitask problems.However,with the increasing integration between large language models(LLMs)and multi-agent systems,a growing number of LLM-based multi-agent systems have emerged,enabling more complex multi-task cooperation.This paper provides a comprehensive review of the latest advances in this field.By combining multi-task reinforcement learning with cooperative MARL,we categorize and analyze the 3 major types of multi-task problems under multi-agent settings,offering more fine-grained classifications and summarizing key insights for each.In addition,we summarize commonly used benchmarks and discuss future directions of research in this area,which hold promise for further enhancing the multi-task cooperation capabilities of multi-agent systems and expanding their practical applications in the real world. 展开更多
关键词 multi-task multi-agent reinforcement learning large language models
在线阅读 下载PDF
Microseismic signal processing and rockburst disaster identification:A multi-task deep learning and machine learning approach
2
作者 Chunchi Ma Weihao Xu +3 位作者 Xuefeng Ran Tianbin Li Hang Zhang Dongwei Xing 《Journal of Rock Mechanics and Geotechnical Engineering》 2026年第1期441-456,共16页
Underground engineering projects such as deep tunnel excavation often encounter rockburst disasters accompanied by numerous microseismic events.Rapid interpretation of microseismic signals is crucial for the timely id... Underground engineering projects such as deep tunnel excavation often encounter rockburst disasters accompanied by numerous microseismic events.Rapid interpretation of microseismic signals is crucial for the timely identification of rockbursts.However,conventional processing encompasses multi-step workflows,including classification,denoising,picking,locating,and computational analysis,coupled with manual intervention,which collectively compromise the reliability of early warnings.To address these challenges,this study innovatively proposes the“microseismic stethoscope"-a multi-task machine learning and deep learning model designed for the automated processing of massive microseismic signals.This model efficiently extracts three key parameters that are necessary for recognizing rockburst disasters:rupture location,microseismic energy,and moment magnitude.Specifically,the model extracts raw waveform features from three dedicated sub-networks:a classifier for source zone classification,and two regressors for microseismic energy and moment magnitude estimation.This model demonstrates superior efficiency compared to traditional processing and semi-automated processing,reducing per-event processing time from 0.71 s to 0.49 s to merely 0.036 s.It concurrently achieves 98%accuracy in source zone classification,with microseismic energy and moment magnitude estimation errors of 0.13 and 0.05,respectively.This model has been well applied and validated in the Daxiagu Tunnel case in Sichuan,China.The application results indicate that the model is as accurate as traditional methods in determining source parameters,and thus can be used to identify potential geomechanical processes of rockburst disasters.By enhancing the signal processing reliability of microseismic events,the proposed model in this study presents a significant advancement in the identification of rockburst disasters. 展开更多
关键词 Underground engineering Microseismic signal processing Deep learning multi-task Rockburst identification
在线阅读 下载PDF
Task-Structured Curriculum Learning for Multi-Task Distillation:Enhancing Step-by-Step Knowledge Transfer in Language Models
3
作者 Ahmet Ezgi Aytug Onan 《Computers, Materials & Continua》 2026年第3期1647-1673,共27页
Knowledge distillation has become a standard technique for compressing large language models into efficient student models,but existing methods often struggle to balance prediction accuracy with explanation quality.Re... Knowledge distillation has become a standard technique for compressing large language models into efficient student models,but existing methods often struggle to balance prediction accuracy with explanation quality.Recent approaches such as Distilling Step-by-Step(DSbS)introduce explanation supervision,yet they apply it in a uniform manner that may not fully exploit the different learning dynamics of prediction and explanation.In this work,we propose a task-structured curriculum learning(TSCL)framework that structures training into three sequential phases:(i)prediction-only,to establish stable feature representations;(ii)joint prediction-explanation,to align task outputs with rationale generation;and(iii)explanation-only,to refine the quality of rationales.This design provides a simple but effective modification to DSbS,requiring no architectural changes and adding negligible training cost.We justify the phase scheduling with ablation studies and convergence analysis,showing that an initial prediction-heavy stage followed by a balanced joint phase improves both stability and explanation alignment.Extensive experiments on five datasets(e-SNLI,ANLI,CommonsenseQA,SVAMP,and MedNLI)demonstrate that TSCL consistently outperforms strong baselines,achieving gains of+1.7-2.6 points in accuracy and 0.8-1.2 in ROUGE-L,corresponding to relative error reductions of up to 21%.Beyond lexical metrics,human evaluation and ERASERstyle faithfulness diagnostics confirm that TSCL produces more faithful and informative explanations.Comparative training curves further reveal faster convergence and lower variance across seeds.Efficiency analysis shows less than 3%overhead in wall-clock training time and no additional inference cost,making the approach practical for realworld deployment.This study demonstrates that a simple task-structured curriculum can significantly improve the effectiveness of knowledge distillation.By separating and sequencing objectives,TSCL achieves a better balance between accuracy,stability,and explanation quality.The framework generalizes across domains,including medical NLI,and offers a principled recipe for future applications in multimodal reasoning and reinforcement learning. 展开更多
关键词 Knowledge distillation curriculum learning language models multi-task learning step-by-step learning
在线阅读 下载PDF
A surface emphasized multi-task learning framework for surface property predictions:A case study of magnesium intermetallics
4
作者 Gaoning Shi Yaowei Wang +3 位作者 Kun Yang Yuan Qiu Hong Zhu Xiaoqin Zeng 《Journal of Magnesium and Alloys》 2026年第1期216-227,共12页
Surface properties of crystals are critical in many fields,including electrochemistry and photoelectronics,the efficient prediction of which can expedite the design and optimization of catalysts,batteries,alloys etc.H... Surface properties of crystals are critical in many fields,including electrochemistry and photoelectronics,the efficient prediction of which can expedite the design and optimization of catalysts,batteries,alloys etc.However,we are still far from realizing this vision due to the rarity of surface property-related databases,especially for multicomponent compounds,due to the large sample spaces and limited computing resources.In this work,we present a surface emphasized multi-task crystal graph convolutional neural network(SEM-CGCNN)to predict multiple surface properties simultaneously from crystal structures.The model is evaluated on a dataset of 3526 surface energies and work functions of binary magnesium intermetallics obtained through first-principles calculations,and obvious improvements are observed both in efficiency and accuracy over the original CGCNN model.By transferring the pre-trained model to the datasets of pure metals and other intermetallics,the fine-tuned SEM-CGCNN outperforms learning from scratch and can be further applied to other surface properties and materials systems.This study could be a paradigm for the end-to-end mapping of atomic structures to anisotropic surface properties of crystals,which provides an efficient framework to understand and screen materials with desired surface characteristics. 展开更多
关键词 Graph neural networks multi-task learning Surface energy Work function Intermetallic compounds Mg alloy
在线阅读 下载PDF
Life cycle environmental impacts and emission reduction pathways of wind power in western China:A scenario-based assessment
5
作者 Ning Su Xiaobing Li +3 位作者 Xin Lyu Dongliang Dang Siyu Liu Chenhao Zhang 《Geography and Sustainability》 2026年第1期54-65,共12页
Compared with traditional energy sources,wind power has a lower environmental impact.However,emissions are still generated across the life cycle of wind turbines,from production to recycling.As wind power rapidly deve... Compared with traditional energy sources,wind power has a lower environmental impact.However,emissions are still generated across the life cycle of wind turbines,from production to recycling.As wind power rapidly develops and deployment increases,these impacts are becoming increasingly evident.A comprehensive understanding of these impacts is crucial for sustainable development.Based on the harmonization of previous detailed life cycle assessment(LCA)studies,this study develops a simplified LCA model that estimates the life cycle environmental impacts of wind turbines based on their nominal power.Using this simplified LCA model,we assess the global warming potential(GWP),acidification potential(AP),and cumulative energy demand(CED)of wind power at the regional scale for 2022 and under three future scenarios(high-power wind turbine promotion,reduced wind curtailment,and a comprehensive development scenario).The results indicate that in 2022,the life cycle GWP,AP,and CED of wind power in western China were 10.76 g CO_(2) eq/kWh,0.177 g SO_(2) eq/kWh,and 17.6 kJ/kWh,respectively.Scenario simulations suggest that reducing wind curtailment is the most effective approach for reducing emissions in Inner Mongolia,Gansu,Qinghai,Ningxia,and Xinjiang,producing average decreases of 8.64%in GWP,8.39%in AP,and 9.26%in CED.In contrast,for Guangxi,Chongqing,Sichuan,Guizhou,Yunnan,Xizang,and Shaanxi,the promotion of high-power wind turbines provides greater environmental benefits than reducing curtailment,producing average decreases of 3.45%,3.09%,and 4.29%in GWP,AP,and CED,respectively.These findings help clarify the environmental impact of wind power across its life cycle at the regional scale and provide theoretical references for the direction of future wind power development and the formulation of related policies. 展开更多
关键词 Wind energy Life cycle assessment Environmental impact scenario simulation Western China
在线阅读 下载PDF
Cooperative Beam Selection for RIS-Aided Terahertz MIMO Networks via Multi-Task Learning
6
作者 Ma Xinying Chen Gong Wang Xiaofei 《China Communications》 2026年第2期211-227,共17页
Reconfigurable intelligent surface(RIS)have been cast as a promising alternative to alleviate blockage vulnerability and enhance coverage capability for terahertz(THz)communications.Owing to large-scale array elements... Reconfigurable intelligent surface(RIS)have been cast as a promising alternative to alleviate blockage vulnerability and enhance coverage capability for terahertz(THz)communications.Owing to large-scale array elements at transceivers and RIS,the codebook based beamforming can be utilized in a computationally efficient manner.However,the codeword selection for analog beamforming is an intractable combinatorial optimization(CO)problem.To this end,by taking the CO problem as a classification problem,a multi-task learning based analog beam selection(MTL-ABS)framework is developed to implement cooperative beam selection concurrently at transceivers and RIS.In addition,residual network and self-attention mechanism are used to combat the network degradation and mine intrinsic THz channel features.Finally,the network convergence is analyzed from a blockwise perspective,and numerical results demonstrate that the MTL-ABS framework greatly decreases the beam selection overhead and achieves near optimal sum-rate compared with heuristic search based counterparts. 展开更多
关键词 beam selection multi-task learning reconfigurable intelligent surface(RIS) terahertz(THz)communications
在线阅读 下载PDF
Channel Characteristics Analysis in Semi-Basement Scenarios for Smart Meter Communication Systems
7
作者 Wang Qing Zhang Zhaolei +1 位作者 Liu Yu Ren Yi 《China Communications》 2026年第1期92-106,共15页
The smart meter communication system has substantial application value for the construction and upgrading of the entire power system.The deployment of the transmitter(Tx)of the smart meter system in the residential sc... The smart meter communication system has substantial application value for the construction and upgrading of the entire power system.The deployment of the transmitter(Tx)of the smart meter system in the residential scenarios is vexed by the need for more theoretical support.This paper mainly studies the communication channel between the Tx at semibasement and receiver(Rx)at outdoor.The design of an effective communication system relies on an accurate understanding of channel characteristics.Channel measurements and ray-tracing channel modeling are conducted to obtain channel data.The influence of different positions at same semi-basement is studied.Typical channel characteristics are analyzed,such as power delay profile(PDP),power angular profile(PAP),root-mean-square(RMS)delay spread(DS),channel capacity,received power,and path loss.The influence of different semi-basement placements and different floor heights is also compared.Besides,the channel measurements and simulation data fit well,which can illustrate the validity and reliability of the acquired channel data.This paper can provide theoretical support for the design and optimization of smart meter communication systems in semi-basement scenarios. 展开更多
关键词 channel characteristics channel measurements ray-tracing method semi-basement scenarios smart meter communication
在线阅读 下载PDF
Optimal Dispatch of Urban Distribution Networks Considering Virtual Power Plant Coordination under Extreme Scenarios
8
作者 Yong Li Yuxuan Chen +4 位作者 Jiahui He Guowei He Chenxi Dai Jingjing Tong Wenting Lei 《Energy Engineering》 2026年第1期204-220,共17页
Ensuring reliable power supply in urban distribution networks is a complex and critical task.To address the increased demand during extreme scenarios,this paper proposes an optimal dispatch strategy that considers the... Ensuring reliable power supply in urban distribution networks is a complex and critical task.To address the increased demand during extreme scenarios,this paper proposes an optimal dispatch strategy that considers the coordination with virtual power plants(VPPs).The proposed strategy improves systemflexibility and responsiveness by optimizing the power adjustment of flexible resources.In the proposed strategy,theGaussian Process Regression(GPR)is firstly employed to determine the adjustable range of aggregated power within the VPP,facilitating an assessment of its potential contribution to power supply support.Then,an optimal dispatch model based on a leader-follower game is developed to maximize the benefits of the VPP and flexible resources while guaranteeing the power balance at the same time.To solve the proposed optimal dispatch model efficiently,the constraints of the problem are reformulated and resolved using the Karush-Kuhn-Tucker(KKT)optimality conditions and linear programming duality theorem.The effectiveness of the strategy is illustrated through a detailed case study. 展开更多
关键词 Urban distribution network virtual power plant power supply support leader-follower optimization game extreme weather scenarios
在线阅读 下载PDF
Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models 被引量:2
9
作者 Wenhua Fang Jun Chen Ruimin Hu 《China Communications》 SCIE CSCD 2018年第12期208-219,共12页
Pedestrian attributes recognition is a very important problem in video surveillance and video forensics. Traditional methods assume the pedestrian attributes are independent and design handcraft features for each one.... Pedestrian attributes recognition is a very important problem in video surveillance and video forensics. Traditional methods assume the pedestrian attributes are independent and design handcraft features for each one. In this paper, we propose a joint hierarchical multi-task learning algorithm to learn the relationships among attributes for better recognizing the pedestrian attributes in still images using convolutional neural networks(CNN). We divide the attributes into local and global ones according to spatial and semantic relations, and then consider learning semantic attributes through a hierarchical multi-task CNN model where each CNN in the first layer will predict each group of such local attributes and CNN in the second layer will predict the global attributes. Our multi-task learning framework allows each CNN model to simultaneously share visual knowledge among different groups of attribute categories. Extensive experiments are conducted on two popular and challenging benchmarks in surveillance scenarios, namely, the PETA and RAP pedestrian attributes datasets. On both benchmarks, our framework achieves superior results over the state-of-theart methods by 88.2% on PETA and 83.25% on RAP, respectively. 展开更多
关键词 attributes RECOGNITION CNN multi-task learning
在线阅读 下载PDF
DEEP NEURAL NETWORKS COMBINING MULTI-TASK LEARNING FOR SOLVING DELAY INTEGRO-DIFFERENTIAL EQUATIONS 被引量:1
10
作者 WANG Chen-yao SHI Feng 《数学杂志》 2025年第1期13-38,共26页
Deep neural networks(DNNs)are effective in solving both forward and inverse problems for nonlinear partial differential equations(PDEs).However,conventional DNNs are not effective in handling problems such as delay di... Deep neural networks(DNNs)are effective in solving both forward and inverse problems for nonlinear partial differential equations(PDEs).However,conventional DNNs are not effective in handling problems such as delay differential equations(DDEs)and delay integrodifferential equations(DIDEs)with constant delays,primarily due to their low regularity at delayinduced breaking points.In this paper,a DNN method that combines multi-task learning(MTL)which is proposed to solve both the forward and inverse problems of DIDEs.The core idea of this approach is to divide the original equation into multiple tasks based on the delay,using auxiliary outputs to represent the integral terms,followed by the use of MTL to seamlessly incorporate the properties at the breaking points into the loss function.Furthermore,given the increased training dificulty associated with multiple tasks and outputs,we employ a sequential training scheme to reduce training complexity and provide reference solutions for subsequent tasks.This approach significantly enhances the approximation accuracy of solving DIDEs with DNNs,as demonstrated by comparisons with traditional DNN methods.We validate the effectiveness of this method through several numerical experiments,test various parameter sharing structures in MTL and compare the testing results of these structures.Finally,this method is implemented to solve the inverse problem of nonlinear DIDE and the results show that the unknown parameters of DIDE can be discovered with sparse or noisy data. 展开更多
关键词 Delay integro-differential equation multi-task learning parameter sharing structure deep neural network sequential training scheme
在线阅读 下载PDF
MolP-PC:a multi-view fusion and multi-task learning framework for drug ADMET property prediction 被引量:1
11
作者 Sishu Li Jing Fan +2 位作者 Haiyang He Ruifeng Zhou Jun Liao 《Chinese Journal of Natural Medicines》 2025年第11期1293-1300,共8页
The accurate prediction of drug absorption,distribution,metabolism,excretion,and toxicity(ADMET)properties represents a crucial step in early drug development for reducing failure risk.Current deep learning approaches... The accurate prediction of drug absorption,distribution,metabolism,excretion,and toxicity(ADMET)properties represents a crucial step in early drug development for reducing failure risk.Current deep learning approaches face challenges with data sparsity and information loss due to single-molecule representation limitations and isolated predictive tasks.This research proposes molecular properties prediction with parallel-view and collaborative learning(MolP-PC),a multi-view fusion and multi-task deep learning framework that integrates 1D molecular fingerprints(MFs),2D molecular graphs,and 3D geometric representations,incorporating an attention-gated fusion mechanism and multi-task adaptive learning strategy for precise ADMET property predictions.Experimental results demonstrate that MolP-PC achieves optimal performance in 27 of 54 tasks,with its multi-task learning(MTL)mechanism significantly enhancing predictive performance on small-scale datasets and surpassing single-task models in 41 of 54 tasks.Additional ablation studies and interpretability analyses confirm the significance of multi-view fusion in capturing multi-dimensional molecular information and enhancing model generalization.A case study examining the anticancer compound Oroxylin A demonstrates MolP-PC’s effective generalization in predicting key pharmacokinetic parameters such as half-life(T0.5)and clearance(CL),indicating its practical utility in drug modeling.However,the model exhibits a tendency to underestimate volume of distribution(VD),indicating potential for improvement in analyzing compounds with high tissue distribution.This study presents an efficient and interpretable approach for ADMET property prediction,establishing a novel framework for molecular optimization and risk assessment in drug development. 展开更多
关键词 Molecular ADMET prediction Multi-view fusion Attention mechanism multi-task deep learning
原文传递
China Can Achieve Carbon Neutrality in Line with the Paris Agreement's 2℃Target:Navigating Global Emissions Scenarios,Warming Levels,and Extreme Event Projections 被引量:1
12
作者 Xiaoye Zhang Junting Zhong +4 位作者 Xiliang Zhang Da Zhang Changhong Miao Deying Wang Lifeng Guo 《Engineering》 2025年第1期207-214,共8页
This paper proposes that China,under the challenge of balancing its development and security,can aim for the Paris Agreement's goal to limit global warming to no more than 2℃by actively seeking carbonpeak and car... This paper proposes that China,under the challenge of balancing its development and security,can aim for the Paris Agreement's goal to limit global warming to no more than 2℃by actively seeking carbonpeak and carbon-neutrality pathways that align with China's national conditions,rather than following the idealized path toward the 1.5℃target by initially relying on extensive negative-emission technologies such as direct air carbon capture and storage(DACCS).This work suggests that pursuing a 1.5℃target is increasingly less feasible for China,as it would potentially incur 3–4 times the cost of pursuing the 2℃target.With China being likely to achieve a peak in its emissions around 2028,at about 12.8 billion tonnes of anthropogenic carbon dioxide(CO_(2)),and become carbon neutral,projected global warming levels may be less severe after the 2050s than previously estimated.This could reduce the risk potential of climate tipping points and extreme events,especially considering that the other two major carbon emitters in the world(Europe and North America)have already passed their carbon peaks.While natural carbon sinks will contribute to China's carbon neutrality efforts,they are not expected to be decisive in the transition stages.This research also addresses the growing focus on climate overshoot,tipping points,extreme events,loss and damage,and methane reductions in international climate cooperation,emphasizing the need to balance these issues with China's development,security,and fairness considerations.China's pursuit of carbon neutrality will have significant implications for global emissions scenarios,warming levels,and extreme event projections,as well as for climate change hotspots of international concern,such as climate tipping points,the climate crisis,and the notion that the world has moved from a warming to a boiling era.Possible research recommendations for global emissions scenarios based on China's 2℃target pathway are also summarized. 展开更多
关键词 Climate change 2℃target Carbon neutrality Emission scenarios Balanced mitigation
在线阅读 下载PDF
Piezo-actuated smart mechatronic systems for extreme scenarios 被引量:1
13
作者 Zhongxiang Yuan Shuliu Zhou +7 位作者 Cailin Hong Ziyu Xiao Zhengguang Zhang Xuedong Chen Lizhan Zeng Jiulin Wu Yunlong Wang Xiaoqing Li 《International Journal of Extreme Manufacturing》 2025年第2期72-119,共48页
Precision actuation is a foundational technology in high-end equipment domains,where stroke,velocity,and accuracy are critical for processing and/or detection quality,precision in spacecraft flight trajectories,and ac... Precision actuation is a foundational technology in high-end equipment domains,where stroke,velocity,and accuracy are critical for processing and/or detection quality,precision in spacecraft flight trajectories,and accuracy in weapon system strikes.Piezoelectric actuators(PEAs),known for their nanometer-level precision,flexible stroke,resistance to electromagnetic interference,and scalable structure,have been widely adopted across various fields.Therefore,this study focuses on extreme scenarios involving ultra-high precision(micrometer and beyond),minuscule scales,and highly complex operational conditions.It provides a comprehensive overview of the types,working principles,advantages,and disadvantages of PEAs,along with their potential applications in piezo-actuated smart mechatronic systems(PSMSs).To address the demands of extreme scenarios in high-end equipment fields,we have identified five representative application areas:positioning and alignment,biomedical device configuration,advanced manufacturing and processing,vibration mitigation,micro robot system.Each area is further divided into specific subcategories,where we explore the underlying relationships,mechanisms,representative schemes,and characteristics.Finally,we discuss the challenges and future development trends related to PEAs and PSMSs.This work aims to showcase the latest advancements in the application of PEAs and provide valuable guidance for researchers in this field. 展开更多
关键词 piezoelectric actuator nanopositioning system high-end equipment extreme scenarios piezo-actuated smart mechatronic system
在线阅读 下载PDF
Research on the Construction of Immersive Education Systems for Fire Safety in University Laboratories Using VR/AR in Hazardous Chemical Scenarios 被引量:1
14
作者 Xuezheng Wu 《Journal of Contemporary Educational Research》 2025年第10期357-362,共6页
With the rapid development of virtual reality(VR)and augmented reality(AR)technologies,their application potential in the field of education has become increasingly significant.For a long time,fire safety education in... With the rapid development of virtual reality(VR)and augmented reality(AR)technologies,their application potential in the field of education has become increasingly significant.For a long time,fire safety education in university laboratories has faced numerous challenges,and traditional teaching methods have been insufficiently effective,with high-risk scenarios difficult to realistically recreate.Especially in special scenarios involving hazardous chemicals,conventional training methods struggle to enable learners to achieve deep understanding and behavioral formation.This study systematically integrates immersive technology theory with safety education needs,providing a replicable technical solution for safety education in high-risk environments.Its modular design approach has reference value for expansion into other professional fields,offering practical evidence for innovation in safety education models in the digital age. 展开更多
关键词 VR/AR Hazardous chemicals scenarios University laboratories Fire safety Immersive education
在线阅读 下载PDF
Short-Term Rolling Prediction of Tropical Cyclone Intensity Based on Multi-Task Learning with Fusion of Deviation-Angle Variance and Satellite Imagery
15
作者 Wei TIAN Ping SONG +5 位作者 Yuanyuan CHEN Yonghong ZHANG Liguang WU Haikun ZHAO Kenny Thiam Choy LIM KAM SIAN Chunyi XIANG 《Advances in Atmospheric Sciences》 2025年第1期111-128,共18页
Tropical cyclones(TCs)are one of the most serious types of natural disasters,and accurate TC activity predictions are key to disaster prevention and mitigation.Recently,TC track predictions have made significant progr... Tropical cyclones(TCs)are one of the most serious types of natural disasters,and accurate TC activity predictions are key to disaster prevention and mitigation.Recently,TC track predictions have made significant progress,but the ability to predict their intensity is obviously lagging behind.At present,research on TC intensity prediction takes atmospheric reanalysis data as the research object and mines the relationship between TC-related environmental factors and intensity through deep learning.However,reanalysis data are non-real-time in nature,which does not meet the requirements for operational forecasting applications.Therefore,a TC intensity prediction model named TC-Rolling is proposed,which can simultaneously extract the degree of symmetry for strong TC convective cloud and convection intensity,and fuse the deviation-angle variance with satellite images to construct the correlation between TC convection structure and intensity.For TCs'complex dynamic processes,a convolutional neural network(CNN)is used to learn their temporal and spatial features.For real-time intensity estimation,multi-task learning acts as an implicit time-series enhancement.The model is designed with a rolling strategy that aims to moderate the long-term dependent decay problem and improve accuracy for short-term intensity predictions.Since multiple tasks are correlated,the loss function of 12 h and 24 h are corrected.After testing on a sample of TCs in the Northwest Pacific,with a 4.48 kt root-mean-square error(RMSE)of 6 h intensity prediction,5.78 kt for 12 h,and 13.94 kt for 24 h,TC records from official agencies are used to assess the validity of TC-Rolling. 展开更多
关键词 tropical cyclone INTENSITY structure rolling prediction multi-task
在线阅读 下载PDF
Explainable AI Based Multi-Task Learning Method for Stroke Prognosis
16
作者 Nan Ding Xingyu Zeng +1 位作者 Jianping Wu Liutao Zhao 《Computers, Materials & Continua》 2025年第9期5299-5315,共17页
Predicting the health status of stroke patients at different stages of the disease is a critical clinical task.The onset and development of stroke are affected by an array of factors,encompassing genetic predispositio... Predicting the health status of stroke patients at different stages of the disease is a critical clinical task.The onset and development of stroke are affected by an array of factors,encompassing genetic predisposition,environmental exposure,unhealthy lifestyle habits,and existing medical conditions.Although existing machine learning-based methods for predicting stroke patients’health status have made significant progress,limitations remain in terms of prediction accuracy,model explainability,and system optimization.This paper proposes a multi-task learning approach based on Explainable Artificial Intelligence(XAI)for predicting the health status of stroke patients.First,we design a comprehensive multi-task learning framework that utilizes the task correlation of predicting various health status indicators in patients,enabling the parallel prediction of multiple health indicators.Second,we develop a multi-task Area Under Curve(AUC)optimization algorithm based on adaptive low-rank representation,which removes irrelevant information from the model structure to enhance the performance of multi-task AUC optimization.Additionally,the model’s explainability is analyzed through the stability analysis of SHAP values.Experimental results demonstrate that our approach outperforms comparison algorithms in key prognostic metrics F1 score and Efficiency. 展开更多
关键词 Explainable AI stroke prognosis multi-task learning AUC optimization
在线阅读 下载PDF
MAMGBR: Group-Buying Recommendation Model Based on Multi-Head Attention Mechanism and Multi-Task Learning
17
作者 Zongzhe Xu Ming Yu 《Computers, Materials & Continua》 2025年第8期2805-2826,共22页
As the group-buying model shows significant progress in attracting new users,enhancing user engagement,and increasing platform profitability,providing personalized recommendations for group-buying users has emerged as... As the group-buying model shows significant progress in attracting new users,enhancing user engagement,and increasing platform profitability,providing personalized recommendations for group-buying users has emerged as a new challenge in the field of recommendation systems.This paper introduces a group-buying recommendation model based on multi-head attention mechanisms and multi-task learning,termed the Multi-head Attention Mechanisms and Multi-task Learning Group-Buying Recommendation(MAMGBR)model,specifically designed to optimize group-buying recommendations on e-commerce platforms.The core dataset of this study comes from the Chinese maternal and infant e-commerce platform“Beibei,”encompassing approximately 430,000 successful groupbuying actions and over 120,000 users.Themodel focuses on twomain tasks:recommending items for group organizers(Task Ⅰ)and recommending participants for a given group-buying event(Task Ⅱ).In model evaluation,MAMGBR achieves an MRR@10 of 0.7696 for Task I,marking a 20.23%improvement over baseline models.Furthermore,in Task II,where complex interaction patterns prevail,MAMGBR utilizes auxiliary loss functions to effectively model the multifaceted roles of users,items,and participants,leading to a 24.08%increase in MRR@100 under a 1:99 sample ratio.Experimental results show that compared to benchmark models,such as NGCF and EATNN,MAMGBR’s integration ofmulti-head attentionmechanisms,expert networks,and gating mechanisms enables more accurate modeling of user preferences and social associations within group-buying scenarios,significantly enhancing recommendation accuracy and platform group-buying success rates. 展开更多
关键词 Group-buying recommendation multi-head attention mechanism multi-task learning
在线阅读 下载PDF
Joint Retrieval of PM_(2.5) Concentration and Aerosol Optical Depth over China Using Multi-Task Learning on FY-4A AGRI
18
作者 Bo LI Disong FU +4 位作者 Ling YANG Xuehua FAN Dazhi YANG Hongrong SHI Xiang’ao XIA 《Advances in Atmospheric Sciences》 2025年第1期94-110,共17页
Aerosol optical depth(AOD)and fine particulate matter with a diameter of less than or equal to 2.5μm(PM_(2.5))play crucial roles in air quality,human health,and climate change.However,the complex correlation of AOD–... Aerosol optical depth(AOD)and fine particulate matter with a diameter of less than or equal to 2.5μm(PM_(2.5))play crucial roles in air quality,human health,and climate change.However,the complex correlation of AOD–PM_(2.5)and the limitations of existing algorithms pose a significant challenge in realizing the accurate joint retrieval of these two parameters at the same location.On this point,a multi-task learning(MTL)model,which enables the joint retrieval of PM_(2.5)concentration and AOD,is proposed and applied on the top-of-the-atmosphere reflectance data gathered by the Fengyun-4A Advanced Geosynchronous Radiation Imager(FY-4A AGRI),and compared to that of two single-task learning models—namely,Random Forest(RF)and Deep Neural Network(DNN).Specifically,MTL achieves a coefficient of determination(R^(2))of 0.88 and a root-mean-square error(RMSE)of 0.10 in AOD retrieval.In comparison to RF,the R^(2)increases by 0.04,the RMSE decreases by 0.02,and the percentage of retrieval results falling within the expected error range(Within-EE)rises by 5.55%.The R^(2)and RMSE of PM_(2.5)retrieval by MTL are 0.84 and 13.76μg m~(-3)respectively.Compared with RF,the R^(2)increases by 0.06,the RMSE decreases by 4.55μg m~(-3),and the Within-EE increases by 7.28%.Additionally,compared to DNN,MTL shows an increase of 0.01 in R^(2)and a decrease of 0.02 in RMSE in AOD retrieval,with a corresponding increase of 2.89%in Within-EE.For PM_(2.5)retrieval,MTL exhibits an increase of 0.05 in R^(2),a decrease of 1.76μg m~(-3)in RMSE,and an increase of 6.83%in Within-EE.The evaluation suggests that MTL is able to provide simultaneously improved AOD and PM_(2.5)retrievals,demonstrating a significant advantage in efficiently capturing the spatial distribution of PM_(2.5)concentration and AOD. 展开更多
关键词 AOD PM_(2.5) FY-4A multi-task learning joint retrieval
在线阅读 下载PDF
Skillful bias correction of offshore near-surface wind field forecasting based on a multi-task machine learning model
19
作者 Qiyang Liu Anboyu Guo +5 位作者 Fengxue Qiao Xinjian Ma Yan-An Liu Yong Huang Rui Wang Chunyan Sheng 《Atmospheric and Oceanic Science Letters》 2025年第5期28-35,共8页
Accurate short-term forecast of offshore wind fields is still challenging for numerical weather prediction models.Based on three years of 48-hour forecast data from the European Centre for Medium-Range Weather Forecas... Accurate short-term forecast of offshore wind fields is still challenging for numerical weather prediction models.Based on three years of 48-hour forecast data from the European Centre for Medium-Range Weather Forecasts Integrated Forecasting System global model(ECMWF-IFS)over 14 offshore weather stations along the coast of Shandong Province,this study introduces a multi-task learning(MTL)model(TabNet-MTL),which significantly improves the forecast bias of near-surface wind direction and speed simultaneously.TabNet-MTL adopts the feature engineering method,utilizes mean square error as the loss function,and employs the 5-fold cross validation method to ensure the generalization ability of the trained model.It demonstrates superior skills in wind field correction across different forecast lead times over all stations compared to its single-task version(TabNet-STL)and three other popular single-task learning models(Random Forest,LightGBM,and XGBoost).Results show that it significantly reduces root mean square error of the ECMWF-IFS wind speed forecast from 2.20 to 1.25 m s−1,and increases the forecast accuracy of wind direction from 50%to 65%.As an explainable deep learning model,the weather stations and long-term temporal statistics of near-surface wind speed are identified as the most influential variables for TabNet-MTL in constructing its feature engineering. 展开更多
关键词 Forecast bias correction Wind field multi-task learning Feature engineering Explainable AI
在线阅读 下载PDF
DKP-ADS:Domain knowledge prompt combined with multi-task learning for assessment of foliar disease severity in staple crops
20
作者 Yujiao Dan Xingcai Wu +5 位作者 Ya Yu Ziang Zou R.D.S.M Gunarathna Peijia Yu Yuanyuan Xiao Qi Wang 《The Crop Journal》 2025年第6期1939-1954,共16页
Staple crops are the cornerstone of the food supply but are frequently threatened by plant diseases.Effective disease management,including disease identification and severity assessment,helps to better address these c... Staple crops are the cornerstone of the food supply but are frequently threatened by plant diseases.Effective disease management,including disease identification and severity assessment,helps to better address these challenges.Currently,methods for disease severity assessment typically rely on calculating the area proportion of disease segmentation regions or using classification networks for severity assessment.However,these methods require large amounts of labeled data and fail to quantify lesion proportions when using classification networks,leading to inaccurate evaluations.To address these issues,we propose an automated framework for disease severity assessment that combines multi-task learning and knowledge-driven large-model segmentation techniques.This framework includes an image information processor,a lesion and leaf segmentation module,and a disease severity assessment module.First,the image information processor utilizes a multi-task learning strategy to analyze input images comprehensively,ensuring a deep understanding of disease characteristics.Second,the lesion and leaf segmentation module employ prompt-driven large-model technology to accurately segment diseased areas and entire leaves,providing detailed visual analysis.Finally,the disease severity assessment module objectively evaluates the severity of the disease based on professional grading standards by calculating lesion area proportions.Additionally,we have developed a comprehensive database of diseased leaf images from major crops,including several task-specific datasets.Experimental results demonstrate that our framework can accurately identify and assess the types and severity of crop diseases,even without extensive labeled data.Codes and data are available at http://dkp-ads.samlab.cn/. 展开更多
关键词 Domain knowledge Prompt-driven multi-task learning Staple crop Assessment of disease severity
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部