期刊文献+
共找到791篇文章
< 1 2 40 >
每页显示 20 50 100
探索非零位置约束:算法-硬件协同设计的DNN稀疏训练方法
1
作者 王淼 张盛兵 张萌 《西北工业大学学报》 北大核心 2025年第1期119-127,共9页
设备上的学习使得边缘设备能连续适应人工智能应用的新数据。利用稀疏性消除训练过程中的冗余计算和存储占用是提高边缘深度神经网络(deep neural network,DNN)学习效率的关键途径。然而由于缺乏对非零位置的假设,往往需要昂贵的代价用... 设备上的学习使得边缘设备能连续适应人工智能应用的新数据。利用稀疏性消除训练过程中的冗余计算和存储占用是提高边缘深度神经网络(deep neural network,DNN)学习效率的关键途径。然而由于缺乏对非零位置的假设,往往需要昂贵的代价用于实时地识别和分配零的位置以及对不规则计算的负载均衡,这使得现有稀疏训练工作难以接近理想加速比。如果能提前预知训练过程中操作数的非零位置约束规则,就可以跳过这些处理开销,从而提升稀疏训练性能和能效比。针对稀疏训练过程,面向边缘场景中典型的3类激活函数探索操作数之间的位置约束规则,提出:①一个硬件友好的稀疏训练算法以减少3个阶段的计算量和存储压力;②一个高能效的稀疏训练加速器,能预估非零位置使得实时处理代价被并行执行掩盖。实验表明所提出的方法比密集加速器和2个其他稀疏训练工作的能效比分别提升了2.2倍,1.38倍和1.46倍。 展开更多
关键词 稀疏训练 非零位置约束 dnn 稀疏加速器
在线阅读 下载PDF
基于加权聚类和DNN的KR法脱硫剂加入量预报模型 被引量:1
2
作者 李威 熊凌 +3 位作者 罗钟邱 吴经纬 万诗斐 但斌斌 《炼钢》 北大核心 2025年第1期12-18,44,共8页
为了准确预测铁水KR脱硫工序中的脱硫剂加入量,提出了一种基于密度的聚类方法(DBSCAN)和深度神经网络(DNN)相结合的建模方法。首先计算Spearman相关系数筛选出与脱硫剂加入量相关性较强的6个输入特征;基于筛选后的特征,利用DNN对数据集... 为了准确预测铁水KR脱硫工序中的脱硫剂加入量,提出了一种基于密度的聚类方法(DBSCAN)和深度神经网络(DNN)相结合的建模方法。首先计算Spearman相关系数筛选出与脱硫剂加入量相关性较强的6个输入特征;基于筛选后的特征,利用DNN对数据集建立脱硫剂加入量预测模型;通过SHapley Additive exPlanations(SHAP)方法解释DNN模型,计算出各个特征对模型输出的贡献程度,根据得到的权重代入DBSCAN聚类算法中对某炼钢厂的脱硫实际生产数据进行聚类,保留清洗后的数据集;最后,通过五折交叉验证的方法对比了数据清洗前后的支持向量回归(SVR)、随机森林(RF)、极限梯度提升(XGBoost)、BP神经网络、深度神经网络(DNN)的预测模型性能。试验结果表明,使用清洗后的数据集建立的脱硫剂加入量预测模型的均方根误差(RMSE)、平均绝对误差(MAE)、平均绝对百分比误差(MAPE)、决定系数(R^(2))较原数据集平均提高了33.6%、15.5%、12.9%、6.9%。 展开更多
关键词 KR脱硫 SHAP DBSCAN聚类 dnn 预测模型
原文传递
基于DNN超参数优化的5G-R宽带集群通信故障识别
3
作者 乔婉淇 丁建文 +2 位作者 郭强亮 孙斌 王玮 《铁道科学与工程学报》 北大核心 2025年第10期4749-4760,共12页
MCX系统(MCPTT、MCData及MCVideo的统称)是5G-R宽带集群通信的核心组成部分,其结构复杂、功能繁多。目前,MCX系统通信故障时采用传统的人工故障排查方法,效率低下,缺乏智能化的识别与分析手段,对此,提出一种基于DNN(deep neural network... MCX系统(MCPTT、MCData及MCVideo的统称)是5G-R宽带集群通信的核心组成部分,其结构复杂、功能繁多。目前,MCX系统通信故障时采用传统的人工故障排查方法,效率低下,缺乏智能化的识别与分析手段,对此,提出一种基于DNN(deep neural network,DNN)超参数优化的5G-R宽带集群通信故障识别方法。首先,通过多维特征融合提取特征,构建用于故障分类模型训练和测试的样本数据集;其次,针对大规模且分布不均的MCX系统通信故障数据,提出反馈驱动的自适应超参数优化(feedback-driven adaptive hyperparameter optimization,FDAHO)算法,优化数据采样和处理方法,改进超参数优化算法;最后,利用DNN结合FDAHO算法构建故障分类模型,采用公共数据集和全实物平台测试处理后所得的实测数据集,将所构建的模型分别与CNN(convolutional neural networks,CNN)结合FDAHO算法的模型、传统贝叶斯优化算法下的模型进行对比。实验结果表明:FDAHO算法在提高分类准确率和F1分数的同时,显著减少了超参数优化时间,提升了模型在资源受限环境下的实用性,且与DNN网络的结合具有更高的优越性和稳定性;实测数据集下所提模型在分类准确率、F1分数上相较于无超参数优化算法的模型分别提高了10.734%、11.328%;相较于DNN结合传统贝叶斯优化模型分别提高了0.342%、0.365%,且超参数优化时间减少了2152 s,可实现对MCX系统通信故障的高准确率和高效分类。研究结果为5G-R宽带集群通信的故障智能检测及监测提供参考。 展开更多
关键词 5G-R MCX 超参数优化 dnn 故障分类
在线阅读 下载PDF
基于POD-DNN降阶模型的油浸式变压器绕组稳态温升快速计算方法 被引量:1
4
作者 赵庆贤 刘云鹏 +3 位作者 刘刚 傅榕韵 邹莹 武卫革 《中国电机工程学报》 北大核心 2025年第6期2423-2436,I0033,共15页
为解决油浸式变压器绕组稳态温升计算耗时久的问题,该文提出一种基于POD-DNN降阶模型的快速计算方法。首先,通过绕组稳态温升全阶模型构建快照矩阵,并基于本征正交分解(proper orthogonal decomposition,POD)获得物理系统的模态及模态... 为解决油浸式变压器绕组稳态温升计算耗时久的问题,该文提出一种基于POD-DNN降阶模型的快速计算方法。首先,通过绕组稳态温升全阶模型构建快照矩阵,并基于本征正交分解(proper orthogonal decomposition,POD)获得物理系统的模态及模态系数。然后,建立工况参数与模态系数间的深度神经网络(deep neural networks,DNN)代理模型,解决POD方法中非线性项求解效率低和控制方程依赖强的局限,同时设计网络正则化策略,避免小样本下模型过拟合。最后,将DNN代理模型预测的模态系数与对应的POD模态线性加权,重构绕组温度场。经验证,POD-DNN求解的绕组温升结果与Fluent仿真和试验测量高度一致,计算效率相较于全阶模型和Fluent仿真分别提升了247478倍和23056倍,该算法能够为变压器的在线监测、运行维护和绝缘设计提供技术支撑。 展开更多
关键词 本征正交分解 深度神经网络 绕组稳态温升 快速计算 降阶模型
原文传递
NoC加速器中的高效DNN动态切片与智能映射算法
5
作者 齐芸 欧阳一鸣 《电信科学》 北大核心 2025年第10期151-160,共10页
针对深度神经网络(deep neural network,DNN)模型在传统切片与映射方法中存在的资源调度和数据传输瓶颈问题,提出了一种基于片上网络(network on chip,NoC)加速器的高效DNN动态切片与智能映射优化算法。该算法通过动态切片技术灵活划分... 针对深度神经网络(deep neural network,DNN)模型在传统切片与映射方法中存在的资源调度和数据传输瓶颈问题,提出了一种基于片上网络(network on chip,NoC)加速器的高效DNN动态切片与智能映射优化算法。该算法通过动态切片技术灵活划分DNN模型的计算任务,并结合智能映射策略优化NoC架构中的任务分配与数据流管理。实验结果表明,与传统方法相比,该算法在计算吞吐量、NoC传输时延、外部内存访问次数和计算能效等方面均显著提升,尤其在复杂模型上表现突出。 展开更多
关键词 NoC加速器 dnn切片 智能映射
在线阅读 下载PDF
基于改进MFCC特征提取和DNN网络的机器人语音识别方法研究 被引量:4
6
作者 秦垲忻 王炜昕 王砚生 《计算机测量与控制》 2025年第2期246-253,共8页
为了实现机器人语音控制,并避免环境噪音的干扰,研究提出了基于改进MFCC特征提取和深度神经网络的机器人语音控制指令识别方法;该方法利用线性判别分析、最大似然线性变换和说话人自适应变换对MFCC特征进行处理,获得了新的语音特征;同... 为了实现机器人语音控制,并避免环境噪音的干扰,研究提出了基于改进MFCC特征提取和深度神经网络的机器人语音控制指令识别方法;该方法利用线性判别分析、最大似然线性变换和说话人自适应变换对MFCC特征进行处理,获得了新的语音特征;同时通过深度玻尔兹曼机对声学模型进行了改进,并利用深度神经网络和谐波增强技术构建了语音增强方法;实验结果显示,研究提出的基于改进Mel频率倒谱系数特征能显著降低语音识别的字错误率,通过辅以改进深度神经网络-隐马尔科夫模型能进一步降低字错误率;在20 dB条件下,该特征和改进深度神经网络-隐马尔科夫模型的平均字错误率分别为24.9%和22.1%,均低于其他方法;上述结果表明,研究提出的语音识别方法能实现带噪声语音的准确识别,提高机器人的语音控制指令识别能力。 展开更多
关键词 语音识别 语音增强 声学模型 MFCC特征 dnn
在线阅读 下载PDF
DEEP NEURAL NETWORKS COMBINING MULTI-TASK LEARNING FOR SOLVING DELAY INTEGRO-DIFFERENTIAL EQUATIONS 被引量:1
7
作者 WANG Chen-yao SHI Feng 《数学杂志》 2025年第1期13-38,共26页
Deep neural networks(DNNs)are effective in solving both forward and inverse problems for nonlinear partial differential equations(PDEs).However,conventional DNNs are not effective in handling problems such as delay di... Deep neural networks(DNNs)are effective in solving both forward and inverse problems for nonlinear partial differential equations(PDEs).However,conventional DNNs are not effective in handling problems such as delay differential equations(DDEs)and delay integrodifferential equations(DIDEs)with constant delays,primarily due to their low regularity at delayinduced breaking points.In this paper,a DNN method that combines multi-task learning(MTL)which is proposed to solve both the forward and inverse problems of DIDEs.The core idea of this approach is to divide the original equation into multiple tasks based on the delay,using auxiliary outputs to represent the integral terms,followed by the use of MTL to seamlessly incorporate the properties at the breaking points into the loss function.Furthermore,given the increased training dificulty associated with multiple tasks and outputs,we employ a sequential training scheme to reduce training complexity and provide reference solutions for subsequent tasks.This approach significantly enhances the approximation accuracy of solving DIDEs with DNNs,as demonstrated by comparisons with traditional DNN methods.We validate the effectiveness of this method through several numerical experiments,test various parameter sharing structures in MTL and compare the testing results of these structures.Finally,this method is implemented to solve the inverse problem of nonlinear DIDE and the results show that the unknown parameters of DIDE can be discovered with sparse or noisy data. 展开更多
关键词 Delay integro-differential equation multi-task learning parameter sharing structure deep neural network sequential training scheme
在线阅读 下载PDF
5G定制DNN就近接入实现方案研究
8
作者 林朝辉 张小勇 +1 位作者 张欣 平军磊 《邮电设计技术》 2025年第4期77-80,共4页
随着5G技术的快速发展和广泛应用,网络服务的个性化和定制化需求日益增长,车企对定制DNN的就近接入需求已成为5G行业应用的重要研究方向之一。基于5G核心网络技术特性,结合当前网络设备实际部署情况,对5G网络下定制DNN就近接入技术的实... 随着5G技术的快速发展和广泛应用,网络服务的个性化和定制化需求日益增长,车企对定制DNN的就近接入需求已成为5G行业应用的重要研究方向之一。基于5G核心网络技术特性,结合当前网络设备实际部署情况,对5G网络下定制DNN就近接入技术的实现方案进行研究。 展开更多
关键词 5G dnn 就近接入 AM-PCF
在线阅读 下载PDF
A Survey of Cooperative Multi-agent Reinforcement Learning for Multi-task Scenarios 被引量:1
9
作者 Jiajun CHAI Zijie ZHAO +1 位作者 Yuanheng ZHU Dongbin ZHAO 《Artificial Intelligence Science and Engineering》 2025年第2期98-121,共24页
Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-... Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-robot control.Empowering cooperative MARL with multi-task decision-making capabilities is expected to further broaden its application scope.In multi-task scenarios,cooperative MARL algorithms need to address 3 types of multi-task problems:reward-related multi-task,arising from different reward functions;multi-domain multi-task,caused by differences in state and action spaces,state transition functions;and scalability-related multi-task,resulting from the dynamic variation in the number of agents.Most existing studies focus on scalability-related multitask problems.However,with the increasing integration between large language models(LLMs)and multi-agent systems,a growing number of LLM-based multi-agent systems have emerged,enabling more complex multi-task cooperation.This paper provides a comprehensive review of the latest advances in this field.By combining multi-task reinforcement learning with cooperative MARL,we categorize and analyze the 3 major types of multi-task problems under multi-agent settings,offering more fine-grained classifications and summarizing key insights for each.In addition,we summarize commonly used benchmarks and discuss future directions of research in this area,which hold promise for further enhancing the multi-task cooperation capabilities of multi-agent systems and expanding their practical applications in the real world. 展开更多
关键词 multi-task multi-agent reinforcement learning large language models
在线阅读 下载PDF
MolP-PC:a multi-view fusion and multi-task learning framework for drug ADMET property prediction 被引量:1
10
作者 Sishu Li Jing Fan +2 位作者 Haiyang He Ruifeng Zhou Jun Liao 《Chinese Journal of Natural Medicines》 2025年第11期1293-1300,共8页
The accurate prediction of drug absorption,distribution,metabolism,excretion,and toxicity(ADMET)properties represents a crucial step in early drug development for reducing failure risk.Current deep learning approaches... The accurate prediction of drug absorption,distribution,metabolism,excretion,and toxicity(ADMET)properties represents a crucial step in early drug development for reducing failure risk.Current deep learning approaches face challenges with data sparsity and information loss due to single-molecule representation limitations and isolated predictive tasks.This research proposes molecular properties prediction with parallel-view and collaborative learning(MolP-PC),a multi-view fusion and multi-task deep learning framework that integrates 1D molecular fingerprints(MFs),2D molecular graphs,and 3D geometric representations,incorporating an attention-gated fusion mechanism and multi-task adaptive learning strategy for precise ADMET property predictions.Experimental results demonstrate that MolP-PC achieves optimal performance in 27 of 54 tasks,with its multi-task learning(MTL)mechanism significantly enhancing predictive performance on small-scale datasets and surpassing single-task models in 41 of 54 tasks.Additional ablation studies and interpretability analyses confirm the significance of multi-view fusion in capturing multi-dimensional molecular information and enhancing model generalization.A case study examining the anticancer compound Oroxylin A demonstrates MolP-PC’s effective generalization in predicting key pharmacokinetic parameters such as half-life(T0.5)and clearance(CL),indicating its practical utility in drug modeling.However,the model exhibits a tendency to underestimate volume of distribution(VD),indicating potential for improvement in analyzing compounds with high tissue distribution.This study presents an efficient and interpretable approach for ADMET property prediction,establishing a novel framework for molecular optimization and risk assessment in drug development. 展开更多
关键词 Molecular ADMET prediction Multi-view fusion Attention mechanism multi-task deep learning
原文传递
改进DDPG的端边DNN协同推理策略
11
作者 和涛 栗娟 《计算机工程与应用》 北大核心 2025年第2期304-315,共12页
当前基于端边的深度神经网络(deep neural network,DNN)协同推理策略仅关注于优化时延敏感型任务的推理时延,而未考虑能耗敏感型任务的推理能耗成本,以及DNN划分后在异构边缘服务器之间的高效卸载问题。基于此,提出一种改进深度确定性... 当前基于端边的深度神经网络(deep neural network,DNN)协同推理策略仅关注于优化时延敏感型任务的推理时延,而未考虑能耗敏感型任务的推理能耗成本,以及DNN划分后在异构边缘服务器之间的高效卸载问题。基于此,提出一种改进深度确定性策略梯度(deep deterministic policy gradients,DDPG)的端边DNN协同推理策略,综合考虑任务对时延与能耗的敏感度,进而对推理成本进行综合优化。该策略将DNN划分与计算卸载问题分离,对不同协同设备建立预测模型,去预测出协同推理DNN的最优划分点与推理综合成本;根据预测的推理综合成本建立奖励函数,使用DDPG算法制定每个DNN推理任务的卸载策略,进而进行协同推理。实验结果证明,相比其他DNN协同推理策略,该策略在复杂的DNN协同推理环境下决策更高效,推理时延平均减少了46%,推理能耗平均减少了44%,推理综合成本平均降低了46%。 展开更多
关键词 边缘智能 深度神经网络(dnn) 协同推理 深度确定性策略梯度 任务卸载 能耗优化
在线阅读 下载PDF
基于DNN-GRU-SVM的深度学习组合模型的网络入侵检测方法
12
作者 刘虎鹏 颜辉 +5 位作者 于萍 许晓晴 龙蕴鑫 耿晓中 龙多 赵禺 《电脑与信息技术》 2025年第4期64-70,共7页
针对现代大数据环境中网络入侵检测系统(network intrusion detection system,NIDS)难以应对复杂网络攻击的问题,提出了一种基于深度神经网络(Deep Neural Network,DNN)-门控循环单元(Gated Recurrent Unit,GRU)-支持向量机(Support Vec... 针对现代大数据环境中网络入侵检测系统(network intrusion detection system,NIDS)难以应对复杂网络攻击的问题,提出了一种基于深度神经网络(Deep Neural Network,DNN)-门控循环单元(Gated Recurrent Unit,GRU)-支持向量机(Support Vector Machine,SVM)的组合模型DNN-GRU-SVM。该模型结合了DNN、GRU与SVM的优势,首先利用DNN提取网络数据特征,通过调整学习率与批量归一化来加速训练并减少过拟合;采用GRU捕捉序列数据中的时间依赖性;通过SVM实现精确分类。在KDD Cup'99数据集上的实验表明,DNNGRU-SVM组合模型取得了显著的性能提升,其检测准确率达94.53%,精确度为99.8%,召回率为92.8%,F1分数为96.2%,显著优于传统机器学习算法及单一的深度神经网络。实验结果表明,该模型能够有效提高网络入侵检测的准确率和适应性,为复杂网络环境下的入侵检测提供了可靠的解决方案。 展开更多
关键词 网络入侵检测 机器学习 深度学习 dnn-GRU-SVM
在线阅读 下载PDF
基于MAHAKIL与AM-DNN的煤层识别方法
13
作者 马晓易 段中钰 《北京信息科技大学学报(自然科学版)》 2025年第2期93-98,共6页
针对煤层识别中数据不平衡导致的精度下降问题,提出一种基于过采样算法MAHAKIL的融合注意力机制(attention mechanism,AM)的深度神经网络(deep neural network,DNN)模型MAHAKIL-AM-DNN。首先,使用改进的MAHAKIL算法生成具有多样性的合... 针对煤层识别中数据不平衡导致的精度下降问题,提出一种基于过采样算法MAHAKIL的融合注意力机制(attention mechanism,AM)的深度神经网络(deep neural network,DNN)模型MAHAKIL-AM-DNN。首先,使用改进的MAHAKIL算法生成具有多样性的合成样本;然后,使用注意力机制强化关键特征权重,优化深度神经网络的识别能力。实验结果表明,相较于不使用过采样技术的DNN方法以及使用合成少数类过采样技术(synthetic minority over-sampling technique,SMOTE)的SMOTE-DNN方法,该方法性能更优,F1值分别提高了58.5和4.8百分点,提升了煤层识别精度。 展开更多
关键词 煤层识别 遗传算法 过采样 注意力机制 深度神经网络
在线阅读 下载PDF
Short-Term Rolling Prediction of Tropical Cyclone Intensity Based on Multi-Task Learning with Fusion of Deviation-Angle Variance and Satellite Imagery
14
作者 Wei TIAN Ping SONG +5 位作者 Yuanyuan CHEN Yonghong ZHANG Liguang WU Haikun ZHAO Kenny Thiam Choy LIM KAM SIAN Chunyi XIANG 《Advances in Atmospheric Sciences》 2025年第1期111-128,共18页
Tropical cyclones(TCs)are one of the most serious types of natural disasters,and accurate TC activity predictions are key to disaster prevention and mitigation.Recently,TC track predictions have made significant progr... Tropical cyclones(TCs)are one of the most serious types of natural disasters,and accurate TC activity predictions are key to disaster prevention and mitigation.Recently,TC track predictions have made significant progress,but the ability to predict their intensity is obviously lagging behind.At present,research on TC intensity prediction takes atmospheric reanalysis data as the research object and mines the relationship between TC-related environmental factors and intensity through deep learning.However,reanalysis data are non-real-time in nature,which does not meet the requirements for operational forecasting applications.Therefore,a TC intensity prediction model named TC-Rolling is proposed,which can simultaneously extract the degree of symmetry for strong TC convective cloud and convection intensity,and fuse the deviation-angle variance with satellite images to construct the correlation between TC convection structure and intensity.For TCs'complex dynamic processes,a convolutional neural network(CNN)is used to learn their temporal and spatial features.For real-time intensity estimation,multi-task learning acts as an implicit time-series enhancement.The model is designed with a rolling strategy that aims to moderate the long-term dependent decay problem and improve accuracy for short-term intensity predictions.Since multiple tasks are correlated,the loss function of 12 h and 24 h are corrected.After testing on a sample of TCs in the Northwest Pacific,with a 4.48 kt root-mean-square error(RMSE)of 6 h intensity prediction,5.78 kt for 12 h,and 13.94 kt for 24 h,TC records from official agencies are used to assess the validity of TC-Rolling. 展开更多
关键词 tropical cyclone INTENSITY structure rolling prediction multi-task
在线阅读 下载PDF
Explainable AI Based Multi-Task Learning Method for Stroke Prognosis
15
作者 Nan Ding Xingyu Zeng +1 位作者 Jianping Wu Liutao Zhao 《Computers, Materials & Continua》 2025年第9期5299-5315,共17页
Predicting the health status of stroke patients at different stages of the disease is a critical clinical task.The onset and development of stroke are affected by an array of factors,encompassing genetic predispositio... Predicting the health status of stroke patients at different stages of the disease is a critical clinical task.The onset and development of stroke are affected by an array of factors,encompassing genetic predisposition,environmental exposure,unhealthy lifestyle habits,and existing medical conditions.Although existing machine learning-based methods for predicting stroke patients’health status have made significant progress,limitations remain in terms of prediction accuracy,model explainability,and system optimization.This paper proposes a multi-task learning approach based on Explainable Artificial Intelligence(XAI)for predicting the health status of stroke patients.First,we design a comprehensive multi-task learning framework that utilizes the task correlation of predicting various health status indicators in patients,enabling the parallel prediction of multiple health indicators.Second,we develop a multi-task Area Under Curve(AUC)optimization algorithm based on adaptive low-rank representation,which removes irrelevant information from the model structure to enhance the performance of multi-task AUC optimization.Additionally,the model’s explainability is analyzed through the stability analysis of SHAP values.Experimental results demonstrate that our approach outperforms comparison algorithms in key prognostic metrics F1 score and Efficiency. 展开更多
关键词 Explainable AI stroke prognosis multi-task learning AUC optimization
在线阅读 下载PDF
MAMGBR: Group-Buying Recommendation Model Based on Multi-Head Attention Mechanism and Multi-Task Learning
16
作者 Zongzhe Xu Ming Yu 《Computers, Materials & Continua》 2025年第8期2805-2826,共22页
As the group-buying model shows significant progress in attracting new users,enhancing user engagement,and increasing platform profitability,providing personalized recommendations for group-buying users has emerged as... As the group-buying model shows significant progress in attracting new users,enhancing user engagement,and increasing platform profitability,providing personalized recommendations for group-buying users has emerged as a new challenge in the field of recommendation systems.This paper introduces a group-buying recommendation model based on multi-head attention mechanisms and multi-task learning,termed the Multi-head Attention Mechanisms and Multi-task Learning Group-Buying Recommendation(MAMGBR)model,specifically designed to optimize group-buying recommendations on e-commerce platforms.The core dataset of this study comes from the Chinese maternal and infant e-commerce platform“Beibei,”encompassing approximately 430,000 successful groupbuying actions and over 120,000 users.Themodel focuses on twomain tasks:recommending items for group organizers(Task Ⅰ)and recommending participants for a given group-buying event(Task Ⅱ).In model evaluation,MAMGBR achieves an MRR@10 of 0.7696 for Task I,marking a 20.23%improvement over baseline models.Furthermore,in Task II,where complex interaction patterns prevail,MAMGBR utilizes auxiliary loss functions to effectively model the multifaceted roles of users,items,and participants,leading to a 24.08%increase in MRR@100 under a 1:99 sample ratio.Experimental results show that compared to benchmark models,such as NGCF and EATNN,MAMGBR’s integration ofmulti-head attentionmechanisms,expert networks,and gating mechanisms enables more accurate modeling of user preferences and social associations within group-buying scenarios,significantly enhancing recommendation accuracy and platform group-buying success rates. 展开更多
关键词 Group-buying recommendation multi-head attention mechanism multi-task learning
在线阅读 下载PDF
Joint Retrieval of PM_(2.5) Concentration and Aerosol Optical Depth over China Using Multi-Task Learning on FY-4A AGRI
17
作者 Bo LI Disong FU +4 位作者 Ling YANG Xuehua FAN Dazhi YANG Hongrong SHI Xiang’ao XIA 《Advances in Atmospheric Sciences》 2025年第1期94-110,共17页
Aerosol optical depth(AOD)and fine particulate matter with a diameter of less than or equal to 2.5μm(PM_(2.5))play crucial roles in air quality,human health,and climate change.However,the complex correlation of AOD–... Aerosol optical depth(AOD)and fine particulate matter with a diameter of less than or equal to 2.5μm(PM_(2.5))play crucial roles in air quality,human health,and climate change.However,the complex correlation of AOD–PM_(2.5)and the limitations of existing algorithms pose a significant challenge in realizing the accurate joint retrieval of these two parameters at the same location.On this point,a multi-task learning(MTL)model,which enables the joint retrieval of PM_(2.5)concentration and AOD,is proposed and applied on the top-of-the-atmosphere reflectance data gathered by the Fengyun-4A Advanced Geosynchronous Radiation Imager(FY-4A AGRI),and compared to that of two single-task learning models—namely,Random Forest(RF)and Deep Neural Network(DNN).Specifically,MTL achieves a coefficient of determination(R^(2))of 0.88 and a root-mean-square error(RMSE)of 0.10 in AOD retrieval.In comparison to RF,the R^(2)increases by 0.04,the RMSE decreases by 0.02,and the percentage of retrieval results falling within the expected error range(Within-EE)rises by 5.55%.The R^(2)and RMSE of PM_(2.5)retrieval by MTL are 0.84 and 13.76μg m~(-3)respectively.Compared with RF,the R^(2)increases by 0.06,the RMSE decreases by 4.55μg m~(-3),and the Within-EE increases by 7.28%.Additionally,compared to DNN,MTL shows an increase of 0.01 in R^(2)and a decrease of 0.02 in RMSE in AOD retrieval,with a corresponding increase of 2.89%in Within-EE.For PM_(2.5)retrieval,MTL exhibits an increase of 0.05 in R^(2),a decrease of 1.76μg m~(-3)in RMSE,and an increase of 6.83%in Within-EE.The evaluation suggests that MTL is able to provide simultaneously improved AOD and PM_(2.5)retrievals,demonstrating a significant advantage in efficiently capturing the spatial distribution of PM_(2.5)concentration and AOD. 展开更多
关键词 AOD PM_(2.5) FY-4A multi-task learning joint retrieval
在线阅读 下载PDF
Skillful bias correction of offshore near-surface wind field forecasting based on a multi-task machine learning model
18
作者 Qiyang Liu Anboyu Guo +5 位作者 Fengxue Qiao Xinjian Ma Yan-An Liu Yong Huang Rui Wang Chunyan Sheng 《Atmospheric and Oceanic Science Letters》 2025年第5期28-35,共8页
Accurate short-term forecast of offshore wind fields is still challenging for numerical weather prediction models.Based on three years of 48-hour forecast data from the European Centre for Medium-Range Weather Forecas... Accurate short-term forecast of offshore wind fields is still challenging for numerical weather prediction models.Based on three years of 48-hour forecast data from the European Centre for Medium-Range Weather Forecasts Integrated Forecasting System global model(ECMWF-IFS)over 14 offshore weather stations along the coast of Shandong Province,this study introduces a multi-task learning(MTL)model(TabNet-MTL),which significantly improves the forecast bias of near-surface wind direction and speed simultaneously.TabNet-MTL adopts the feature engineering method,utilizes mean square error as the loss function,and employs the 5-fold cross validation method to ensure the generalization ability of the trained model.It demonstrates superior skills in wind field correction across different forecast lead times over all stations compared to its single-task version(TabNet-STL)and three other popular single-task learning models(Random Forest,LightGBM,and XGBoost).Results show that it significantly reduces root mean square error of the ECMWF-IFS wind speed forecast from 2.20 to 1.25 m s−1,and increases the forecast accuracy of wind direction from 50%to 65%.As an explainable deep learning model,the weather stations and long-term temporal statistics of near-surface wind speed are identified as the most influential variables for TabNet-MTL in constructing its feature engineering. 展开更多
关键词 Forecast bias correction Wind field multi-task learning Feature engineering Explainable AI
在线阅读 下载PDF
DKP-ADS:Domain knowledge prompt combined with multi-task learning for assessment of foliar disease severity in staple crops
19
作者 Yujiao Dan Xingcai Wu +5 位作者 Ya Yu Ziang Zou R.D.S.M Gunarathna Peijia Yu Yuanyuan Xiao Qi Wang 《The Crop Journal》 2025年第6期1939-1954,共16页
Staple crops are the cornerstone of the food supply but are frequently threatened by plant diseases.Effective disease management,including disease identification and severity assessment,helps to better address these c... Staple crops are the cornerstone of the food supply but are frequently threatened by plant diseases.Effective disease management,including disease identification and severity assessment,helps to better address these challenges.Currently,methods for disease severity assessment typically rely on calculating the area proportion of disease segmentation regions or using classification networks for severity assessment.However,these methods require large amounts of labeled data and fail to quantify lesion proportions when using classification networks,leading to inaccurate evaluations.To address these issues,we propose an automated framework for disease severity assessment that combines multi-task learning and knowledge-driven large-model segmentation techniques.This framework includes an image information processor,a lesion and leaf segmentation module,and a disease severity assessment module.First,the image information processor utilizes a multi-task learning strategy to analyze input images comprehensively,ensuring a deep understanding of disease characteristics.Second,the lesion and leaf segmentation module employ prompt-driven large-model technology to accurately segment diseased areas and entire leaves,providing detailed visual analysis.Finally,the disease severity assessment module objectively evaluates the severity of the disease based on professional grading standards by calculating lesion area proportions.Additionally,we have developed a comprehensive database of diseased leaf images from major crops,including several task-specific datasets.Experimental results demonstrate that our framework can accurately identify and assess the types and severity of crop diseases,even without extensive labeled data.Codes and data are available at http://dkp-ads.samlab.cn/. 展开更多
关键词 Domain knowledge Prompt-driven multi-task learning Staple crop Assessment of disease severity
在线阅读 下载PDF
AS-SOMTF:A novel multi-task learning model for water level prediction by satellite remoting
20
作者 Xin Su Zijian Qin +3 位作者 Weikang Feng Ziyang Gong Christian Esposito Sokjoon Lee 《Digital Communications and Networks》 2025年第5期1554-1566,共13页
Satellite communication technology has emerged as a key solution to address the challenges of data transmission in remote areas.By overcoming the limitations of traditional terrestrial communication networks,it enable... Satellite communication technology has emerged as a key solution to address the challenges of data transmission in remote areas.By overcoming the limitations of traditional terrestrial communication networks,it enables long-distance data transmission anytime and anywhere,ensuring the timely and accurate delivery of water level data,which is particularly crucial for fishway water level monitoring.To enhance the effectiveness of fishway water level monitoring,this study proposes a multi-task learning model,AS-SOMTF,designed for real-time and comprehensive prediction.The model integrates auxiliary sequences with primary input sequences to capture complex relationships and dependencies,thereby improving representational capacity.In addition,a novel timeseries embedding algorithm,AS-SOM,is introduced,which combines generative inference and pooling operations to optimize prediction efficiency for long sequences.This innovation not only ensures the timely transmission of water level data but also enhances the accuracy of real-time monitoring.Compared with traditional models such as Transformer and Long Short-Term Memory(LSTM)networks,the proposed model achieves improvements of 3.8%and 1.4%in prediction accuracy,respectively.These advancements provide more precise technical support for water level forecasting and resource management in the Diqing Tibetan Autonomous Prefecture of the Lancang River,contributing to ecosystem protection and improved operational safety. 展开更多
关键词 Fish passages Water-level prediction Time series forecasting multi-task learning Hierarchical clustering Satellite communication
在线阅读 下载PDF
上一页 1 2 40 下一页 到第
使用帮助 返回顶部