期刊文献+
共找到38,507篇文章
< 1 2 250 >
每页显示 20 50 100
基于TCN-BiLSTM-Attention模型的超短期光伏发电量预测方法
1
作者 刘凯伦 孙广玲 陆小锋 《工业控制计算机》 2026年第1期122-124,共3页
随着光伏发电在全球能源体系中占比不断提升,超短期光伏发电量预测对电力系统调度与安全运行至关重要。然而,光伏发电量受多因素影响,具有显著随机性与波动性。为此,提出了一种基于TCN-BiLSTM-Attention模型的超短期光伏发电量预测方法... 随着光伏发电在全球能源体系中占比不断提升,超短期光伏发电量预测对电力系统调度与安全运行至关重要。然而,光伏发电量受多因素影响,具有显著随机性与波动性。为此,提出了一种基于TCN-BiLSTM-Attention模型的超短期光伏发电量预测方法。首先通过皮尔逊相关分析筛选关键特征,并利用孤立森林算法检测异常值,结合线性插值法和标准化完成数据预处理。随后,通过时间卷积网络(Temporal Convolutional Network,TCN)提取时序特征,再利用双向长短期记忆网络(Bidirectional Long Short-Term Memory,BiLSTM)网络捕获前后向时间依赖关系,并在输出端引入注意力机制聚焦关键时间步特征。最后,在Desert Knowledge Australia Solar Centre(DKASC)数据集上的对比实验表明,与传统LSTM、BiLSTM模型相比,提出的TCN-BiLSTM-Attention模型在预测精度、稳定性等方面均表现出一定优势。 展开更多
关键词 TCN BiLSTM attention 发电量超短期预测
在线阅读 下载PDF
SwinHCAD: A Robust Multi-Modality Segmentation Model for Brain Tumors Using Transformer and Channel-Wise Attention
2
作者 Seyong Jin Muhammad Fayaz +2 位作者 L.Minh Dang Hyoung-Kyu Song Hyeonjoon Moon 《Computers, Materials & Continua》 2026年第1期511-533,共23页
Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the b... Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the burden on medical staff and provides quantitative information,existing methodologies and recent models still struggle to accurately capture and classify the fine boundaries and diverse morphologies of tumors.In order to address these challenges and maximize the performance of brain tumor segmentation,this research introduces a novel SwinUNETR-based model by integrating a new decoder block,the Hierarchical Channel-wise Attention Decoder(HCAD),into a powerful SwinUNETR encoder.The HCAD decoder block utilizes hierarchical features and channelspecific attention mechanisms to further fuse information at different scales transmitted from the encoder and preserve spatial details throughout the reconstruction phase.Rigorous evaluations on the recent BraTS GLI datasets demonstrate that the proposed SwinHCAD model achieved superior and improved segmentation accuracy on both the Dice score and HD95 metrics across all tumor subregions(WT,TC,and ET)compared to baseline models.In particular,the rationale and contribution of the model design were clarified through ablation studies to verify the effectiveness of the proposed HCAD decoder block.The results of this study are expected to greatly contribute to enhancing the efficiency of clinical diagnosis and treatment planning by increasing the precision of automated brain tumor segmentation. 展开更多
关键词 attention mechanism brain tumor segmentation channel-wise attention decoder deep learning medical imaging MRI TRANSFORMER U-Net
在线阅读 下载PDF
GFL-SAR: Graph Federated Collaborative Learning Framework Based on Structural Amplification and Attention Refinement
3
作者 Hefei Wang Ruichun Gu +2 位作者 Jingyu Wang Xiaolin Zhang Hui Wei 《Computers, Materials & Continua》 2026年第1期1683-1702,共20页
Graph Federated Learning(GFL)has shown great potential in privacy protection and distributed intelligence through distributed collaborative training of graph-structured data without sharing raw information.However,exi... Graph Federated Learning(GFL)has shown great potential in privacy protection and distributed intelligence through distributed collaborative training of graph-structured data without sharing raw information.However,existing GFL approaches often lack the capability for comprehensive feature extraction and adaptive optimization,particularly in non-independent and identically distributed(NON-IID)scenarios where balancing global structural understanding and local node-level detail remains a challenge.To this end,this paper proposes a novel framework called GFL-SAR(Graph Federated Collaborative Learning Framework Based on Structural Amplification and Attention Refinement),which enhances the representation learning capability of graph data through a dual-branch collaborative design.Specifically,we propose the Structural Insight Amplifier(SIA),which utilizes an improved Graph Convolutional Network(GCN)to strengthen structural awareness and improve modeling of topological patterns.In parallel,we propose the Attentive Relational Refiner(ARR),which employs an enhanced Graph Attention Network(GAT)to perform fine-grained modeling of node relationships and neighborhood features,thereby improving the expressiveness of local interactions and preserving critical contextual information.GFL-SAR effectively integrates multi-scale features from every branch via feature fusion and federated optimization,thereby addressing existing GFL limitations in structural modeling and feature representation.Experiments on standard benchmark datasets including Cora,Citeseer,Polblogs,and Cora_ML demonstrate that GFL-SAR achieves superior performance in classification accuracy,convergence speed,and robustness compared to existing methods,confirming its effectiveness and generalizability in GFL tasks. 展开更多
关键词 Graph federated learning GCN GNNs attention mechanism
在线阅读 下载PDF
DAUNet: Unsupervised Neural Network Based on Dual Attention for Clock Synchronization in Multi-Agent Wireless Ad Hoc Networks
4
作者 Haihao He Xianzhou Dong +2 位作者 Shuangshuang Wang Chengzhang Zhu Xiaotong Zhao 《Computers, Materials & Continua》 2026年第1期847-869,共23页
Clock synchronization has important applications in multi-agent collaboration(such as drone light shows,intelligent transportation systems,and game AI),group decision-making,and emergency rescue operations.Synchroniza... Clock synchronization has important applications in multi-agent collaboration(such as drone light shows,intelligent transportation systems,and game AI),group decision-making,and emergency rescue operations.Synchronization method based on pulse-coupled oscillators(PCOs)provides an effective solution for clock synchronization in wireless networks.However,the existing clock synchronization algorithms in multi-agent ad hoc networks are difficult to meet the requirements of high precision and high stability of synchronization clock in group cooperation.Hence,this paper constructs a network model,named DAUNet(unsupervised neural network based on dual attention),to enhance clock synchronization accuracy in multi-agent wireless ad hoc networks.Specifically,we design an unsupervised distributed neural network framework as the backbone,building upon classical PCO-based synchronization methods.This framework resolves issues such as prolonged time synchronization message exchange between nodes,difficulties in centralized node coordination,and challenges in distributed training.Furthermore,we introduce a dual-attention mechanism as the core module of DAUNet.By integrating a Multi-Head Attention module and a Gated Attention module,the model significantly improves information extraction capabilities while reducing computational complexity,effectively mitigating synchronization inaccuracies and instability in multi-agent ad hoc networks.To evaluate the effectiveness of the proposed model,comparative experiments and ablation studies were conducted against classical methods and existing deep learning models.The research results show that,compared with the deep learning networks based on DASA and LSTM,DAUNet can reduce the mean normalized phase difference(NPD)by 1 to 2 orders of magnitude.Compared with the attention models based on additive attention and self-attention mechanisms,the performance of DAUNet has improved by more than ten times.This study demonstrates DAUNet’s potential in advancing multi-agent ad hoc networking technologies. 展开更多
关键词 Clock synchronization deep learning dual attention mechanism pulse-coupled oscillator
在线阅读 下载PDF
Syntax-Aware Hierarchical Attention Networks for Code Vulnerability Detection
5
作者 Yongbo Jiang Shengnan Huang +1 位作者 Tao Feng Baofeng Duan 《Computers, Materials & Continua》 2026年第1期2252-2273,共22页
In the context of modern software development characterized by increasing complexity and compressed development cycles,traditional static vulnerability detection methods face prominent challenges including high false ... In the context of modern software development characterized by increasing complexity and compressed development cycles,traditional static vulnerability detection methods face prominent challenges including high false positive rates and missed detections of complex logic due to their over-reliance on rule templates.This paper proposes a Syntax-Aware Hierarchical Attention Network(SAHAN)model,which achieves high-precision vulnerability detection through grammar-rule-driven multi-granularity code slicing and hierarchical semantic fusion mechanisms.The SAHAN model first generates Syntax Independent Units(SIUs),which slices the code based on Abstract Syntax Tree(AST)and predefined grammar rules,retaining vulnerability-sensitive contexts.Following this,through a hierarchical attention mechanism,the local syntax-aware layer encodes fine-grained patterns within SIUs,while the global semantic correlation layer captures vulnerability chains across SIUs,achieving synergistic modeling of syntax and semantics.Experiments show that on benchmark datasets like QEMU,SAHAN significantly improves detection performance by 4.8%to 13.1%on average compared to baseline models such as Devign and VulDeePecker. 展开更多
关键词 Vulnerability detection abstract syntax tree syntax rule slicing hierarchical attention mechanism deep learning
在线阅读 下载PDF
Interactive Dynamic Graph Convolution with Temporal Attention for Traffic Flow Forecasting
6
作者 Zitong Zhao Zixuan Zhang Zhenxing Niu 《Computers, Materials & Continua》 2026年第1期1049-1064,共16页
Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating In... Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating Interactive Dynamic Graph Convolution Network(IDGCN)with Temporal Multi-Head Trend-Aware Attention.Its core innovation lies in IDGCN,which uniquely splits sequences into symmetric intervals for interactive feature sharing via dynamic graphs,and a novel attention mechanism incorporating convolutional operations to capture essential local traffic trends—addressing a critical gap in standard attention for continuous data.For 15-and 60-min forecasting on METR-LA,AIDGCN achieves MAEs of 0.75%and 0.39%,and RMSEs of 1.32%and 0.14%,respectively.In the 60-min long-term forecasting of the PEMS-BAY dataset,the AIDGCN out-performs the MRA-BGCN method by 6.28%,4.93%,and 7.17%in terms of MAE,RMSE,and MAPE,respectively.Experimental results demonstrate the superiority of our pro-posed model over state-of-the-art methods. 展开更多
关键词 Traffic flow prediction interactive dynamic graph convolution graph convolution temporal multi-head trend-aware attention self-attention mechanism
在线阅读 下载PDF
Research on multi-view collaborative detection system for UAV swarms based on Pix2Pix framework and BAM attention mechanism
7
作者 Yan Ding Qingxin Cao +2 位作者 Bozhi Zhang Peilin Li Zhongjiao Shi 《Defence Technology(防务技术)》 2025年第4期213-226,共14页
Drone swarm systems,equipped with photoelectric imaging and intelligent target perception,are essential for reconnaissance and strike missions in complex and high-risk environments.They excel in information sharing,an... Drone swarm systems,equipped with photoelectric imaging and intelligent target perception,are essential for reconnaissance and strike missions in complex and high-risk environments.They excel in information sharing,anti-jamming capabilities,and combat performance,making them critical for future warfare.However,varied perspectives in collaborative combat scenarios pose challenges to object detection,hindering traditional detection algorithms and reducing accuracy.Limited angle-prior data and sparse samples further complicate detection.This paper presents the Multi-View Collaborative Detection System,which tackles the challenges of multi-view object detection in collaborative combat scenarios.The system is designed to enhance multi-view image generation and detection algorithms,thereby improving the accuracy and efficiency of object detection across varying perspectives.First,an observation model for three-dimensional targets through line-of-sight angle transformation is constructed,and a multi-view image generation algorithm based on the Pix2Pix network is designed.For object detection,YOLOX is utilized,and a deep feature extraction network,BA-RepCSPDarknet,is developed to address challenges related to small target scale and feature extraction challenges.Additionally,a feature fusion network NS-PAFPN is developed to mitigate the issue of deep feature map information loss in UAV images.A visual attention module(BAM)is employed to manage appearance differences under varying angles,while a feature mapping module(DFM)prevents fine-grained feature loss.These advancements lead to the development of BA-YOLOX,a multi-view object detection network model suitable for drone platforms,enhancing accuracy and effectively targeting small objects. 展开更多
关键词 Drone swarm systems Reconnaissance and strike Image generation multi-view detection Pix2Pix framework attention mechanism
在线阅读 下载PDF
基于MSCNN+Attention模型的轴承故障诊断方法研究
8
作者 付志鹏 么洪飞 《齐齐哈尔大学学报(自然科学版)》 2026年第1期9-16,43,共9页
针对传统故障诊断方法特征提取能力不足以及诊断精度低的问题,提出一种融合通道注意力与自注意力机制的轴承故障诊断模型。该模型通过多层卷积与注意力机制提取关键特征,并利用自注意力模块进行全局特征融合,构建残差结构增强特征表达能... 针对传统故障诊断方法特征提取能力不足以及诊断精度低的问题,提出一种融合通道注意力与自注意力机制的轴承故障诊断模型。该模型通过多层卷积与注意力机制提取关键特征,并利用自注意力模块进行全局特征融合,构建残差结构增强特征表达能力,诊断模型通过Softmax分类器识别故障。通过凯斯西储大学的轴承数据验证窗口长度与优化器选择的合理性,结果表明,当窗口长度为1024,采用Adam优化器(学习率0.001)时模型性能最佳。通过准确率、ROC曲线和混淆矩阵指标对模型性能进行全面评估。实验结果显示,模型的故障识别准确率达99.4%~100%,显著优于RF模型(96.8%)、GRU模型(97.5%)和LSTM模型(92.3%),在窗口长度为1024时,分类准确率提升最明显,且AUC均超过0.99,综合分析表明该模型的特征提取能力和诊断精度相比传统模型显著提升。 展开更多
关键词 注意力机制 滚动轴承 特征提取 卷积神经网络
在线阅读 下载PDF
Multi-View Seizure Classification Based on Attention-Based Adaptive Graph ProbSparse Hybrid Network
9
作者 Changxu Dong Yanqing Liu Dengdi Sun 《CAAI Transactions on Intelligence Technology》 2025年第6期1783-1798,共16页
Epilepsy is a neurological disorder characterised by recurrent seizures due to abnormal neuronal discharges.Seizure detection via EEG signals has progressed,but two main challenges are still encountered.First,EEG data... Epilepsy is a neurological disorder characterised by recurrent seizures due to abnormal neuronal discharges.Seizure detection via EEG signals has progressed,but two main challenges are still encountered.First,EEG data can be distorted by physiological factors and external variables,resulting in noisy brain networks.Static adjacency matrices are typically used in current mainstream methods,which neglect the need for dynamic updates and feature refinement.The second challenge stems from the strong reliance on long-range dependencies through self-attention in current methods,which can introduce redundant noise and increase computational complexity,especially in long-duration data.To address these challenges,the Attention-based Adaptive Graph ProbSparse Hybrid Network(AA-GPHN)is proposed.Brain network structures are dynamically optimised using variational inference and the information bottleneck principle,refining the adjacency matrix for improved epilepsy classification.A Linear Graph Convolutional Network(LGCN)is incorporated to focus on first-order neighbours,minimising the aggregation of distant information.Furthermore,a ProbSparse attention-based Informer(PAT)is introduced to adaptively filter long-range dependencies,enhancing efficiency.A joint optimisation loss function is applied to improve robustness in noisy environments.Experimental results on both patient-specific and cross-subject datasets demonstrate that AA-GPHN outperforms existing methods in seizure detection,showing superior effectiveness and generalisation. 展开更多
关键词 BIOINFORMATICS deep learning dynamically EEG ELECTROENCEPHALOGRAPHY ProbSparse attention seizure classification
在线阅读 下载PDF
MVLA-Net:A Multi-View Lesion Attention Network for Advanced Diagnosis and Grading of Diabetic Retinopathy
10
作者 Tariq Mahmood Tanzila Saba +2 位作者 Faten S.Alamri Alishba Tahir Noor Ayesha 《Computers, Materials & Continua》 2025年第4期1173-1193,共21页
Innovation in learning algorithms has made retinal vessel segmentation and automatic grading tech-niques crucial for clinical diagnosis and prevention of diabetic retinopathy.The traditional methods struggle with accu... Innovation in learning algorithms has made retinal vessel segmentation and automatic grading tech-niques crucial for clinical diagnosis and prevention of diabetic retinopathy.The traditional methods struggle with accuracy and reliability due to multi-scale variations in retinal blood vessels and the complex pathological relationship in fundus images associated with diabetic retinopathy.While the single-modal diabetic retinopathy grading network addresses class imbalance challenges and lesion representation in fundus image data,dual-modal diabetic retinopathy grading methods offer superior performance.However,the scarcity of dual-modal data and the lack of effective feature fusion methods limit their potential due to multi-scale variations.This paper addresses these issues by focusing on multi-scale retinal vessel segmentation,dual feature fusion,data augmentation,and attention-based grading.The proposed model aims to improve comprehensive segmentation for retinal images with varying vessel thicknesses.It employs a dual-branch parallel architecture that integrates a transformer encoder with a convolutional neural network encoder to extract local and global information for synergistic saliency learning.Besides that,the model uses residual structures and attention modules to extract critical lesions,enhancing the accuracy and reliability of diabetic retinopathy grading.To evaluate the efficacy of the proposed approach,this study compared it with other pre-trained publicly open models,ResNet152V2,ConvNext,Efficient Net,DenseNet,and Swin Transform,with the same developmental parameters.All models achieved approximately 85%accuracy with the same image preparation method.However,the proposed approach outperforms and optimizes existing models by achieving an accuracy of 99.17%,99.04%,and 99.24%,on Kaggle APTOS19,IDRiD,and EyePACS datasets,respectively.These results support the model’s utility in helping ophthalmologists diagnose diabetic retinopathy more rapidly and accurately. 展开更多
关键词 Diabetic retinopathy grading retinal vessel segmentation dual-modal deep learning attention mecha-nism health risks
在线阅读 下载PDF
基于SSA-LSTM-Attention的日光温室环境预测模型 被引量:3
11
作者 孟繁佳 许瑞峰 +3 位作者 赵维娟 宋文臻 高艺璇 李莉 《农业工程学报》 北大核心 2025年第11期256-263,共8页
建立准确的温室环境预测模型有助于精准调控温室环境促进作物的生长发育,针对温室小气候具有时序性、非线性和强耦合等特点,该研究提出了一种基于SSA-LSTM-Attention(sparrow search algorithm-long short-term memoryattention mechani... 建立准确的温室环境预测模型有助于精准调控温室环境促进作物的生长发育,针对温室小气候具有时序性、非线性和强耦合等特点,该研究提出了一种基于SSA-LSTM-Attention(sparrow search algorithm-long short-term memoryattention mechanism)的日光温室环境预测模型。首先,通过温室物联网数据采集系统获取温室内外环境数据;其次,使用皮尔逊相关性分析法筛选出强相关性因子;最后,构建环境特征时间序列矩阵输入模型进行温室环境预测。对日光温室的室内温度、室内湿度、光照强度和土壤湿度4种环境因子的预测,SSA-LSTM-Attention模型的平均拟合指数达到了97.9%。相较于反向传播神经网络(back propagation neural network,BP)、门控循环单元(gate recurrent unit,GRU)、长短期记忆神经网络(long short term memory,LSTM)和LSTM-Attention(long short-term memory-attention mechanism)模型,分别提高8.1、4.1、3.5、3.0个百分点;平均绝对百分比误差为2.6%,分别降低6.5、3.2、2.8、2.5个百分点。试验结果表明,通过利用SSA自动优化LSTM-Attention模型的超参数,提高了模型预测精度,为日光温室环境超前调控提供了有效的数据支持。 展开更多
关键词 日光温室 麻雀搜索算法 长短期记忆网络 注意力机制 环境预测模型
在线阅读 下载PDF
基于VMD-TCN-BiLSTM-Attention的短期电力负荷预测 被引量:1
12
作者 刘义艳 李国良 代杰 《智慧电力》 北大核心 2025年第10期87-94,共8页
针对短期电力负荷数据具有非线性和波动性等特点而导致的预测精度不足问题,提出一种基于变分模态分解(VMD)、时间卷积网络(TCN)、双向长短期记忆网络(BiLSTM)与注意力机制(Attention)相结合的新型预测模型。首先,采用VMD方法将电力负荷... 针对短期电力负荷数据具有非线性和波动性等特点而导致的预测精度不足问题,提出一种基于变分模态分解(VMD)、时间卷积网络(TCN)、双向长短期记忆网络(BiLSTM)与注意力机制(Attention)相结合的新型预测模型。首先,采用VMD方法将电力负荷数据分解成多个不同频率的模态分量,利用TCN模型提取模态分量中的时序特征;其次,通过BiLSTM网络进一步挖掘序列依赖关系;最后,引入注意力机制对BiLSTM输出的特征进行加权处理。实验结果表明,所提模型与其他传统模型相比预测精度显著提升,在短期电力负荷预测中具有较高的应用价值。 展开更多
关键词 短期电力负荷 变分模态分解 时间卷积网络 双向长短期记忆网络 注意力机制
在线阅读 下载PDF
中国保险业系统性风险的评估与预警研究——基于Attention-LSTM模型的分析 被引量:2
13
作者 师荣蓉 杨娅 《财经理论与实践》 北大核心 2025年第2期26-34,共9页
基于保险业系统性风险传导机制和预警机制的理论分析,利用CoVaR方法评估保险业系统性风险,从微观保险机构和宏观经济环境构建Attention-LSTM模型对保险业系统性风险进行预警分析。研究发现:当遭遇重大事件冲击时,系统重要性保险机构对... 基于保险业系统性风险传导机制和预警机制的理论分析,利用CoVaR方法评估保险业系统性风险,从微观保险机构和宏观经济环境构建Attention-LSTM模型对保险业系统性风险进行预警分析。研究发现:当遭遇重大事件冲击时,系统重要性保险机构对保险业的风险溢出增加;将金融压力指数纳入风险预警体系,其预测平均绝对误差、均方根误差和平均绝对百分比误差分别降低8.59%、7.27%和4.55%;Attention-LSTM模型能捕捉风险间的关联性和传染性,在预测准确性、泛化能力和时间稳定性方面均优于传统机器学习模型。鉴于此,应建立保险业风险分区管理体系,融合深度学习模型多维度构建保险业系统性风险预警机制。 展开更多
关键词 保险业系统性风险 评估 预警 attention-LSTM模型
在线阅读 下载PDF
基于音视频信息融合与Self-Attention-DSC-CNN6网络的鲈鱼摄食强度分类方法 被引量:4
14
作者 李道亮 李万超 杜壮壮 《农业机械学报》 北大核心 2025年第1期16-24,共9页
摄食强度识别分类是实现水产养殖精准投喂的重要环节。现有的投喂方式存在过度依赖人工经验判断、投喂量不精确、饲料浪费严重等问题。基于多模态融合的鱼类摄食程度分类能够综合不同类型的数据(如:视频、声音和水质参数),为鱼群的投喂... 摄食强度识别分类是实现水产养殖精准投喂的重要环节。现有的投喂方式存在过度依赖人工经验判断、投喂量不精确、饲料浪费严重等问题。基于多模态融合的鱼类摄食程度分类能够综合不同类型的数据(如:视频、声音和水质参数),为鱼群的投喂提供更加全面精准的决策依据。因此,提出了一种融合视频和音频数据的多模态融合框架,旨在提升鲈鱼摄食强度分类性能。将预处理后的Mel频谱图(Mel Spectrogram)和视频帧图像分别输入到Self-Attention-DSC-CNN6(Self-attention-depthwise separable convolution-CNN6)优化模型进行高层次的特征提取,并将提取的特征进一步拼接融合,最后将拼接后的特征经分类器分类。针对Self-Attention-DSC-CNN6优化模型,基于CNN6算法进行了改进,将传统卷积层替换为深度可分离卷积(Depthwise separable convolution,DSC)来达到减少计算复杂度的效果,并引入Self-Attention注意力机制以增强特征提取能力。实验结果显示,本文所提出的多模态融合框架鲈鱼摄食强度分类准确率达到90.24%,模型可以有效利用不同数据源信息,提升了对复杂环境中鱼群行为的理解,增强了模型决策能力,确保了投喂策略的及时性与准确性,从而有效减少了饲料浪费。 展开更多
关键词 鲈鱼 摄食强度分类 多模态融合 Self-attention-DSC-CNN6
在线阅读 下载PDF
基于BiLSTM-Attention的议论文篇章要素识别 被引量:1
15
作者 刘佳旭 白再冉 张艳菊 《计算机系统应用》 2025年第5期202-211,共10页
篇章要素识别(discourse element identification)的主要任务是识别篇章要素单元并进行分类.针对篇章要素识别对上下文依赖性理解不足的问题,提出一种基于BiLSTM-Attention的识别篇章要素模型,提高议论文篇章要素识别的准确率.该模型利... 篇章要素识别(discourse element identification)的主要任务是识别篇章要素单元并进行分类.针对篇章要素识别对上下文依赖性理解不足的问题,提出一种基于BiLSTM-Attention的识别篇章要素模型,提高议论文篇章要素识别的准确率.该模型利用句子结构和位置编码来识别句子的成分关系,通过双向长短期记忆网络(bidirectional long short-term memory,BiLSTM)进一步获得深层次上下文相关联的信息;引入注意力机制(attention mechanism)优化模型特征向量,提高文本分类的准确度;最终用句间多头自注意力(multi-head self-attention)获取句子在内容和结构上的关系,弥补距离较远的句子依赖问题.相比于HBiLSTM、BERT等基线模型,在相同参数、相同实验条件下,中文数据集和英文数据集上准确率分别提升1.3%、3.6%,验证了该模型在篇章要素识别任务中的有效性. 展开更多
关键词 双向长短期记忆网络 注意力机制 位置编码 篇章要素识别 多头注意力
在线阅读 下载PDF
基于Attention-1DCNN-CE的加密流量分类方法
16
作者 耿海军 董赟 +3 位作者 胡治国 池浩田 杨静 尹霞 《计算机应用》 北大核心 2025年第3期872-882,共11页
针对传统加密流量识别方法存在多分类准确率低、泛化性不强以及易侵犯隐私等问题,提出一种结合注意力机制(Attention)与一维卷积神经网络(1DCNN)的多分类深度学习模型——Attention-1DCNN-CE。该模型包含3个核心部分:1)数据集预处理阶段... 针对传统加密流量识别方法存在多分类准确率低、泛化性不强以及易侵犯隐私等问题,提出一种结合注意力机制(Attention)与一维卷积神经网络(1DCNN)的多分类深度学习模型——Attention-1DCNN-CE。该模型包含3个核心部分:1)数据集预处理阶段,保留原始数据流中数据包间的空间关系,并根据样本分布构建成本敏感矩阵;2)在初步提取加密流量特征的基础上,利用Attention和1DCNN模型深入挖掘并压缩流量的全局与局部特征;3)针对数据不平衡这一挑战,通过结合成本敏感矩阵与交叉熵(CE)损失函数,显著提升少数类别样本的分类精度,进而优化模型的整体性能。实验结果表明,在BOT-IOT和TON-IOT数据集上该模型的整体识别准确率高达97%以上;并且该模型在公共数据集ISCX-VPN和USTC-TFC上表现优异,在不需要预训练的前提下,达到了与ET-BERT(Encrypted Traffic BERT)相近的性能;相较于PERT(Payload Encoding Representation from Transformer),该模型在ISCX-VPN数据集的应用类型检测中的F1分数提升了29.9个百分点。以上验证了该模型的有效性,为加密流量识别和恶意流量检测提供了解决方案。 展开更多
关键词 网络安全 加密流量 注意力机制 一维卷积神经网络 数据不平衡 成本敏感矩阵
在线阅读 下载PDF
基于CNN-LSTM-Attention 组合模型的黄金周旅游客流预测——以大理州为例 被引量:2
17
作者 戢晓峰 郭雅诗 +2 位作者 陈方 黄志文 李武 《干旱区资源与环境》 北大核心 2025年第3期200-208,共9页
黄金周旅游客流预测一直是区域旅游管理的重大现实需求,能够为黄金周旅游组织提供更为精准的数据支持。文中基于百度迁徙数据和百度搜索指数数据,以卷积神经网络(CNN)、长短期记忆网络(LSTM)以及注意力机制(Attention)为基准,构建了CNN-... 黄金周旅游客流预测一直是区域旅游管理的重大现实需求,能够为黄金周旅游组织提供更为精准的数据支持。文中基于百度迁徙数据和百度搜索指数数据,以卷积神经网络(CNN)、长短期记忆网络(LSTM)以及注意力机制(Attention)为基准,构建了CNN-LSTM-Attention组合模型,对大理州黄金周日度旅游客流人数进行了预测,并基于SHAP算法进行了影响因素分析。结果显示:1)CNN-LSTM-Attention组合模型的预测精度优于RF模型、SVM模型、CNN模型、LSTM模型和CNN-LSTM模型。2)引入百度搜索指数特征后,模型的均方根误差(RMSE)、平均绝对百分比误差(MAPE)、决定系数(R^(2))表现最优,表明百度搜索指数的加入在一定程度上提升了模型的预测精度。文中所构模型为黄金周旅游客流预测提供了新思路。 展开更多
关键词 客流预测 黄金周 卷积神经网络(CNN) 长短期记忆网络(LSTM) 注意力机制
原文传递
基于Attention LSTM的中小企业财务风险预测模型
18
作者 张文闻 《中国市场》 2025年第27期147-150,共4页
文章提出了一种基于Attention LSTM的中小企业财务风险预测模型。此模型结合了长短期记忆网络(LSTM)和注意力机制(Attention),有效解读财务时间序列数据,并准确评估各时间段数据对风险预测的重要性。实证研究揭示,对于关键风险因素,如... 文章提出了一种基于Attention LSTM的中小企业财务风险预测模型。此模型结合了长短期记忆网络(LSTM)和注意力机制(Attention),有效解读财务时间序列数据,并准确评估各时间段数据对风险预测的重要性。实证研究揭示,对于关键风险因素,如偿债能力、经营稳定性和盈利能力等,模型表现出优于传统预测方式的精准度。因此,该模型为中小企业提供了一个有效的财务风险预测工具,可以帮助企业及时发现并应对潜在的财务风险,为未来的决策制定提供重要支持。 展开更多
关键词 中小企业 财务风险预测 attention LSTM 模型预测
在线阅读 下载PDF
基于GRU-Attention-BiLSTM的船舶轨迹预测模型
19
作者 袁志涛 李泽伟 +2 位作者 刘克中 陈默子 袁航 《中国航海》 北大核心 2025年第4期132-140,共9页
针对复杂通航水域中船舶轨迹预测准确性不高的问题,提出了基于GRU-Attention-BiLSTM的船舶轨迹预测模型,该模型编码器部分使用门控循环单元(GRU)来捕捉轨迹序列中的时序特征,解码器采用双向长短期记忆网络(BiLSTM)并加入注意力(Attenti... 针对复杂通航水域中船舶轨迹预测准确性不高的问题,提出了基于GRU-Attention-BiLSTM的船舶轨迹预测模型,该模型编码器部分使用门控循环单元(GRU)来捕捉轨迹序列中的时序特征,解码器采用双向长短期记忆网络(BiLSTM)并加入注意力(Attention)机制来调整数据特征的权值。以历史时刻的船舶经度、纬度、速度及航向为模型输入基础特征,同时引入中值滤波平滑处理后的水域船舶密度作为附加特征。选取宁波舟山港核心港区2024年3月的AIS数据进行模型的训练和验证,并与GRU、LSTM、Seq2Seq-LSTM、Attention-BiLSTM和Transformer模型进行定量和定性对比,结果表明本文模型在不同的预测时长和航行场景下都有更优的预测结果。 展开更多
关键词 复杂通航水域 船舶轨迹预测 注意力机制
在线阅读 下载PDF
基于电池老化趋势重构与TCN-GRU-Attention网络的SOH估计 被引量:1
20
作者 李士哲 张天宇 谢家乐 《电力科学与工程》 2025年第3期38-45,共8页
针对噪声干扰导致锂电池老化过程中关键特征提取困难的问题,首先,在增量容量曲线中提取反应电池老化规律的峰值特征,捕捉电池性能随时间变化的关键信息;然后,通过改进的自适应噪声完备集合经验模态分解与小波阈值降噪对特征进行联合降噪... 针对噪声干扰导致锂电池老化过程中关键特征提取困难的问题,首先,在增量容量曲线中提取反应电池老化规律的峰值特征,捕捉电池性能随时间变化的关键信息;然后,通过改进的自适应噪声完备集合经验模态分解与小波阈值降噪对特征进行联合降噪,重构出更高精度的特征序列;最后,将该特征序列输入到时间卷积网络提取序列特征,并利用门控循环单元捕捉长时间依赖性,同时引入多头注意力机制进一步增强模型对关键特征的感知能力。实验结果表明,用该方法可有效提高锂电池健康状态估计的准确性,使均方根误差小于1.5%,平均绝对误差小于1%。 展开更多
关键词 锂电池 电池健康状态 自适应噪声完备集合经验模态分解 小波阈值降噪 时间卷积网络 门控循环单元 多头注意力机制
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部