期刊文献+
共找到11篇文章
< 1 >
每页显示 20 50 100
Secure Medical Image Retrieval Based on Multi-Attention Mechanism and Triplet Deep Hashing
1
作者 Shaozheng Zhang Qiuyu Zhang +1 位作者 Jiahui Tang Ruihua Xu 《Computers, Materials & Continua》 2025年第2期2137-2158,共22页
Medical institutions frequently utilize cloud servers for storing digital medical imaging data, aiming to lower both storage expenses and computational expenses. Nevertheless, the reliability of cloud servers as third... Medical institutions frequently utilize cloud servers for storing digital medical imaging data, aiming to lower both storage expenses and computational expenses. Nevertheless, the reliability of cloud servers as third-party providers is not always guaranteed. To safeguard against the exposure and misuse of personal privacy information, and achieve secure and efficient retrieval, a secure medical image retrieval based on a multi-attention mechanism and triplet deep hashing is proposed in this paper (abbreviated as MATDH). Specifically, this method first utilizes the contrast-limited adaptive histogram equalization method applicable to color images to enhance chest X-ray images. Next, a designed multi-attention mechanism focuses on important local features during the feature extraction stage. Moreover, a triplet loss function is utilized to learn discriminative hash codes to construct a compact and efficient triplet deep hashing. Finally, upsampling is used to restore the original resolution of the images during retrieval, thereby enabling more accurate matching. To ensure the security of medical image data, a lightweight image encryption method based on frequency domain encryption is designed to encrypt the chest X-ray images. The findings of the experiment indicate that, in comparison to various advanced image retrieval techniques, the suggested approach improves the precision of feature extraction and retrieval using the COVIDx dataset. Additionally, it offers enhanced protection for the confidentiality of medical images stored in cloud settings and demonstrates strong practicality. 展开更多
关键词 Secure medical image retrieval multi-attention mechanism triplet deep hashing image enhancement lightweight image encryption
在线阅读 下载PDF
MU-GAN:Facial Attribute Editing Based on Multi-Attention Mechanism 被引量:6
2
作者 Ke Zhang Yukun Su +2 位作者 Xiwang Guo Liang Qi Zhenbing Zhao 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第9期1614-1626,共13页
Facial attribute editing has mainly two objectives:1)translating image from a source domain to a target one,and 2)only changing the facial regions related to a target attribute and preserving the attribute-excluding d... Facial attribute editing has mainly two objectives:1)translating image from a source domain to a target one,and 2)only changing the facial regions related to a target attribute and preserving the attribute-excluding details.In this work,we propose a multi-attention U-Net-based generative adversarial network(MU-GAN).First,we replace a classic convolutional encoder-decoder with a symmetric U-Net-like structure in a generator,and then apply an additive attention mechanism to build attention-based U-Net connections for adaptively transferring encoder representations to complement a decoder with attribute-excluding detail and enhance attribute editing ability.Second,a self-attention(SA)mechanism is incorporated into convolutional layers for modeling long-range and multi-level dependencies across image regions.Experimental results indicate that our method is capable of balancing attribute editing ability and details preservation ability,and can decouple the correlation among attributes.It outperforms the state-of-the-art methods in terms of attribute manipulation accuracy and image quality.Our code is available at https://github.com/SuSir1996/MU-GAN. 展开更多
关键词 Attention U-Net connection encoder-decoder archi-tecture facial attribute editing multi-attention mechanism
在线阅读 下载PDF
A multi-attention RNN-based relation linking approach for question answering over knowledge base 被引量:2
3
作者 Li Huiying Zhao Man Yu Wenqi 《Journal of Southeast University(English Edition)》 EI CAS 2020年第4期385-392,共8页
Aiming at the relation linking task for question answering over knowledge base,especially the multi relation linking task for complex questions,a relation linking approach based on the multi-attention recurrent neural... Aiming at the relation linking task for question answering over knowledge base,especially the multi relation linking task for complex questions,a relation linking approach based on the multi-attention recurrent neural network(RNN)model is proposed,which works for both simple and complex questions.First,the vector representations of questions are learned by the bidirectional long short-term memory(Bi-LSTM)model at the word and character levels,and named entities in questions are labeled by the conditional random field(CRF)model.Candidate entities are generated based on a dictionary,the disambiguation of candidate entities is realized based on predefined rules,and named entities mentioned in questions are linked to entities in knowledge base.Next,questions are classified into simple or complex questions by the machine learning method.Starting from the identified entities,for simple questions,one-hop relations are collected in the knowledge base as candidate relations;for complex questions,two-hop relations are collected as candidates.Finally,the multi-attention Bi-LSTM model is used to encode questions and candidate relations,compare their similarity,and return the candidate relation with the highest similarity as the result of relation linking.It is worth noting that the Bi-LSTM model with one attentions is adopted for simple questions,and the Bi-LSTM model with two attentions is adopted for complex questions.The experimental results show that,based on the effective entity linking method,the Bi-LSTM model with the attention mechanism improves the relation linking effectiveness of both simple and complex questions,which outperforms the existing relation linking methods based on graph algorithm or linguistics understanding. 展开更多
关键词 question answering over knowledge base(KBQA) entity linking relation linking multi-attention bidirectional long short-term memory(Bi-LSTM) large-scale complex question answering dataset(LC-QuAD)
在线阅读 下载PDF
Multi-attention fusion and weighted class representation for few-shot classification
4
作者 ZHAO Wencang QIN Wenqian LI Ming 《High Technology Letters》 EI CAS 2022年第3期295-306,共12页
The existing few-shot learning(FSL) approaches based on metric-learning usually lack attention to the distinction of feature contributions,and the importance of each sample is often ignored when obtaining the class re... The existing few-shot learning(FSL) approaches based on metric-learning usually lack attention to the distinction of feature contributions,and the importance of each sample is often ignored when obtaining the class representation,where the performance of the model is limited.Additionally,similarity metric method is also worthy of attention.Therefore,a few-shot learning approach called MWNet based on multi-attention fusion and weighted class representation(WCR) is proposed in this paper.Firstly,a multi-attention fusion module is introduced into the model to highlight the valuable part of the feature and reduce the interference of irrelevant content.Then,when obtaining the class representation,weight is given to each support set sample,and the weighted class representation is used to better express the class.Moreover,a mutual similarity metric method is used to obtain a more accurate similarity relationship through the mutual similarity for each representation.Experiments prove that the approach in this paper performs well in few-shot image classification,and also shows remarkable excellence and competitiveness compared with related advanced techniques. 展开更多
关键词 few-shot learning(FSL) image classification metric-learning multi-attention fusion
在线阅读 下载PDF
Image recognition for crop diseases using a novel multi-attention module
5
作者 Lei Chen Yuan Yuan Haiyan He 《International Journal of Agricultural and Biological Engineering》 2025年第1期238-244,共7页
Deep convolution neural networks constitute a breakthrough in computer vision.Based on this,the Convolutional Neural Network(CNN)models offer enormous potential for crop disease classification.However,significant trai... Deep convolution neural networks constitute a breakthrough in computer vision.Based on this,the Convolutional Neural Network(CNN)models offer enormous potential for crop disease classification.However,significant training data are required to realize their potential.In the case of crop disease image recognition,especially with complex backgrounds,it is sometimes difficult to acquire adequately labeled large datasets.This research proposed a solution to this problem that integrates multi-attention modules,i.e.,channel and position block(CPB)module.Given an intermediate feature map,the CPB module can infer attention maps in parallel with the channel and position.The attention maps can then be multiplied to form input feature maps for adaptive feature refinement.This provides a simple yet effective intermediate attention structure for CNNs.The module is also lightweight and produces little overhead.Some experiments on cucumber and rice image datasets with complex backgrounds were conducted to validate the effectiveness of the CPB module.The experiments included different module locations and class activation map display characteristics.The classification accuracy reached 96.67%on the cucumber disease image dataset and 95.29%on the rice disease image dataset.The results show that the CPB module can effectively classify crop disease images with complex backgrounds,even on small-scale datasets,which providing a reference for crop disease image recognition method under complex background conditions in the field. 展开更多
关键词 image recognition crop disease multi-attention module deep learning small sample
原文传递
M2ATNet: Multi-Scale Multi-Attention Denoising and Feature Fusion Transformer for Low-Light Image Enhancement
6
作者 Zhongliang Wei Jianlong An Chang Su 《Computers, Materials & Continua》 2026年第1期1819-1838,共20页
Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approach... Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments. 展开更多
关键词 Low-light image enhancement multi-scale multi-attention transformer
在线阅读 下载PDF
一种基于DWConvLSTM与局部敏感哈希注意力的视频摘要方案
7
作者 朱頔 滕晓宇 +4 位作者 王刚 许文丽 樊懿雯 何勇 张清 《信息技术与信息化》 2025年第9期64-68,共5页
针对现有视频摘要算法存在特征敏感性不足、特征提取不够细腻、算法复杂性高等问题,文章基于深度可分离卷积(DWConv)、卷积长短期记忆网络(ConvLSTM)、多头注意力(Multi-Attention)与局部敏感哈希(LSH),共同设计一种轻量级视频摘要方案(... 针对现有视频摘要算法存在特征敏感性不足、特征提取不够细腻、算法复杂性高等问题,文章基于深度可分离卷积(DWConv)、卷积长短期记忆网络(ConvLSTM)、多头注意力(Multi-Attention)与局部敏感哈希(LSH),共同设计一种轻量级视频摘要方案(DWCH-Attention)。在这一方案中,为降低算法复杂性,通过改进ConvLSTM并结合DWConv搭建DWConvLSTM,再将其与注意力机制结合以提取全局特征;为提升特征提取细腻程度及算法对特征的敏感程度,借助局部敏感哈希(LSH)与交叉注意力机制设计局部特征提取方法;为进一步强化特征,设计以查询为驱动的特征融合方法,实现全局与局部特征的融合。为验证方案的有效性与可行性,将方案在TVSum与SumMe两个数据集上开展实验验证,结果表明,该方案在交叉验证、消融实验及对比分析中均表现出较好的性能。 展开更多
关键词 DWConv ConvLSTM multi-attention 局部敏感哈希 视频摘要
在线阅读 下载PDF
A teacher-student based attention network for fine-grainedimage recognition
8
作者 Ang Li Xueyi Zhang +1 位作者 Peilin Li Bin Kang 《Digital Communications and Networks》 2025年第1期52-59,共8页
Fine-grained Image Recognition(FGIR)task is dedicated to distinguishing similar sub-categories that belong to the same super-category,such as bird species and car types.In order to highlight visual differences,existin... Fine-grained Image Recognition(FGIR)task is dedicated to distinguishing similar sub-categories that belong to the same super-category,such as bird species and car types.In order to highlight visual differences,existing FGIR works often follow two steps:discriminative sub-region localization and local feature representation.However,these works pay less attention on global context information.They neglect a fact that the subtle visual difference in challenging scenarios can be highlighted through exploiting the spatial relationship among different subregions from a global view point.Therefore,in this paper,we consider both global and local information for FGIR,and propose a collaborative teacher-student strategy to reinforce and unity the two types of information.Our framework is implemented mainly by convolutional neural network,referred to Teacher-Student Based Attention Convolutional Neural Network(T-S-ACNN).For fine-grained local information,we choose the classic Multi-Attention Network(MA-Net)as our baseline,and propose a type of boundary constraint to further reduce background noises in the local attention maps.In this way,the discriminative sub-regions tend to appear in the area occupied by fine-grained objects,leading to more accurate sub-region localization.For fine-grained global information,we design a graph convolution based Global Attention Network(GA-Net),which can combine extracted local attention maps from MA-Net with non-local techniques to explore spatial relationship among subregions.At last,we develop a collaborative teacher-student strategy to adaptively determine the attended roles and optimization modes,so as to enhance the cooperative reinforcement of MA-Net and GA-Net.Extensive experiments on CUB-200-2011,Stanford Cars and FGVC Aircraft datasets illustrate the promising performance of our framework. 展开更多
关键词 Fine-grained image recognition Collaborative teacher-student strategy multi-attention Global attention
在线阅读 下载PDF
Fault diagnosis of a marine power-generation diesel engine based on the Gramian angular field and a convolutional neural network 被引量:5
9
作者 Congyue LI Yihuai HU +1 位作者 Jiawei JIANG Dexin CUI 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2024年第6期470-482,共13页
Marine power-generation diesel engines operate in harsh environments.Their vibration signals are highly complex and the feature information exhibits a non-linear distribution.It is difficult to extract effective featu... Marine power-generation diesel engines operate in harsh environments.Their vibration signals are highly complex and the feature information exhibits a non-linear distribution.It is difficult to extract effective feature information from the network model,resulting in low fault-diagnosis accuracy.To address this problem,we propose a fault-diagnosis method that combines the Gramian angular field(GAF)with a convolutional neural network(CNN).Firstly,the vibration signals are transformed into 2D images by taking advantage of the GAF,which preserves the temporal correlation.The raw signals can be mapped to 2D image features such as texture and color.To integrate the feature information,the images of the Gramian angular summation field(GASF)and Gramian angular difference field(GADF)are fused by the weighted average fusion method.Secondly,the channel attention mechanism and temporal attention mechanism are introduced in the CNN model to optimize the CNN learning mechanism.Introducing the concept of residuals in the attention mechanism improves the feasibility of optimization.Finally,the weighted average fused images are fed into the CNN for feature extraction and fault diagnosis.The validity of the proposed method is verified by experiments with abnormal valve clearance.The average diagnostic accuracy is 98.40%.When−20 dB≤signal-to-noise ratio(SNR)≤20 dB,the diagnostic accuracy of the proposed method is higher than 94.00%.The proposed method has superior diagnostic performance.Moreover,it has a certain anti-noise capability and variable-load adaptive capability. 展开更多
关键词 multi-attention mechanisms(MAM) Convolutional neural network(CNN) Gramian angular field(GAF) Image fusion Marine power-generation diesel engine Fault diagnosis
原文传递
Relation Extraction Based on Dual Attention Mechanism
10
作者 Xue Li Yuan Rao +1 位作者 Long Sun Yi Lu 《国际计算机前沿大会会议论文集》 2019年第1期354-356,共3页
The traditional deep learning model has problems that the longdistance dependent information cannot be learned, and the correlation between the input and output of the model is not considered. And the information proc... The traditional deep learning model has problems that the longdistance dependent information cannot be learned, and the correlation between the input and output of the model is not considered. And the information processing on the sentence set is still insufficient. Aiming at the above problems, a relation extraction method combining bidirectional GRU network and multiattention mechanism is proposed. The word-level attention mechanism was used to extract the word-level features from the sentence, and the sentence-level attention mechanism was used to focus on the characteristics of sentence sets. The experimental verification in the NYT dataset was conducted. The experimental results show that the proposed method can effectively improve the F1 value of the relationship extraction. 展开更多
关键词 BIDIRECTIONAL GRU multi-attention RELATION extraction
在线阅读 下载PDF
Energy Optimization for Autonomous Mobile Robot Path Planning Based on Deep Reinforcement Learning
11
作者 Longfei Gao Weidong Wang Dieyun Ke 《Computers, Materials & Continua》 2026年第1期984-998,共15页
At present,energy consumption is one of the main bottlenecks in autonomous mobile robot development.To address the challenge of high energy consumption in path planning for autonomous mobile robots navigating unknown ... At present,energy consumption is one of the main bottlenecks in autonomous mobile robot development.To address the challenge of high energy consumption in path planning for autonomous mobile robots navigating unknown and complex environments,this paper proposes an Attention-Enhanced Dueling Deep Q-Network(ADDueling DQN),which integrates a multi-head attention mechanism and a prioritized experience replay strategy into a Dueling-DQN reinforcement learning framework.A multi-objective reward function,centered on energy efficiency,is designed to comprehensively consider path length,terrain slope,motion smoothness,and obstacle avoidance,enabling optimal low-energy trajectory generation in 3D space from the source.The incorporation of a multihead attention mechanism allows the model to dynamically focus on energy-critical state features—such as slope gradients and obstacle density—thereby significantly improving its ability to recognize and avoid energy-intensive paths.Additionally,the prioritized experience replay mechanism accelerates learning from key decision-making experiences,suppressing inefficient exploration and guiding the policy toward low-energy solutions more rapidly.The effectiveness of the proposed path planning algorithm is validated through simulation experiments conducted in multiple off-road scenarios.Results demonstrate that AD-Dueling DQN consistently achieves the lowest average energy consumption across all tested environments.Moreover,the proposed method exhibits faster convergence and greater training stability compared to baseline algorithms,highlighting its global optimization capability under energy-aware objectives in complex terrains.This study offers an efficient and scalable intelligent control strategy for the development of energy-conscious autonomous navigation systems. 展开更多
关键词 Autonomous mobile robot deep reinforcement learning energy optimization multi-attention mechanism prioritized experience replay dueling deep Q-Network
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部