期刊文献+
共找到560篇文章
< 1 2 28 >
每页显示 20 50 100
Augmented Deep Multi-Granularity Pose-Aware Feature Fusion Network for Visible-Infrared Person Re-Identification 被引量:3
1
作者 Zheng Shi Wanru Song +1 位作者 Junhao Shan Feng Liu 《Computers, Materials & Continua》 SCIE EI 2023年第12期3467-3488,共22页
Visible-infrared Cross-modality Person Re-identification(VI-ReID)is a critical technology in smart public facilities such as cities,campuses and libraries.It aims to match pedestrians in visible light and infrared ima... Visible-infrared Cross-modality Person Re-identification(VI-ReID)is a critical technology in smart public facilities such as cities,campuses and libraries.It aims to match pedestrians in visible light and infrared images for video surveillance,which poses a challenge in exploring cross-modal shared information accurately and efficiently.Therefore,multi-granularity feature learning methods have been applied in VI-ReID to extract potential multi-granularity semantic information related to pedestrian body structure attributes.However,existing research mainly uses traditional dual-stream fusion networks and overlooks the core of cross-modal learning networks,the fusion module.This paper introduces a novel network called the Augmented Deep Multi-Granularity Pose-Aware Feature Fusion Network(ADMPFF-Net),incorporating the Multi-Granularity Pose-Aware Feature Fusion(MPFF)module to generate discriminative representations.MPFF efficiently explores and learns global and local features with multi-level semantic information by inserting disentangling and duplicating blocks into the fusion module of the backbone network.ADMPFF-Net also provides a new perspective for designing multi-granularity learning networks.By incorporating the multi-granularity feature disentanglement(mGFD)and posture information segmentation(pIS)strategies,it extracts more representative features concerning body structure information.The Local Information Enhancement(LIE)module augments high-performance features in VI-ReID,and the multi-granularity joint loss supervises model training for objective feature learning.Experimental results on two public datasets show that ADMPFF-Net efficiently constructs pedestrian feature representations and enhances the accuracy of VI-ReID. 展开更多
关键词 Visible-infrared person re-identification multi-granularity feature learning modality
在线阅读 下载PDF
Feature-Based Aggregation and Deep Reinforcement Learning:A Survey and Some New Implementations 被引量:15
2
作者 Dimitri P.Bertsekas 《IEEE/CAA Journal of Automatica Sinica》 EI CSCD 2019年第1期1-31,共31页
In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based aggregation methods and their connection with deep reinfor... In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based aggregation methods and their connection with deep reinforcement learning schemes. We introduce features of the states of the original problem, and we formulate a smaller "aggregate" Markov decision problem, whose states relate to the features. We discuss properties and possible implementations of this type of aggregation, including a new approach to approximate policy iteration. In this approach the policy improvement operation combines feature-based aggregation with feature construction using deep neural networks or other calculations. We argue that the cost function of a policy may be approximated much more accurately by the nonlinear function of the features provided by aggregation, than by the linear function of the features provided by neural networkbased reinforcement learning, thereby potentially leading to more effective policy improvement. 展开更多
关键词 REINFORCEMENT learning dynamic programming Markovian DECISION problems aggregation feature-based ARCHITECTURES policy ITERATION DEEP neural networks rollout algorithms
在线阅读 下载PDF
DMHFR:Decoder with Multi-Head Feature Receptors for Tract Image Segmentation
3
作者 Jianuo Huang Bohan Lai +2 位作者 Weiye Qiu Caixu Xu Jie He 《Computers, Materials & Continua》 2025年第3期4841-4862,共22页
The self-attention mechanism of Transformers,which captures long-range contextual information,has demonstrated significant potential in image segmentation.However,their ability to learn local,contextual relationships ... The self-attention mechanism of Transformers,which captures long-range contextual information,has demonstrated significant potential in image segmentation.However,their ability to learn local,contextual relationships between pixels requires further improvement.Previous methods face challenges in efficiently managing multi-scale fea-tures of different granularities from the encoder backbone,leaving room for improvement in their global representation and feature extraction capabilities.To address these challenges,we propose a novel Decoder with Multi-Head Feature Receptors(DMHFR),which receives multi-scale features from the encoder backbone and organizes them into three feature groups with different granularities:coarse,fine-grained,and full set.These groups are subsequently processed by Multi-Head Feature Receptors(MHFRs)after feature capture and modeling operations.MHFRs include two Three-Head Feature Receptors(THFRs)and one Four-Head Feature Receptor(FHFR).Each group of features is passed through these MHFRs and then fed into axial transformers,which help the model capture long-range dependencies within the features.The three MHFRs produce three distinct feature outputs.The output from the FHFR serves as auxiliary auxiliary features in the prediction head,and the prediction output and their losses will eventually be aggregated.Experimental results show that the Transformer using DMHFR outperforms 15 state of the arts(SOTA)methods on five public datasets.Specifically,it achieved significant improvements in mean DICE scores over the classic Parallel Reverse Attention Network(PraNet)method,with gains of 4.1%,2.2%,1.4%,8.9%,and 16.3%on the CVC-ClinicDB,Kvasir-SEG,CVC-T,CVC-ColonDB,and ETIS-LaribPolypDB datasets,respectively. 展开更多
关键词 Medical image segmentation feature exploration feature aggregation deep learning multi-head feature receptor
在线阅读 下载PDF
Implicit Feature Contrastive Learning for Few-Shot Object Detection
4
作者 Gang Li Zheng Zhou +6 位作者 Yang Zhang Chuanyun Xu Zihan Ruan Pengfei Lv Ru Wang Xinyu Fan Wei Tan 《Computers, Materials & Continua》 2025年第7期1615-1632,共18页
Although conventional object detection methods achieve high accuracy through extensively annotated datasets,acquiring such large-scale labeled data remains challenging and cost-prohibitive in numerous real-world appli... Although conventional object detection methods achieve high accuracy through extensively annotated datasets,acquiring such large-scale labeled data remains challenging and cost-prohibitive in numerous real-world applications.Few-shot object detection presents a new research idea that aims to localize and classify objects in images using only limited annotated examples.However,the inherent challenge in few-shot object detection lies in the insufficient sample diversity to fully characterize the sample feature distribution,which consequently impacts model performance.Inspired by contrastive learning principles,we propose an Implicit Feature Contrastive Learning(IFCL)module to address this limitation and augment feature diversity for more robust representational learning.This module generates augmented support sample features in a mixed feature space and implicitly contrasts them with query Region of Interest(RoI)features.This approach facilitates more comprehensive learning of both intra-class feature similarity and inter-class feature diversity,thereby enhancing the model’s object classification and localization capabilities.Extensive experiments on PASCAL VOC show that our method achieves a respective improvement of 3.2%,1.8%,and 2.3%on 10-shot of three Novel Sets compared to the baseline model FPD. 展开更多
关键词 Few-shot learning object detection implicit contrastive learning feature mixing feature aggregation
在线阅读 下载PDF
Full Perception Head:Bridging the Gap Between Local and Global Features
5
作者 Jie Hua Zhongyuan Wang +3 位作者 Xin Tian Qin Zou Jinsheng Xiao Jiayi Ma 《IEEE/CAA Journal of Automatica Sinica》 2025年第7期1391-1406,共16页
Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image.Local features extracted by convolutions,etc.,capture finegrained details such as edges and te... Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image.Local features extracted by convolutions,etc.,capture finegrained details such as edges and textures,while global features extracted by full connection layers,etc.,represent the overall structure and long-range relationships within the image.These features are crucial for accurate object detection,yet most existing methods focus on aggregating local and global features,often overlooking the importance of medium-range dependencies.To address this gap,we propose a novel full perception module(FPModule),a simple yet effective feature extraction module designed to simultaneously capture local details,medium-range dependencies,and long-range dependencies.Building on this,we construct a full perception head(FP-Head)by cascading multiple FP-Modules,enabling the prediction layer to leverage the most informative features.Experimental results in the MS COCO dataset demonstrate that our approach significantly enhances object recognition and localization,achieving 2.7−5.7 APval gains when integrated into standard object detectors.Notably,the FP-Module is a universal solution that can be seamlessly incorporated into existing detectors to boost performance.The code will be released at https://github.com/Idcogroup/FP-Head. 展开更多
关键词 feature aggregation full perception module medium-range dependencies object detection
在线阅读 下载PDF
MSL-Net:a lightweight apple leaf disease detection model based on multi-scale feature fusion
6
作者 YANG Kangyi YAN Chunman 《Optoelectronics Letters》 2025年第12期745-752,共8页
Aiming at the problem of low detection accuracy due to the different scale sizes of apple leaf disease spots and their similarity to the background,this paper proposes a multi-scale lightweight network(MSL-Net).Firstl... Aiming at the problem of low detection accuracy due to the different scale sizes of apple leaf disease spots and their similarity to the background,this paper proposes a multi-scale lightweight network(MSL-Net).Firstly,a multiplexed aggregated feature extraction network is proposed using residual bottleneck block(RES-Bottleneck)and middle partial-convolution(MP-Conv)to capture multi-scale spatial features and enhance focus on disease features for better differentiation between disease targets and background information.Secondly,a lightweight feature fusion network is designed using scale-fuse concatenation(SF-Cat)and triple-scale sequence feature fusion(TSSF)module to merge multi-scale feature maps comprehensively.Depthwise convolution(DWConv)and GhostNet lighten the network,while the cross stage partial bottleneck with 3 convolutions ghost-normalization attention module(C3-GN)reduces missed detections by suppressing irrelevant background information.Finally,soft non-maximum suppression(Soft-NMS)is used in the post-processing stage to improve the problem of misdetection of dense disease sites.The results show that the MSL-Net improves mean average precision at intersection over union of 0.5(mAP@0.5)by 2.0%over the baseline you only look once version 5s(YOLOv5s)and reduces parameters by 44%,reducing computation by 27%,outperforming other state-of-the-art(SOTA)models overall.This method also shows excellent performance compared to the latest research. 展开更多
关键词 enhance focus disease features background i multi scale feature fusion apple leaf disease spots residual bottleneck block res bottleneck multiplexed aggregated feature extraction network lightweight network apple leaf disease detection
原文传递
Enhancing Classroom Behavior Recognition with Lightweight Multi-Scale Feature Fusion
7
作者 Chuanchuan Wang Ahmad Sufril Azlan Mohamed +3 位作者 Xiao Yang Hao Zhang Xiang Li Mohd Halim Bin Mohd Noor 《Computers, Materials & Continua》 2025年第10期855-874,共20页
Classroom behavior recognition is a hot research topic,which plays a vital role in assessing and improving the quality of classroom teaching.However,existing classroom behavior recognition methods have challenges for ... Classroom behavior recognition is a hot research topic,which plays a vital role in assessing and improving the quality of classroom teaching.However,existing classroom behavior recognition methods have challenges for high recognition accuracy with datasets with problems such as scenes with blurred pictures,and inconsistent objects.To address this challenge,we proposed an effective,lightweight object detector method called the RFNet model(YOLO-FR).The YOLO-FR is a lightweight and effective model.Specifically,for efficient multi-scale feature extraction,effective feature pyramid shared convolutional(FPSC)was designed to improve the feature extract performance by leveraging convolutional layers with varying dilation rates from the input image in the backbone.Secondly,to address the problem of multi-scale variability in the scene,we design the Rep Ghost fusion Cross Stage Partial and Efficient Layer Aggregation Network(RGCSPELAN)to improve the network performance further and reduce the amount of computation and the number of parameters.In addition,by conducting experimental valuation on the SCB dataset3 and STBD-08 dataset.Experimental results indicate that,compared to the baseline model,the RFNet model has increased mean accuracy precision(mAP@50)from 69.6%to 71.0%on the SCB dataset3 and from 91.8%to 93.1%on the STBD-08 dataset.The RFNet approach has effectiveness precision at 68.6%,surpassing the baseline method(YOLOv11)at 3.3%and archieve the minimal size(4.9 M)on the SCB dataset3.Finally,comparing it with other algorithms,it accurately detects student behavior in complex classroom environments results confirmed that RFNet is well-suited for real-time and efficiently recognizing classroom behaviors. 展开更多
关键词 Classroom action recognition YOLO-FR feature pyramid shared convolutional rep ghost cross stage partial efficient layer aggregation network(RGCSPELAN)
在线阅读 下载PDF
Point Cloud Classification Using Content-Based Transformer via Clustering in Feature Space 被引量:11
8
作者 Yahui Liu Bin Tian +2 位作者 Yisheng Lv Lingxi Li Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期231-239,共9页
Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to est... Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to establish relationships between distant but relevant points. To overcome the limitation of local spatial attention, we propose a point content-based Transformer architecture, called PointConT for short. It exploits the locality of points in the feature space(content-based), which clusters the sampled points with similar features into the same class and computes the self-attention within each class, thus enabling an effective trade-off between capturing long-range dependencies and computational complexity. We further introduce an inception feature aggregator for point cloud classification, which uses parallel structures to aggregate high-frequency and low-frequency information in each branch separately. Extensive experiments show that our PointConT model achieves a remarkable performance on point cloud shape classification. Especially, our method exhibits 90.3% Top-1 accuracy on the hardest setting of ScanObjectN N. Source code of this paper is available at https://github.com/yahuiliu99/PointC onT. 展开更多
关键词 Content-based Transformer deep learning feature aggregator local attention point cloud classification
在线阅读 下载PDF
ST-SIGMA:Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting 被引量:6
9
作者 Yang Fang Bei Luo +3 位作者 Ting Zhao Dong He Bingbing Jiang Qilie Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2022年第4期744-757,共14页
Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges... Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges mentioned above with a single model.To tackle this dilemma,this paper proposes spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting(STSIGMA),an efficient end-to-end method to jointly and accurately perceive the AD environment and forecast the trajectories of the surrounding traffic agents within a unified framework.ST-SIGMA adopts a trident encoder-decoder architecture to learn scene semantics and agent interaction information on bird’s-eye view(BEV)maps simultaneously.Specifically,an iterative aggregation network is first employed as the scene semantic encoder(SSE)to learn diverse scene information.To preserve dynamic interactions of traffic agents,ST-SIGMA further exploits a spatio-temporal graph network as the graph interaction encoder.Meanwhile,a simple yet efficient feature fusion method to fuse semantic and interaction features into a unified feature space as the input to a novel hierarchical aggregation decoder for downstream prediction tasks is designed.Extensive experiments on the nuScenes data set have demonstrated that the proposed ST-SIGMA achieves significant improvements compared to the state-of-theart(SOTA)methods in terms of scene perception and trajectory forecasting,respectively.Therefore,the proposed approach outperforms SOTA in terms of model generalisation and robustness and is therefore more feasible for deployment in realworld AD scenarios. 展开更多
关键词 feature fusion graph interaction hierarchical aggregation scene perception scene semantics trajectory forecasting
在线阅读 下载PDF
MIA-UNet:Multi-Scale Iterative Aggregation U-Network for Retinal Vessel Segmentation 被引量:2
10
作者 Linfang Yu Zhen Qin +1 位作者 Yi Ding Zhiguang Qin 《Computer Modeling in Engineering & Sciences》 SCIE EI 2021年第11期805-828,共24页
As an important part of the new generation of information technology,the Internet of Things(IoT)has been widely concerned and regarded as an enabling technology of the next generation of health care system.The fundus ... As an important part of the new generation of information technology,the Internet of Things(IoT)has been widely concerned and regarded as an enabling technology of the next generation of health care system.The fundus photography equipment is connected to the cloud platform through the IoT,so as to realize the realtime uploading of fundus images and the rapid issuance of diagnostic suggestions by artificial intelligence.At the same time,important security and privacy issues have emerged.The data uploaded to the cloud platform involves more personal attributes,health status and medical application data of patients.Once leaked,abused or improperly disclosed,personal information security will be violated.Therefore,it is important to address the security and privacy issues of massive medical and healthcare equipment connecting to the infrastructure of IoT healthcare and health systems.To meet this challenge,we propose MIA-UNet,a multi-scale iterative aggregation U-network,which aims to achieve accurate and efficient retinal vessel segmentation for ophthalmic auxiliary diagnosis while ensuring that the network has low computational complexity to adapt to mobile terminals.In this way,users do not need to upload the data to the cloud platform,and can analyze and process the fundus images on their own mobile terminals,thus eliminating the leakage of personal information.Specifically,the interconnection between encoder and decoder,as well as the internal connection between decoder subnetworks in classic U-Net are redefined and redesigned.Furthermore,we propose a hybrid loss function to smooth the gradient and deal with the imbalance between foreground and background.Compared with the UNet,the segmentation performance of the proposed network is significantly improved on the premise that the number of parameters is only increased by 2%.When applied to three publicly available datasets:DRIVE,STARE and CHASE DB1,the proposed network achieves the accuracy/F1-score of 96.33%/84.34%,97.12%/83.17%and 97.06%/84.10%,respectively.The experimental results show that the MIA-UNet is superior to the state-of-the-art methods. 展开更多
关键词 Retinal vessel segmentation security and privacy redesigned skip connection feature maps aggregation hybrid loss function
在线阅读 下载PDF
Modelling the temporal-varied nonlinear velocity profile of debris flow using a stratification aggregation algorithm in 3D-HBP-SPH framework
11
作者 HAN Zheng XIE Wendu +5 位作者 ZENG Chuicheng LI Yange CHEN Guangqi CHEN Ningsheng HU Guisheng WANG Weidong 《Journal of Mountain Science》 SCIE CSCD 2024年第12期3945-3960,共16页
Estimation of velocity profile within mud depth is a long-standing and essential problem in debris flow dynamics.Until now,various velocity profiles have been proposed based on the fitting analysis of experimental mea... Estimation of velocity profile within mud depth is a long-standing and essential problem in debris flow dynamics.Until now,various velocity profiles have been proposed based on the fitting analysis of experimental measurements,but these are often limited by the observation conditions,such as the number of configured sensors.Therefore,the resulting linear velocity profiles usually exhibit limitations in reproducing the temporal-varied and nonlinear behavior during the debris flow process.In this study,we present a novel approach to explore the debris flow velocity profile in detail upon our previous 3D-HBPSPH numerical model,i.e.,the three-dimensional Smoothed Particle Hydrodynamic model incorporating the Herschel-Bulkley-Papanastasiou rheology.Specifically,we propose a stratification aggregation algorithm for interpreting the details of SPH particles,which enables the recording of temporal velocities of debris flow at different mud depths.To analyze the velocity profile,we introduce a logarithmic-based nonlinear model with two key parameters,that a controlling the shape of velocity profile and b concerning its temporal evolution.We verify the proposed velocity profile and explore its sensitivity using 34 sets of velocity data from three individual flume experiments in previous literature.Our results demonstrate that the proposed temporalvaried nonlinear velocity profile outperforms the previous linear profiles. 展开更多
关键词 Debris flow Velocity profile Temporal varied feature NONLINEAR Stratification aggregation algorithm
原文传递
Online identification and extraction method of regional large-scale adjustable load-aggregation characteristics
12
作者 Siwei Li Liang Yue +1 位作者 Xiangyu Kong Chengshan Wang 《Global Energy Interconnection》 EI CSCD 2024年第3期313-323,共11页
This article introduces the concept of load aggregation,which involves a comprehensive analysis of loads to acquire their external characteristics for the purpose of modeling and analyzing power systems.The online ide... This article introduces the concept of load aggregation,which involves a comprehensive analysis of loads to acquire their external characteristics for the purpose of modeling and analyzing power systems.The online identification method is a computer-involved approach for data collection,processing,and system identification,commonly used for adaptive control and prediction.This paper proposes a method for dynamically aggregating large-scale adjustable loads to support high proportions of new energy integration,aiming to study the aggregation characteristics of regional large-scale adjustable loads using online identification techniques and feature extraction methods.The experiment selected 300 central air conditioners as the research subject and analyzed their regulation characteristics,economic efficiency,and comfort.The experimental results show that as the adjustment time of the air conditioner increases from 5 minutes to 35 minutes,the stable adjustment quantity during the adjustment period decreases from 28.46 to 3.57,indicating that air conditioning loads can be controlled over a long period and have better adjustment effects in the short term.Overall,the experimental results of this paper demonstrate that analyzing the aggregation characteristics of regional large-scale adjustable loads using online identification techniques and feature extraction algorithms is effective. 展开更多
关键词 Load aggregation Regional large-scale Online recognition feature extraction method
在线阅读 下载PDF
Supervised Feature Learning for Offline Writer Identification Using VLAD and Double Power Normalization
13
作者 Dawei Liang Meng Wu Yan Hu 《Computers, Materials & Continua》 SCIE EI 2023年第7期279-293,共15页
As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quick... As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quickly is still challenging due to the method of extracting and processing handwriting features.In this paper,we propose an efficient system to identify writers through handwritten images,which integrates local and global features from similar handwritten images.The local features are modeled by effective aggregate processing,and global features are extracted through transfer learning.Specifically,the proposed system employs a pre-trained Residual Network to mine the relationship between large image sets and specific handwritten images,while the vector of locally aggregated descriptors with double power normalization is employed in aggregating local and global features.Moreover,handwritten image segmentation,preprocessing,enhancement,optimization of neural network architecture,and normalization for local and global features are exploited,significantly improving system performance.The proposed system is evaluated on Computer Vision Lab(CVL)datasets and the International Conference on Document Analysis and Recognition(ICDAR)2013 datasets.The results show that it represents good generalizability and achieves state-of-the-art performance.Furthermore,the system performs better when training complete handwriting patches with the normalization method.The experimental result indicates that it’s significant to segment handwriting reasonably while dealing with handwriting overlap,which reduces visual burstiness. 展开更多
关键词 Writer identification power normalization vector of locally aggregated descriptors feature extraction
在线阅读 下载PDF
AMHF-TP:Multifunctional therapeutic peptides prediction based on multi-granularity hierarchical features
14
作者 Shouheng Tuo YanLing Zhu +1 位作者 Jiangkun Lin Jiewei Jiang 《Quantitative Biology》 2025年第1期127-141,共15页
Multifunctional therapeutic peptides(MFTP)hold immense potential in diverse therapeutic contexts,yet their prediction and identification remain challenging due to the limitations of traditional methodologies,such as e... Multifunctional therapeutic peptides(MFTP)hold immense potential in diverse therapeutic contexts,yet their prediction and identification remain challenging due to the limitations of traditional methodologies,such as extensive training durations,limited sample sizes,and inadequate generalization capabilities.To address these issues,we present AMHF-TP,an advanced method for MFTP recognition that utilizes attention mechanisms and multi-granularity hierarchical features to enhance performance.The AMHF-TP is composed of four key components:a migration learning module that leverages pretrained models to extract atomic compositional features of MFTP sequences;a convolutional neural network and selfattention module that refine feature extraction from amino acid sequences and their secondary structures;a hypergraph module that constructs a hypergraph for complex similarity representation between MFTP sequences;and a hierarchical feature extraction module that integrates multimodal peptide sequence features.Compared with leading methods,the proposed AMHF-TP demonstrates superior precision,accuracy,and coverage,underscoring its effectiveness and robustness in MFTP recognition.The comparative analysis of separate hierarchical models and the combined model,as well as with five contemporary models,reveals AMHFTP’s exceptional performance and stability in recognition tasks. 展开更多
关键词 deep learning hypergraph multifunctional therapeutic peptides multi-granularity hierarchical features
原文传递
MASA-Net:Multi-Aspect Channel-Spatial Attention Network With Cross-Layer Feature Aggregation for Accurate Fungi Species Identification
15
作者 Indranil Bera Rajesh Mukherjee Bidesh Chakraborty 《IET Cyber-Systems and Robotics》 2025年第4期81-94,共14页
Accurate identification of fungal species is essential for effective diagnosis and treatment.Traditional microscopy-based methods are often subjective and time-consuming.Deep learning has emerged as a promising tool i... Accurate identification of fungal species is essential for effective diagnosis and treatment.Traditional microscopy-based methods are often subjective and time-consuming.Deep learning has emerged as a promising tool in this domain.However,existing deep learning models often struggle to generalise in the presence of class imbalance and subtle morphological differences,which are common in fungal image datasets.This study proposes MASA-Net,a deep learning framework that combines a fine-tuned DenseNet201 backbone with a multi-aspect channel-spatial attention(MASA)module.The attention mechanism refines spatial and channel-wise features by capturing multi-scale spatial patterns and adaptively emphasising informative channels.This enhances the network's ability to focus on diagnostically relevant fungal structures while suppressing irrelevant features.The MASA-Net is evaluated on the DeFungi dataset and demonstrates superior performance in terms of accuracy,precision,recall and F1-score.It also outperforms established attention mechanisms such as squeeze-andexcitation networks(SE)and convolutional block attention module(CBAM)under identical conditions.These results highlight MASA-Net's robustness and effectiveness in addressing class imbalance and structural variability,offering a reliable solution for automated fungal species identification. 展开更多
关键词 attention convolutional neural network(CNN) cross-layer feature aggregation fungi Classification transfer learning
原文传递
基于融合降噪先验与多尺度特征聚合的微地震震源定位方法
16
作者 黄建平 王秋阳 +5 位作者 李媛媛 黎国龙 路依霖 李三福 段文胜 雷刚林 《中国石油大学学报(自然科学版)》 北大核心 2026年第1期65-75,共11页
随着地震勘探采集数据量的激增,传统微地震定位方法难以满足当前水力压裂实时定位的需求且在采集过程中受环境噪声干扰,导致信噪比较低,影响震源定位精度。为此,提出一种基于深度学习的自监督-监督混合学习方法,首先采用卷积降噪自编码... 随着地震勘探采集数据量的激增,传统微地震定位方法难以满足当前水力压裂实时定位的需求且在采集过程中受环境噪声干扰,导致信噪比较低,影响震源定位精度。为此,提出一种基于深度学习的自监督-监督混合学习方法,首先采用卷积降噪自编码器(CDAE)进行自监督预训练,在去噪的同时学习数据中的波形特征;随后将CDAE的编码器部分作为特征提取模块,与全卷积定位模块级联,构建联合定位模型;最后利用少量标签数据微调网络,实现从含噪的微地震数据到震源空间分布的非线性映射。通过线性模型和Marmousi-2模型进行试验测试并与U-Net等网络进行对比。结果表明,提出的联合定位网络在仅使用少量标签数据训练情况下,仍可实现高精度震源定位效果。 展开更多
关键词 微地震监测 降噪先验 特征聚合 震源定位 深度学习
在线阅读 下载PDF
特征偏置与注意力聚合的自监督单目深度估计网络
17
作者 贾瑞明 余俊 +2 位作者 徐霄 王涵 郑瀚 《计算机工程与应用》 北大核心 2026年第1期253-263,共11页
提出一种特征偏置与注意力聚合的网络模型,解决单目深度估计任务中存在的深度不连续、局部偏差大等问题。特征偏置增强模块通过在网络内部特征通道维度上添加随机偏置,强化网络对位置信息的处理,增强模型的泛化能力。此外,重新设计卷积... 提出一种特征偏置与注意力聚合的网络模型,解决单目深度估计任务中存在的深度不连续、局部偏差大等问题。特征偏置增强模块通过在网络内部特征通道维度上添加随机偏置,强化网络对位置信息的处理,增强模型的泛化能力。此外,重新设计卷积与自注意力的双路聚合结构,使用深度卷积与非压缩门控结构模块,更加准确地推理像点的深度信息。在公开数据集KITTI和Make3D上分别进行了对比实验和泛化验证实验,对比实验的精度提升了2.01%,跨数据集泛化实验的精度提高了5.23%。 展开更多
关键词 单目深度估计 自监督 聚合注意力 特征偏置增强
在线阅读 下载PDF
蚕豆11S蛋白淀粉样纤维化聚集的结构表征及其功能特性研究
18
作者 陈慧 周梦园 +5 位作者 许喆 林学镁 赵文忠 许志豪 丁笑琦 丁玉庭 《食品与发酵工业》 北大核心 2026年第1期277-283,共7页
食源性蛋白淀粉样纤维化聚集具有独特的结构特性,蚕豆11S蛋白(fava bean 11S protein,FP)作为一种可持续蛋白资源,表现出巨大的潜力。该研究探究了蚕豆11S蛋白淀粉样纤维化聚集(fibrotic aggregation of 11S protein in fava bean,FPF)... 食源性蛋白淀粉样纤维化聚集具有独特的结构特性,蚕豆11S蛋白(fava bean 11S protein,FP)作为一种可持续蛋白资源,表现出巨大的潜力。该研究探究了蚕豆11S蛋白淀粉样纤维化聚集(fibrotic aggregation of 11S protein in fava bean,FPF)在形成过程中的动态演变,包括其结构表征和功能特性。6 g/100 mL的FP通过酸热处理(pH 2,85℃)不同时间(0~24 h)后得到FPF。处理后的样品通过硫黄素T、荧光、二酪氨酸、透射电子显微镜、傅里叶红外光谱等进行结构表征,结果表明FP先在酸热过程中水解成多肽,再自组装成富含β-折叠结构的FPF(由0 h的34.44%增加到24 h的45.89%)。通过起泡性、乳化性和凝胶特性等对FPF功能特性进行表征,与FP相比,反应24 h后的FPF具有更好的起泡性、乳化性和凝胶特性。此外,FPF在体外细胞实验中没有表现出细胞毒性。研究结果为FPF的形成规律提供了理论支撑。 展开更多
关键词 蚕豆蛋白 11S蛋白 淀粉样纤维化聚集 结构特征 功能特性
在线阅读 下载PDF
频域空间信息驱动的特征聚合跨模态行人重识别方法
19
作者 金静 朱传斌 翟凤文 《计算机应用研究》 北大核心 2026年第1期298-304,共7页
跨模态行人重识别旨在匹配可见光与红外不同模态下的行人图像,该任务的核心挑战是缓解可见光与红外模态间差异并提取具有鉴别力的共享特征。然而,现有方法在最小化模态间差异和提取模态共享特征过程中,未能充分利用数据增强后的模态信... 跨模态行人重识别旨在匹配可见光与红外不同模态下的行人图像,该任务的核心挑战是缓解可见光与红外模态间差异并提取具有鉴别力的共享特征。然而,现有方法在最小化模态间差异和提取模态共享特征过程中,未能充分利用数据增强后的模态信息且忽略了不同尺度特征语义关联性,提出一种基于频域空间信息的特征聚合(FDSIFA)网络。首先,通过设计的多分支频域空间感知模块(MFSPM),对不同模态的增强图像和原始图像充分提取模态特定信息,同时在频域和空间维度上挖掘跨模态特征的一致性,有效减小了模态间的差异;其次,设计了多阶段特征聚合模块(MFAM),自适应聚合不同尺度的特征,挖掘低层次特征与高层次特征之间的语义关联,提升特征的语义表达能力和判别力。该网络在SYSU-MM01数据集的全搜索模式下,rank-1和mAP分别达到了75.09%和71.35%,优于对比方法,实验结果验证了所提方法的有效性。 展开更多
关键词 跨模态 行人重识别 数据增强 频域空间信息 特征聚合
在线阅读 下载PDF
VIFusion:低光场景下可见光与红外图像的互补融合模型
20
作者 张晓滨 牛燕皓 陈金广 《西安工程大学学报》 2026年第1期126-135,共10页
针对低光场景下可见光与红外图像融合算法存在时序信息丢失、特征图通道冗余、细节模糊等问题,本文基于Vision Transformer框架,提出了一种低光场景下可见光与红外图像的互补融合模型VIFusion。该模型通过包含的双时态特征聚合(dual tem... 针对低光场景下可见光与红外图像融合算法存在时序信息丢失、特征图通道冗余、细节模糊等问题,本文基于Vision Transformer框架,提出了一种低光场景下可见光与红外图像的互补融合模型VIFusion。该模型通过包含的双时态特征聚合(dual temporal feature aggregation,DTFA)模块、特征细化前馈网络(feature refinement feedforward network,FRFN)模块和空间通道注意力机制(spatial channel attention,SCA)模块提升了融合图像的质量和信息表达能力。其中,DTFA模块使用分组卷积保持特征空间完整性,然后进行时序对齐与融合,以增强时序一致性并减少信息损失。FRFN模块对提取的特征进行逐层优化,减少通道冗余。SCA模块通过自适应建模图像空间和通道关系,突出关键特征,提高信息表达能力、增强边缘、纹理等细节信息。实验结果表明:在LLVIP数据集上,VIFusion模型在客观指标(AG、CC、EN、SF、SSIM、VIF、MI)上优于传统方法和深度学习模型(如GTF、TarDAL、DenseFuse等)。在数据集TNO上的泛化实验中,生成的融合图像在细节保留和目标突出上也表现更佳。VIFusion模型为低光场景下的多模态图像融合提供了一种高效实用的解决方案。 展开更多
关键词 双时态特征聚合 特征细化前馈网络 空间通道注意力 图像融合
在线阅读 下载PDF
上一页 1 2 28 下一页 到第
使用帮助 返回顶部