期刊文献+
共找到13,691篇文章
< 1 2 250 >
每页显示 20 50 100
A Fine-Grained RecognitionModel based on Discriminative Region Localization and Efficient Second-Order Feature Encoding
1
作者 Xiaorui Zhang Yingying Wang +3 位作者 Wei Sun Shiyu Zhou Haoming Zhang Pengpai Wang 《Computers, Materials & Continua》 2026年第4期946-965,共20页
Discriminative region localization and efficient feature encoding are crucial for fine-grained object recognition.However,existing data augmentation methods struggle to accurately locate discriminative regions in comp... Discriminative region localization and efficient feature encoding are crucial for fine-grained object recognition.However,existing data augmentation methods struggle to accurately locate discriminative regions in complex backgrounds,small target objects,and limited training data,leading to poor recognition.Fine-grained images exhibit“small inter-class differences,”and while second-order feature encoding enhances discrimination,it often requires dual Convolutional Neural Networks(CNN),increasing training time and complexity.This study proposes a model integrating discriminative region localization and efficient second-order feature encoding.By ranking feature map channels via a fully connected layer,it selects high-importance channels to generate an enhanced map,accurately locating discriminative regions.Cropping and erasing augmentations further refine recognition.To improve efficiency,a novel second-order feature encoding module generates an attention map from the fourth convolutional group of Residual Network 50 layers(ResNet-50)and multiplies it with features from the fifth group,producing second-order features while reducing dimensionality and training time.Experiments on Caltech-University of California,San Diego Birds-200-2011(CUB-200-2011),Stanford Car,and Fine-Grained Visual Classification of Aircraft(FGVC Aircraft)datasets show state-of-the-art accuracy of 88.9%,94.7%,and 93.3%,respectively. 展开更多
关键词 Fine-grained recognition feature encoding data augmentation second-order feature discriminative regions
在线阅读 下载PDF
Joint Feature Encoding and Task Alignment Mechanism for Emotion-Cause Pair Extraction
2
作者 Shi Li Didi Sun 《Computers, Materials & Continua》 SCIE EI 2025年第1期1069-1086,共18页
With the rapid expansion of social media,analyzing emotions and their causes in texts has gained significant importance.Emotion-cause pair extraction enables the identification of causal relationships between emotions... With the rapid expansion of social media,analyzing emotions and their causes in texts has gained significant importance.Emotion-cause pair extraction enables the identification of causal relationships between emotions and their triggers within a text,facilitating a deeper understanding of expressed sentiments and their underlying reasons.This comprehension is crucial for making informed strategic decisions in various business and societal contexts.However,recent research approaches employing multi-task learning frameworks for modeling often face challenges such as the inability to simultaneouslymodel extracted features and their interactions,or inconsistencies in label prediction between emotion-cause pair extraction and independent assistant tasks like emotion and cause extraction.To address these issues,this study proposes an emotion-cause pair extraction methodology that incorporates joint feature encoding and task alignment mechanisms.The model consists of two primary components:First,joint feature encoding simultaneously generates features for emotion-cause pairs and clauses,enhancing feature interactions between emotion clauses,cause clauses,and emotion-cause pairs.Second,the task alignment technique is applied to reduce the labeling distance between emotion-cause pair extraction and the two assistant tasks,capturing deep semantic information interactions among tasks.The proposed method is evaluated on a Chinese benchmark corpus using 10-fold cross-validation,assessing key performance metrics such as precision,recall,and F1 score.Experimental results demonstrate that the model achieves an F1 score of 76.05%,surpassing the state-of-the-art by 1.03%.The proposed model exhibits significant improvements in emotion-cause pair extraction(ECPE)and cause extraction(CE)compared to existing methods,validating its effectiveness.This research introduces a novel approach based on joint feature encoding and task alignment mechanisms,contributing to advancements in emotion-cause pair extraction.However,the study’s limitation lies in the data sources,potentially restricting the generalizability of the findings. 展开更多
关键词 Emotion-cause pair extraction interactive information enhancement joint feature encoding label consistency task alignment mechanisms
在线阅读 下载PDF
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation
3
作者 ISLAM Md Tauhidul WU Da-Wen +6 位作者 TANG Qing-Qing ZHAO Kai-Yang YIN Teng LI Yan-Fei SHANG Wen-Yi LIU Jing-Yu ZHANG Hai-Xian 《四川大学学报(自然科学版)》 北大核心 2025年第1期79-95,共17页
Retinal blood vessel segmentation is crucial for diagnosing ocular and cardiovascular diseases.Although the introduction of U-Net in 2015 by Olaf Ronneberger significantly advanced this field,yet issues like limited t... Retinal blood vessel segmentation is crucial for diagnosing ocular and cardiovascular diseases.Although the introduction of U-Net in 2015 by Olaf Ronneberger significantly advanced this field,yet issues like limited training data,imbalance data distribution,and inadequate feature extraction persist,hindering both the segmentation performance and optimal model generalization.Addressing these critical issues,the DEFFA-Unet is proposed featuring an additional encoder to process domain-invariant pre-processed inputs,thereby improving both richer feature encoding and enhanced model generalization.A feature filtering fusion module is developed to ensure the precise feature filtering and robust hybrid feature fusion.In response to the task-specific need for higher precision where false positives are very costly,traditional skip connections are replaced with the attention-guided feature reconstructing fusion module.Additionally,innovative data augmentation and balancing methods are proposed to counter data scarcity and distribution imbalance,further boosting the robustness and generalization of the model.With a comprehensive suite of evaluation metrics,extensive validations on four benchmark datasets(DRIVE,CHASEDB1,STARE,and HRF)and an SLO dataset(IOSTAR),demonstrate the proposed method’s superiority over both baseline and state-of-the-art models.Particularly the proposed method significantly outperforms the compared methods in cross-validation model generalization. 展开更多
关键词 Vessel segmentation Data balancing Data augmentation Dual encoder Attention Mechanism Model generalization
在线阅读 下载PDF
Encoding converters for quantum communication networks
4
作者 Hua-Xing Xu Shao-Hua Wang +2 位作者 Ya-Qi Song Ping Zhang Chang-Lei Wang 《Chinese Physics B》 2025年第5期64-69,共6页
Quantum communication networks,such as quantum key distribution(QKD)networks,typically employ the measurement-resend mechanism between two users using quantum communication devices based on different quantum encoding ... Quantum communication networks,such as quantum key distribution(QKD)networks,typically employ the measurement-resend mechanism between two users using quantum communication devices based on different quantum encoding types.To achieve direct communication between the devices with different quantum encoding types,in this paper,we propose encoding conversion schemes between the polarization bases(rectilinear,diagonal and circular bases)and the time-bin phase bases(two phase bases and time-bin basis)and design the quantum encoding converters.The theoretical analysis of the encoding conversion schemes is given in detail,and the basis correspondence of encoding conversion and the property of bit flip are revealed.The conversion relationship between polarization bases and time-bin phase bases can be easily selected by controlling a phase shifter.Since no optical switches are used in our scheme,the converter can be operated with high speed.The converters can also be modularized,which may be utilized to realize miniaturization in the future. 展开更多
关键词 quantum communication networks encoding conversion polarization encoding time-bin phase encoding
原文传递
A Blockchain-Based Covert Communication Model Based on Dynamic Base-K Encoding
5
作者 Wang Zhujun Zhang Lejun +7 位作者 Li Xueqing Tian Zhihong Su Shen Qiu Jing Chen Huiling Qiu Tie Sergey Gataullin Guo Ran 《China Communications》 2025年第6期319-333,共15页
Blockchain,as a distributed ledger,inherently possesses tamper-resistant capabilities,creating a natural channel for covert communication.However,the immutable nature of data storage might introduce challenges to comm... Blockchain,as a distributed ledger,inherently possesses tamper-resistant capabilities,creating a natural channel for covert communication.However,the immutable nature of data storage might introduce challenges to communication security.This study introduces a blockchain-based covert communication model utilizing dynamic Base-K encoding.The proposed encoding scheme utilizes the input address sequence to determine K to encode the secret message and determines the order of transactions based on K,thus ensuring effective concealment of the message.The dynamic encoding parameters enhance flexibility and address issues related to identical transaction amounts for the same secret message.Experimental results demonstrate that the proposed method maintains smooth communication and low susceptibility to tampering,achieving commendable concealment and embedding rates. 展开更多
关键词 base-K encoding blockchain CONCEALMENT covert communication
在线阅读 下载PDF
Image encoding-based bearing fault diagnosis:Review and challenges for high-speed trains
6
作者 Huimin Li Lingfeng Li +1 位作者 Bin Liu Ge Xin 《High-Speed Railway》 2025年第3期251-259,共9页
High-Speed Trains (HSTs) have emerged as a mainstream mode of transportation in China, owing to their exceptional safety and efficiency. Ensuring the reliable operation of HSTs is of paramount economic and societal im... High-Speed Trains (HSTs) have emerged as a mainstream mode of transportation in China, owing to their exceptional safety and efficiency. Ensuring the reliable operation of HSTs is of paramount economic and societal importance. As critical rotating mechanical components of the transmission system, bearings make their fault diagnosis a topic of extensive attention. This paper provides a systematic review of image encoding-based bearing fault diagnosis methods tailored to the condition monitoring of HSTs. First, it categorizes the image encoding techniques applied in the field of bearing fault diagnosis. Then, a review of state-of-the-art studies has been presented, encompassing both monomodal image conversion and multimodal image fusion approaches. Finally, it highlights current challenges and proposes future research directions to advance intelligent fault diagnosis in HSTs, aiming to provide a valuable reference for researchers and engineers in the field of intelligent operation and maintenance. 展开更多
关键词 High-speed trains Image encoding Fault diagnosis Rotating machinery Condition monitoring
在线阅读 下载PDF
Validity of the Gaussian phase distribution approximation for analysis of isotropic diffusion encoding applied to restricted diffusion in a cylinder
7
作者 Daniel Topgaard 《Magnetic Resonance Letters》 2025年第4期20-27,共8页
The Gaussian phase distribution approximation enables analysis of restricted diffusion encoded by general gradient waveforms but fails to account for the diffraction-like features that may occur for simple pore geomet... The Gaussian phase distribution approximation enables analysis of restricted diffusion encoded by general gradient waveforms but fails to account for the diffraction-like features that may occur for simple pore geometries.We investigate the range of validity of the approximation by random walk simulations of restricted diffusion in a cylinder using isotropic diffusion encoding sequences as well as conventional single gradient pulse pairs and oscillating gradient waveforms.The results show that clear deviations from the approximation may be observed at relative signal attenuations below 0.1 for onedimensional sequences with few oscillation periods.Increasing the encoding dimensionality and/or number of oscillations while extending the total duration of the waveform diminishes the non-Gaussian effects while preserving the low apparent diffusivities characteristic of restriction. 展开更多
关键词 NMR DIFFUSION Porous media Pulsed gradient spin echo Tensor-valued encoding
在线阅读 下载PDF
Enhanced Multimodal Sentiment Analysis via Integrated Spatial Position Encoding and Fusion Embedding
8
作者 Chenquan Gan Xu Liu +3 位作者 Yu Tang Xianrong Yu Qingyi Zhu Deepak Kumar Jain 《Computers, Materials & Continua》 2025年第12期5399-5421,共23页
Multimodal sentiment analysis aims to understand emotions from text,speech,and video data.However,current methods often overlook the dominant role of text and suffer from feature loss during integration.Given the vary... Multimodal sentiment analysis aims to understand emotions from text,speech,and video data.However,current methods often overlook the dominant role of text and suffer from feature loss during integration.Given the varying importance of each modality across different contexts,a central and pressing challenge in multimodal sentiment analysis lies in maximizing the use of rich intra-modal features while minimizing information loss during the fusion process.In response to these critical limitations,we propose a novel framework that integrates spatial position encoding and fusion embedding modules to address these issues.In our model,text is treated as the core modality,while speech and video features are selectively incorporated through a unique position-aware fusion process.The spatial position encoding strategy preserves the internal structural information of speech and visual modalities,enabling the model to capture localized intra-modal dependencies that are often overlooked.This design enhances the richness and discriminative power of the fused representation,enabling more accurate and context-aware sentiment prediction.Finally,we conduct comprehensive evaluations on two widely recognized standard datasets in the field—CMU-MOSI and CMU-MOSEI to validate the performance of the proposed model.The experimental results demonstrate that our model exhibits good performance and effectiveness for sentiment analysis tasks. 展开更多
关键词 Multimodal sentiment analysis spatial position encoding fusion embedding feature loss reduction
在线阅读 下载PDF
Autonomous inverse encoding guides 4D nanoprinting for highly programmable shape morphing
9
作者 Shuaiqi Ren Zhiang Zhang +6 位作者 Ruokun He Jiahao Fan Guangming Wang Hesheng Wang Bing Han Yong-Lai Zhang Zhuo-Chen Ma 《International Journal of Extreme Manufacturing》 2025年第3期467-482,共16页
Highly programmable shape morphing of 4D-printed micro/nanostructures is urgently desired for applications in robotics and intelligent systems.However,due to the lack of autonomous holistic strategies throughout the t... Highly programmable shape morphing of 4D-printed micro/nanostructures is urgently desired for applications in robotics and intelligent systems.However,due to the lack of autonomous holistic strategies throughout the target shape input,optimal material distribution generation,and fabrication program output,4D nanoprinting that permits arbitrary shape morphing remains a challenging task for manual design.In this study,we report an autonomous inverse encoding strategy to decipher the genetic code for material property distributions that can guide the encoded modeling toward arbitrarily pre-programmed 4D shape morphing.By tuning the laser power of each voxel at the nanoscale,the genetic code can be spatially programmed and controllable shape morphing can be realized through the inverse encoding process.Using this strategy,the 4D-printed structures can be designed and accurately shift to the target morphing of arbitrarily hand-drawn lines under stimulation.Furthermore,as a proof-of-concept,a flexible fiber micromanipulator that can approach the target region through pre-programmed shape morphing is autonomously inversely encoded according to the localized spatial environment.This strategy may contribute to the modeling and arbitrary shape morphing of micro/nanostructures fabricated via 4D nanoprinting,leading to cutting-edge applications in microfluidics,micro-robotics,minimally invasive robotic surgery,and tissue engineering. 展开更多
关键词 femtosecond laser fabrication 4D printing two-photon polymerization autonomous inverse encoding stimuli-responsive materials
在线阅读 下载PDF
Enhancing the genomic prediction accuracy of swine agricultural economic traits using an expanded one-hot encoding in CNN models
10
作者 Zishuai Wang Wangchang Li Zhonglin Tang 《Journal of Integrative Agriculture》 2025年第9期3574-3582,共9页
Deep learning(DL)methods like multilayer perceptrons(MLPs)and convolutional neural networks(CNNs)have been applied to predict the complex traits in animal and plant breeding.However,improving the genomic prediction ac... Deep learning(DL)methods like multilayer perceptrons(MLPs)and convolutional neural networks(CNNs)have been applied to predict the complex traits in animal and plant breeding.However,improving the genomic prediction accuracy still presents signifcant challenges.In this study,we applied CNNs to predict swine traits using previously published data.Specifcally,we extensively evaluated the CNN model's performance by employing various sets of single nucleotide polymorphisms(SNPs)and concluded that the CNN model achieved optimal performance when utilizing SNP sets comprising 1,000 SNPs.Furthermore,we adopted a novel approach using the one-hot encoding method that transforms the 16 different genotypes into sets of eight binary variables.This innovative encoding method signifcantly enhanced the CNN's prediction accuracy for swine traits,outperforming the traditional one-hot encoding techniques.Our fndings suggest that the expanded one-hot encoding method can improve the accuracy of DL methods in the genomic prediction of swine agricultural economic traits.This discovery has significant implications for swine breeding programs,where genomic prediction is pivotal in improving breeding strategies.Furthermore,future research endeavors can explore additional enhancements to DL methods by incorporating advanced data pre-processing techniques. 展开更多
关键词 SWINE agricultural economic traits genomic prediction deep learning one-hot encoding convolutional neural networks(CNNs)
在线阅读 下载PDF
Improved Sensitivity Encoding Parallel Magnetic Resonance Imaging Reconstruction Algorithm Based on Efficient Sum of Outer Products Dictionary Learning
11
作者 DUAN Jizhong SU Yan 《Journal of Shanghai Jiaotong university(Science)》 2025年第3期561-571,共11页
Sensitivity encoding(SENSE)is a parallel magnetic resonance imaging(MRI)reconstruction model by utilizing the sensitivity information of receiver coils to achieve image reconstruction.The existing SENSE-based reconstr... Sensitivity encoding(SENSE)is a parallel magnetic resonance imaging(MRI)reconstruction model by utilizing the sensitivity information of receiver coils to achieve image reconstruction.The existing SENSE-based reconstruction algorithms usually used nonadaptive sparsifying transforms,resulting in a limited reconstruction accuracy.Therefore,we proposed a new model for accurate parallel MRI reconstruction by combining the L0 norm regularization term based on the efficient sum of outer products dictionary learning(SOUPDIL)with the SENSE model,called SOUPDIL-SENSE.The SOUPDIL-SENSE model is mainly solved by utilizing the variable splitting and alternating direction method of multipliers techniques.The experimental results on four human datasets show that the proposed algorithm effectively promotes the image sparsity,eliminates the noise and artifacts of the reconstructed images,and improves the reconstruction accuracy. 展开更多
关键词 parallel magnetic resonance imaging(MRI) sensitivity encoding(SENSE) efficient sum of outer products dictionary learning(SOUPDIL) alternating direction method of multipliers
原文传递
基于双向时序窗口Transformer的网络入侵检测方法
12
作者 王长浩 王明阳 +1 位作者 丁磊 刘凯 《计算机应用研究》 北大核心 2026年第1期271-279,共9页
近年来,网络攻击的高度动态化、隐蔽化给互联网的安全和稳定带来了极大的威胁。针对现有网络入侵检测方法在局部时序建模精度不足及多分类下少数类识别能力不佳等问题,提出了一种基于双向时间滑动窗口Transformer的网络异常流量检测方... 近年来,网络攻击的高度动态化、隐蔽化给互联网的安全和稳定带来了极大的威胁。针对现有网络入侵检测方法在局部时序建模精度不足及多分类下少数类识别能力不佳等问题,提出了一种基于双向时间滑动窗口Transformer的网络异常流量检测方法。该方法将网络流量数据转换为突出时序关系的三维序列数据,引入可学习的嵌入编码及上下文位置编码,以增强序列特征的表现能力,提升了异常流量检测的准确率和稳定性,并在UNSW-NB15、CIC-IDS-2017公开数据集上进行了验证。实验结果表明,所提方法均表现出较好的性能优势,在二分类任务中检测准确率分别为99.79%、99.77%;在多分类任务中,准确率分别达到98.48%、99.76%,性能均显著高于其他先进深度学习模型。综上,该方法有效提升了网络异常流量检测的准确性和对少数类攻击的识别能力,为网络安全防护提供了新的技术手段。 展开更多
关键词 入侵检测 网络流量 双向时间窗口 上下文位置编码
在线阅读 下载PDF
基于多方位感知深度融合检测头的目标检测算法
13
作者 包晓安 彭书友 +3 位作者 张娜 涂小妹 张庆琪 吴彪 《浙江大学学报(工学版)》 北大核心 2026年第1期32-42,共11页
针对传统目标检测头难以有效捕捉全局信息的问题,提出基于多方位感知深度融合检测头的目标检测算法.通过在检测头部分设计高效双轴窗口注意力编码器(EDWE)模块,使网络能够深度融合捕获到的全局信息与局部信息;在特征金字塔结构之后使用... 针对传统目标检测头难以有效捕捉全局信息的问题,提出基于多方位感知深度融合检测头的目标检测算法.通过在检测头部分设计高效双轴窗口注意力编码器(EDWE)模块,使网络能够深度融合捕获到的全局信息与局部信息;在特征金字塔结构之后使用重参化大核卷积(RLK)模块,减小来自主干网络的特征空间差异,增强网络对中小型数据集的适应性;引入编码器选择保留模块(ESM),选择性地累积来自EDWE模块的输出,优化反向传播.实验结果表明,在规模较大的MS-COCO2017数据集上,所提算法应用于常见模型RetinaNet、FCOS、ATSS时使AP分别提升了2.9、2.6、3.4个百分点;在规模较小的PASCAL VOC2007数据集上,所提算法使3种模型的AP分别实现了1.3、1.0和1.1个百分点的提升.通过EDWE、RLK和ESM模块的协同作用,所提算法有效提升了目标检测精度,在不同规模的数据集上均展现了显著的性能优势. 展开更多
关键词 检测头 目标检测 Transformer编码器 深度融合 大核卷积
在线阅读 下载PDF
基于文本引导的轻量异构编码多模态图像融合
14
作者 王传云 周明奇 +3 位作者 孙冬冬 王田 高骞 李照奎 《工程科学学报》 北大核心 2026年第2期346-359,共14页
针对资源受限的无人机平台对红外与可见光图像的融合效率与感知性能需求,本文提出一种基于文本引导的轻量异构编码多模态图像融合网络.该网络设计了一种面向红外与可见光图像信息表达功能互补的轻量化双分支异构编码,红外图像编码分支... 针对资源受限的无人机平台对红外与可见光图像的融合效率与感知性能需求,本文提出一种基于文本引导的轻量异构编码多模态图像融合网络.该网络设计了一种面向红外与可见光图像信息表达功能互补的轻量化双分支异构编码,红外图像编码分支强调热目标与边缘响应,可见光图像编码分支侧重于纹理与细节信息建模,从而有效避免同构编码器带来的特征冗余与性能瓶颈.同时,引入轻量级跨模态特征融合模块,增强多模信息之间的互补性与融合表达能力.进一步,通过预训练视觉语言模型结合语义文本特征对融合过程进行引导与调控,提升融合图像的语义一致性与环境适应性.在三个公开多模态图像数据集TNO、LLVIP与M3FD上,本文方法与九种代表性图像融合算法进行了系统对比实验与综合评估,结果显示本文网络在互信息、结构相似性等多个主流评价指标上均表现优越,融合图像在细节清晰度、边缘结构一致性与目标可辨性方面优于现有方法.同时,消融实验表明所提出模型的推理时间相较基线方法减少约50%,且在不显著牺牲性能的前提下实现了更高的效率.除定量评估外,本文还开展了基于文本指令的定性实验,结果显示模型可根据不同语义指令灵活调整红外与可见光特征融合策略,适应低光、过曝、低对比、噪声等多种任务场景.在保证语义一致性的同时,有效增强了热源感知、结构清晰度与抗干扰能力,展现出传统无引导方法难以实现的语义可控性与内容适应性. 展开更多
关键词 多模态图像融合 双分支异构编码 文本引导 轻量化网络 注意力机制
在线阅读 下载PDF
利用编码器-解码器的温室温湿度长序列预测
15
作者 盖荣丽 王鹏飞 +1 位作者 郭志斌 段立明 《小型微型计算机系统》 北大核心 2026年第1期89-96,共8页
针对现有温湿度预测模型难以充分考虑温室温湿度数据本身的复杂非线性特征和长期依赖关系,导致模型在实际应用中预测精度不足问题,本文提出了一种基于编码器-解码器架构的多层结构温湿度预测模型.模型通过卷积运算对数据进行多尺度转换... 针对现有温湿度预测模型难以充分考虑温室温湿度数据本身的复杂非线性特征和长期依赖关系,导致模型在实际应用中预测精度不足问题,本文提出了一种基于编码器-解码器架构的多层结构温湿度预测模型.模型通过卷积运算对数据进行多尺度转换和特征提取,并使用改进的双向限制性耦合长短期记忆网络(Bidirectional Restrictive Coupled Long-Short Term Memory,BiRCLSTM)优化了信息传递机制,同时运用多头注意力机制从不同的表示子空间中捕捉信息,最终实现了长序列多变量温室温湿度数据的精确预测.在自建温湿度数据集中,该模型的预测误差明显优于基线模型,并且该模型还在3个公共数据集上进行了不同时间分辨率的预测实验,综合实验结果表明,本文模型在温室温湿度预测中具有更高的精度和良好的泛化性能. 展开更多
关键词 温湿度预测 长时间序列 多变量特征 编码器-解码器 长短期记忆网络
在线阅读 下载PDF
基于小波去噪超图深度聚类网络的多传感器故障识别方法
16
作者 王刚 俞云龙 卢明凤 《计算机集成制造系统》 北大核心 2026年第2期686-705,共20页
针对实际工业场景中标签数据不足,以及多传感器数据间的复杂高阶异质关系所带来的挑战,提出一种基于小波去噪超图深度聚类网络的多传感器故障识别方法。首先,该方法利用K近邻算法为每个由多传感器数据构成的样本构建超图,以建模传感器... 针对实际工业场景中标签数据不足,以及多传感器数据间的复杂高阶异质关系所带来的挑战,提出一种基于小波去噪超图深度聚类网络的多传感器故障识别方法。首先,该方法利用K近邻算法为每个由多传感器数据构成的样本构建超图,以建模传感器间的高阶异质关系;然后,设计基于离散超图小波框架的小波去噪超图卷积编码器,以提取并融合多尺度下的高频细节分量和低频近似分量;最后,通过联合优化聚类损失与重构损失,迭代更新深度故障特征与故障簇的中心表示,实现深度故障模式聚类。为验证该方法的有效性,在两个公开数据集上进行了充分的实验。实验结果表明,相较于基准方法,所提方法在无监督故障识别任务上表现出显著优越性,且具有良好的抗噪性能。 展开更多
关键词 深度聚类 小波去噪超图卷积编码器 多传感器故障识别 无监督学习
在线阅读 下载PDF
基于可变形卷积和注意力机制的路面裂缝检测
17
作者 谢永华 方育才 彭银佳 《计算机工程与设计》 北大核心 2026年第1期279-285,共7页
为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器... 为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器部分使用空洞残差模块扩大感受野并保留更多细节信息;在解码器部分添加注意力机制提高对裂缝特征的关注度,抑制背景噪声。实验结果表明,该网络在MPA、mIoU和F1值这3项指标上均优于其它对比网络,验证了该网络的有效性。 展开更多
关键词 裂缝检测 语义分割 编码解码 可变形卷积 空洞卷积 残差连接 注意力机制
在线阅读 下载PDF
基于Diff-Cascade的低资源命名实体识别方法
18
作者 邱云飞 董丽波 张文文 《计算机科学与探索》 北大核心 2026年第2期533-545,共13页
在低资源命名实体识别(NER)任务中,目前许多基于迁移学习的方法虽然能够缓解数据稀缺问题,但往往会导致句子中部分正确信息的遗漏或识别错误,从而影响模型在低资源环境中的效果。针对此问题,提出了一种基于多模块协同的NER模型Diff-Casc... 在低资源命名实体识别(NER)任务中,目前许多基于迁移学习的方法虽然能够缓解数据稀缺问题,但往往会导致句子中部分正确信息的遗漏或识别错误,从而影响模型在低资源环境中的效果。针对此问题,提出了一种基于多模块协同的NER模型Diff-Cascade-NER。利用变分自编码器(VAE)在潜在空间中学习数据表示,并生成多样化的样本;将上下文信息、句法分析和VAE重构数据作为条件输入到条件编码器(CE)进行编码;将编码后的数据传递给级联扩散模型(CDM),通过多阶段的去噪和生成过程产生高质量样本;通过对抗学习阶段(AL)优化生成样本的质量和多样性。实验结果表明,对比现有模型,Diff-Cascade-NER在8个低资源数据集上表现优越,特别是在BC2GM和WNUT-16数据集上,F1值分别达到85.44%和56.38%,验证了各模块协同作用在低资源NER任务中的有效性。 展开更多
关键词 低资源命名实体识别 变分自编码器 条件编码器 级联扩散模型 对抗学习
在线阅读 下载PDF
双路径编码与自适应感受野驱动的医学图像分割
19
作者 彭晏飞 孙伟强 《中国图象图形学报》 北大核心 2026年第1期320-334,共15页
目的受限于局部感受野,卷积神经网络难以有效建模长程依赖。现有研究尝试将Transformer模块引入编码器、解码器或跳跃连接以增强全局信息建模能力,但此类局部式嵌入仍不足以捕获器官在尺度与形态高度可变情况下所呈现的复杂依赖关系。此... 目的受限于局部感受野,卷积神经网络难以有效建模长程依赖。现有研究尝试将Transformer模块引入编码器、解码器或跳跃连接以增强全局信息建模能力,但此类局部式嵌入仍不足以捕获器官在尺度与形态高度可变情况下所呈现的复杂依赖关系。此外,传统卷积在训练后趋于静态,难以适应器官的几何形变,从而在一定程度上限制了模型对动态形变结构的表征能力。方法针对上述问题,提出一种端到端的医学图像分割框架,通过双路径编码与自适应感受野机制的协同设计,增强模型对全局—局部特征融合能力。具体而言,首先,设计了双路径编码结构,在多个网络层级融合卷积神经网络与Transformer特征,实现局部细节与全局上下文的渐进式融合;其次,构建编码器多层次融合机制,通过跨尺度信息交互整合浅层纹理与深层语义特征,增强模型对目标结构的多分辨率解析能力;最后,提出自适应感受野机制,基于像素级语义差距动态调整卷积核感知范围,突破静态卷积在形变组织表征中的瓶颈。结果实验在两个公开数据集上与最新的方法进行比较,在Synapse数据集中,本文方法较次优模型在DSC(Dice similarity coefficient)和HD95(95%Hausdorff distance)评价指标上分别提升0.54%和0.44;在ACDC(auto⁃mated cardiac diagnosis challenge)数据集上的DSC值提高0.34%;消融实验进一步验证了双路径编码与自适应感受野机制的协同有效性。结论本文方法通过深度融合卷积神经网络局部感知与Transformer全局建模的各自优势,结合自适应感受野机制,有效解决了当前医学图像分割模型中全局—局部特征融合不足及卷积核参数静态固化的问题,实现了SOTA(state-of-the-art)级别的分割精度,为复杂医学图像分割任务提供了新的方案。代码已开源:https://github.com/Swq308/DPAR-Net。 展开更多
关键词 卷积神经网络(CNN) TRANSFORMER 双路径编码 自适应感受野 多层次融合
原文传递
基于线性注意和类别关联特征学习的在线动作检测
20
作者 詹永照 孙慧敏 +1 位作者 夏惠芬 任晓鹏 《江苏大学学报(自然科学版)》 北大核心 2026年第1期39-47,63,共10页
为了在在线动作检测中充分合理利用动作的上下文特征、与类别关联的特征和预测的未来特征快速检测相应动作,提出基于线性注意和类别关联特征学习的在线动作检测方法.该方法改进了Transformer构架,采用哈达玛积的轻型线性自注意实现Trans... 为了在在线动作检测中充分合理利用动作的上下文特征、与类别关联的特征和预测的未来特征快速检测相应动作,提出基于线性注意和类别关联特征学习的在线动作检测方法.该方法改进了Transformer构架,采用哈达玛积的轻型线性自注意实现Transformer视频上下文特征学习,以减少计算开销;其次对训练样本动作特征进行聚类,将视频序列上下文特征与动作类别特征进行关联学习,有效获得与类别关联的特征表达;最后融合动作的上下文特征、与类别关联的特征和预测的未来特征检测相应时刻动作,以提升动作鉴别性.在典型数据集上进行性能试验,完成了超参取值分析,对比了不同方法的工作精度和运行效率.给出了消融试验和可视化分析.结果表明:在Thumos14(TSN-Anet)、Thumos14(TSN-Kinetics)和HDD数据集上,所提出方法的mAP比Colar方法分别提高了0.2、0.5、0.2百分点,可见新方法优于目前较先进的Colar方法. 展开更多
关键词 在线动作检测 深度学习 注意力机制 编码 上下文特征 TRANSFORMER 类别关联特征学习
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部