期刊文献+
共找到13,782篇文章
< 1 2 250 >
每页显示 20 50 100
A Fine-Grained RecognitionModel based on Discriminative Region Localization and Efficient Second-Order Feature Encoding
1
作者 Xiaorui Zhang Yingying Wang +3 位作者 Wei Sun Shiyu Zhou Haoming Zhang Pengpai Wang 《Computers, Materials & Continua》 2026年第4期946-965,共20页
Discriminative region localization and efficient feature encoding are crucial for fine-grained object recognition.However,existing data augmentation methods struggle to accurately locate discriminative regions in comp... Discriminative region localization and efficient feature encoding are crucial for fine-grained object recognition.However,existing data augmentation methods struggle to accurately locate discriminative regions in complex backgrounds,small target objects,and limited training data,leading to poor recognition.Fine-grained images exhibit“small inter-class differences,”and while second-order feature encoding enhances discrimination,it often requires dual Convolutional Neural Networks(CNN),increasing training time and complexity.This study proposes a model integrating discriminative region localization and efficient second-order feature encoding.By ranking feature map channels via a fully connected layer,it selects high-importance channels to generate an enhanced map,accurately locating discriminative regions.Cropping and erasing augmentations further refine recognition.To improve efficiency,a novel second-order feature encoding module generates an attention map from the fourth convolutional group of Residual Network 50 layers(ResNet-50)and multiplies it with features from the fifth group,producing second-order features while reducing dimensionality and training time.Experiments on Caltech-University of California,San Diego Birds-200-2011(CUB-200-2011),Stanford Car,and Fine-Grained Visual Classification of Aircraft(FGVC Aircraft)datasets show state-of-the-art accuracy of 88.9%,94.7%,and 93.3%,respectively. 展开更多
关键词 Fine-grained recognition feature encoding data augmentation second-order feature discriminative regions
在线阅读 下载PDF
布鲁氏菌分泌蛋白BPE043的功能及其对细胞凋亡基因表达的影响 被引量:1
2
作者 王书利 李梦含 +5 位作者 高萌萌 王倩 郑好 张荷婷 李志强 李启峰 《中国动物传染病学报》 北大核心 2025年第1期21-28,共8页
本研究旨在分析布鲁氏菌Ⅳ型分泌系统(T4SS)效应蛋白BPE043在布鲁氏菌感染过程和调控细胞凋亡基因表达中的作用。以流产布鲁氏菌2308(S2308)为亲本株,通过同源重组的方法构建布鲁氏菌BPE 043基因缺失株(2308ΔBPE 043)和回补株(2308ΔBP... 本研究旨在分析布鲁氏菌Ⅳ型分泌系统(T4SS)效应蛋白BPE043在布鲁氏菌感染过程和调控细胞凋亡基因表达中的作用。以流产布鲁氏菌2308(S2308)为亲本株,通过同源重组的方法构建布鲁氏菌BPE 043基因缺失株(2308ΔBPE 043)和回补株(2308ΔBPE 043-C)。将亲本株、缺失株和回补株在相同起始浓度下培养,观察它们的生长变化。亲本株、缺失株和回补株感染小鼠,测定小鼠脾脏的荷菌量。亲本株、缺失株和回补株侵染小鼠巨噬细胞RAW 264.7,检测细菌的胞内存活能力。利用实时荧光定量PCR(RT-qPCR)和Western blot技术检测布鲁氏菌侵染后细胞凋亡基因Caspase-3、Caspase-8、Bax和Bcl-2的表达水平。结果显示,亲本株、缺失株和回补株在体外培养的生长趋势基本相同。2308ΔBPE 043感染小鼠脾脏的荷菌量显著低于S2303感染组,且在感染2周后,缺失株感染组脾脏荷菌量的下降趋势较亲本株感染组明显。布鲁氏菌侵染宿主细胞8 h后,2308ΔBPE 043的胞内存活能力显著低于S2308。布鲁氏菌侵染宿主细胞24 h后,2308ΔBPE 043侵染组凋亡基因Caspase-3、Caspase-8和Bax表达水平显著高于S2308侵染组,Bcl-2表达水平显著低于S2308侵染组。2308ΔBPE 043侵染组Caspase-3和Caspase-8的蛋白表达水平,以及Caspase-3和Caspase-8的活性显著高于S2308侵染组。本研究发现,T4SS效应蛋白BPE043在布鲁氏菌感染和胞内存活过程中发挥重要作用,是布鲁氏菌重要的毒力因子,为进一步探究BPE043在布鲁氏菌持续性感染中的作用奠定了基础。 展开更多
关键词 布鲁氏菌 bpe043 胞内存活 毒力 凋亡
在线阅读 下载PDF
Joint Feature Encoding and Task Alignment Mechanism for Emotion-Cause Pair Extraction
3
作者 Shi Li Didi Sun 《Computers, Materials & Continua》 SCIE EI 2025年第1期1069-1086,共18页
With the rapid expansion of social media,analyzing emotions and their causes in texts has gained significant importance.Emotion-cause pair extraction enables the identification of causal relationships between emotions... With the rapid expansion of social media,analyzing emotions and their causes in texts has gained significant importance.Emotion-cause pair extraction enables the identification of causal relationships between emotions and their triggers within a text,facilitating a deeper understanding of expressed sentiments and their underlying reasons.This comprehension is crucial for making informed strategic decisions in various business and societal contexts.However,recent research approaches employing multi-task learning frameworks for modeling often face challenges such as the inability to simultaneouslymodel extracted features and their interactions,or inconsistencies in label prediction between emotion-cause pair extraction and independent assistant tasks like emotion and cause extraction.To address these issues,this study proposes an emotion-cause pair extraction methodology that incorporates joint feature encoding and task alignment mechanisms.The model consists of two primary components:First,joint feature encoding simultaneously generates features for emotion-cause pairs and clauses,enhancing feature interactions between emotion clauses,cause clauses,and emotion-cause pairs.Second,the task alignment technique is applied to reduce the labeling distance between emotion-cause pair extraction and the two assistant tasks,capturing deep semantic information interactions among tasks.The proposed method is evaluated on a Chinese benchmark corpus using 10-fold cross-validation,assessing key performance metrics such as precision,recall,and F1 score.Experimental results demonstrate that the model achieves an F1 score of 76.05%,surpassing the state-of-the-art by 1.03%.The proposed model exhibits significant improvements in emotion-cause pair extraction(ECPE)and cause extraction(CE)compared to existing methods,validating its effectiveness.This research introduces a novel approach based on joint feature encoding and task alignment mechanisms,contributing to advancements in emotion-cause pair extraction.However,the study’s limitation lies in the data sources,potentially restricting the generalizability of the findings. 展开更多
关键词 Emotion-cause pair extraction interactive information enhancement joint feature encoding label consistency task alignment mechanisms
在线阅读 下载PDF
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation
4
作者 ISLAM Md Tauhidul WU Da-Wen +6 位作者 TANG Qing-Qing ZHAO Kai-Yang YIN Teng LI Yan-Fei SHANG Wen-Yi LIU Jing-Yu ZHANG Hai-Xian 《四川大学学报(自然科学版)》 北大核心 2025年第1期79-95,共17页
Retinal blood vessel segmentation is crucial for diagnosing ocular and cardiovascular diseases.Although the introduction of U-Net in 2015 by Olaf Ronneberger significantly advanced this field,yet issues like limited t... Retinal blood vessel segmentation is crucial for diagnosing ocular and cardiovascular diseases.Although the introduction of U-Net in 2015 by Olaf Ronneberger significantly advanced this field,yet issues like limited training data,imbalance data distribution,and inadequate feature extraction persist,hindering both the segmentation performance and optimal model generalization.Addressing these critical issues,the DEFFA-Unet is proposed featuring an additional encoder to process domain-invariant pre-processed inputs,thereby improving both richer feature encoding and enhanced model generalization.A feature filtering fusion module is developed to ensure the precise feature filtering and robust hybrid feature fusion.In response to the task-specific need for higher precision where false positives are very costly,traditional skip connections are replaced with the attention-guided feature reconstructing fusion module.Additionally,innovative data augmentation and balancing methods are proposed to counter data scarcity and distribution imbalance,further boosting the robustness and generalization of the model.With a comprehensive suite of evaluation metrics,extensive validations on four benchmark datasets(DRIVE,CHASEDB1,STARE,and HRF)and an SLO dataset(IOSTAR),demonstrate the proposed method’s superiority over both baseline and state-of-the-art models.Particularly the proposed method significantly outperforms the compared methods in cross-validation model generalization. 展开更多
关键词 Vessel segmentation Data balancing Data augmentation Dual encoder Attention Mechanism Model generalization
在线阅读 下载PDF
Encoding converters for quantum communication networks
5
作者 Hua-Xing Xu Shao-Hua Wang +2 位作者 Ya-Qi Song Ping Zhang Chang-Lei Wang 《Chinese Physics B》 2025年第5期64-69,共6页
Quantum communication networks,such as quantum key distribution(QKD)networks,typically employ the measurement-resend mechanism between two users using quantum communication devices based on different quantum encoding ... Quantum communication networks,such as quantum key distribution(QKD)networks,typically employ the measurement-resend mechanism between two users using quantum communication devices based on different quantum encoding types.To achieve direct communication between the devices with different quantum encoding types,in this paper,we propose encoding conversion schemes between the polarization bases(rectilinear,diagonal and circular bases)and the time-bin phase bases(two phase bases and time-bin basis)and design the quantum encoding converters.The theoretical analysis of the encoding conversion schemes is given in detail,and the basis correspondence of encoding conversion and the property of bit flip are revealed.The conversion relationship between polarization bases and time-bin phase bases can be easily selected by controlling a phase shifter.Since no optical switches are used in our scheme,the converter can be operated with high speed.The converters can also be modularized,which may be utilized to realize miniaturization in the future. 展开更多
关键词 quantum communication networks encoding conversion polarization encoding time-bin phase encoding
原文传递
A Blockchain-Based Covert Communication Model Based on Dynamic Base-K Encoding
6
作者 Wang Zhujun Zhang Lejun +7 位作者 Li Xueqing Tian Zhihong Su Shen Qiu Jing Chen Huiling Qiu Tie Sergey Gataullin Guo Ran 《China Communications》 2025年第6期319-333,共15页
Blockchain,as a distributed ledger,inherently possesses tamper-resistant capabilities,creating a natural channel for covert communication.However,the immutable nature of data storage might introduce challenges to comm... Blockchain,as a distributed ledger,inherently possesses tamper-resistant capabilities,creating a natural channel for covert communication.However,the immutable nature of data storage might introduce challenges to communication security.This study introduces a blockchain-based covert communication model utilizing dynamic Base-K encoding.The proposed encoding scheme utilizes the input address sequence to determine K to encode the secret message and determines the order of transactions based on K,thus ensuring effective concealment of the message.The dynamic encoding parameters enhance flexibility and address issues related to identical transaction amounts for the same secret message.Experimental results demonstrate that the proposed method maintains smooth communication and low susceptibility to tampering,achieving commendable concealment and embedding rates. 展开更多
关键词 base-K encoding blockchain CONCEALMENT covert communication
在线阅读 下载PDF
Image encoding-based bearing fault diagnosis:Review and challenges for high-speed trains
7
作者 Huimin Li Lingfeng Li +1 位作者 Bin Liu Ge Xin 《High-Speed Railway》 2025年第3期251-259,共9页
High-Speed Trains (HSTs) have emerged as a mainstream mode of transportation in China, owing to their exceptional safety and efficiency. Ensuring the reliable operation of HSTs is of paramount economic and societal im... High-Speed Trains (HSTs) have emerged as a mainstream mode of transportation in China, owing to their exceptional safety and efficiency. Ensuring the reliable operation of HSTs is of paramount economic and societal importance. As critical rotating mechanical components of the transmission system, bearings make their fault diagnosis a topic of extensive attention. This paper provides a systematic review of image encoding-based bearing fault diagnosis methods tailored to the condition monitoring of HSTs. First, it categorizes the image encoding techniques applied in the field of bearing fault diagnosis. Then, a review of state-of-the-art studies has been presented, encompassing both monomodal image conversion and multimodal image fusion approaches. Finally, it highlights current challenges and proposes future research directions to advance intelligent fault diagnosis in HSTs, aiming to provide a valuable reference for researchers and engineers in the field of intelligent operation and maintenance. 展开更多
关键词 High-speed trains Image encoding Fault diagnosis Rotating machinery Condition monitoring
在线阅读 下载PDF
Validity of the Gaussian phase distribution approximation for analysis of isotropic diffusion encoding applied to restricted diffusion in a cylinder
8
作者 Daniel Topgaard 《Magnetic Resonance Letters》 2025年第4期20-27,共8页
The Gaussian phase distribution approximation enables analysis of restricted diffusion encoded by general gradient waveforms but fails to account for the diffraction-like features that may occur for simple pore geomet... The Gaussian phase distribution approximation enables analysis of restricted diffusion encoded by general gradient waveforms but fails to account for the diffraction-like features that may occur for simple pore geometries.We investigate the range of validity of the approximation by random walk simulations of restricted diffusion in a cylinder using isotropic diffusion encoding sequences as well as conventional single gradient pulse pairs and oscillating gradient waveforms.The results show that clear deviations from the approximation may be observed at relative signal attenuations below 0.1 for onedimensional sequences with few oscillation periods.Increasing the encoding dimensionality and/or number of oscillations while extending the total duration of the waveform diminishes the non-Gaussian effects while preserving the low apparent diffusivities characteristic of restriction. 展开更多
关键词 NMR DIFFUSION Porous media Pulsed gradient spin echo Tensor-valued encoding
在线阅读 下载PDF
Enhanced Multimodal Sentiment Analysis via Integrated Spatial Position Encoding and Fusion Embedding
9
作者 Chenquan Gan Xu Liu +3 位作者 Yu Tang Xianrong Yu Qingyi Zhu Deepak Kumar Jain 《Computers, Materials & Continua》 2025年第12期5399-5421,共23页
Multimodal sentiment analysis aims to understand emotions from text,speech,and video data.However,current methods often overlook the dominant role of text and suffer from feature loss during integration.Given the vary... Multimodal sentiment analysis aims to understand emotions from text,speech,and video data.However,current methods often overlook the dominant role of text and suffer from feature loss during integration.Given the varying importance of each modality across different contexts,a central and pressing challenge in multimodal sentiment analysis lies in maximizing the use of rich intra-modal features while minimizing information loss during the fusion process.In response to these critical limitations,we propose a novel framework that integrates spatial position encoding and fusion embedding modules to address these issues.In our model,text is treated as the core modality,while speech and video features are selectively incorporated through a unique position-aware fusion process.The spatial position encoding strategy preserves the internal structural information of speech and visual modalities,enabling the model to capture localized intra-modal dependencies that are often overlooked.This design enhances the richness and discriminative power of the fused representation,enabling more accurate and context-aware sentiment prediction.Finally,we conduct comprehensive evaluations on two widely recognized standard datasets in the field—CMU-MOSI and CMU-MOSEI to validate the performance of the proposed model.The experimental results demonstrate that our model exhibits good performance and effectiveness for sentiment analysis tasks. 展开更多
关键词 Multimodal sentiment analysis spatial position encoding fusion embedding feature loss reduction
在线阅读 下载PDF
Autonomous inverse encoding guides 4D nanoprinting for highly programmable shape morphing
10
作者 Shuaiqi Ren Zhiang Zhang +6 位作者 Ruokun He Jiahao Fan Guangming Wang Hesheng Wang Bing Han Yong-Lai Zhang Zhuo-Chen Ma 《International Journal of Extreme Manufacturing》 2025年第3期467-482,共16页
Highly programmable shape morphing of 4D-printed micro/nanostructures is urgently desired for applications in robotics and intelligent systems.However,due to the lack of autonomous holistic strategies throughout the t... Highly programmable shape morphing of 4D-printed micro/nanostructures is urgently desired for applications in robotics and intelligent systems.However,due to the lack of autonomous holistic strategies throughout the target shape input,optimal material distribution generation,and fabrication program output,4D nanoprinting that permits arbitrary shape morphing remains a challenging task for manual design.In this study,we report an autonomous inverse encoding strategy to decipher the genetic code for material property distributions that can guide the encoded modeling toward arbitrarily pre-programmed 4D shape morphing.By tuning the laser power of each voxel at the nanoscale,the genetic code can be spatially programmed and controllable shape morphing can be realized through the inverse encoding process.Using this strategy,the 4D-printed structures can be designed and accurately shift to the target morphing of arbitrarily hand-drawn lines under stimulation.Furthermore,as a proof-of-concept,a flexible fiber micromanipulator that can approach the target region through pre-programmed shape morphing is autonomously inversely encoded according to the localized spatial environment.This strategy may contribute to the modeling and arbitrary shape morphing of micro/nanostructures fabricated via 4D nanoprinting,leading to cutting-edge applications in microfluidics,micro-robotics,minimally invasive robotic surgery,and tissue engineering. 展开更多
关键词 femtosecond laser fabrication 4D printing two-photon polymerization autonomous inverse encoding stimuli-responsive materials
在线阅读 下载PDF
Enhancing the genomic prediction accuracy of swine agricultural economic traits using an expanded one-hot encoding in CNN models
11
作者 Zishuai Wang Wangchang Li Zhonglin Tang 《Journal of Integrative Agriculture》 2025年第9期3574-3582,共9页
Deep learning(DL)methods like multilayer perceptrons(MLPs)and convolutional neural networks(CNNs)have been applied to predict the complex traits in animal and plant breeding.However,improving the genomic prediction ac... Deep learning(DL)methods like multilayer perceptrons(MLPs)and convolutional neural networks(CNNs)have been applied to predict the complex traits in animal and plant breeding.However,improving the genomic prediction accuracy still presents signifcant challenges.In this study,we applied CNNs to predict swine traits using previously published data.Specifcally,we extensively evaluated the CNN model's performance by employing various sets of single nucleotide polymorphisms(SNPs)and concluded that the CNN model achieved optimal performance when utilizing SNP sets comprising 1,000 SNPs.Furthermore,we adopted a novel approach using the one-hot encoding method that transforms the 16 different genotypes into sets of eight binary variables.This innovative encoding method signifcantly enhanced the CNN's prediction accuracy for swine traits,outperforming the traditional one-hot encoding techniques.Our fndings suggest that the expanded one-hot encoding method can improve the accuracy of DL methods in the genomic prediction of swine agricultural economic traits.This discovery has significant implications for swine breeding programs,where genomic prediction is pivotal in improving breeding strategies.Furthermore,future research endeavors can explore additional enhancements to DL methods by incorporating advanced data pre-processing techniques. 展开更多
关键词 SWINE agricultural economic traits genomic prediction deep learning one-hot encoding convolutional neural networks(CNNs)
在线阅读 下载PDF
Improved Sensitivity Encoding Parallel Magnetic Resonance Imaging Reconstruction Algorithm Based on Efficient Sum of Outer Products Dictionary Learning
12
作者 DUAN Jizhong SU Yan 《Journal of Shanghai Jiaotong university(Science)》 2025年第3期561-571,共11页
Sensitivity encoding(SENSE)is a parallel magnetic resonance imaging(MRI)reconstruction model by utilizing the sensitivity information of receiver coils to achieve image reconstruction.The existing SENSE-based reconstr... Sensitivity encoding(SENSE)is a parallel magnetic resonance imaging(MRI)reconstruction model by utilizing the sensitivity information of receiver coils to achieve image reconstruction.The existing SENSE-based reconstruction algorithms usually used nonadaptive sparsifying transforms,resulting in a limited reconstruction accuracy.Therefore,we proposed a new model for accurate parallel MRI reconstruction by combining the L0 norm regularization term based on the efficient sum of outer products dictionary learning(SOUPDIL)with the SENSE model,called SOUPDIL-SENSE.The SOUPDIL-SENSE model is mainly solved by utilizing the variable splitting and alternating direction method of multipliers techniques.The experimental results on four human datasets show that the proposed algorithm effectively promotes the image sparsity,eliminates the noise and artifacts of the reconstructed images,and improves the reconstruction accuracy. 展开更多
关键词 parallel magnetic resonance imaging(MRI) sensitivity encoding(SENSE) efficient sum of outer products dictionary learning(SOUPDIL) alternating direction method of multipliers
原文传递
基于双向时序窗口Transformer的网络入侵检测方法
13
作者 王长浩 王明阳 +1 位作者 丁磊 刘凯 《计算机应用研究》 北大核心 2026年第1期271-279,共9页
近年来,网络攻击的高度动态化、隐蔽化给互联网的安全和稳定带来了极大的威胁。针对现有网络入侵检测方法在局部时序建模精度不足及多分类下少数类识别能力不佳等问题,提出了一种基于双向时间滑动窗口Transformer的网络异常流量检测方... 近年来,网络攻击的高度动态化、隐蔽化给互联网的安全和稳定带来了极大的威胁。针对现有网络入侵检测方法在局部时序建模精度不足及多分类下少数类识别能力不佳等问题,提出了一种基于双向时间滑动窗口Transformer的网络异常流量检测方法。该方法将网络流量数据转换为突出时序关系的三维序列数据,引入可学习的嵌入编码及上下文位置编码,以增强序列特征的表现能力,提升了异常流量检测的准确率和稳定性,并在UNSW-NB15、CIC-IDS-2017公开数据集上进行了验证。实验结果表明,所提方法均表现出较好的性能优势,在二分类任务中检测准确率分别为99.79%、99.77%;在多分类任务中,准确率分别达到98.48%、99.76%,性能均显著高于其他先进深度学习模型。综上,该方法有效提升了网络异常流量检测的准确性和对少数类攻击的识别能力,为网络安全防护提供了新的技术手段。 展开更多
关键词 入侵检测 网络流量 双向时间窗口 上下文位置编码
在线阅读 下载PDF
基于多方位感知深度融合检测头的目标检测算法
14
作者 包晓安 彭书友 +3 位作者 张娜 涂小妹 张庆琪 吴彪 《浙江大学学报(工学版)》 北大核心 2026年第1期32-42,共11页
针对传统目标检测头难以有效捕捉全局信息的问题,提出基于多方位感知深度融合检测头的目标检测算法.通过在检测头部分设计高效双轴窗口注意力编码器(EDWE)模块,使网络能够深度融合捕获到的全局信息与局部信息;在特征金字塔结构之后使用... 针对传统目标检测头难以有效捕捉全局信息的问题,提出基于多方位感知深度融合检测头的目标检测算法.通过在检测头部分设计高效双轴窗口注意力编码器(EDWE)模块,使网络能够深度融合捕获到的全局信息与局部信息;在特征金字塔结构之后使用重参化大核卷积(RLK)模块,减小来自主干网络的特征空间差异,增强网络对中小型数据集的适应性;引入编码器选择保留模块(ESM),选择性地累积来自EDWE模块的输出,优化反向传播.实验结果表明,在规模较大的MS-COCO2017数据集上,所提算法应用于常见模型RetinaNet、FCOS、ATSS时使AP分别提升了2.9、2.6、3.4个百分点;在规模较小的PASCAL VOC2007数据集上,所提算法使3种模型的AP分别实现了1.3、1.0和1.1个百分点的提升.通过EDWE、RLK和ESM模块的协同作用,所提算法有效提升了目标检测精度,在不同规模的数据集上均展现了显著的性能优势. 展开更多
关键词 检测头 目标检测 Transformer编码器 深度融合 大核卷积
在线阅读 下载PDF
基于CNN-Transformer架构的电磁传播损耗预测算法
15
作者 万勇 李骏杰 +1 位作者 孙伟峰 戴永寿 《现代电子技术》 北大核心 2026年第6期43-48,共6页
为了解决传统经验传播损耗模型预测精度不足的问题,提出一种基于CNN-Transformer架构的电磁传播损耗预测算法,通过构建回归模型进行精准的传播损耗预测。通过斯皮尔曼系数法提取有效特征,利用CNN提取与传播损耗预测高度相关的浅层特征,... 为了解决传统经验传播损耗模型预测精度不足的问题,提出一种基于CNN-Transformer架构的电磁传播损耗预测算法,通过构建回归模型进行精准的传播损耗预测。通过斯皮尔曼系数法提取有效特征,利用CNN提取与传播损耗预测高度相关的浅层特征,将从卫星图像中获取的传播路径上地物特征序列进行位置编码,增强对传播路径中不同地物特征顺序对传播损耗影响的理解。最后将CNN提取的浅层特征与位置编码后的地物特征输入到Transformer模型,通过多头自注意力机制捕捉特征间的全局关联性,从而有效校正传播损耗的预测结果。实验结果表明,所提出的CNN-Transformer方法显著降低了传播损耗预测的均方根误差(RMSE),达到了3.3745 dB,同时保持了0.8956的较高确定性系数(R^(2))。所提的电磁传播损耗预测算法为无线通信传播特性研究领域提供了参考,具有一定的应用价值。 展开更多
关键词 电磁传播 损耗预测 TRANSFORMER CNN 斯皮尔曼系数法 地物类型 位置编码
在线阅读 下载PDF
基于文本引导的轻量异构编码多模态图像融合
16
作者 王传云 周明奇 +3 位作者 孙冬冬 王田 高骞 李照奎 《工程科学学报》 北大核心 2026年第2期346-359,共14页
针对资源受限的无人机平台对红外与可见光图像的融合效率与感知性能需求,本文提出一种基于文本引导的轻量异构编码多模态图像融合网络.该网络设计了一种面向红外与可见光图像信息表达功能互补的轻量化双分支异构编码,红外图像编码分支... 针对资源受限的无人机平台对红外与可见光图像的融合效率与感知性能需求,本文提出一种基于文本引导的轻量异构编码多模态图像融合网络.该网络设计了一种面向红外与可见光图像信息表达功能互补的轻量化双分支异构编码,红外图像编码分支强调热目标与边缘响应,可见光图像编码分支侧重于纹理与细节信息建模,从而有效避免同构编码器带来的特征冗余与性能瓶颈.同时,引入轻量级跨模态特征融合模块,增强多模信息之间的互补性与融合表达能力.进一步,通过预训练视觉语言模型结合语义文本特征对融合过程进行引导与调控,提升融合图像的语义一致性与环境适应性.在三个公开多模态图像数据集TNO、LLVIP与M3FD上,本文方法与九种代表性图像融合算法进行了系统对比实验与综合评估,结果显示本文网络在互信息、结构相似性等多个主流评价指标上均表现优越,融合图像在细节清晰度、边缘结构一致性与目标可辨性方面优于现有方法.同时,消融实验表明所提出模型的推理时间相较基线方法减少约50%,且在不显著牺牲性能的前提下实现了更高的效率.除定量评估外,本文还开展了基于文本指令的定性实验,结果显示模型可根据不同语义指令灵活调整红外与可见光特征融合策略,适应低光、过曝、低对比、噪声等多种任务场景.在保证语义一致性的同时,有效增强了热源感知、结构清晰度与抗干扰能力,展现出传统无引导方法难以实现的语义可控性与内容适应性. 展开更多
关键词 多模态图像融合 双分支异构编码 文本引导 轻量化网络 注意力机制
在线阅读 下载PDF
利用编码器-解码器的温室温湿度长序列预测
17
作者 盖荣丽 王鹏飞 +1 位作者 郭志斌 段立明 《小型微型计算机系统》 北大核心 2026年第1期89-96,共8页
针对现有温湿度预测模型难以充分考虑温室温湿度数据本身的复杂非线性特征和长期依赖关系,导致模型在实际应用中预测精度不足问题,本文提出了一种基于编码器-解码器架构的多层结构温湿度预测模型.模型通过卷积运算对数据进行多尺度转换... 针对现有温湿度预测模型难以充分考虑温室温湿度数据本身的复杂非线性特征和长期依赖关系,导致模型在实际应用中预测精度不足问题,本文提出了一种基于编码器-解码器架构的多层结构温湿度预测模型.模型通过卷积运算对数据进行多尺度转换和特征提取,并使用改进的双向限制性耦合长短期记忆网络(Bidirectional Restrictive Coupled Long-Short Term Memory,BiRCLSTM)优化了信息传递机制,同时运用多头注意力机制从不同的表示子空间中捕捉信息,最终实现了长序列多变量温室温湿度数据的精确预测.在自建温湿度数据集中,该模型的预测误差明显优于基线模型,并且该模型还在3个公共数据集上进行了不同时间分辨率的预测实验,综合实验结果表明,本文模型在温室温湿度预测中具有更高的精度和良好的泛化性能. 展开更多
关键词 温湿度预测 长时间序列 多变量特征 编码器-解码器 长短期记忆网络
在线阅读 下载PDF
基于小波去噪超图深度聚类网络的多传感器故障识别方法
18
作者 王刚 俞云龙 卢明凤 《计算机集成制造系统》 北大核心 2026年第2期686-705,共20页
针对实际工业场景中标签数据不足,以及多传感器数据间的复杂高阶异质关系所带来的挑战,提出一种基于小波去噪超图深度聚类网络的多传感器故障识别方法。首先,该方法利用K近邻算法为每个由多传感器数据构成的样本构建超图,以建模传感器... 针对实际工业场景中标签数据不足,以及多传感器数据间的复杂高阶异质关系所带来的挑战,提出一种基于小波去噪超图深度聚类网络的多传感器故障识别方法。首先,该方法利用K近邻算法为每个由多传感器数据构成的样本构建超图,以建模传感器间的高阶异质关系;然后,设计基于离散超图小波框架的小波去噪超图卷积编码器,以提取并融合多尺度下的高频细节分量和低频近似分量;最后,通过联合优化聚类损失与重构损失,迭代更新深度故障特征与故障簇的中心表示,实现深度故障模式聚类。为验证该方法的有效性,在两个公开数据集上进行了充分的实验。实验结果表明,相较于基准方法,所提方法在无监督故障识别任务上表现出显著优越性,且具有良好的抗噪性能。 展开更多
关键词 深度聚类 小波去噪超图卷积编码器 多传感器故障识别 无监督学习
在线阅读 下载PDF
基于可变形卷积和注意力机制的路面裂缝检测
19
作者 谢永华 方育才 彭银佳 《计算机工程与设计》 北大核心 2026年第1期279-285,共7页
为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器... 为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器部分使用空洞残差模块扩大感受野并保留更多细节信息;在解码器部分添加注意力机制提高对裂缝特征的关注度,抑制背景噪声。实验结果表明,该网络在MPA、mIoU和F1值这3项指标上均优于其它对比网络,验证了该网络的有效性。 展开更多
关键词 裂缝检测 语义分割 编码解码 可变形卷积 空洞卷积 残差连接 注意力机制
在线阅读 下载PDF
基于Diff-Cascade的低资源命名实体识别方法
20
作者 邱云飞 董丽波 张文文 《计算机科学与探索》 北大核心 2026年第2期533-545,共13页
在低资源命名实体识别(NER)任务中,目前许多基于迁移学习的方法虽然能够缓解数据稀缺问题,但往往会导致句子中部分正确信息的遗漏或识别错误,从而影响模型在低资源环境中的效果。针对此问题,提出了一种基于多模块协同的NER模型Diff-Casc... 在低资源命名实体识别(NER)任务中,目前许多基于迁移学习的方法虽然能够缓解数据稀缺问题,但往往会导致句子中部分正确信息的遗漏或识别错误,从而影响模型在低资源环境中的效果。针对此问题,提出了一种基于多模块协同的NER模型Diff-Cascade-NER。利用变分自编码器(VAE)在潜在空间中学习数据表示,并生成多样化的样本;将上下文信息、句法分析和VAE重构数据作为条件输入到条件编码器(CE)进行编码;将编码后的数据传递给级联扩散模型(CDM),通过多阶段的去噪和生成过程产生高质量样本;通过对抗学习阶段(AL)优化生成样本的质量和多样性。实验结果表明,对比现有模型,Diff-Cascade-NER在8个低资源数据集上表现优越,特别是在BC2GM和WNUT-16数据集上,F1值分别达到85.44%和56.38%,验证了各模块协同作用在低资源NER任务中的有效性。 展开更多
关键词 低资源命名实体识别 变分自编码器 条件编码器 级联扩散模型 对抗学习
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部