期刊文献+
共找到124篇文章
< 1 2 7 >
每页显示 20 50 100
Adaptive target and jamming recognition for the pulse doppler radar fuze based on a time-frequency joint feature and an online-updated naive bayesian classifier with minimal risk 被引量:9
1
作者 Jian Dai Xin-hong Hao +2 位作者 Ze Li Ping Li Xiao-peng Yan 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2022年第3期457-466,共10页
This paper considers the problem of target and jamming recognition for the pulse Doppler radar fuze(PDRF).To solve the problem,the matched filter outputs of the PDRF under the action of target and jamming are analyzed... This paper considers the problem of target and jamming recognition for the pulse Doppler radar fuze(PDRF).To solve the problem,the matched filter outputs of the PDRF under the action of target and jamming are analyzed.Then,the frequency entropy and peak-to-peak ratio are extracted from the matched filter output of the PDRF,and the time-frequency joint feature is constructed.Based on the time-frequency joint feature,the naive Bayesian classifier(NBC)with minimal risk is established for target and jamming recognition.To improve the adaptability of the proposed method in complex environments,an online update process that adaptively modifies the classifier in the duration of the work of the PDRF is proposed.The experiments show that the PDRF can maintain high recognition accuracy when the signal-to-noise ratio(SNR)decreases and the jamming-to-signal ratio(JSR)increases.Moreover,the applicable analysis shows that he ONBCMR method has low computational complexity and can fully meet the real-time requirements of PDRF. 展开更多
关键词 Pulse Doppler radar fuze(PDRF) Target and jamming recognition time-frequency joint feature Online-update naive Bayesian classifier minimal risk(ONBCMR)
在线阅读 下载PDF
Digital modulation classification using multi-layer perceptron and time-frequency features
2
作者 Yuan Ye Mei Wenbo 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2007年第2期249-254,共6页
Considering that real communication signals corrupted by noise are generally nonstationary, and timefrequency distributions are especially suitable for the analysis of nonstationary signals, time-frequency distributio... Considering that real communication signals corrupted by noise are generally nonstationary, and timefrequency distributions are especially suitable for the analysis of nonstationary signals, time-frequency distributions are introduced for the modulation classification of communication signals: The extracted time-frequency features have good classification information, and they are insensitive to signal to noise ratio (SNR) variation. According to good classification by the correct rate of a neural network classifier, a multilayer perceptron (MLP) classifier with better generalization, as well as, addition of time-frequency features set for classifying six different modulation types has been proposed. Computer simulations show that the MLP classifier outperforms the decision-theoretic classifier at low SNRs, and the classification experiments for real MPSK signals verify engineering significance of the MLP classifier. 展开更多
关键词 Digital modulation classification time-frequency feature time-frequency distribution Multi-layer perceptron.
在线阅读 下载PDF
IMTNet:Improved Multi-Task Copy-Move Forgery Detection Network with Feature Decoupling and Multi-Feature Pyramid
3
作者 Huan Wang Hong Wang +2 位作者 Zhongyuan Jiang Qing Qian Yong Long 《Computers, Materials & Continua》 SCIE EI 2024年第9期4603-4620,共18页
Copy-Move Forgery Detection(CMFD)is a technique that is designed to identify image tampering and locate suspicious areas.However,the practicality of the CMFD is impeded by the scarcity of datasets,inadequate quality a... Copy-Move Forgery Detection(CMFD)is a technique that is designed to identify image tampering and locate suspicious areas.However,the practicality of the CMFD is impeded by the scarcity of datasets,inadequate quality and quantity,and a narrow range of applicable tasks.These limitations significantly restrict the capacity and applicability of CMFD.To overcome the limitations of existing methods,a novel solution called IMTNet is proposed for CMFD by employing a feature decoupling approach.Firstly,this study formulates the objective task and network relationship as an optimization problem using transfer learning.Furthermore,it thoroughly discusses and analyzes the relationship between CMFD and deep network architecture by employing ResNet-50 during the optimization solving phase.Secondly,a quantitative comparison between fine-tuning and feature decoupling is conducted to evaluate the degree of similarity between the image classification and CMFD domains by the enhanced ResNet-50.Finally,suspicious regions are localized using a feature pyramid network with bottom-up path augmentation.Experimental results demonstrate that IMTNet achieves faster convergence,shorter training times,and favorable generalization performance compared to existingmethods.Moreover,it is shown that IMTNet significantly outperforms fine-tuning based approaches in terms of accuracy and F_(1). 展开更多
关键词 Image copy-move detection feature decoupling multi-scale feature pyramids passive forensics
在线阅读 下载PDF
A reliability-oriented genetic algorithm-levenberg marquardt model for leak risk assessment based on time-frequency features
4
作者 Ying-Ying Wang Hai-Bo Sun +4 位作者 Jin Yang Shi-De Wu Wen-Ming Wang Yu-Qi Li Ze-Qing Lin 《Petroleum Science》 SCIE EI CSCD 2023年第5期3194-3209,共16页
Since leaks in high-pressure pipelines transporting crude oil can cause severe economic losses,a reliable leak risk assessment can assist in developing an effective pipeline maintenance plan and avoiding unexpected in... Since leaks in high-pressure pipelines transporting crude oil can cause severe economic losses,a reliable leak risk assessment can assist in developing an effective pipeline maintenance plan and avoiding unexpected incidents.The fast and accurate leak detection methods are essential for maintaining pipeline safety in pipeline reliability engineering.Current oil pipeline leakage signals are insufficient for feature extraction,while the training time for traditional leakage prediction models is too long.A new leak detection method is proposed based on time-frequency features and the Genetic Algorithm-Levenberg Marquardt(GA-LM)classification model for predicting the leakage status of oil pipelines.The signal that has been processed is transformed to the time and frequency domain,allowing full expression of the original signal.The traditional Back Propagation(BP)neural network is optimized by the Genetic Algorithm(GA)and Levenberg Marquardt(LM)algorithms.The results show that the recognition effect of a combined feature parameter is superior to that of a single feature parameter.The Accuracy,Precision,Recall,and F1score of the GA-LM model is 95%,93.5%,96.7%,and 95.1%,respectively,which proves that the GA-LM model has a good predictive effect and excellent stability for positive and negative samples.The proposed GA-LM model can obviously reduce training time and improve recognition efficiency.In addition,considering that a large number of samples are required for model training,a wavelet threshold method is proposed to generate sample data with higher reliability.The research results can provide an effective theoretical and technical reference for the leakage risk assessment of the actual oil pipelines. 展开更多
关键词 Leak risk assessment Oil pipeline GA-LM model Data derivation time-frequency features
原文传递
Unsupervised subdomain contrastive adaptation for elevator fault diagnosis based on time-frequency feature attention mechanism segmentation
5
作者 Chenyu FENG Hao SUN +6 位作者 Pengcheng XIA Chengjin QIN Zhinan ZHANG Cheng HE Bin ZHENG Jiacheng JIANG Chengliang LIU 《Science China(Technological Sciences)》 2026年第2期302-323,共22页
Existing elevator fault diagnosis algorithms have limited engineering applicability due to variations in working conditions and differences in equipment structures.To address this limitation,this study proposes an uns... Existing elevator fault diagnosis algorithms have limited engineering applicability due to variations in working conditions and differences in equipment structures.To address this limitation,this study proposes an unsupervised subdomain adaptation method based on a time-frequency feature attention mechanism,LMMD-based subdomain alignment,and contrastive local alignment.This enables the application of the diagnosis model across different working conditions and equipment types.First,a novel time-frequency feature attention mechanism assigns weights to vibration signals of varying dimensions.Second,the time series is transformed to obtain a three-channel time-frequency diagram.This diagram is input into the proposed dimension-segmentation cross-channel multihead self-attention framework to extract high-dimensional frequencydomain fault features.These features are concatenated with the time-domain features to obtain a global feature representation.Then,the extracted high-dimensional features are sent to the classification module to obtain the predicted labels for the source and target domains.Finally,after confidence filtering,the true labels from the source domain and the prediction labels from the target domain are fed into a dynamically weighted multilevel feature alignment module to promote proximity between similar fault features across domains while enhancing separation among different fault types.The validity and superiority of the proposed method were demonstrated through simulation experiments conducted on two types of manned escalator systems under multiple working conditions.For the most challenging transfer task,the proposed method achieved higher accuracy on the target domain test set than DANN,ADDA,C-CLCN,TFA-CCN,and TFA-LCN by 26.87%,24.72%,11.44%,28.94%,and 16.85%,respectively. 展开更多
关键词 time-frequency feature attention mechanism unsupervised domain adaptation fault diagnosis transfer learning passenger elevator
原文传递
基于Decoupled-FR Net的伪造图像检测模型
6
作者 杨桃 张乾 +1 位作者 文露露 彭杉 《湖北民族大学学报(自然科学版)》 2026年第1期69-74,共6页
针对伪造图像检测中细粒度特征感知与跨区域依赖建模的挑战,提出解耦频率细化网络(decoupled frequency refinement network, Decoupled-FR Net)检测模型。该模型设计了空间-通道解耦注意力机制,有效避免了传统混合注意力中的特征耦合问... 针对伪造图像检测中细粒度特征感知与跨区域依赖建模的挑战,提出解耦频率细化网络(decoupled frequency refinement network, Decoupled-FR Net)检测模型。该模型设计了空间-通道解耦注意力机制,有效避免了传统混合注意力中的特征耦合问题,提升了特征表示的独立性与判别力;引入了特征细化模块,通过层级特征校准与融合功能增强对细微篡改的感知能力;结合上下文感知机制,捕捉跨区域的长距离依赖关系,从而提升整体检测性能。结果表明,Decoupled-FR Net模型在伪造合成(forensic synthetics, ForenSynths)数据集上的准确率、平均精确率分别比块间依赖网络(inter-patch dependency network, IPD-Net)模型提高了2.4、0.5个百分点,在生成式对抗网络(generative adversarial network, GAN)的生成图像检测(GAN generation detection, GANGen-Detection)数据集上的平均精确率比频域网络(frequency domain network, FreqNet)模型提高了0.1个百分点。该模型为细粒度伪造图像检测提供了新的解决方案,在多媒体取证领域具有重要的应用价值。 展开更多
关键词 空间-通道注意力解耦 特征细化 频域增强 上下文感知 跨模型伪造图像检测 生成式对抗网络
在线阅读 下载PDF
Feature Extraction and Recognition for Rolling Element Bearing Fault Utilizing Short-Time Fourier Transform and Non-negative Matrix Factorization 被引量:31
7
作者 GAO Huizhong LIANG Lin +1 位作者 CHEN Xiaoguang XU Guanghua 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2015年第1期96-105,共10页
Due to the non-stationary characteristics of vibration signals acquired from rolling element bearing fault, thc time-frequency analysis is often applied to describe the local information of these unstable signals smar... Due to the non-stationary characteristics of vibration signals acquired from rolling element bearing fault, thc time-frequency analysis is often applied to describe the local information of these unstable signals smartly. However, it is difficult to classitythe high dimensional feature matrix directly because of too large dimensions for many classifiers. This paper combines the concepts of time-frequency distribution(TFD) with non-negative matrix factorization(NMF), and proposes a novel TFD matrix factorization method to enhance representation and identification of bearing fault. Throughout this method, the TFD of a vibration signal is firstly accomplished to describe the localized faults with short-time Fourier transform(STFT). Then, the supervised NMF mapping is adopted to extract the fault features from TFD. Meanwhile, the fault samples can be clustered and recognized automatically by using the clustering property of NMF. The proposed method takes advantages of the NMF in the parts-based representation and the adaptive clustering. The localized fault features of interest can be extracted as well. To evaluate the performance of the proposed method, the 9 kinds of the bearing fault on a test bench is performed. The proposed method can effectively identify the fault severity and different fault types. Moreover, in comparison with the artificial neural network(ANN), NMF yields 99.3% mean accuracy which is much superior to ANN. This research presents a simple and practical resolution for the fault diagnosis problem of rolling element bearing in high dimensional feature space. 展开更多
关键词 time-frequency distribution non-negative matrix factorization rolling element bearing feature extraction
在线阅读 下载PDF
Radar Signal Intra-Pulse Feature Extraction Based on Improved Wavelet Transform Algorithm 被引量:2
8
作者 Wenxu Zhang Fuli Sun Bing Wang 《International Journal of Communications, Network and System Sciences》 2017年第8期118-127,共10页
With the new system radar put into practical use, the characteristics of complex radar signals are changing and developing. The traditional analysis method of one-dimensional transformation domain is no longer applica... With the new system radar put into practical use, the characteristics of complex radar signals are changing and developing. The traditional analysis method of one-dimensional transformation domain is no longer applicable to the modern radar signal processing, and it is necessary to seek new methods in the two-dimensional transformation domain. The time-frequency analysis method is the most widely used method in the two-dimensional transformation domain. In this paper, two typical time-frequency analysis methods of short-time Fourier transform and Wigner-Ville distribution are studied by analyzing the time-frequency transform of typical radar reconnaissance linear frequency modulation signal, aiming at the problem of low accuracy and sen-sitivity to the signal noise of common methods, the improved wavelet transform algorithm was proposed. 展开更多
关键词 Intra-Pulse feature Extraction time-frequency Analysis Short-Time FOURIER TRANSFORM Wigner-Ville Distribution WAVELET TRANSFORM
在线阅读 下载PDF
Jamming Recognition Based on Feature Fusion and Convolutional Neural Network
9
作者 Sitian Liu Chunli Zhu 《Journal of Beijing Institute of Technology》 EI CAS 2022年第2期169-177,共9页
The complicated electromagnetic environment of the BeiDou satellites introduces vari-ous types of external jamming to communication links,in which recognition of jamming signals with uncertainties is essential.In this... The complicated electromagnetic environment of the BeiDou satellites introduces vari-ous types of external jamming to communication links,in which recognition of jamming signals with uncertainties is essential.In this work,the jamming recognition framework proposed consists of fea-ture fusion and a convolutional neural network(CNN).Firstly,the recognition inputs are obtained by prepossessing procedure,in which the 1-D power spectrum and 2-D time-frequency image are ac-cessed through the Welch algorithm and short-time Fourier transform(STFT),respectively.Then,the 1D-CNN and residual neural network(ResNet)are introduced to extract the deep features of the two prepossessing inputs,respectively.Finally,the two deep features are concatenated for the following three fully connected layers and output the jamming signal classification results through the softmax layer.Results show the proposed method could reduce the impacts of potential feature loss,therefore improving the generalization ability on dealing with uncertainties. 展开更多
关键词 time-frequency image feature power spectrum feature convolutional neural network feature fusion jamming recognition
在线阅读 下载PDF
基于时频特征解耦的水声目标被动识别
10
作者 潘光麟 任佳威 +1 位作者 王文超 赵庆卫 《声学学报》 北大核心 2026年第1期132-144,共13页
为充分挖掘舰船辐射噪声中的判别性信息,提升水声目标被动识别的鲁棒性,提出了一种基于时频特征解耦的水声目标识别方法。首先使用基于多尺度策略的两阶段线谱调制分解算法,在时频域实现接收信号中线谱与调制信息的高效分离;然后利用基... 为充分挖掘舰船辐射噪声中的判别性信息,提升水声目标被动识别的鲁棒性,提出了一种基于时频特征解耦的水声目标识别方法。首先使用基于多尺度策略的两阶段线谱调制分解算法,在时频域实现接收信号中线谱与调制信息的高效分离;然后利用基于双分支网络结构的时频解耦融合模块对分离信号的时频特征进行自适应融合,以增强时频特征的判别性。所提方法在信号预处理与神经网络设计过程中对舰船辐射噪声的时频特征分布特性进行建模,降低了复杂海洋环境干扰对识别模型的影响。在公开数据集DeepShip和黄海海域实测数据上采用多种时频特征提取方式进行了系统性实验验证。实验结果表明,所提方法在两个数据集上分别取得了80.02%和97.81%的高识别准确率,较现有方法有显著提升,验证了所提方法在实际应用场景中的有效性。 展开更多
关键词 水声目标识别 舰船辐射噪声 自适应特征融合 时频特征解耦
原文传递
LFSepNet:融合Transformer的照明和面部特征解耦人脸识别方法
11
作者 黎克迅 高治军 《计算机工程与应用》 北大核心 2026年第4期201-209,共9页
在低光环境下,人脸识别面临图像质量低、特征模糊等诸多挑战,导致现有方法难以提取鲁棒且辨识度高的特征,从而严重影响识别性能。为应对这一问题,提出了一种新颖的非成对低光人脸识别模型LFSepNet(low-light face separation network)... 在低光环境下,人脸识别面临图像质量低、特征模糊等诸多挑战,导致现有方法难以提取鲁棒且辨识度高的特征,从而严重影响识别性能。为应对这一问题,提出了一种新颖的非成对低光人脸识别模型LFSepNet(low-light face separation network)。与传统基于卷积神经网络(convolutional neural network,CNN)架构的训练方法不同,LFSepNet采用Transformer架构,更有效地捕捉长距离依赖关系,从而克服卷积神经网络在局部感受野上的限制。由于低光环境下的人脸图像往往整体偏暗,仅有少数区域可能包含较丰富的照明信息,传统CNN在特征提取时容易受限于局部区域,难以充分利用这些关键信息。相比之下,Transformer通过自注意力机制实现全局信息建模,使网络能够更全面地整合亮度不均的人脸图像信息,从而提升特征解耦的效果和低光人脸识别的准确性。LFSepNet模型包含自适应亮度分离模块和自适应照明间隙损失,通过动态分离人脸与光照特征,减少光照干扰,同时进一步优化特征分离效果,使模型能够提取更加精确和鲁棒的特征。实验结果表明,LFSepNet在多个低光人脸数据集上的性能均优于现有方法,特别是在极端低光条件下,其识别精度显著提升。该研究为低光人脸识别提供了基于非成对设置的有效解决方案,并在实际应用中展现了良好的潜力。 展开更多
关键词 低光人脸识别 深度学习 TRANSFORMER 特征解耦 卷积神经网络(CNN) LFSepNet
在线阅读 下载PDF
条件感知跨模态相似度模型的设计与分析
12
作者 钱新桥 《现代信息科技》 2026年第4期39-43,共5页
跨模态相似度计算是多模态学习的核心任务之一。在多模态图像相似度计算方面,现有方法普遍采用文本与图像静态映射的机制,难以根据用户指定的语义条件动态调整特征相似度权重。针对这一难题,文章提出了一种条件感知跨模态相似度模型(Con... 跨模态相似度计算是多模态学习的核心任务之一。在多模态图像相似度计算方面,现有方法普遍采用文本与图像静态映射的机制,难以根据用户指定的语义条件动态调整特征相似度权重。针对这一难题,文章提出了一种条件感知跨模态相似度模型(Condition-Aware Cross-Modal Similarity,CACMS),该模型能够根据不同语义动态调整图像特征值,从而实现条件控制的图像相似度计算。该模型的核心创新点有两点。首先,设计了动态门控融合模块,将条件解耦为共享属性(如“姿态”)与独有属性(如“类别”),进而生成基于条件的特征门控向量;其次,提出了对抗性解耦对比学习策略,进一步优化特征空间的动态重组。 展开更多
关键词 跨模态学习 条件感知 动态图像相似度 特征解耦 对比学习
在线阅读 下载PDF
Underground Pipeline Surveillance with an Algorithm Based on Statistical Time-Frequency Acoustic Features
13
作者 Tianlei Wang Jiuwen Cao +1 位作者 Ru Xu Jianzhong Wang 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2022年第2期358-371,共14页
Underground pipeline networks suffer from severe damage by earth-moving devices due to rapid urbanization.Thus,designing a round-the-clock intelligent surveillance system has become crucial and urgent.In this study,we... Underground pipeline networks suffer from severe damage by earth-moving devices due to rapid urbanization.Thus,designing a round-the-clock intelligent surveillance system has become crucial and urgent.In this study,we develop an acoustic signal-based excavation device recognition system for underground pipeline protection.The front-end hardware system is equipped with an acoustic sensor array,an Analog-to-Digital Converter(ADC)module(ADS1274),and an industrial processor Advanced RISC Machine(ARM)cortex-A8 for signal collection and algorithm implementation.Then,a novel Statistical Time-Frequency acoustic Feature(STFF)is proposed,and a fast Extreme Learning Machine(ELM)is adopted as the classifier.Experiments on real recorded data show that the proposed STFF achieves better discriminative capability than the conventional acoustic cepstrum features.In addition,the surveillance platform is applicable for encountering big data owing to the fast learning speed of ELM. 展开更多
关键词 underground pipeline surveillance time-frequency feature excavation device recognition Extreme Learning Machine(ELM)
原文传递
Spatiotemporal emotion recognition based on 3D time-frequency domain feature matrix
14
作者 Chao Hao Lian Weifang Liu Yongli 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2022年第5期62-72,共11页
The research of emotion recognition based on electroencephalogram(EEG)signals often ignores the related information between the brain electrode channels and the contextual emotional information existing in EEG signals... The research of emotion recognition based on electroencephalogram(EEG)signals often ignores the related information between the brain electrode channels and the contextual emotional information existing in EEG signals,which may contain important characteristics related to emotional states.Aiming at the above defects,a spatiotemporal emotion recognition method based on a 3-dimensional(3 D)time-frequency domain feature matrix was proposed.Specifically,the extracted time-frequency domain EEG features are first expressed as a 3 D matrix format according to the actual position of the cerebral cortex.Then,the input 3 D matrix is processed successively by multivariate convolutional neural network(MVCNN)and long short-term memory(LSTM)to classify the emotional state.Spatiotemporal emotion recognition method is evaluated on the DEAP data set,and achieved accuracy of 87.58%and 88.50%on arousal and valence dimensions respectively in binary classification tasks,as well as obtained accuracy of 84.58%in four class classification tasks.The experimental results show that 3 D matrix representation can represent emotional information more reasonably than two-dimensional(2 D).In addition,MVCNN and LSTM can utilize the spatial information of the electrode channels and the temporal context information of the EEG signal respectively. 展开更多
关键词 spatiotemporal emotion recognition model 3-dimensinal(3D)feature matrix time-frequency features multivariate convolutional neural network(MVCNN) long short-term memory(LSTM)
原文传递
基于跨模态特征重构与解耦网络的多模态抑郁症检测方法 被引量:1
15
作者 赵小明 谌自强 张石清 《计算机应用研究》 北大核心 2025年第1期236-241,共6页
抑郁症是一种广泛而严重的心理健康障碍,需要早期检测以便进行有效的干预。因为跨模态之间存在的信息冗余和模态间的异质性,集成音频和文本模态的自动化抑郁症检测是一个具有挑战性但重要的问题,先前的研究通常未能充分地明确学习音频-... 抑郁症是一种广泛而严重的心理健康障碍,需要早期检测以便进行有效的干预。因为跨模态之间存在的信息冗余和模态间的异质性,集成音频和文本模态的自动化抑郁症检测是一个具有挑战性但重要的问题,先前的研究通常未能充分地明确学习音频-文本模态的相互作用以用于抑郁症检测。为了解决这些问题,提出了基于跨模态特征重构与解耦网络的多模态抑郁症检测方法(CFRDN)。该方法以文本作为核心模态,引导模型重构音频特征用于跨模态特征解耦任务。该框架旨在从文本引导重构的音频特征中解离共享和私有特征,以供后续的多模态融合使用。在DAIC-WoZ和E-DAIC数据集上进行了充分的实验,结果显示所提方法在多模态抑郁症检测任务上优于现有技术。 展开更多
关键词 多模态 抑郁症检测 特征重构 特征解耦 特征融合
在线阅读 下载PDF
任务自适应增强的人机特征解耦可分级压缩 被引量:1
16
作者 安平 沙莉娅 +2 位作者 吴颖 杨超 黄新彭 《信号处理》 北大核心 2025年第2期399-408,共10页
图像压缩作为一项关键技术,旨在传输过程中保留尽可能少的关键信息,同时使得压缩后的图像保持较好的质量。而随着计算机视觉的发展,图像的主要消费者不仅仅是人类而更多的是机器,因此探索一种能够同时面向人类视觉和机器视觉的图像压缩... 图像压缩作为一项关键技术,旨在传输过程中保留尽可能少的关键信息,同时使得压缩后的图像保持较好的质量。而随着计算机视觉的发展,图像的主要消费者不仅仅是人类而更多的是机器,因此探索一种能够同时面向人类视觉和机器视觉的图像压缩方法十分具有意义。然而,现有的基于学习的图像编码技术虽然已经在人眼感知质量上取得了显著性的进步,但由于信号保真度及语义保真度的方法在驱动目标上存在分歧,无法同时满足机器视觉和人眼的需求。因此,本文提出了任务自适应增强的特征解耦可分级压缩方法,旨在利用单一比特流来支持多种视觉任务,并根据需求进行图像的选择性重建或完全重建。具体而言,本方法将图像特征解耦为目标特征和背景特征分别进行压缩和重建,所得到的目标图像用于后续目标检测和语义分割任务,而高质量完整重建的图像供人眼观看。这样不仅在实现视觉任务时避免了重建完整图像,提高压缩效率,还能够满足人眼的不同需求。此外,为了解决因目标区域重要性差异而引起的任务性能不平衡问题,本方法还设计了可插拔的任务自适应单元,并将其嵌入在目标特征解码器中,从而可以根据具体任务需求调整特征以增强重建目标图像的分析性能,而无须重新训练整个网络。实验结果证明,该方法与其他编解码器相比,展现出了更优的任务性能和速率失真(RateDistortion)性能。 展开更多
关键词 图像压缩 人机协同 特征解耦 任务自适应增强
在线阅读 下载PDF
多域时空层次图神经网络的空气质量预测 被引量:8
17
作者 马汉达 吴亚东 《计算机应用》 北大核心 2025年第2期444-452,共9页
在协同融合气象、空间和时间三大信息的时空混合模型中,时间变化建模通常在一维空间中完成。针对一维序列局限于滑动窗口和缺乏对多尺度特征的灵活提取的问题,提出一种多域时空层次图神经网络(MST-HGNN)模型。首先,构建城市全局尺度和... 在协同融合气象、空间和时间三大信息的时空混合模型中,时间变化建模通常在一维空间中完成。针对一维序列局限于滑动窗口和缺乏对多尺度特征的灵活提取的问题,提出一种多域时空层次图神经网络(MST-HGNN)模型。首先,构建城市全局尺度和站点局部尺度的两级层次图,从而进行空间关系学习;其次,将一维空气质量序列转换为一组基于多个周期的二维张量,并在二维空间上通过多尺度卷积进行周期解耦以捕获频域特征;同时,在一维空间中利用长短期记忆(LSTM)网络拟合时域特征;最后,为避免聚合冗余信息,设计一种门控机制融合模块用于频域和时域特征的多域特征融合。在Urban-Air数据集和长三角城市群数据集上的实验结果表明,相较于多视图多任务时空图卷积网络模型(M2),所提模型在预测第1 h、3 h、6 h、12 h空气质量的平均绝对误差(MAE)和均方根误差(RMSE)均低于对比模型。可见,MST-HGNN能在频域上解耦复杂时间模式,利用频域信息弥补时域特征建模的局限性,并结合时域信息更全面地预测空气质量变化。 展开更多
关键词 空气质量预测 多域特征融合 时空特征 周期解耦 门控机制融合 图神经网络
在线阅读 下载PDF
面向长尾异构数据的个性化联邦学习框架 被引量:1
18
作者 吴家皋 易婧 +1 位作者 周泽辉 刘林峰 《计算机科学》 北大核心 2025年第9期232-240,共9页
针对数据长尾分布和异构性引起的联邦学习模型性能下降的问题,提出了一种新的个性化联邦学习框架——平衡的个性化联邦学习(Balanced Personalized Federated Learning,BPFed),将整个联邦学习过程分为基于个性化联邦学习的表示学习和基... 针对数据长尾分布和异构性引起的联邦学习模型性能下降的问题,提出了一种新的个性化联邦学习框架——平衡的个性化联邦学习(Balanced Personalized Federated Learning,BPFed),将整个联邦学习过程分为基于个性化联邦学习的表示学习和基于全局特征增强的个性化分类器再训练两个阶段。在第一阶段,首先采用Mixup策略进行数据增强,然后提出基于参数解耦的个性化联邦学习特征提取器训练方法,在优化特征提取器性能的同时减少通信开销;在第二阶段,首先提出新的基于全局协方差矩阵的类级特征增强方法,然后提出基于样本权重的标签平滑损失函数对客户端分类器进行平衡的个性化再训练,以纠正头类置信过度并提高尾类的泛化能力。大量的实验结果表明,在不同的数据长尾分布和异构性设置下,BPFed模型的准确度相比其他代表性相关算法均有明显提升。此外,消融和超参数影响实验也进一步验证了所提方法和优化策略的有效性。 展开更多
关键词 个性化联邦学习 长尾分布 数据异构性 参数解耦 特征增强 优化策略
在线阅读 下载PDF
基于TLF-YOLOv8的堆叠垃圾实例分割算法 被引量:1
19
作者 李利 梁晶 +2 位作者 陈旭东 潘红光 寇发荣 《科学技术与工程》 北大核心 2025年第5期2009-2018,共10页
相较于一般场景下的图像实例分割,复杂堆叠场景下的实例分割受到严重遮挡、同类别待测物体堆叠等复杂情况的影响,使得其实例分割具有更大的难度。针对具有复杂堆叠场景下的垃圾实例分割问题,提出了一种融合YOLOv8与双层特征网络策略的... 相较于一般场景下的图像实例分割,复杂堆叠场景下的实例分割受到严重遮挡、同类别待测物体堆叠等复杂情况的影响,使得其实例分割具有更大的难度。针对具有复杂堆叠场景下的垃圾实例分割问题,提出了一种融合YOLOv8与双层特征网络策略的实例分割算法。首先,在数据预处理部分进行特征数据分层,并通过双层图卷积网络(graph convolutions network,GCN)实现双分支特征融合,减弱堆叠情况对被遮挡物体特征的影响,从而解决复杂堆叠遮挡下的实例分割问题。同时,为了解决同类待测物体易混淆的问题,融入了软阈值化非极大值抑制算法和新的交并比算法。最后,根据应用场景和数据集的复杂性,优化了主干网络部分的特征提取模块,并在主干网络部分引入了多尺度注意力机制,有效提高了模型的检测性能。实验使用遮挡垃圾分类实例分割数据集,实验结果表明该方法的平均准确率、交并比阈值为0.5时的平均准确率(AP_(50))、交并比为0.5~0.95时的平均准确率(AP_(50~95))等指标较之前的其他方法更优。相较于原YOLOv8算法,检测AP_(50)提高了7.9%,分割AP_(50)提高了5.4%,具有更好的检测和分割效果。 展开更多
关键词 垃圾堆叠 双层特征解耦融合 YOLOv8算法 软阈值化非极大值抑制 动态非单调聚焦机制 期望最大化注意力
在线阅读 下载PDF
分层蒸馏解耦网络的低分辨率人脸识别算法 被引量:1
20
作者 钟锐 宋亚锋 周晓康 《计算机应用研究》 北大核心 2025年第6期1900-1908,共9页
低分辨率人脸图像中大量面部细节特征丢失,使得目前许多具有较好性能的经典人脸识别模型的识别率急剧降低。针对该问题,提出了一种分层蒸馏解耦网络(hierarchical knowledge distillation decoupling,HKDD)。首先,为了提升学生网络对低... 低分辨率人脸图像中大量面部细节特征丢失,使得目前许多具有较好性能的经典人脸识别模型的识别率急剧降低。针对该问题,提出了一种分层蒸馏解耦网络(hierarchical knowledge distillation decoupling,HKDD)。首先,为了提升学生网络对低分辨率样本的特征描述能力,在教师网络与学生网络的卷积层之间进行分层特征蒸馏,使学生网络各中间层所提取的低分辨率人脸特征尽可能接近教师网络中间层所提取的高分辨率人脸特征,从而将教师网络各中间层强大的特征描述能力蒸馏至学生网络。随后,在教师网络与学生网络的softmax层之间进行解耦蒸馏,把softmax层的蒸馏损失解耦为目标类蒸馏损失和非目标类蒸馏损失,以发挥出被抑制的非目标类蒸馏损失对学生网络训练的指导作用,使学生网络在教师网络指导下学习到通用性面部特征的分类能力,从而确保学生网络能够在非限制性应用场景中具有较强的分类能力。最后,在TinyFace和QMUL-SurvFace等多个低分辨率人脸数据集中进行了效果验证,HKDD网络的识别率与实时性都优于其他代表性的低分辨率人脸识别模型,实验结果验证了该模型在低分辨率人脸识别任务中的有效性。 展开更多
关键词 低分辨率人脸识别 分层蒸馏解耦网络 分层特征蒸馏 解耦蒸馏 非限制性场景
在线阅读 下载PDF
上一页 1 2 7 下一页 到第
使用帮助 返回顶部