期刊文献+
共找到93篇文章
< 1 2 5 >
每页显示 20 50 100
Unsupervised vehicle re-identification via meta-type generalization
1
作者 HUANG Chengti ZHANG Xiaoxiang +1 位作者 ZHAO Qianqian ZHU Jianqing 《High Technology Letters》 2025年第1期32-40,共9页
Unsupervised vehicle re-identification(Re-ID)methods have garnered widespread attention due to their potential in real-world traffic monitoring.However,existing unsupervised domain adaptation techniques often rely on ... Unsupervised vehicle re-identification(Re-ID)methods have garnered widespread attention due to their potential in real-world traffic monitoring.However,existing unsupervised domain adaptation techniques often rely on pseudo-labels generated from the source domain,which struggle to effectively address the diversity and dynamic nature of real-world scenarios.Given the limited variety of common vehicle types,enhancing the model’s generalization capability across these types is crucial.To this end,an innovative approach called meta-type generalization(MTG)is proposed.By dividing the training data into meta-train and meta-test sets based on vehicle type information,a novel gradient interaction computation strategy is designed to enhance the model’s ability to learn typeinvariant features.Integrated into the ResNet50 backbone,the MTG model achieves improvements of 4.50%and 12.04%on the Veri-776 and VRAI datasets,respectively,compared with traditional unsupervised algorithms,and surpasses current state-of-the-art methods.This achievement holds promise for application in intelligent traffic systems,enabling more efficient urban traffic solutions. 展开更多
关键词 deep learning unsupervised vehicle re-identification(re-id) META-LEARNING
在线阅读 下载PDF
Exploring Frontier Technologies in Video-Based Person Re-Identification:A Survey on Deep Learning Approach
2
作者 Jiahe Wang Xizhan Gao +1 位作者 Fa Zhu Xingchi Chen 《Computers, Materials & Continua》 SCIE EI 2024年第10期25-51,共27页
Video-based person re-identification(Re-ID),a subset of retrieval tasks,faces challenges like uncoordinated sample capturing,viewpoint variations,occlusions,cluttered backgrounds,and sequence uncertainties.Recent adva... Video-based person re-identification(Re-ID),a subset of retrieval tasks,faces challenges like uncoordinated sample capturing,viewpoint variations,occlusions,cluttered backgrounds,and sequence uncertainties.Recent advancements in deep learning have significantly improved video-based person Re-ID,laying a solid foundation for further progress in the field.In order to enrich researchers’insights into the latest research findings and prospective developments,we offer an extensive overview and meticulous analysis of contemporary video-based person ReID methodologies,with a specific emphasis on network architecture design and loss function design.Firstly,we introduce methods based on network architecture design and loss function design from multiple perspectives,and analyzes the advantages and disadvantages of these methods.Furthermore,we provide a synthesis of prevalent datasets and key evaluation metrics utilized within this field to assist researchers in assessing methodological efficacy and establishing benchmarks for performance evaluation.Lastly,through a critical evaluation of the experimental outcomes derived from various methodologies across four prominent public datasets,we identify promising research avenues and offer valuable insights to steer future exploration and innovation in this vibrant and evolving field of video-based person Re-ID.This comprehensive analysis aims to equip researchers with the necessary knowledge and strategic foresight to navigate the complexities of video-based person Re-ID,fostering continued progress and breakthroughs in this challenging yet promising research domain. 展开更多
关键词 Video-based person re-id deep learning survey of video re-id loss function
在线阅读 下载PDF
Learning Deep RGBT Representations for Robust Person Re-identification 被引量:2
3
作者 Ai-Hua Zheng Zi-Han Chen +2 位作者 Cheng-Long Li Jin Tang Bin Luo 《International Journal of Automation and computing》 EI CSCD 2021年第3期443-456,共14页
Person re-identification(Re-ID)is the scientific task of finding specific person images of a person in a non-overlapping camera networks,and has achieved many breakthroughs recently.However,it remains very challenging... Person re-identification(Re-ID)is the scientific task of finding specific person images of a person in a non-overlapping camera networks,and has achieved many breakthroughs recently.However,it remains very challenging in adverse environmental conditions,especially in dark areas or at nighttime due to the imaging limitations of a single visible light source.To handle this problem,we propose a novel deep red green blue(RGB)-thermal(RGBT)representation learning framework for a single modality RGB person ReID.Due to the lack of thermal data in prevalent RGB Re-ID datasets,we propose to use the generative adversarial network to translate labeled RGB images of person to thermal infrared ones,trained on existing RGBT datasets.The labeled RGB images and the synthetic thermal images make up a labeled RGBT training set,and we propose a cross-modal attention network to learn effective RGBT representations for person Re-ID in day and night by leveraging the complementary advantages of RGB and thermal modalities.Extensive experiments on Market1501,CUHK03 and Duke MTMC-re ID datasets demonstrate the effectiveness of our method,which achieves stateof-the-art performance on all above person Re-ID datasets. 展开更多
关键词 Person re-identification(re-id) thermal infrared generative networks ATTENTION deep learning
原文传递
Attributes-based person re-identification via CNNs with coupled clusters loss 被引量:1
4
作者 SUN Rui HUANG Qiheng +1 位作者 FANGWei ZHANG Xudong 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2020年第1期45-55,共11页
Person re-identification(re-id)involves matching a person across nonoverlapping views,with different poses,illuminations and conditions.Visual attributes are understandable semantic information to help improve the iss... Person re-identification(re-id)involves matching a person across nonoverlapping views,with different poses,illuminations and conditions.Visual attributes are understandable semantic information to help improve the issues including illumination changes,viewpoint variations and occlusions.This paper proposes an end-to-end framework of deep learning for attribute-based person re-id.In the feature representation stage of framework,the improved convolutional neural network(CNN)model is designed to leverage the information contained in automatically detected attributes and learned low-dimensional CNN features.Moreover,an attribute classifier is trained on separate data and includes its responses into the training process of our person re-id model.The coupled clusters loss function is used in the training stage of the framework,which enhances the discriminability of both types of features.The combined features are mapped into the Euclidean space.The L2 distance can be used to calculate the distance between any two pedestrians to determine whether they are the same.Extensive experiments validate the superiority and advantages of our proposed framework over state-of-the-art competitors on contemporary challenging person re-id datasets. 展开更多
关键词 person re-identification(re-id) convolutions neural network(CNN) attributes coupled clusters loss(CCL)
在线阅读 下载PDF
Integrating Coarse Granularity Part-Level Features with Supervised Global-Level Features for Person Re-Identification 被引量:1
5
作者 CAO Jiahao MAO Xiaofei +2 位作者 LI Dongfang ZHENG Qingfang JIA Xia 《ZTE Communications》 2021年第1期72-81,共10页
Person re-identification(Re-ID)has achieved great progress in recent years.However,person Re-ID methods are still suffering from body part missing and occlusion problems,which makes the learned representations less re... Person re-identification(Re-ID)has achieved great progress in recent years.However,person Re-ID methods are still suffering from body part missing and occlusion problems,which makes the learned representations less reliable.In this paper,we pro⁃pose a robust coarse granularity part-level network(CGPN)for person Re-ID,which ex⁃tracts robust regional features and integrates supervised global features for pedestrian im⁃ages.CGPN gains two-fold benefit toward higher accuracy for person Re-ID.On one hand,CGPN learns to extract effective regional features for pedestrian images.On the other hand,compared with extracting global features directly by backbone network,CGPN learns to extract more accurate global features with a supervision strategy.The single mod⁃el trained on three Re-ID datasets achieves state-of-the-art performances.Especially on CUHK03,the most challenging Re-ID dataset,we obtain a top result of Rank-1/mean av⁃erage precision(mAP)=87.1%/83.6%without re-ranking. 展开更多
关键词 person re-id SUPERVISION coarse granularity
在线阅读 下载PDF
Feature mapping space and sample determination for person re-identification
6
作者 HOU Wei HU Zhentao +1 位作者 LIU Xianxing SHI Changsen 《High Technology Letters》 EI CAS 2022年第3期237-246,共10页
Person re-identification(Re-ID) is integral to intelligent monitoring systems.However,due to the variability in viewing angles and illumination,it is easy to cause visual ambiguities,affecting the accuracy of person r... Person re-identification(Re-ID) is integral to intelligent monitoring systems.However,due to the variability in viewing angles and illumination,it is easy to cause visual ambiguities,affecting the accuracy of person re-identification.An approach for person re-identification based on feature mapping space and sample determination is proposed.At first,a weight fusion model,including mean and maximum value of the horizontal occurrence in local features,is introduced into the mapping space to optimize local features.Then,the Gaussian distribution model with hierarchical mean and covariance of pixel features is introduced to enhance feature expression.Finally,considering the influence of the size of samples on metric learning performance,the appropriate metric learning is selected by sample determination method to further improve the performance of person re-identification.Experimental results on the VIPeR,PRID450 S and CUHK01 datasets demonstrate that the proposed method is better than the traditional methods. 展开更多
关键词 person re-identification(re-id) mapping space feature optimization sample determination
在线阅读 下载PDF
融合多元注意力与高效特征聚合的行人重识别方法
7
作者 程杰 卞长智 +1 位作者 张婧 李小霞 《计算机工程与应用》 北大核心 2026年第2期220-231,共12页
针对行人重识别任务中视角、光照以及姿态变化等导致行人特征提取不充分的问题,设计了一种融合多元注意力与高效特征聚合的行人重识别网络。在ResNet50的瓶颈模块中嵌入无参通道注意力,利用均值与方差信息对通道权重进行调整,以增强对... 针对行人重识别任务中视角、光照以及姿态变化等导致行人特征提取不充分的问题,设计了一种融合多元注意力与高效特征聚合的行人重识别网络。在ResNet50的瓶颈模块中嵌入无参通道注意力,利用均值与方差信息对通道权重进行调整,以增强对关键通道的作用。仅在ResNet50的深层网络中引入自注意力机制,通过计算特征点之间的相关性来提高捕捉全局信息的能力,同时设置不同下采样率的双分支,以捕获不同分辨率下的行人特征信息。设计高效特征聚合金字塔,通过自上而下和自下而上的双路径实现深层语义和浅层的细节信息快速融合,同时在金字塔横向连接中加入通道与空间注意力残差融合模块以突出行人关键特征。使用双重批量归一化特征分类模块优化训练,提高网络稳定性。在Market1501、DukeMTMC-ReID和CUHK03数据集上,所提方法的mAP和Rank-1指标分别达到了89.5%和95.6%、80.1%和90.5%、79.7%和84.9%,实验结果表明,所提方法能够有效提高受视角、光照、姿态变化影响的行人重识别性能。 展开更多
关键词 行人重识别(re-id) ResNet50 注意力机制 高效特征聚合 金字塔横向连接
在线阅读 下载PDF
轻量级深度特征交互融合的车辆重识别网络研究
8
作者 徐岩 刘国荣 +3 位作者 张晓迪 崔海青 薛威海 朱国生 《计算机工程与应用》 北大核心 2026年第5期314-325,共12页
车辆重识别要求模型既关注车辆的整体轮廓,又关注车辆在不同阶段的微妙局部细节,在更深层次上提取区别特征。为解决上述问题,构建了一个具有轻量级大感受野的金字塔分支,在仅引入少于0.84×10^(6)个额外参数的同时,显著提高了骨干... 车辆重识别要求模型既关注车辆的整体轮廓,又关注车辆在不同阶段的微妙局部细节,在更深层次上提取区别特征。为解决上述问题,构建了一个具有轻量级大感受野的金字塔分支,在仅引入少于0.84×10^(6)个额外参数的同时,显著提高了骨干网络的性能,可使模型专注于网络深层的全局纹理。为了使金字塔分支学习有效的特征表示,提出了骨干引导融合(backbone guided fusion,BGF)模块,可将金字塔分支特征与骨干特征进行自适应融合,以帮助金字塔分支学习到有效信息。此外,采用了图像去模糊(image deblurring,ID)技术对输入特征进行预处理,并结合并行注意力机制来加强对特征细节的关注。在Veri-776和VehicleID数据集上进行的实验表明,所提出的轻量化方法有效提高了车辆重识别的准确性和泛化能力。 展开更多
关键词 车辆重识别(Vehicle re-id) 图像修复 轻量级特征金字塔分支 分支融合
在线阅读 下载PDF
融合空频信息的多粒度师生网络无监督行人重识别
9
作者 陈玉敏 车进 杨莹莹 《计算机工程》 北大核心 2026年第1期217-227,共11页
无监督行人重识别旨在挖掘无标注数据的判别性表示用于行人检索任务。基于伪标签进行训练的无监督行人重识别方法目前已经取得了瞩目的进展。然而在训练过程中引入的噪声和信息利用不完全问题限制了该任务的进一步发展。提出一种融合浅... 无监督行人重识别旨在挖掘无标注数据的判别性表示用于行人检索任务。基于伪标签进行训练的无监督行人重识别方法目前已经取得了瞩目的进展。然而在训练过程中引入的噪声和信息利用不完全问题限制了该任务的进一步发展。提出一种融合浅层空频信息的多粒度师生网络。首先,同时考虑全局和局部特征并将其集成到聚类对比学习中,丰富特征表示,利用训练好的教师模型指导学生模型快速收敛,减少噪声伪标签的干扰;其次,设计一个新颖的空频信息交互模块,利用网络加深过程中丢失的浅层空间域、频域有用信息;此外,在学生网络的训练过程中采用一种重利用策略,将以往方法中直接丢弃的部分未聚类实例作为难样本重新利用。在Market1501、DukeMTMC-reID和MSMT173个大型数据集上的均值平均精度(mAP)结果分别达到87.5%、74.8%和41.9%,证明了该方法的优越性。 展开更多
关键词 无监督行人重识别 伪标签噪声 多粒度特征 师生网络 空频信息
在线阅读 下载PDF
Person Re-Identification with Effectively Designed Parts 被引量:2
10
作者 Yali Zhao Yali Li Shengjin Wang 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2020年第3期415-424,共10页
Person re-IDentification(re-ID) is an important research topic in the computer vision community, with significance for a range of applications. Pedestrians are well-structured objects that can be partitioned, although... Person re-IDentification(re-ID) is an important research topic in the computer vision community, with significance for a range of applications. Pedestrians are well-structured objects that can be partitioned, although detection errors cause slightly misaligned bounding boxes, which lead to mismatches. In this paper, we study the person re-identification performance of using variously designed pedestrian parts instead of the horizontal partitioning routine typically applied in previous hand-crafted part works, and thereby obtain more effective feature descriptors. Specifically, we benchmark the accuracy of individual part matching with discriminatively trained Convolutional Neural Network(CNN) descriptors on the Market-1501 dataset. We also investigate the complementarity among different parts using combination and ablation studies, and provide novel insights into this issue. Compared with the state-of-the-art, our method yields a competitive accuracy rate when the best part combination is used on two large-scale datasets(Market-1501 and CUHK03) and one small-scale dataset(VIPeR). 展开更多
关键词 person re-identification(re-id) Convolutional Neural Network(CNN) part model
原文传递
SAM-drivenMAE pre-training and background-awaremeta-learning for unsupervised vehicle re-identification 被引量:1
11
作者 Dong Wang Qi Wang +4 位作者 Weidong Min Di Gai Qing Han Longfei Li Yuhan Geng 《Computational Visual Media》 SCIE EI CSCD 2024年第4期771-789,共19页
Distinguishing identity-unrelated background information from discriminative identity information poses a challenge in unsupervised vehicle re-identification(Re-ID).Re-ID models suffer from varying degrees of backgrou... Distinguishing identity-unrelated background information from discriminative identity information poses a challenge in unsupervised vehicle re-identification(Re-ID).Re-ID models suffer from varying degrees of background interference caused by continuous scene variations.The recently proposed segment anything model(SAM)has demonstrated exceptional performance in zero-shot segmentation tasks.The combination of SAM and vehicle Re-ID models can achieve efficient separation of vehicle identity and background information.This paper proposes a method that combines SAM-driven mask autoencoder(MAE)pre-training and backgroundaware meta-learning for unsupervised vehicle Re-ID.The method consists of three sub-modules.First,the segmentation capacity of SAM is utilized to separate the vehicle identity region from the background.SAM cannot be robustly employed in exceptional situations,such as those with ambiguity or occlusion.Thus,in vehicle Re-ID downstream tasks,a spatiallyconstrained vehicle background segmentation method is presented to obtain accurate background segmentation results.Second,SAM-driven MAE pre-training utilizes the aforementioned segmentation results to select patches belonging to the vehicle and to mask other patches,allowing MAE to learn identity-sensitive features in a self-supervised manner.Finally,we present a background-aware meta-learning method to fit varying degrees of background interference in different scenarios by combining different background region ratios.Our experiments demonstrate that the proposed method has state-of-the-art performance in reducing background interference variations. 展开更多
关键词 UNSUPERVISED re-identification(re-id) vehicles segmentation autoencoder META-LEARNING
原文传递
An efficient deep learning-assisted person re-identification solution for intelligent video surveillance in smart cities 被引量:1
12
作者 Muazzam MAQSOOD Sadaf YASMIN +3 位作者 Saira GILLANI Maryam BUKHARI Seungmin RHO Sang-Soo YEO 《Frontiers of Computer Science》 SCIE EI CSCD 2023年第4期83-96,共14页
Innovations on the Internet of Everything(IoE)enabled systems are driving a change in the settings where we interact in smart units,recognized globally as smart city environments.However,intelligent video-surveillance... Innovations on the Internet of Everything(IoE)enabled systems are driving a change in the settings where we interact in smart units,recognized globally as smart city environments.However,intelligent video-surveillance systems are critical to increasing the security of these smart cities.More precisely,in today’s world of smart video surveillance,person re-identification(Re-ID)has gained increased consideration by researchers.Various researchers have designed deep learningbased algorithms for person Re-ID because they have achieved substantial breakthroughs in computer vision problems.In this line of research,we designed an adaptive feature refinementbased deep learning architecture to conduct person Re-ID.In the proposed architecture,the inter-channel and inter-spatial relationship of features between the images of the same individual taken from nonidentical camera viewpoints are focused on learning spatial and channel attention.In addition,the spatial pyramid pooling layer is inserted to extract the multiscale and fixed-dimension feature vectors irrespective of the size of the feature maps.Furthermore,the model’s effectiveness is validated on the CUHK01 and CUHK02 datasets.When compared with existing approaches,the approach presented in this paper achieves encouraging Rank 1 and 5 scores of 24.6% and 54.8%,respectively. 展开更多
关键词 Internet of Everything(IoE) visual surveillance systems big data security systems person re-identification(re-id) deep learning
原文传递
Learning convolutional multi-level transformers for image-based person re-identification 被引量:2
13
作者 Peilei Yan Xuehu Liu +1 位作者 Pingping Zhang Huchuan Lu 《Visual Intelligence》 2023年第1期84-95,共12页
As a vital vision task,person re-identification(Re-ID)aims to retrieve the same person under non-overlapping cameras.It is a very challenging task due to the presence of complex backgrounds,diverse illuminations and d... As a vital vision task,person re-identification(Re-ID)aims to retrieve the same person under non-overlapping cameras.It is a very challenging task due to the presence of complex backgrounds,diverse illuminations and different perspectives.In this work,we integrate the advantages of convolutional neural networks(CNNs)and transformers,and propose a novel learning framework named convolutional multi-level transformer(CMT)for image-based person Re-ID.More specifically,wefirst propose a scale-aware feature enhancement(SFE)module to extract multi-scale local features from a pre-trained CNN backbone.Then,we introduce a part-aware transformer encoder(PTE)to further mine discriminative local information guided by global semantics.Finally,a deeply-supervised learning(DSL)technique is adopted to optimize the proposed CMT and improve its training efficiency.Extensive experiments on four large-scale Re-ID benchmarks demonstrate that our method performs favorably against several state-of-the-art methods. 展开更多
关键词 Person re-identification(re-id) Vision transformer Global-local features Deeply-supervised learning(DSL)
在线阅读 下载PDF
Joint training with local soft attention and dual cross-neighbor label smoothing for unsupervised person re-identification
14
作者 Qing Han Longfei Li +4 位作者 Weidong Min Qi Wang Qingpeng Zeng Shimiao Cui Jiongjin Chen 《Computational Visual Media》 SCIE EI CSCD 2024年第3期543-558,共16页
Existing unsupervised person re-identification approaches fail to fully capture thefine-grained features of local regions,which can result in people with similar appearances and different identities being assigned the... Existing unsupervised person re-identification approaches fail to fully capture thefine-grained features of local regions,which can result in people with similar appearances and different identities being assigned the same label after clustering.The identity-independent information contained in different local regions leads to different levels of local noise.To address these challenges,joint training with local soft attention and dual cross-neighbor label smoothing(DCLS)is proposed in this study.First,the joint training is divided into global and local parts,whereby a soft attention mechanism is proposed for the local branch to accurately capture the subtle differences in local regions,which improves the ability of the re-identification model in identifying a person’s local significant features.Second,DCLS is designed to progressively mitigate label noise in different local regions.The DCLS uses global and local similarity metrics to semantically align the global and local regions of the person and further determines the proximity association between local regions through the cross information of neighboring regions,thereby achieving label smoothing of the global and local regions throughout the training process.In extensive experiments,the proposed method outperformed existing methods under unsupervised settings on several standard person re-identification datasets. 展开更多
关键词 person re-identification(re-id) unsupervised learning(USL) local soft attention joint training dual cross-neighbor label smoothing(DCLS)
原文传递
Visible-infrared person re-identification via specific and shared representations learning
15
作者 Aihua Zheng Juncong Liu +3 位作者 Zi Wang Lili Huang Chenglong Li Bing Yin 《Visual Intelligence》 2023年第1期28-39,共12页
The primary goal of visible-infrared person re-identification(VI-ReID)is to match pedestrian photos obtained during the day and night.The majority of existing methods simply generate auxiliary modalities to reduce the... The primary goal of visible-infrared person re-identification(VI-ReID)is to match pedestrian photos obtained during the day and night.The majority of existing methods simply generate auxiliary modalities to reduce the modality discrepancy for cross-modality matching.They capture modality-invariant representations but ignore the extraction of modality-specific representations that can aid in distinguishing among various identities of the same modality.To alleviate these issues,this work provides a novel specific and shared representations learning(SSRL)model for VI-ReID to learn modality-specific and modality-shared representations.We design a shared branch in SSRL to bridge the image-level gap and learn modality-shared representations,while a specific branch retains the discriminative information of visible images to learn modality-specific representations.In addition,we propose intra-class aggregation and inter-class separation learning strategies to optimize the distribution of feature embeddings at afine-grained level.Extensive experimental results on two challenging benchmark datasets,SYSU-MM01 and RegDB,demonstrate the superior performance of SSRL over state-of-the-art methods. 展开更多
关键词 Person re-identification(re-id) Cross-modality Specific Representations Shared Representations
在线阅读 下载PDF
双路注意力机制行人重识别方法
16
作者 张媛媛 宋存利 张雪松 《计算机辅助设计与图形学学报》 北大核心 2025年第5期877-886,共10页
为解决目前Re-ID方法中对非显著可辨别特征关注不足,以及提取的行人关键特征表达不充分的问题,提出一种基于双路注意力机制特征提取网络,由双路注意力主干网络和增强注意特征融合模块组成.其中,双路注意力网络使模型关注到不同显著程度... 为解决目前Re-ID方法中对非显著可辨别特征关注不足,以及提取的行人关键特征表达不充分的问题,提出一种基于双路注意力机制特征提取网络,由双路注意力主干网络和增强注意特征融合模块组成.其中,双路注意力网络使模型关注到不同显著程度的有效特征区域,可分别用于挖掘显著和潜在非显著可辨别特征,强调潜在关键特征的重要性;增强注意特征融合模块用于完成特征信息互补,同时采用反事实干预强化习得注意力特征图的质量和有效性,从而得到更具有判别性的最终特征表示.在Market1501, DukeMTMC-reID和MSMT17数据集上进行了广泛实验,结果表明, mAP值分别达到了89.3%, 80.0%, 58.4%;Rank-1值分别达到了95.7%, 89.8%, 80.7%,充分证明了该方法的优越性. 展开更多
关键词 re-id 深度学习 注意力 非显著特征 反事实干预
在线阅读 下载PDF
多模态行人重识别研究综述 被引量:2
17
作者 石瑞鑫 智敏 殷雁君 《计算机应用研究》 北大核心 2025年第7期1921-1929,共9页
在现代安全监控系统中,行人重识别技术扮演着至关重要的角色。面对行人图像的光照变化、视角差异和遮挡等问题,传统的行人重识别系统(person re-identification,RE-ID)准确性和可靠性受限。为应对这些挑战,研究者将多模态学习方法引入RE... 在现代安全监控系统中,行人重识别技术扮演着至关重要的角色。面对行人图像的光照变化、视角差异和遮挡等问题,传统的行人重识别系统(person re-identification,RE-ID)准确性和可靠性受限。为应对这些挑战,研究者将多模态学习方法引入RE-ID领域,希望有效融合多种数据模态,如深度图像、红外图像和文本信息,以期提高RE-ID的性能。综述了多模态RE-ID技术在现代安全监控系统中的应用及其研究进展。首先介绍了多模态技术的基本概念和多模态RE-ID任务,接着概述该领域的关键数据集和评估协议。核心部分详细讨论了多模态RE-ID中的融合策略,包括特征层次融合和模型层次融合两种方法。最后,探讨了多模态RE-ID的研究挑战与未来研究方向,以进一步推动多模态行人重识别发展。 展开更多
关键词 re-id 多模态融合 数据融合 特征融合 模型融合
在线阅读 下载PDF
基于渐进式混合对比学习的无监督领域自适应行人再识别 被引量:1
18
作者 赵宇 舒巧媛 《电子学报》 北大核心 2025年第6期1829-1846,共18页
无监督领域自适应(Unsupervised Domain Adaptation,UDA)行人再识别(person Re-IDentification,Re-ID)旨在利用有标注的源域数据来解决无标注目标域数据的无监督Re-ID任务.近期,对比学习在该领域引起关注,但现有方法存在正样本对差异较... 无监督领域自适应(Unsupervised Domain Adaptation,UDA)行人再识别(person Re-IDentification,Re-ID)旨在利用有标注的源域数据来解决无标注目标域数据的无监督Re-ID任务.近期,对比学习在该领域引起关注,但现有方法存在正样本对差异较小以及忽略负代理采样偏差的问题.为解决这些问题,本文提出一种渐进式混合对比学习(Progressive Hybrid Contrastive Learning,PHCL)方法.在每个训练轮次,PHCL方法通过聚类和渐进细化两个步骤,将无标签数据集划分为带伪标签的聚类样本和未聚类的独立实例.基于聚类划分结果,PHCL方法在两个层次实施对比学习:通过将同一聚类(目标域)或同一身份标签(源域)中的相似样本拉近,指导模型学习类内相似性,同时通过在未聚类的实例间施加排斥作用,挖掘实例间差异性.此外,PHCL方法通过最近邻挖掘为未聚类的实例生成正代理,增大正样本对的差异性,学习更丰富的语义信息.同时,PHCL方法在负代理采样过程中去偏差,减轻假负代理对训练的不利影响.实验结果表明:PHCL方法在Market-1501和MSMT17数据集上的平均精度均值(mean Average Precision,mAP)分别为85.9%与42.3%,比基线模型分别提高4.3个百分点和13.5个百分点.上述实验结果验证了PHCL方法在UDA Re-ID任务中的有效性. 展开更多
关键词 无监督领域自适应(UDA)行人再识别(re-id) 对比学习 伪标签 最近邻挖掘 去偏差
在线阅读 下载PDF
基于端到端的轻量化人体姿态跟踪算法研究
19
作者 裴涛 王向阳 +1 位作者 谢慧志 赵佳辉 《工业控制计算机》 2025年第3期28-29,31,共3页
为了解决视频中实时的姿态跟踪问题,提出了一个端到端的轻量化人体姿态跟踪算法,该算法能同时执行行人检测、多人姿态估计和跟踪。针对多人姿态跟踪过程中行人尺度变化问题,设计了尺度归一化图像特征金字塔网络来提高性能和速度,同时采... 为了解决视频中实时的姿态跟踪问题,提出了一个端到端的轻量化人体姿态跟踪算法,该算法能同时执行行人检测、多人姿态估计和跟踪。针对多人姿态跟踪过程中行人尺度变化问题,设计了尺度归一化图像特征金字塔网络来提高性能和速度,同时采用基于姿态加权的Re-ID特征匹配算法来实现前后帧行人姿态的跟踪。实验表明,提出的算法在姿态跟踪数据集PoseTrack 2017和PoseTrack 2018上,MOTA指标分别达到71.8%和65.6%,帧率为12.6 fps,实现了较好的实时跟踪性能。 展开更多
关键词 姿态跟踪 姿态估计 re-id特征
在线阅读 下载PDF
二值化图像与双流网络在跨模态行人重识别的应用
20
作者 崔洪刚 曹钢钢 苏荻翔 《计算机应用与软件》 北大核心 2025年第2期216-226,共11页
在现有的跨模态行人重识别方法中,很少有方法会利用图像中人的姿态信息进行网络的学习。考虑到姿态信息在行人重识别网络学习中的重要性,提出一种融合局部阈值二值化图像特征的端到端的行人重识别方法。该方法使用ResNet50作为骨干网络... 在现有的跨模态行人重识别方法中,很少有方法会利用图像中人的姿态信息进行网络的学习。考虑到姿态信息在行人重识别网络学习中的重要性,提出一种融合局部阈值二值化图像特征的端到端的行人重识别方法。该方法使用ResNet50作为骨干网络对三种模态图像进行特征提取和特征融合,使用交叉熵损失和改进的难样本三元组损失进行网络训练。在使用简单网络结构的同时使用姿态信息。实验结果表明,在跨模态行人重识别网络中融合局部阈值二值化图像信息,能提高网络对行人重识别的准确率,显著提升最难样本的挖掘能力。 展开更多
关键词 跨模态行人重识别 卷积神经网络 局部阈值二值化
在线阅读 下载PDF
上一页 1 2 5 下一页 到第
使用帮助 返回顶部