期刊文献+
共找到6篇文章
< 1 >
每页显示 20 50 100
VTAN: A Novel Video Transformer Attention-Based Network for Dynamic Sign Language Recognition
1
作者 Ziyang Deng weidong min +2 位作者 Qing Han Mengxue Liu Longfei Li 《Computers, Materials & Continua》 2025年第2期2793-2812,共20页
Dynamic sign language recognition holds significant importance, particularly with the application of deep learning to address its complexity. However, existing methods face several challenges. Firstly, recognizing dyn... Dynamic sign language recognition holds significant importance, particularly with the application of deep learning to address its complexity. However, existing methods face several challenges. Firstly, recognizing dynamic sign language requires identifying keyframes that best represent the signs, and missing these keyframes reduces accuracy. Secondly, some methods do not focus enough on hand regions, which are small within the overall frame, leading to information loss. To address these challenges, we propose a novel Video Transformer Attention-based Network (VTAN) for dynamic sign language recognition. Our approach prioritizes informative frames and hand regions effectively. To tackle the first issue, we designed a keyframe extraction module enhanced by a convolutional autoencoder, which focuses on selecting information-rich frames and eliminating redundant ones from the video sequences. For the second issue, we developed a soft attention-based transformer module that emphasizes extracting features from hand regions, ensuring that the network pays more attention to hand information within sequences. This dual-focus approach improves effective dynamic sign language recognition by addressing the key challenges of identifying critical frames and emphasizing hand regions. Experimental results on two public benchmark datasets demonstrate the effectiveness of our network, outperforming most of the typical methods in sign language recognition tasks. 展开更多
关键词 Dynamic sign language recognition TRANSFORMER soft attention attention-based visual feature aggregation
在线阅读 下载PDF
Pseudo Label Purification with Dual Contrastive Learning for Unsupervised Vehicle Re-Identification
2
作者 Jiyang Xu Qi Wang +4 位作者 Xin Xiong weidong min Jiang Luo Di Gai Qing Han 《Computers, Materials & Continua》 2025年第3期3921-3941,共21页
The unsupervised vehicle re-identification task aims at identifying specific vehicles in surveillance videos without utilizing annotation information.Due to the higher similarity in appearance between vehicles compare... The unsupervised vehicle re-identification task aims at identifying specific vehicles in surveillance videos without utilizing annotation information.Due to the higher similarity in appearance between vehicles compared to pedestrians,pseudo-labels generated through clustering are ineffective in mitigating the impact of noise,and the feature distance between inter-class and intra-class has not been adequately improved.To address the aforementioned issues,we design a dual contrastive learning method based on knowledge distillation.During each iteration,we utilize a teacher model to randomly partition the entire dataset into two sub-domains based on clustering pseudo-label categories.By conducting contrastive learning between the two student models,we extract more discernible vehicle identity cues to improve the problem of imbalanced data distribution.Subsequently,we propose a context-aware pseudo label refinement strategy that leverages contextual features by progressively associating granularity information from different bottleneck blocks.To produce more trustworthy pseudo-labels and lessen noise interference during the clustering process,the context-aware scores are obtained by calculating the similarity between global features and contextual ones,which are subsequently added to the pseudo-label encoding process.The proposed method has achieved excellent performance in overcoming label noise and optimizing data distribution through extensive experimental results on publicly available datasets. 展开更多
关键词 Unsupervised vehicle re-identification dual contrastive learning pseudo label refinement knowledge distillation
在线阅读 下载PDF
SAM-drivenMAE pre-training and background-awaremeta-learning for unsupervised vehicle re-identification 被引量:1
3
作者 Dong Wang Qi Wang +4 位作者 weidong min Di Gai Qing Han Longfei Li Yuhan Geng 《Computational Visual Media》 SCIE EI CSCD 2024年第4期771-789,共19页
Distinguishing identity-unrelated background information from discriminative identity information poses a challenge in unsupervised vehicle re-identification(Re-ID).Re-ID models suffer from varying degrees of backgrou... Distinguishing identity-unrelated background information from discriminative identity information poses a challenge in unsupervised vehicle re-identification(Re-ID).Re-ID models suffer from varying degrees of background interference caused by continuous scene variations.The recently proposed segment anything model(SAM)has demonstrated exceptional performance in zero-shot segmentation tasks.The combination of SAM and vehicle Re-ID models can achieve efficient separation of vehicle identity and background information.This paper proposes a method that combines SAM-driven mask autoencoder(MAE)pre-training and backgroundaware meta-learning for unsupervised vehicle Re-ID.The method consists of three sub-modules.First,the segmentation capacity of SAM is utilized to separate the vehicle identity region from the background.SAM cannot be robustly employed in exceptional situations,such as those with ambiguity or occlusion.Thus,in vehicle Re-ID downstream tasks,a spatiallyconstrained vehicle background segmentation method is presented to obtain accurate background segmentation results.Second,SAM-driven MAE pre-training utilizes the aforementioned segmentation results to select patches belonging to the vehicle and to mask other patches,allowing MAE to learn identity-sensitive features in a self-supervised manner.Finally,we present a background-aware meta-learning method to fit varying degrees of background interference in different scenarios by combining different background region ratios.Our experiments demonstrate that the proposed method has state-of-the-art performance in reducing background interference variations. 展开更多
关键词 UNSUPERVISED re-identification(Re-ID) vehicles segmentation autoencoder META-LEARNING
原文传递
Joint training with local soft attention and dual cross-neighbor label smoothing for unsupervised person re-identification
4
作者 Qing Han Longfei Li +4 位作者 weidong min Qi Wang Qingpeng Zeng Shimiao Cui Jiongjin Chen 《Computational Visual Media》 SCIE EI CSCD 2024年第3期543-558,共16页
Existing unsupervised person re-identification approaches fail to fully capture thefine-grained features of local regions,which can result in people with similar appearances and different identities being assigned the... Existing unsupervised person re-identification approaches fail to fully capture thefine-grained features of local regions,which can result in people with similar appearances and different identities being assigned the same label after clustering.The identity-independent information contained in different local regions leads to different levels of local noise.To address these challenges,joint training with local soft attention and dual cross-neighbor label smoothing(DCLS)is proposed in this study.First,the joint training is divided into global and local parts,whereby a soft attention mechanism is proposed for the local branch to accurately capture the subtle differences in local regions,which improves the ability of the re-identification model in identifying a person’s local significant features.Second,DCLS is designed to progressively mitigate label noise in different local regions.The DCLS uses global and local similarity metrics to semantically align the global and local regions of the person and further determines the proximity association between local regions through the cross information of neighboring regions,thereby achieving label smoothing of the global and local regions throughout the training process.In extensive experiments,the proposed method outperformed existing methods under unsupervised settings on several standard person re-identification datasets. 展开更多
关键词 person re-identification(Re-ID) unsupervised learning(USL) local soft attention joint training dual cross-neighbor label smoothing(DCLS)
原文传递
Scene-adaptive crowd counting method based on meta learning with dual-input network DMNet
5
作者 Haoyu ZHAO weidong min +3 位作者 Jianqiang XU Qi WANG Yi ZOU Qiyan FU 《Frontiers of Computer Science》 SCIE EI CSCD 2023年第1期91-100,共10页
Crowd counting is recently becoming a hot research topic, which aims to count the number of the people in different crowded scenes. Existing methods are mainly based on training-testing pattern and rely on large data ... Crowd counting is recently becoming a hot research topic, which aims to count the number of the people in different crowded scenes. Existing methods are mainly based on training-testing pattern and rely on large data training, which fails to accurately count the crowd in real-world scenes because of the limitation of model’s generalization capability. To alleviate this issue, a scene-adaptive crowd counting method based on meta-learning with Dual-illumination Merging Network (DMNet) is proposed in this paper. The proposed method based on learning-to-learn and few-shot learning is able to adapt different scenes which only contain a few labeled images. To generate high quality density map and count the crowd in low-lighting scene, the DMNet is proposed, which contains Multi-scale Feature Extraction module and Element-wise Fusion Module. The Multi-scale Feature Extraction module is used to extract the image feature by multi-scale convolutions, which helps to improve network accuracy. The Element-wise Fusion module fuses the low-lighting feature and illumination-enhanced feature, which supplements the missing illumination in low-lighting environments. Experimental results on benchmarks, WorldExpo’10, DISCO, USCD, and Mall, show that the proposed method outperforms the existing state-of-the-art methods in accuracy and gets satisfied results. 展开更多
关键词 crowd counting META-LEARNING scene-adaptive Dual-illumination Merging Network
原文传递
Mechanically scanned leaky-wave antenna based on a topological one-way waveguide
6
作者 Qian Shen Yun You +6 位作者 Jie Xu Yun Shen Xiaohua Deng Zhuoyuan Wang weidong min Linfang Shen Sanshui Xiao 《Frontiers of physics》 SCIE CSCD 2020年第3期97-103,共7页
We propose a uniform backfire-to-endfire leaky-wave antenna(LWA)based on a topological one-way waveguide under external bias magnetic field.We systematically analyze the dispersion,showing that the proposed structure ... We propose a uniform backfire-to-endfire leaky-wave antenna(LWA)based on a topological one-way waveguide under external bias magnetic field.We systematically analyze the dispersion,showing that the proposed structure supports leaky mode arisen from total internal reflection.By means of tuning frequency or magnetic field,we obtain fixed-bias frequency and fixed-frequency bias LWA with continuous beam scanning from backward,broadside to forward direction.More importantly,we,for the first time,demonstrate that this proposed LWA shows mechanical tunability,allowing us to manipulate the radiation direction from backward,broadside to forward direction by mechanically tuning the air layer thickness.The simulated results show that our system exhibits super low 3dB beam width,high radiation efficiency as well as high antenna gain.Being provided such multiple controlled(especially mechanically)beam scanning manners,the present LWA paves an advanced approach for continuous beam scanning,holding a great potential for applications in modern communication and radar system. 展开更多
关键词 leaky-wave antenna one-way waveguide magneto-optic materials
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部