期刊文献+
共找到171篇文章
< 1 2 9 >
每页显示 20 50 100
ICA-Net:improving class activation for weakly supervised semantic segmentation via joint contrastive and simulation learning
1
作者 YE Zhuang LIU Ruyu SUN Bo 《Optoelectronics Letters》 2025年第3期188-192,共5页
In the field of optoelectronics,certain types of data may be difficult to accurately annotate,such as high-resolution optoelectronic imaging or imaging in certain special spectral ranges.Weakly supervised learning can... In the field of optoelectronics,certain types of data may be difficult to accurately annotate,such as high-resolution optoelectronic imaging or imaging in certain special spectral ranges.Weakly supervised learning can provide a more reliable approach in these situations.Current popular approaches mainly adopt the classification-based class activation maps(CAM)as initial pseudo labels to solve the task. 展开更多
关键词 high resolution imaging supervised learning class activation maps joint contrastive simulation learning special spectral ranges weakly supervised learning OPTOELECTRONICS
原文传递
CPEWS:Contextual Prototype-Based End-to-End Weakly Supervised Semantic Segmentation
2
作者 Xiaoyan Shao Jiaqi Han +2 位作者 Lingling Li Xuezhuan Zhao Jingjing Yan 《Computers, Materials & Continua》 2025年第4期595-617,共23页
The primary challenge in weakly supervised semantic segmentation is effectively leveraging weak annotations while minimizing the performance gap compared to fully supervised methods.End-to-end model designs have gaine... The primary challenge in weakly supervised semantic segmentation is effectively leveraging weak annotations while minimizing the performance gap compared to fully supervised methods.End-to-end model designs have gained significant attention for improving training efficiency.Most current algorithms rely on Convolutional Neural Networks(CNNs)for feature extraction.Although CNNs are proficient at capturing local features,they often struggle with global context,leading to incomplete and false Class Activation Mapping(CAM).To address these limitations,this work proposes a Contextual Prototype-Based End-to-End Weakly Supervised Semantic Segmentation(CPEWS)model,which improves feature extraction by utilizing the Vision Transformer(ViT).By incorporating its intermediate feature layers to preserve semantic information,this work introduces the Intermediate Supervised Module(ISM)to supervise the final layer’s output,reducing boundary ambiguity and mitigating issues related to incomplete activation.Additionally,the Contextual Prototype Module(CPM)generates class-specific prototypes,while the proposed Prototype Discrimination Loss and Superclass Suppression Loss guide the network’s training,(LPDL)(LSSL)effectively addressing false activation without the need for extra supervision.The CPEWS model proposed in this paper achieves state-of-the-art performance in end-to-end weakly supervised semantic segmentation without additional supervision.The validation set and test set Mean Intersection over Union(MIoU)of PASCAL VOC 2012 dataset achieved 69.8%and 72.6%,respectively.Compared with ToCo(pre trained weight ImageNet-1k),MIoU on the test set is 2.1%higher.In addition,MIoU reached 41.4%on the validation set of the MS COCO 2014 dataset. 展开更多
关键词 End-to-end weakly supervised semantic segmentation vision transformer contextual prototype class activation map
在线阅读 下载PDF
Weakly Supervised Network with Scribble-Supervised and Edge-Mask for Road Extraction from High-Resolution Remote Sensing Images
3
作者 Supeng Yu Fen Huang Chengcheng Fan 《Computers, Materials & Continua》 SCIE EI 2024年第4期549-562,共14页
Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous human... Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous humaneffort to label the image. Within this field, other research endeavors utilize weakly supervised methods. Theseapproaches aim to reduce the expenses associated with annotation by leveraging sparsely annotated data, such asscribbles. This paper presents a novel technique called a weakly supervised network using scribble-supervised andedge-mask (WSSE-net). This network is a three-branch network architecture, whereby each branch is equippedwith a distinct decoder module dedicated to road extraction tasks. One of the branches is dedicated to generatingedge masks using edge detection algorithms and optimizing road edge details. The other two branches supervise themodel’s training by employing scribble labels and spreading scribble information throughout the image. To addressthe historical flaw that created pseudo-labels that are not updated with network training, we use mixup to blendprediction results dynamically and continually update new pseudo-labels to steer network training. Our solutiondemonstrates efficient operation by simultaneously considering both edge-mask aid and dynamic pseudo-labelsupport. The studies are conducted on three separate road datasets, which consist primarily of high-resolutionremote-sensing satellite photos and drone images. The experimental findings suggest that our methodologyperforms better than advanced scribble-supervised approaches and specific traditional fully supervised methods. 展开更多
关键词 Semantic segmentation road extraction weakly supervised learning scribble supervision remote sensing image
在线阅读 下载PDF
Local saliency consistency-based label inference for weakly supervised salient object detection using scribble annotations
4
作者 Shuo Zhao Peng Cui +1 位作者 Jing Shen Haibo Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2024年第1期239-249,共11页
Recently,weak supervision has received growing attention in the field of salient object detection due to the convenience of labelling.However,there is a large performance gap between weakly supervised and fully superv... Recently,weak supervision has received growing attention in the field of salient object detection due to the convenience of labelling.However,there is a large performance gap between weakly supervised and fully supervised salient object detectors because the scribble annotation can only provide very limited foreground/background information.Therefore,an intuitive idea is to infer annotations that cover more complete object and background regions for training.To this end,a label inference strategy is proposed based on the assumption that pixels with similar colours and close positions should have consistent labels.Specifically,k-means clustering algorithm was first performed on both colours and coordinates of original annotations,and then assigned the same labels to points having similar colours with colour cluster centres and near coordinate cluster centres.Next,the same annotations for pixels with similar colours within each kernel neighbourhood was set further.Extensive experiments on six benchmarks demonstrate that our method can significantly improve the performance and achieve the state-of-the-art results. 展开更多
关键词 label inference salient object detection weak supervision
在线阅读 下载PDF
Deep Learning Models Based on Weakly Supervised Learning and Clustering Visualization for Disease Diagnosis
5
作者 Jingyao Liu Qinghe Feng +4 位作者 Jiashi Zhao Yu Miao Wei He Weili Shi Zhengang Jiang 《Computers, Materials & Continua》 SCIE EI 2023年第9期2649-2665,共17页
The coronavirus disease 2019(COVID-19)has severely disrupted both human life and the health care system.Timely diagnosis and treatment have become increasingly important;however,the distribution and size of lesions va... The coronavirus disease 2019(COVID-19)has severely disrupted both human life and the health care system.Timely diagnosis and treatment have become increasingly important;however,the distribution and size of lesions vary widely among individuals,making it challenging to accurately diagnose the disease.This study proposed a deep-learning disease diagnosismodel based onweakly supervised learning and clustering visualization(W_CVNet)that fused classification with segmentation.First,the data were preprocessed.An optimizable weakly supervised segmentation preprocessing method(O-WSSPM)was used to remove redundant data and solve the category imbalance problem.Second,a deep-learning fusion method was used for feature extraction and classification recognition.A dual asymmetric complementary bilinear feature extraction method(D-CBM)was used to fully extract complementary features,which solved the problem of insufficient feature extraction by a single deep learning network.Third,an unsupervised learning method based on Fuzzy C-Means(FCM)clustering was used to segment and visualize COVID-19 lesions enabling physicians to accurately assess lesion distribution and disease severity.In this study,5-fold cross-validation methods were used,and the results showed that the network had an average classification accuracy of 85.8%,outperforming six recent advanced classification models.W_CVNet can effectively help physicians with automated aid in diagnosis to determine if the disease is present and,in the case of COVID-19 patients,to further predict the area of the lesion. 展开更多
关键词 CLASSIFICATION COVID-19 deep learning SEGMENTATION unsupervised learning weakly supervised
在线阅读 下载PDF
Weakly Supervised Abstractive Summarization with Enhancing Factual Consistency for Chinese Complaint Reports
6
作者 Ren Tao Chen Shuang 《Computers, Materials & Continua》 SCIE EI 2023年第6期6201-6217,共17页
A large variety of complaint reports reflect subjective information expressed by citizens.A key challenge of text summarization for complaint reports is to ensure the factual consistency of generated summary.Therefore... A large variety of complaint reports reflect subjective information expressed by citizens.A key challenge of text summarization for complaint reports is to ensure the factual consistency of generated summary.Therefore,in this paper,a simple and weakly supervised framework considering factual consistency is proposed to generate a summary of city-based complaint reports without pre-labeled sentences/words.Furthermore,it considers the importance of entity in complaint reports to ensure factual consistency of summary.Experimental results on the customer review datasets(Yelp and Amazon)and complaint report dataset(complaint reports of Shenyang in China)show that the proposed framework outperforms state-of-the-art approaches in ROUGE scores and human evaluation.It unveils the effectiveness of our approach to helping in dealing with complaint reports. 展开更多
关键词 Automatic summarization abstractive summarization weakly supervised training entity recognition
在线阅读 下载PDF
Weakly Supervised Instance Action Recognition
7
作者 Haomin Yan Ruize Han +2 位作者 Wei Feng Jiewen Zhao Song Wang 《Computational Visual Media》 2025年第3期603-618,共16页
We study the novel problem of weakly supervised instance action recognition(WSiAR)in multi-person(crowd)scenes.We specifically aim to recognize the action of each subject in the crowd,for which we propose the use of a... We study the novel problem of weakly supervised instance action recognition(WSiAR)in multi-person(crowd)scenes.We specifically aim to recognize the action of each subject in the crowd,for which we propose the use of a weakly supervised method,considering the expense of large-scale annotations for training.This problem is of great practical value for video surveillance and sports scene analysis.To this end,we investigated and designed a series of weak annotations for the supervision of weakly supervised instance action recognition(WSiAR).We propose two categories of weak label settings,bag labels and sparse labels,to significantly reduce the number of labels.Based on the former,we propose a novel sub-block-aware multi-instance learning(MIL)loss to obtain more effective information from weak labels during training.With respect to the latter,we propose a pseudo label generation strategy for extending sparse labels.This enables our method to achieve results comparable to those of fully supervised methods but with significantly fewer annotations.The experimental results on two benchmarks verified the rationality of the problem definition and effectiveness of the proposed weakly supervised training method in solving our problem. 展开更多
关键词 weak supervision instance action recognition crowd multi-person scene human activity
原文传递
A weakly-supervised deep learning model for end-to-end detection of airfield pavement distress
8
作者 Zefeng Tao Hongren Gong +2 位作者 Liming Liu Lin Cong Haimei Liang 《International Journal of Transportation Science and Technology》 2025年第1期67-78,共12页
Accurate and timely surveying of airfield pavement distress is crucial for cost-effective air-port maintenance.Deep learning(DL)approaches,leveraging advancements in computer science and image acquisition techniques,h... Accurate and timely surveying of airfield pavement distress is crucial for cost-effective air-port maintenance.Deep learning(DL)approaches,leveraging advancements in computer science and image acquisition techniques,have become the mainstream for automated air-field pavement distress detection.However,fully-supervised DL methods require a large number of manually annotated ground truth labels to achieve high accuracy.To address the challenge of limited high-quality manual annotations,we propose a novel end-to-end distress detection model called class activation map informed weakly-supervised dis-tress detection(WSDD-CAM).Based on YOLOv5,WSDD-CAM consists of an efficient back-bone,a classification branch,and a localization network.By utilizing class activation map(CAM)information,our model significantly reduces the need for manual annotations,auto-matically generating pseudo bounding boxes with a 71%overlap with the ground truth.To evaluate WSDD-CAM,we tested it on a self-made dataset and compared it with other weakly-supervised and fully-supervised models.The results show that our model achieves 49.2%mean average precision(mAP),outperforming other weakly-supervised methods and even approaching state-of-the-art fully-supervised methods.Additionally,ablation experiments confirm the effectiveness of our architecture design.In conclusion,our WSDD-CAM model offers a promising solution for airfield pavement distress detection,reducing manual annotation time while maintaining high accuracy.This efficient and effec-tive approach can significantly contribute to cost-effective airport maintenance management. 展开更多
关键词 Airfield pavement Distress detection Deep learning(DL) Object detection weakly supervised algorithms
暂未订购
Pancancer outcome prediction via a unified weakly supervised deep learning model
9
作者 Wei Yuan Yijiang Chen +34 位作者 Biyue Zhu Sen Yang Jiayu Zhang Ning Mao Jinxi Xiang Yuchen Li Yuanfeng Ji Xiangde Luo Kangning Zhang Xiaohan Xing Shuo Kang Dongyuan Xiao Fang Wang Jinkun Wu Haiyan Zhang Hongping Tang Himanshu Maurya German Corredor Cristian Barrera Yufei Zhou Krunal Pandav Junhan Zhao Prantesh Jain Luke Delasos Junzhou Huang Kailin Yang Theodoros N.Teknos James Lewis Jr Shlomo Koyfman Nathan A.Pennell Kun-Hsing Yu Xiao Han Jing Zhang Xiyue Wang Anant Madabhushi 《Signal Transduction and Targeted Therapy》 2025年第10期5454-5464,共11页
Accurate prognosis prediction is essential for guiding cancer treatment and improving patient outcomes.While recent studies have demonstrated the potential of histopathological images in survival analysis,existing mod... Accurate prognosis prediction is essential for guiding cancer treatment and improving patient outcomes.While recent studies have demonstrated the potential of histopathological images in survival analysis,existing models are typically developed in a cancerspecific manner,lack extensive external validation,and often rely on molecular data that are not routinely available in clinical practice.To address these limitations,we present PROGPATH,a unified model capable of integrating histopathological image features with routinely collected clinical variables to achieve pancancer prognosis prediction.PROGPATH employs a weakly supervised deep learning architecture built upon the foundation model for image encoding.Morphological features are aggregated through an attention-guided multiple instance learning module and fused with clinical information via a cross-attention transformer.A router-based classification strategy further refines the prediction performance.PROGPATH was trained on 7999 whole-slide images(WSIs)from 6,670 patients across 15 cancer types,and extensively validated on 17 external cohorts with a total of 7374 WSIs from 4441 patients,covering 12 cancer types from 8 consortia and institutions across three continents.PROGPATH achieved consistently superior performance compared with state-of-the-art multimodal prognosis prediction models.It demonstrated strong generalizability across cancer types and robustness in stratified subgroups,including early-and advancedstage patients,treatment cohorts(radiotherapy and pharmaceutical therapy),and biomarker-defined subsets.We further provide model interpretability by identifying pathological patterns critical to PROGPATH’s risk predictions,such as the degree of cell differentiation and extent of necrosis.Together,these results highlight the potential of PROGPATH to support pancancer outcome prediction and inform personalized cancer management strategies. 展开更多
关键词 pancancer prognosis integrating histopathological image features molecular data accurate prognosis prediction unified model histopathological images weakly supervised deep learning survival analysisexisting
原文传递
Scribble-Supervised Video Object Segmentation 被引量:3
10
作者 Peiliang Huang Junwei Han +2 位作者 Nian Liu Jun Ren Dingwen Zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第2期339-353,共15页
Recently,video object segmentation has received great attention in the computer vision community.Most of the existing methods heavily rely on the pixel-wise human annotations,which are expensive and time-consuming to ... Recently,video object segmentation has received great attention in the computer vision community.Most of the existing methods heavily rely on the pixel-wise human annotations,which are expensive and time-consuming to obtain.To tackle this problem,we make an early attempt to achieve video object segmentation with scribble-level supervision,which can alleviate large amounts of human labor for collecting the manual annotation.However,using conventional network architectures and learning objective functions under this scenario cannot work well as the supervision information is highly sparse and incomplete.To address this issue,this paper introduces two novel elements to learn the video object segmentation model.The first one is the scribble attention module,which captures more accurate context information and learns an effective attention map to enhance the contrast between foreground and background.The other one is the scribble-supervised loss,which can optimize the unlabeled pixels and dynamically correct inaccurate segmented areas during the training stage.To evaluate the proposed method,we implement experiments on two video object segmentation benchmark datasets,You Tube-video object segmentation(VOS),and densely annotated video segmentation(DAVIS)-2017.We first generate the scribble annotations from the original per-pixel annotations.Then,we train our model and compare its test performance with the baseline models and other existing works.Extensive experiments demonstrate that the proposed method can work effectively and approach to the methods requiring the dense per-pixel annotations. 展开更多
关键词 Convolutional neural networks(CNNs) SCRIBBLE self-attention video object segmentation weakly supervised
在线阅读 下载PDF
Human-pose estimation based on weak supervision 被引量:1
11
作者 Xiaoyan HU Xizhao BAO +1 位作者 Guoli WEI Zhaoyu LI 《Virtual Reality & Intelligent Hardware》 EI 2023年第4期366-377,共12页
Background In computer vision,simultaneously estimating human pose,shape,and clothing is a practical issue in real life,but remains a challenging task owing to the variety of clothing,complexity of de-formation,shorta... Background In computer vision,simultaneously estimating human pose,shape,and clothing is a practical issue in real life,but remains a challenging task owing to the variety of clothing,complexity of de-formation,shortage of large-scale datasets,and difficulty in estimating clothing style.Methods We propose a multistage weakly supervised method that makes full use of data with less labeled information for learning to estimate human body shape,pose,and clothing deformation.In the first stage,the SMPL human-body model parameters were regressed using the multi-view 2D key points of the human body.Using multi-view information as weakly supervised information can avoid the deep ambiguity problem of a single view,obtain a more accurate human posture,and access supervisory information easily.In the second stage,clothing is represented by a PCA-based model that uses two-dimensional key points of clothing as supervised information to regress the parameters.In the third stage,we predefine an embedding graph for each type of clothing to describe the deformation.Then,the mask information of the clothing is used to further adjust the deformation of the clothing.To facilitate training,we constructed a multi-view synthetic dataset that included BCNet and SURREAL.Results The Experiments show that the accuracy of our method reaches the same level as that of SOTA methods using strong supervision information while only using weakly supervised information.Because this study uses only weakly supervised information,which is much easier to obtain,it has the advantage of utilizing existing data as training data.Experiments on the DeepFashion2 dataset show that our method can make full use of the existing weak supervision information for fine-tuning on a dataset with little supervision information,compared with the strong supervision information that cannot be trained or adjusted owing to the lack of exact annotation information.Conclusions Our weak supervision method can accurately estimate human body size,pose,and several common types of clothing and overcome the issues of the current shortage of clothing data. 展开更多
关键词 Human pose estimation Clothing estimation weak supervision
在线阅读 下载PDF
Active self-training for weakly supervised 3D scene semantic segmentation
12
作者 Gengxin Liu Oliver van Kaick +1 位作者 Hui Huang Ruizhen Hu 《Computational Visual Media》 SCIE EI CSCD 2024年第3期425-438,共14页
Since the preparation of labeled datafor training semantic segmentation networks of pointclouds is a time-consuming process, weakly supervisedapproaches have been introduced to learn fromonly a small fraction of data.... Since the preparation of labeled datafor training semantic segmentation networks of pointclouds is a time-consuming process, weakly supervisedapproaches have been introduced to learn fromonly a small fraction of data. These methods aretypically based on learning with contrastive losses whileautomatically deriving per-point pseudo-labels from asparse set of user-annotated labels. In this paper, ourkey observation is that the selection of which samplesto annotate is as important as how these samplesare used for training. Thus, we introduce a methodfor weakly supervised segmentation of 3D scenes thatcombines self-training with active learning. Activelearning selects points for annotation that are likelyto result in improvements to the trained model, whileself-training makes efficient use of the user-providedlabels for learning the model. We demonstrate thatour approach leads to an effective method that providesimprovements in scene segmentation over previouswork and baselines, while requiring only a few userannotations. 展开更多
关键词 semantic segmentation weakly supervised SELF-TRAINING active learning
原文传递
Enhancing action discrimination via category-specific frame clustering for weakly-supervised temporal action localization
13
作者 Huifen XIA Yongzhao ZHAN +1 位作者 Honglin LIU Xiaopeng REN 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2024年第6期809-823,共15页
Temporal action localization (TAL) is a task of detecting the start and end timestamps of action instances and classifying them in an untrimmed video. As the number of action categories per video increases, existing w... Temporal action localization (TAL) is a task of detecting the start and end timestamps of action instances and classifying them in an untrimmed video. As the number of action categories per video increases, existing weakly-supervised TAL (W-TAL) methods with only video-level labels cannot provide sufficient supervision. Single-frame supervision has attracted the interest of researchers. Existing paradigms model single-frame annotations from the perspective of video snippet sequences, neglect action discrimination of annotated frames, and do not pay sufficient attention to their correlations in the same category. Considering a category, the annotated frames exhibit distinctive appearance characteristics or clear action patterns.Thus, a novel method to enhance action discrimination via category-specific frame clustering for W-TAL is proposed. Specifically,the K-means clustering algorithm is employed to aggregate the annotated discriminative frames of the same category, which are regarded as exemplars to exhibit the characteristics of the action category. Then, the class activation scores are obtained by calculating the similarities between a frame and exemplars of various categories. Category-specific representation modeling can provide complimentary guidance to snippet sequence modeling in the mainline. As a result, a convex combination fusion mechanism is presented for annotated frames and snippet sequences to enhance the consistency properties of action discrimination,which can generate a robust class activation sequence for precise action classification and localization. Due to the supplementary guidance of action discriminative enhancement for video snippet sequences, our method outperforms existing single-frame annotation based methods. Experiments conducted on three datasets (THUMOS14, GTEA, and BEOID) show that our method achieves high localization performance compared with state-of-the-art methods. 展开更多
关键词 weakly supervised Temporal action localization Single-frame annotation Category-specific Action discrimination
原文传递
Weakly Supervised Object Localization with Background Suppression Erasing for Art Authentication and Copyright Protection
14
作者 Chaojie Wu Mingyang Li +3 位作者 Ying Gao Xinyan Xie Wing W.Y.Ng Ahmad Musyafa 《Machine Intelligence Research》 EI CSCD 2024年第1期89-103,共15页
The problem of art forgery and infringement is becoming increasingly prominent,since diverse self-media contents with all kinds of art pieces are released on the Internet every day.For art paintings,object detection a... The problem of art forgery and infringement is becoming increasingly prominent,since diverse self-media contents with all kinds of art pieces are released on the Internet every day.For art paintings,object detection and localization provide an efficient and ef-fective means of art authentication and copyright protection.However,the acquisition of a precise detector requires large amounts of ex-pensive pixel-level annotations.To alleviate this,we propose a novel weakly supervised object localization(WSOL)with background su-perposition erasing(BSE),which recognizes objects with inexpensive image-level labels.First,integrated adversarial erasing(IAE)for vanilla convolutional neural network(CNN)dropouts the most discriminative region by leveraging high-level semantic information.Second,a background suppression module(BSM)limits the activation area of the IAE to the object region through a self-guidance mechanism.Finally,in the inference phase,we utilize the refined importance map(RIM)of middle features to obtain class-agnostic loc-alization results.Extensive experiments are conducted on paintings,CUB-200-2011 and ILSVRC to validate the effectiveness of our BSE. 展开更多
关键词 weakly supervised object localization erasing method deep learning computer vision art authentication and copyright protection
原文传递
Weakly supervised target detection based on spatial attention
15
作者 Wenqing Zhao Lijiao Xu 《Visual Intelligence》 2024年第1期16-26,共11页
Due to the lack of annotations in target bounding boxes,most methods for weakly supervised target detection transform the problem of object detection into a classification problem of candidate regions,making it easy f... Due to the lack of annotations in target bounding boxes,most methods for weakly supervised target detection transform the problem of object detection into a classification problem of candidate regions,making it easy for weakly supervised target detectors to locate significant and highly discriminative local areas of objects.We propose a weak monitoring method that combines attention and erasure mechanisms.The supervised target detection method uses attention maps to search for areas with higher discrimination within candidate regions,and then uses an erasure mechanism to erase the region,forcing the model to enhance its learning of features in areas with weaker discrimination.To improve the positioning ability of the detector,we cascade the weakly supervised target detection network and the fully supervised target detection network,and jointly train the weakly supervised target detection network and the fully supervised target detection network through multi-task learning.Based on the validation trials,the category mean average precision(mAP)and the correct localization(CorLoc)on the two datasets,i.e.,VOC2007 and VOC2012,are 55.2% and 53.8%,respectively.In regard to the mAP and CorLoc,this approach significantly outperforms previous approaches,which creates opportunities for additional investigations into weakly supervised target identification algorithms. 展开更多
关键词 weakly supervised learning Object detection Multi instance learning Spatial attention Erasure concept
在线阅读 下载PDF
面向单目可见光环境的自适应双手重建网络
16
作者 廖国琼 黄龙杰 +2 位作者 李清新 辜勇 李海波 《图学学报》 北大核心 2025年第4期837-846,共10页
准确重建双手手部网格对于自然的人机交互体验来说是一个至关重要的过程,但由于双手的遮挡、户外收集双手交互数据集的复杂性和复杂的光照环境干扰等因素导致双手手部重建任务仍极具挑战性。目前已有的工作大多是在环境干扰比较小的实... 准确重建双手手部网格对于自然的人机交互体验来说是一个至关重要的过程,但由于双手的遮挡、户外收集双手交互数据集的复杂性和复杂的光照环境干扰等因素导致双手手部重建任务仍极具挑战性。目前已有的工作大多是在环境干扰比较小的实验室等场景下取得的的良好效果,而在复杂的光照场景中的重建效果仍不佳。为了解决上述问题,提出一种面向单目可见光环境的自适应手部重建网络。通过引入单手检测框和使用2D复杂光照场景数据集进行弱监督等策略使得模型得以对复杂光照场景产生泛化性;设计的双手特征交互器得以有效建立左右手特征的远距离依赖关系,缓解了单手检测框缺乏双手交互信息的问题;针对如何有效融合交互特征与单手特征的问题,设计了自适应融合的策略,增强了模型的鲁棒性。实验结果表明,在包含多个复杂光照场景的HIC数据集中取得了最佳的效果。 展开更多
关键词 复杂光照场景 手部网格 双手交互 弱监督 特征融合
在线阅读 下载PDF
基于弱监督学习的Text-to-SQL自动生成方法
17
作者 向宁 《无线电通信技术》 北大核心 2025年第3期520-529,共10页
结构化查询语言(Structured Query Language,SQL)生成模型对于非专业人员检索情报至关重要。通常训练SQL生成模型需要使用标注的SQL以及对应的自然语言问题,现有SQL生成模型难以推广到不同的训练数据。根据问题分解半结构化表示(Decompo... 结构化查询语言(Structured Query Language,SQL)生成模型对于非专业人员检索情报至关重要。通常训练SQL生成模型需要使用标注的SQL以及对应的自然语言问题,现有SQL生成模型难以推广到不同的训练数据。根据问题分解半结构化表示(Decomposition Semi-structed Representation,DSR),提出一种基于弱监督学习的Text-to-SQL自动生成方法(Text-to-SQL Automatic Generation Method Based on Weakly Supervised Learning,TS-WSL),给定问题、DSR和执行答案,能够自动合成用于训练Text-to-SQL模型的SQL查询。使用DSR解析器对问题进行解析,通过短语链接、连接路径推理以及SQL映射过程生成候选SQL;使用候选SQL搜索选择最佳的SQL查询;使用生成的SQL数据对T5模型进行训练。在5个基准数据集上进行实验,结果表明所提方法比基于注释SQL数据集上训练的模型更具泛化性,在无域内DSR场景下,仍然可以达到完全监督模型约90%的性能。 展开更多
关键词 结构化查询语言生成模型 分解半结构化表示 弱监督学习 大模型
在线阅读 下载PDF
基于视觉的非完全标注表面缺陷检测综述 被引量:1
18
作者 叶标华 康丹青 +1 位作者 谢晓华 赖剑煌 《中国图象图形学报》 北大核心 2025年第6期1661-1689,共29页
在现代制造业中,基于机器视觉的表面缺陷检测是保证产品质量的关键,在工业智能化发展中发挥着重要作用。然而,获取缺陷数据的标注需要花费大量人力和时间成本。随着深度学习、大数据和传感器等技术的发展,如何在非完全标注的情况下实现... 在现代制造业中,基于机器视觉的表面缺陷检测是保证产品质量的关键,在工业智能化发展中发挥着重要作用。然而,获取缺陷数据的标注需要花费大量人力和时间成本。随着深度学习、大数据和传感器等技术的发展,如何在非完全标注的情况下实现准确、快速和鲁棒的缺陷识别成为当前的研究热点。鉴于此,对非完全标注场景下的表面缺陷检测技术的研究进展进行全面的梳理回顾。首先简要介绍缺陷检测领域的研究背景、基础概念的定义、常用数据集和相关技术。在此基础上,从标签策略以及任务策略两个角度介绍多种非完全标注场景下的缺陷检测技术。在标签策略中,比较了基于无监督、半监督和弱监督学习下的不同缺陷检测算法的研究思路。在任务策略中,总结了领域自适应、小样本以及大模型的表面缺陷检测算法的最新进展。接着,在多个数据集上横向对比不同标签策略以及任务策略中前沿算法的性能。最后,对该任务中的弱小目标检测、伪标签质量评估以及大模型的知识迁移等问题进行总结和展望。总体而言,非完全标注的表面缺陷检测是一个充满挑战且技术性极强的问题。同时,如何进一步推动表面缺陷检测技术进一步利用非完全标注的数据,并切实在工业制造场景中落地应用还需要更深入的研究。 展开更多
关键词 表面缺陷检测 非完全标注 无监督学习 弱监督学习 半监督学习 域适应 小样本
原文传递
极其弱监督场景下的小样本图异常检测 被引量:1
19
作者 郑文捷 傅司超 +5 位作者 陈嘉真 彭勤牧 涂益群 邹斌 荆晓远 尤新革 《计算机学报》 北大核心 2025年第4期927-948,共22页
近年来,小样本图异常检测在各个领域中引起了广泛的研究兴趣,其旨在在少量有标记训练节点(支持集)的引导下去检测出大量无标记测试节点(查询集)中的异常行为。然而,现有的小样本图异常检测算法通常假设其可以从具有大量有标记节点的训... 近年来,小样本图异常检测在各个领域中引起了广泛的研究兴趣,其旨在在少量有标记训练节点(支持集)的引导下去检测出大量无标记测试节点(查询集)中的异常行为。然而,现有的小样本图异常检测算法通常假设其可以从具有大量有标记节点的训练任务(元训练任务)中学习,从而有效地推广到具有少量标记节点的测试任务(元测试任务),这一假设并不符合真实世界的应用条件。在实际应用中,用于小样本图异常检测训练的元训练任务通常只包含极其有限的有标记节点,其标签占比通常不超过0.1%,甚至更低。由于元训练和元测试任务之间存在的巨大任务差异,现有的小样本图异常检测算法很容易出现模型的过拟合问题。除此之外,现有的小样本图异常检测算法仅利用节点间的一阶邻域(局部结构信息)来学习节点的低维特征嵌入,反而忽略了节点间的长距离依赖关系(全局结构信息),进而导致学习到的低维特征嵌入的不准确性和失真问题。针对上述挑战,本文提出了极其弱监督场景下的小样本图异常检测算法——EWSFSGAD。具体来说,该方法首先提出了一个简单且有效的图神经网络框架——GLN(Global and Local Network),其能够同时有效地利用节点间的全局和局部结构信息,并进一步引入注意力机制实现节点间的信息交互,从而更加有效地学习节点鲁棒的低维特征嵌入;该方法还引入了图对比学习中的自监督重建损失,使得节点原始视图与其增强视图之间低维特征嵌入的互信息尽可能一致,为EWS-FSGAD模型的优化提供更多有效的自监督信息,进而提升模型的泛化性;为了提升模型在真实场景中小样本图异常检测任务的快速适应性,该方法引入跨网络元学习训练机制,从多个辅助网络学习可迁移元知识,为模型提供良好的参数初始化,从而能够通过在仅有很少甚至一个标记节点的目标网络上进行微调并有效泛化。在三个真实世界的数据集(Flickr、PubMed、Yelp)上的大量实验结果表明,本文所提方法的性能明显优于现有的图异常检测算法。特别是在PubMed数据集上,AUC-PR提升了28.8%~35.4%。这些实验结果强有力地证明了在极其有限标记的元训练任务引导下,本文所提方法能够更好地学习到异常节点本质特征,从而提升小样本图异常检测任务的有效性。 展开更多
关键词 图异常检测 小样本学习 极其弱监督 图神经网络 图对比学习 长距离依赖关系
在线阅读 下载PDF
融合知识嵌入评分的强化学习多跳问答模型
20
作者 赵小康 李书琴 《计算机工程与设计》 北大核心 2025年第9期2450-2456,共7页
为降低基于强化学习的知识图谱多跳问答模型中智能体搜索的盲目性,缓解模型训练中的稀疏奖励和延迟奖励,构建一种融合知识嵌入评分机制的强化学习多跳问答模型。创新地采用评分模块约束智能体的搜索方向,并构造一个集成该评分模块的奖... 为降低基于强化学习的知识图谱多跳问答模型中智能体搜索的盲目性,缓解模型训练中的稀疏奖励和延迟奖励,构建一种融合知识嵌入评分机制的强化学习多跳问答模型。创新地采用评分模块约束智能体的搜索方向,并构造一个集成该评分模块的奖励塑造策略,缓解稀疏奖励和延迟奖励。通过在PathQuestion和PathQuestion-Large数据集上与其它几种模型进行对比实验,展现了优于其它基准模型的准确性。通过消融实验,验证了评分模块和奖励塑造策略的有效性。通过收敛时长的验证实验,验证了评分模块在降低智能体搜索盲目性的有效性。 展开更多
关键词 知识图谱 知识问答 强化学习 知识图谱嵌入 奖励塑造 弱监督 多跳问答
在线阅读 下载PDF
上一页 1 2 9 下一页 到第
使用帮助 返回顶部