期刊文献+
共找到11篇文章
< 1 >
每页显示 20 50 100
Complementary-Label Adversarial Domain Adaptation Fault Diagnosis Network under Time-Varying Rotational Speed and Weakly-Supervised Conditions 被引量:1
1
作者 Siyuan Liu Jinying Huang +2 位作者 Jiancheng Ma Licheng Jing Yuxuan Wang 《Computers, Materials & Continua》 SCIE EI 2024年第4期761-777,共17页
Recent research in cross-domain intelligence fault diagnosis of machinery still has some problems,such as relatively ideal speed conditions and sample conditions.In engineering practice,the rotational speed of the mac... Recent research in cross-domain intelligence fault diagnosis of machinery still has some problems,such as relatively ideal speed conditions and sample conditions.In engineering practice,the rotational speed of the machine is often transient and time-varying,which makes the sample annotation increasingly expensive.Meanwhile,the number of samples collected from different health states is often unbalanced.To deal with the above challenges,a complementary-label(CL)adversarial domain adaptation fault diagnosis network(CLADAN)is proposed under time-varying rotational speed and weakly-supervised conditions.In the weakly supervised learning condition,machine prior information is used for sample annotation via cost-friendly complementary label learning.A diagnosticmodel learning strategywith discretized category probabilities is designed to avoidmulti-peak distribution of prediction results.In adversarial training process,we developed virtual adversarial regularization(VAR)strategy,which further enhances the robustness of the model by adding adversarial perturbations in the target domain.Comparative experiments on two case studies validated the superior performance of the proposed method. 展开更多
关键词 Time-varying rotational speed weakly-supervised fault diagnosis domain adaptation
在线阅读 下载PDF
A Weakly-Supervised Crowd Density Estimation Method Based on Two-Stage Linear Feature Calibration 被引量:3
2
作者 Yong-Chao Li Rui-Sheng Jia +1 位作者 Ying-Xiang Hu Hong-Mei Sun 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第4期965-981,共17页
In a crowd density estimation dataset,the annotation of crowd locations is an extremely laborious task,and they are not taken into the evaluation metrics.In this paper,we aim to reduce the annotation cost of crowd dat... In a crowd density estimation dataset,the annotation of crowd locations is an extremely laborious task,and they are not taken into the evaluation metrics.In this paper,we aim to reduce the annotation cost of crowd datasets,and propose a crowd density estimation method based on weakly-supervised learning,in the absence of crowd position supervision information,which directly reduces the number of crowds by using the number of pedestrians in the image as the supervised information.For this purpose,we design a new training method,which exploits the correlation between global and local image features by incremental learning to train the network.Specifically,we design a parent-child network(PC-Net)focusing on the global and local image respectively,and propose a linear feature calibration structure to train the PC-Net simultaneously,and the child network learns feature transfer factors and feature bias weights,and uses the transfer factors and bias weights to linearly feature calibrate the features extracted from the Parent network,to improve the convergence of the network by using local features hidden in the crowd images.In addition,we use the pyramid vision transformer as the backbone of the PC-Net to extract crowd features at different levels,and design a global-local feature loss function(L2).We combine it with a crowd counting loss(LC)to enhance the sensitivity of the network to crowd features during the training process,which effectively improves the accuracy of crowd density estimation.The experimental results show that the PC-Net significantly reduces the gap between fullysupervised and weakly-supervised crowd density estimation,and outperforms the comparison methods on five datasets of Shanghai Tech Part A,ShanghaiTech Part B,UCF_CC_50,UCF_QNRF and JHU-CROWD++. 展开更多
关键词 Crowd density estimation linear feature calibration vision transformer weakly-supervision learning
在线阅读 下载PDF
Weakly-supervised instance co-segmentation via tensor-based salient co-peak search
3
作者 Wuxiu QUAN Yu HU +3 位作者 Tingting DAN Junyu LI Yue ZHANG Hongmin CAI 《Frontiers of Computer Science》 SCIE EI CSCD 2024年第2期83-92,共10页
Instance co-segmentation aims to segment the co-occurrent instances among two images.This task heavily relies on instance-related cues provided by co-peaks,which are generally estimated by exhaustively exploiting all ... Instance co-segmentation aims to segment the co-occurrent instances among two images.This task heavily relies on instance-related cues provided by co-peaks,which are generally estimated by exhaustively exploiting all paired candidates in point-to-point patterns.However,such patterns could yield a high number of false-positive co-peaks,resulting in over-segmentation whenever there are mutual occlusions.To tackle with this issue,this paper proposes an instance co-segmentation method via tensor-based salient co-peak search(TSCPS-ICS).The proposed method explores high-order correlations via triple-to-triple matching among feature maps to find reliable co-peaks with the help of co-saliency detection.The proposed method is shown to capture more accurate intra-peaks and inter-peaks among feature maps,reducing the false-positive rate of co-peak search.Upon having accurate co-peaks,one can efficiently infer responses of the targeted instance.Experiments on four benchmark datasets validate the superior performance of the proposed method. 展开更多
关键词 weakly-supervised co-segmentation co-peak tensormatching deep network instance segmentation
原文传递
4D foetal cardiac ultrasound image detection based on deep learning with weakly supervised localisation for rapid diagnosis of evolving hypoplastic left heart syndrome
4
作者 Gang Wang Weisheng Li +3 位作者 Mingliang Zhou Haobo Zhu Guang Yang Choon Hwai Yap 《CAAI Transactions on Intelligence Technology》 2024年第6期1485-1499,共15页
Hypoplastic left heart syndrome(HLHS)is a rare,complex,and incredibly foetal congenital heart disease.To decrease neonatal mortality,evolving HLHS(eHLHS)in pregnant women should be critically diagnosed as soon as poss... Hypoplastic left heart syndrome(HLHS)is a rare,complex,and incredibly foetal congenital heart disease.To decrease neonatal mortality,evolving HLHS(eHLHS)in pregnant women should be critically diagnosed as soon as possible.However,diagnosis is currently heavily dependent on skilled medical professionals using foetal cardiac ultrasound images,making it difficult to rapidly and easily examine for this disease.Herein,the authors propose a cost-effective deep learning framework for rapid diagnosis of eHLHS(RDeH),which we have named RDeH-Net.Briefly,the framework implements a coarseto-fine two-stage detection approach,with a structure classification network for 4D human foetal cardiac ultrasound images from various spatial and temporal domains,and a fine detection module with weakly-supervised localisation for high-precision nidus localisation and physician assistance.The experiments extensively compare the authors’network with other state-of-the-art methods on a 4D human foetal cardiac ultrasound image dataset and show two main benefits:(1)it achieved superior average accuracy of 99.37%on three categories of foetal ultrasound images from different cases;(2)it demonstrates visually fine detection performance with weakly supervised localisation.This framework could be used to accelerate the diagnosis of eHLHS,and hence significantly lessen reliance on experienced medical physicians. 展开更多
关键词 4D deep learning fetal cardiac ultrasound hypoplastic left heart syndrome weakly-supervised learning
在线阅读 下载PDF
PT-MIL:Parallel transformer based on multi-instance learning for osteoporosis detection in panoramic oral radiography
5
作者 黄欣然 YANG Hongjie +2 位作者 CHEN Hu ZHANG Yi 廖培希 《中国体视学与图像分析》 2023年第4期410-418,共9页
Osteoporosis is a systemic disease characterized by low bone mass,impaired bone microstruc-ture,increased bone fragility,and a higher risk of fractures.It commonly affects postmenopausal women and the elderly.Orthopan... Osteoporosis is a systemic disease characterized by low bone mass,impaired bone microstruc-ture,increased bone fragility,and a higher risk of fractures.It commonly affects postmenopausal women and the elderly.Orthopantomography,also known as panoramic radiography,is a widely used imaging technique in dental examinations due to its low cost and easy accessibility.Previous studies have shown that the mandibular cortical index(MCI)derived from orthopantomography can serve as an important indicator of osteoporosis risk.To address this,this study proposes a parallel Transformer network based on multiple instance learning.By introducing parallel modules that alleviate optimization issues and integrating multiple-instance learning with the Transformer architecture,our model effectively extracts information from image patches.Our model achieves an accuracy of 86%and an AUC score of 0.963 on an osteoporosis dataset,which demonstrates its promising and competitive performance. 展开更多
关键词 parallel transformer multiple instance learning weakly-supervised classification
原文传递
SSA: semantic structure aware inference on CNN networks for weakly pixel-wise dense predictions without cost
6
作者 Yanpeng SUN Zechao LI 《Frontiers of Computer Science》 2025年第2期1-10,共10页
The pixel-wise dense prediction tasks based on weakly supervisions currently use Class Attention Maps(CAMs)to generate pseudo masks as ground-truth.However,existing methods often incorporate trainable modules to expan... The pixel-wise dense prediction tasks based on weakly supervisions currently use Class Attention Maps(CAMs)to generate pseudo masks as ground-truth.However,existing methods often incorporate trainable modules to expand the immature class activation maps,which can result in significant computational overhead and complicate the training process.In this work,we investigate the semantic structure information concealed within the CNN network,and propose a semantic structure aware inference(SSA)method that utilizes this information to obtain high-quality CAM without any additional training costs.Specifically,the semantic structure modeling module(SSM)is first proposed to generate the classagnostic semantic correlation representation,where each item denotes the affinity degree between one category of objects and all the others.Then,the immature CAM are refined through a dot product operation that utilizes semantic structure information.Finally,the polished CAMs from different backbone stages are fused as the output.The advantage of SSA lies in its parameter-free nature and the absence of additional training costs,which makes it suitable for various weakly supervised pixel-dense prediction tasks.We conducted extensive experiments on weakly supervised object localization and weakly supervised semantic segmentation,and the results confirm the effectiveness of SSA. 展开更多
关键词 class attention maps semantic structure weaklysupervised object localization weakly-supervised semantic segmentation
原文传递
Rts:learning robustly from time series data with noisy label
7
作者 Zhi ZHOU Yi-Xuan JIN Yu-Feng LI 《Frontiers of Computer Science》 SCIE EI CSCD 2024年第6期119-136,共18页
Significant progress has been made in machine learning with large amounts of clean labels and static data.However,in many real-world applications,the data often changes with time and it is difficult to obtain massive ... Significant progress has been made in machine learning with large amounts of clean labels and static data.However,in many real-world applications,the data often changes with time and it is difficult to obtain massive clean annotations,that is,noisy labels and time series are faced simultaneously.For example,in product-buyer evaluation,each sample records the daily time behavior of users,but the long transaction period brings difficulties to analysis,and salespeople often erroneously annotate the user’s purchase behavior.Such a novel setting,to our best knowledge,has not been thoroughly studied yet,and there is still a lack of effective machine learning methods.In this paper,we present a systematic approach RTS both theoretically and empirically,consisting of two components,Noise-Tolerant Time Series Representation and Purified Oversampling Learning.Specifically,we propose reducing label noise’s destructive impact to obtain robust feature representations and potential clean samples.Then,a novel learning method based on the purified data and time series oversampling is adopted to train an unbiased model.Theoretical analysis proves that our proposal can improve the quality of the noisy data set.Empirical experiments on diverse tasks,such as the house-buyer evaluation task from real-world applications and various benchmark tasks,clearly demonstrate that our new algorithm robustly outperforms many competitive methods. 展开更多
关键词 weakly-supervised learning time-series classification class-imbalanced learning
原文传递
Transformers in medical image analysis 被引量:7
8
作者 Kelei He Chen Gan +7 位作者 Zhuoyuan Li Islem Rekik Zihao Yin Wen Ji Yang Gao Qian Wang Junfeng Zhang Dinggang Shen 《Intelligent Medicine》 CSCD 2023年第1期59-78,共20页
Transformers have dominated the field of natural language processing and have recently made an impact in the area of computer vision.In the field of medical image analysis,transformers have also been successfully used... Transformers have dominated the field of natural language processing and have recently made an impact in the area of computer vision.In the field of medical image analysis,transformers have also been successfully used in to full-stack clinical applications,including image synthesis/reconstruction,registration,segmentation,detection,and diagnosis.This paper aimed to promote awareness of the applications of transformers in medical image analysis.Specifically,we first provided an overview of the core concepts of the attention mechanism built into transformers and other basic components.Second,we reviewed various transformer architectures tailored for medical image applications and discuss their limitations.Within this review,we investigated key challenges including the use of transformers in different learning paradigms,improving model efficiency,and coupling with other techniques.We hope this review would provide a comprehensive picture of transformers to readers with an interest in medical image analysis. 展开更多
关键词 Transformer Medical image analysis Deep learning DIAGNOSIS REGISTRATION SEGMENTATION Image synthesis Multi-task learning Multi-modal learning weakly-supervised learning
原文传递
Weakly- and Semi-Supervised Fast Region-Based CNN for Object Detection 被引量:1
9
作者 Xing-Gang Wang Jia-Si Wang +1 位作者 Peng Tang Wen-Yu Liu 《Journal of Computer Science & Technology》 SCIE EI CSCD 2019年第6期1269-1278,共10页
Learning an effective object detector with little supervision is an essential but challenging problem in computer vision applications. In this paper, we consider the problem of learning a deep convolutional neural net... Learning an effective object detector with little supervision is an essential but challenging problem in computer vision applications. In this paper, we consider the problem of learning a deep convolutional neural network (CNN) based object detector using weakly-supervised and semi-supervised information in the framework of fast region-based CNN (Fast R-CNN). The target is to obtain an object detector as accurate as the fully-supervised Fast R-CNN, but it requires less image annotation effort. To solve this problem, we use weakly-supervised training images (i.e., only the image-level annotation is given) and a few proportions of fully-supervised training images (i.e., the bounding box level annotation is given), that is a weakly-and semi-supervised (WASS) object detection setting. The proposed solution is termed as WASS R-CNN, in which there are two main components. At first, a weakly-supervised R-CNN is firstly trained;after that semi-supervised data are used for finetuning the weakly-supervised detector. We perform object detection experiments on the PASCAL VOC 2007 dataset. The proposed WASS R-CNN achieves more than 85% of a fully-supervised Fast R-CNN's performance (measured using mean average precision) with only 10%of fully-supervised annotations together with weak supervision for all training images. The results show that the proposed learning framework can significantly reduce the labeling efforts for obtaining reliable object detectors. 展开更多
关键词 object detection weakly-supervised LEARNING SEMI-SUPERVISED LEARNING FAST region-based convolutional NEURAL network (Fast R-CNN)
原文传递
Visual Superordinate Abstraction for Robust Concept Learning
10
作者 Qi Zheng Chao-Yue Wang +1 位作者 Dadong Wang Da-Cheng Tao 《Machine Intelligence Research》 EI CSCD 2023年第1期79-91,共13页
Concept learning constructs visual representations that are connected to linguistic semantics, which is fundamental to vision-language tasks. Although promising progress has been made, existing concept learners are st... Concept learning constructs visual representations that are connected to linguistic semantics, which is fundamental to vision-language tasks. Although promising progress has been made, existing concept learners are still vulnerable to attribute perturbations and out-of-distribution compositions during inference. We ascribe the bottleneck to a failure to explore the intrinsic semantic hierarchy of visual concepts, e.g., {red, blue,···} ∈“color” subspace yet cube ∈“shape”. In this paper, we propose a visual superordinate abstraction framework for explicitly modeling semantic-aware visual subspaces(i.e., visual superordinates). With only natural visual question answering data, our model first acquires the semantic hierarchy from a linguistic view and then explores mutually exclusive visual superordinates under the guidance of linguistic hierarchy. In addition, a quasi-center visual concept clustering and superordinate shortcut learning schemes are proposed to enhance the discrimination and independence of concepts within each visual superordinate. Experiments demonstrate the superiority of the proposed framework under diverse settings, which increases the overall answering accuracy relatively by 7.5% for reasoning with perturbations and 15.6% for compositional generalization tests. 展开更多
关键词 Concept learning visual question answering weakly-supervised learning multi-modal learning curriculum learning
原文传递
Multi-Label Image Classification with Weak Correlation Prior
11
作者 Xiao Ouyang Ruidong Fan +1 位作者 Hong Tao Chenping Hou 《CAAI Artificial Intelligence Research》 2022年第1期79-92,共14页
Image classification is vital and basic in many data analysis domains.Since real-world images generally contain multiple diverse semantic labels,it amounts to a typical multi-label classification problem.Traditional m... Image classification is vital and basic in many data analysis domains.Since real-world images generally contain multiple diverse semantic labels,it amounts to a typical multi-label classification problem.Traditional multi-label image classification relies on a large amount of training data with plenty of labels,which requires a lot of human and financial costs.By contrast,one can easily obtain a correlation matrix of concerned categories in current scene based on the historical image data in other application scenarios.How to perform image classification with only label correlation priors,without specific and costly annotated labels,is an important but rarely studied problem.In this paper,we propose a model to classify images with this kind of weak correlation prior.We use label correlation to recapitulate the sample similarity,employ the prior information to decompose the projection matrix when regressing the label indication matrix,and introduce the L_(2,1) norm to select features for each image.Finally,experimental results on several image datasets demonstrate that the proposed model has distinct advantages over current state-of-the-art multi-label classification methods. 展开更多
关键词 image recognition label correlation multi-label classification weakly-supervised learning
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部