期刊文献+
共找到23篇文章
< 1 2 >
每页显示 20 50 100
Unlocking the potential of unlabeled data:Self-supervised machine learning for battery aging diagnosis with real-world field data
1
作者 Qiao Wang Min Ye +4 位作者 Sehriban Celik Zhongwei Deng Bin Li Dirk Uwe Sauer Weihan Li 《Journal of Energy Chemistry》 SCIE EI CAS CSCD 2024年第12期681-691,共11页
Accurate aging diagnosis is crucial for the health and safety management of lithium-ion batteries in electric vehicles.Despite significant advancements achieved by data-driven methods,diagnosis accuracy remains constr... Accurate aging diagnosis is crucial for the health and safety management of lithium-ion batteries in electric vehicles.Despite significant advancements achieved by data-driven methods,diagnosis accuracy remains constrained by the high costs of check-up tests and the scarcity of labeled data.This paper presents a framework utilizing self-supervised machine learning to harness the potential of unlabeled data for diagnosing battery aging in electric vehicles during field operations.We validate our method using battery degradation datasets collected over more than two years from twenty real-world electric vehicles.Our analysis comprehensively addresses cell inconsistencies,physical interpretations,and charging uncertainties in real-world applications.This is achieved through self-supervised feature extraction using random short charging sequences in the main peak of incremental capacity curves.By leveraging inexpensive unlabeled data in a self-supervised approach,our method demonstrates improvements in average root mean square errors of 74.54%and 60.50%in the best and worst cases,respectively,compared to the supervised benchmark.This work underscores the potential of employing low-cost unlabeled data with self-supervised machine learning for effective battery health and safety management in realworld scenarios. 展开更多
关键词 Lithium-ion battery Aging diagnosis Self-supervised Machine learning unlabeled data
在线阅读 下载PDF
Semi-supervised LIBS quantitative analysis method based on co-training regression model with selection of effective unlabeled samples 被引量:1
2
作者 Xiaomeng LI Huili LU +1 位作者 Jianhong YANG Fu CHANG 《Plasma Science and Technology》 SCIE EI CAS CSCD 2019年第3期114-124,共11页
The accuracy of laser-induced breakdown spectroscopy(LIBS) quantitative method is greatly dependent on the amount of certified standard samples used for training. However, in practical applications, only limited stand... The accuracy of laser-induced breakdown spectroscopy(LIBS) quantitative method is greatly dependent on the amount of certified standard samples used for training. However, in practical applications, only limited standard samples with labeled certified concentrations are available. A novel semi-supervised LIBS quantitative analysis method is proposed, based on co-training regression model with selection of effective unlabeled samples. The main idea of the proposed method is to obtain better regression performance by adding effective unlabeled samples in semisupervised learning. First, effective unlabeled samples are selected according to the testing samples by Euclidean metric. Two original regression models based on least squares support vector machine with different parameters are trained by the labeled samples separately, and then the effective unlabeled samples predicted by the two models are used to enlarge the training dataset based on labeling confidence estimation. The final predictions of the proposed method on the testing samples will be determined by weighted combinations of the predictions of two updated regression models. Chromium concentration analysis experiments of 23 certified standard high-alloy steel samples were carried out, in which 5 samples with labeled concentrations and 11 unlabeled samples were used to train the regression models and the remaining 7 samples were used for testing. With the numbers of effective unlabeled samples increasing, the root mean square error of the proposed method went down from 1.80% to 0.84% and the relative prediction error was reduced from 9.15% to 4.04%. 展开更多
关键词 LIBS EFFECTIVE unlabeled samples CO-TRAINING SEMI-SUPERVISED LABELING CONFIDENCE estimation
在线阅读 下载PDF
Regularized canonical correlation analysis with unlabeled data 被引量:1
3
作者 Xi-chuan ZHOU Hai-bin SHEN 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2009年第4期504-511,共8页
In standard canonical correlation analysis (CCA), the data from definite datasets are used to estimate their canonical correlation. In real applications, for example in bilingual text retrieval, it may have a great po... In standard canonical correlation analysis (CCA), the data from definite datasets are used to estimate their canonical correlation. In real applications, for example in bilingual text retrieval, it may have a great portion of data that we do not know which set it belongs to. This part of data is called unlabeled data, while the rest from definite datasets is called labeled data. We propose a novel method called regularized canonical correlation analysis (RCCA), which makes use of both labeled and unlabeled samples. Specifically, we learn to approximate canonical correlation as if all data were labeled. Then, we describe a generalization of RCCA for the multi-set situation. Experiments on four real world datasets, Yeast, Cloud, Iris, and Haberman, demonstrate that, by incorporating the unlabeled data points, the accuracy of correlation coefficients can be improved by over 30%. 展开更多
关键词 Canonical correlation analysis (CCA) REGULARIZATION unlabeled data Generalized canonical correlation analysis(GCCA)
原文传递
Combining supervised classifiers with unlabeled data
4
作者 刘雪艳 张雪英 +1 位作者 李凤莲 黄丽霞 《Journal of Central South University》 SCIE EI CAS CSCD 2016年第5期1176-1182,共7页
Ensemble learning is a wildly concerned issue.Traditional ensemble techniques are always adopted to seek better results with labeled data and base classifiers.They fail to address the ensemble task where only unlabele... Ensemble learning is a wildly concerned issue.Traditional ensemble techniques are always adopted to seek better results with labeled data and base classifiers.They fail to address the ensemble task where only unlabeled data are available.A label propagation based ensemble(LPBE) approach is proposed to further combine base classification results with unlabeled data.First,a graph is constructed by taking unlabeled data as vertexes,and the weights in the graph are calculated by correntropy function.Average prediction results are gained from base classifiers,and then propagated under a regularization framework and adaptively enhanced over the graph.The proposed approach is further enriched when small labeled data are available.The proposed algorithms are evaluated on several UCI benchmark data sets.Results of simulations show that the proposed algorithms achieve satisfactory performance compared with existing ensemble methods. 展开更多
关键词 correntropy unlabeled data regularization framework ensemble learning
在线阅读 下载PDF
A FLUORESCENCE QUENCHING IMNUNOASSAY METHOD FOR DETERMINATION OF TRACE ALBUMIN USING UNLABELED EUROPIUM CHELATE
5
作者 Feng Ji YAO Mei HONG Xiao Da YANG Yun Xiang CI Department of Chemistry,Peking University,100871 《Chinese Chemical Letters》 SCIE CAS CSCD 1992年第12期1011-1014,共4页
A fluoroimmunoassay method using unlabeled europium chelate is described.The principle is similar to that of fluoroimmunoassay method using lanthanide chelate as labels.The procedure is simple because labeling process... A fluoroimmunoassay method using unlabeled europium chelate is described.The principle is similar to that of fluoroimmunoassay method using lanthanide chelate as labels.The procedure is simple because labeling process is omitted.The detection limit is about 10^(10) mol/L antigen.The relative standard deviation of immunoassay is less than 10%.The recoveries of human serum albumin and estradiol protein conjugate are 96-105% and 111% respectively. 展开更多
关键词 RSA Eu A FLUORESCENCE QUENCHING IMNUNOASSAY METHOD FOR DETERMINATION OF TRACE ALBUMIN USING unlabeled EUROPIUM CHELATE
在线阅读 下载PDF
A QUENCHING FLUORESCENCE IMMUNOASSAY METHOD FOR DETERMINATION OF TRACE ALBUMIN USING UNLABELED TERBIUM CHELATE
6
作者 Feng Ji YAO Pei Hong LI Xiao Da YANG Bo XING Yun Xiang CI (Department of Chemistry,Peking University,100871) 《Chinese Chemical Letters》 SCIE CAS CSCD 1991年第9期737-738,共2页
A fluoroimmunoassay method using unlabeled Terbium chelate is described.The principle is similar to that of fluoroimmunoassay method using lanthanide chelate as labels.The procedure is simpte because labeling process ... A fluoroimmunoassay method using unlabeled Terbium chelate is described.The principle is similar to that of fluoroimmunoassay method using lanthanide chelate as labels.The procedure is simpte because labeling process is unnecessary.The recovery of HSA and albumin in urine is 107% and 95% respectively.The standard deviation is tess than 10%. 展开更多
关键词 FIA A QUENCHING FLUORESCENCE IMMUNOASSAY METHOD FOR DETERMINATION OF TRACE ALBUMIN USING unlabeled TERBIUM CHELATE
在线阅读 下载PDF
A Novel Active Learning Approach for Improving Classification of Unlabeled Video Based on Deep Learning Techniques
7
作者 Mohamed Salama 《Journal of Social Computing》 2025年第1期1-17,共17页
Video classification typically requires large labeled datasets which are costly and time-consuming to obtain.This paper proposes a novel Active Learning(AL)framework to improve video classification performance while m... Video classification typically requires large labeled datasets which are costly and time-consuming to obtain.This paper proposes a novel Active Learning(AL)framework to improve video classification performance while minimizing the human annotation effort.Unlike passive learning methods that randomly select samples for labeling,our approach actively identifies the most informative unlabeled instances to be annotated.Specifically,we develop batch mode AL techniques that select useful videos based on uncertainty and diversity sampling.The algorithm then extracts a diverse set of representative keyframes from the queried videos.Human annotators only need to label these keyframes instead of watching the full videos.We implement this approach by leveraging recent advances in deep neural networks for visual feature extraction and sequence modeling.Our experiments on benchmark datasets demonstrate that our method achieves significant improvements in video classification accuracy with less training data.This enables more efficient video dataset construction and could make large-scale video annotation more feasible.Our AL framework minimizes the human effort needed to train accurate video classifiers. 展开更多
关键词 active learning unlabeled data classification query strategies deep learning video classification annotation costs
原文传递
A recent survey on instance-dependent positive and unlabeled learning
8
作者 Chen Gong Muhammad Imran Zulfiqar +2 位作者 Chuang Zhang Shahid Mahmood Jian Yang 《Fundamental Research》 2025年第2期796-803,共8页
Training with confident positive-labeled instances has received a lot of attention in Positive and Unlabeled(PU)learning tasks,and this is formally termed“Instance-Dependent PU learning”.In instance-dependent PU lea... Training with confident positive-labeled instances has received a lot of attention in Positive and Unlabeled(PU)learning tasks,and this is formally termed“Instance-Dependent PU learning”.In instance-dependent PU learning,whether a positive instance is labeled depends on its labeling confidence.In other words,it is assumed that not all positive instances have the same probability to be included by the positive set.Instead,the instances that are far from the potential decision boundary are with larger probability to be labeled than those that are close to the decision boundary.This setting has practical importance in many real-world applications such as medical diagnosis,outlier detection,object detection,etc.In this survey,we first present the preliminary knowledge of PU learning,and then review the representative instance-dependent PU learning settings and methods.After that,we thoroughly compare them with typical PU learning methods on various benchmark datasets and analyze their performances.Finally,we discuss the potential directions for future research. 展开更多
关键词 Instance-dependent positive and unlabeled LEARNING Weakly supervised learning Label noise learning Cost-sensitive learning Labeling assumption Scoring function
原文传递
Exploiting Unlabeled Data for Neural Grammatical Error Detection 被引量:3
9
作者 Zhuo-Ran Liu Yang Liu 《Journal of Computer Science & Technology》 SCIE EI CSCD 2017年第4期758-767,共10页
Identifying and correcting grammatical errors in the text written by non-native writers have received increasing attention in recent years. Although a number of annotated corpora have been established to facilitate da... Identifying and correcting grammatical errors in the text written by non-native writers have received increasing attention in recent years. Although a number of annotated corpora have been established to facilitate data-driven grammatical error detection and correction approaches, they are still limited in terms of quantity and coverage because human annotation is labor-intensive, time-consuming, and expensive. In this work, we propose to utilize unlabeled data to train neural network based grammatical error detection models. The basic idea is to cast error detection as a binary classification problem and derive positive and negative training examples from unlabeled data. We introduce an attention-based neural network to capture long-distance dependencies that influence the word being detected. Experiments show that the proposed approach significantly outperforms SVM and convolutional networks with fixed-size context window. 展开更多
关键词 unlabeled data grammatical error detection neural network
原文传递
Effcient poisoning attacks and defenses for unlabeled data in DDoS prediction of intelligent transportation systems 被引量:1
10
作者 Zhong Li Xianke Wu Changjun Jiang 《Security and Safety》 2022年第1期145-165,共21页
Nowadays,large numbers of smart sensors(e.g.,road-side cameras)which com-municate with nearby base stations could launch distributed denial of services(DDoS)attack storms in intelligent transportation systems.DDoS att... Nowadays,large numbers of smart sensors(e.g.,road-side cameras)which com-municate with nearby base stations could launch distributed denial of services(DDoS)attack storms in intelligent transportation systems.DDoS attacks disable the services provided by base stations.Thus in this paper,considering the uneven communication traffic ows and privacy preserving,we give a hidden Markov model-based prediction model by utilizing the multi-step characteristic of DDoS with a federated learning framework to predict whether DDoS attacks will happen on base stations in the future.However,in the federated learning,we need to consider the problem of poisoning attacks due to malicious participants.The poisoning attacks will lead to the intelligent transportation systems paralysis without security protection.Traditional poisoning attacks mainly apply to the classi cation model with labeled data.In this paper,we propose a reinforcement learning-based poisoningmethod speci cally for poisoning the prediction model with unlabeled data.Besides,previous related defense strategies rely on validation datasets with labeled data in the server.However,it is unrealistic since the local training datasets are not uploaded to the server due to privacy preserving,and our datasets are also unlabeled.Furthermore,we give a validation dataset-free defense strategy based on Dempster-Shafer(D-S)evidence theory avoiding anomaly aggregation to obtain a robust global model for precise DDoS prediction.In our experiments,we simulate 3000 points in combination with DARPA2000 dataset to carry out evaluations.The results indicate that our poisoning method can successfully poison the global prediction model with unlabeled data in a short time.Meanwhile,we compare our proposed defense algorithm with three popularly used defense algorithms.The results show that our defense method has a high accuracy rate of excluding poisoners and can obtain a high attack prediction probability. 展开更多
关键词 Poisoning attacks DEFENSES Multi-step DDoS prediction unlabeled data Intel-ligent transportation systems
原文传递
Entire Solution Path for Support Vector Machine for Positive and Unlabeled Classification
11
作者 姚利敏 唐杰 李涓子 《Tsinghua Science and Technology》 SCIE EI CAS 2009年第2期242-251,共10页
Support vector machines (SVMs) aim to find an optimal separating hyper-plane that maximizes separation between two classes of training examples (more precisely, maximizes the margin between the two classes of examp... Support vector machines (SVMs) aim to find an optimal separating hyper-plane that maximizes separation between two classes of training examples (more precisely, maximizes the margin between the two classes of examples). The choice of the cost parameter for training the SVM model is always a critical issue. This analysis studies how the cost parameter determines the hyper-plane; especially for classifications using only positive data and unlabeled data. An algorithm is given for the entire solution path by choosing the 'best' cost parameter while training the SVM model. The performance of the algorithm is compared with conventional implementations that use default values as the cost parameter on two synthetic data sets and two real-world data sets. The results show that the algorithm achieves better results when dealing with positive data and unlabeled classification. 展开更多
关键词 support vector machine cost parameter positive and unlabeled classification
原文传递
Shapelet Based Two-Step Time Series Positive and Unlabeled Learning
12
作者 张翰博 王鹏 +1 位作者 张明明 汪卫 《Journal of Computer Science & Technology》 SCIE EI CSCD 2023年第6期1387-1402,共16页
In the last decade,there has been significant progress in time series classification.However,in real-world in-dustrial settings,it is expensive and difficult to obtain high-quality labeled data.Therefore,the positive ... In the last decade,there has been significant progress in time series classification.However,in real-world in-dustrial settings,it is expensive and difficult to obtain high-quality labeled data.Therefore,the positive and unlabeled learning(PU-learning)problem has become more and more popular recently.The current PU-learning approaches of the time series data suffer from low accuracy due to the lack of negative labeled time series.In this paper,we propose a novel shapelet based two-step(2STEP)PU-learning approach.In the first step,we generate shapelet features based on the posi-tive time series,which are used to select a set of negative examples.In the second step,based on both positive and nega-tive time series,we select the final features and build the classification model.The experimental results show that our 2STEP approach can improve the average F1 score on 15 datasets by 9.1%compared with the baselines,and achieves the highest F1 score on 10 out of 15 time series datasets. 展开更多
关键词 positive unlabeled learning time series shapelet
原文传递
一种基于聚类的PU主动文本分类方法 被引量:24
13
作者 刘露 彭涛 +1 位作者 左万利 戴耀康 《软件学报》 EI CSCD 北大核心 2013年第11期2571-2583,共13页
文本分类是信息检索的关键问题之一.提取更多的可信反例和构造准确高效的分类器是PU(positive and unlabeled)文本分类的两个重要问题.然而,在现有的可信反例提取方法中,很多方法提取的可信反例数量较少,构建的分类器质量有待提高.分别... 文本分类是信息检索的关键问题之一.提取更多的可信反例和构造准确高效的分类器是PU(positive and unlabeled)文本分类的两个重要问题.然而,在现有的可信反例提取方法中,很多方法提取的可信反例数量较少,构建的分类器质量有待提高.分别针对这两个重要步骤提供了一种基于聚类的半监督主动分类方法.与传统的反例提取方法不同,利用聚类技术和正例文档应与反例文档共享尽可能少的特征项这一特点,从未标识数据集中尽可能多地移除正例,从而可以获得更多的可信反例.结合SVM主动学习和改进的Rocchio构建分类器,并采用改进的TFIDF(term frequency inverse document frequency)进行特征提取,可以显著提高分类的准确度.分别在3个不同的数据集中测试了分类结果(RCV1,Reuters-21578,20 Newsgoups).实验结果表明,基于聚类寻找可信反例可以在保持较低错误率的情况下获取更多的可信反例,而且主动学习方法的引入也显著提升了分类精度. 展开更多
关键词 PU(FIositive and unlabeled)文本分类 聚类 TFIPNDF(term FREQUENCY inverse positive negative document frequency) 主动学习 可信反例 改进的Rocchio
在线阅读 下载PDF
基于PUL算法及高分辨率WorldView影像的城市不透水面提取 被引量:6
14
作者 刘冉 李文楷 +2 位作者 刘小平 陈逸敏 刘珍环 《地理与地理信息科学》 CSCD 北大核心 2018年第1期40-46,130,共8页
准确提取城市不透水面对生态环境、水热循环及热岛效应等研究具有重要意义。该文利用WorldView高分辨遥感影像,提出基于PUL(Positive and Unlabeled Learning)算法的高分辨率影像城市不透水面提取方法,该方法不需要负样本数据,只需少量... 准确提取城市不透水面对生态环境、水热循环及热岛效应等研究具有重要意义。该文利用WorldView高分辨遥感影像,提出基于PUL(Positive and Unlabeled Learning)算法的高分辨率影像城市不透水面提取方法,该方法不需要负样本数据,只需少量的正样本和未标记样本即可训练分类模型。结果显示,PUL算法的提取结果优于一类支持向量机(OCSVM)以及最大熵(MAXENT)模型。使用不同正样本量时,PUL的提取结果总体精度和kappa系数均优于OCSVM和MAXENT,最高总体精度为91.27%,最高kappa系数可达0.8255,可快速、有效地从高分辨率遥感影像中提取不透水面。 展开更多
关键词 城市不透水面 POSITIVE and unlabeled Learning(PUL) 一类支持向量机(OCSVM) 最大熵(MAXENT)模型
在线阅读 下载PDF
Improved spatiotemporal resolution of anti-scattering super-resolution label-free microscopy via synthetic wave 3D metalens imaging 被引量:4
15
作者 Yuting Xiao Lianwei Chen +5 位作者 Mingbo Pu Mingfeng Xu Qi Zhang Yinghui Guo Tianqu Chen Xiangang Luo 《Opto-Electronic Science》 2023年第11期4-13,共10页
Super-resolution(SR)microscopy has dramatically enhanced our understanding of biological processes.However,scattering media in thick specimens severely limits the spatial resolution,often rendering the images unclear ... Super-resolution(SR)microscopy has dramatically enhanced our understanding of biological processes.However,scattering media in thick specimens severely limits the spatial resolution,often rendering the images unclear or indistinguishable.Additionally,live-cell imaging faces challenges in achieving high temporal resolution for fast-moving subcellular structures.Here,we present the principles of a synthetic wave microscopy(SWM)to extract three-dimensional information from thick unlabeled specimens,where photobleaching and phototoxicity are avoided.SWM exploits multiple-wave interferometry to reveal the specimen’s phase information in the area of interest,which is not affected by the scattering media in the optical path.SWM achieves~0.42λ/NA resolution at an imaging speed of up to 106 pixels/s.SWM proves better temporal resolution and sensitivity than the most conventional microscopes currently available while maintaining exceptional SR and anti-scattering capabilities.Penetrating through the scattering media is challenging for conventional imaging techniques.Remarkably,SWM retains its efficacy even in conditions of low signal-to-noise ratios.It facilitates the visualization of dynamic subcellular structures in live cells,encompassing tubular endoplasmic reticulum(ER),lipid droplets,mitochondria,and lysosomes. 展开更多
关键词 SUPER-RESOLUTION anti-scattering unlabeled high temporal resolution
在线阅读 下载PDF
Iterative Semi-Supervised Learning Using Softmax Probability 被引量:1
16
作者 Heewon Chung Jinseok Lee 《Computers, Materials & Continua》 SCIE EI 2022年第9期5607-5628,共22页
For the classification problem in practice,one of the challenging issues is to obtain enough labeled data for training.Moreover,even if such labeled data has been sufficiently accumulated,most datasets often exhibit l... For the classification problem in practice,one of the challenging issues is to obtain enough labeled data for training.Moreover,even if such labeled data has been sufficiently accumulated,most datasets often exhibit long-tailed distribution with heavy class imbalance,which results in a biased model towards a majority class.To alleviate such class imbalance,semisupervised learning methods using additional unlabeled data have been considered.However,as a matter of course,the accuracy is much lower than that from supervised learning.In this study,under the assumption that additional unlabeled data is available,we propose the iterative semi-supervised learning algorithms,which iteratively correct the labeling of the extra unlabeled data based on softmax probabilities.The results show that the proposed algorithms provide the accuracy as high as that from the supervised learning.To validate the proposed algorithms,we tested on the two scenarios:with the balanced unlabeled dataset and with the imbalanced unlabeled dataset.Under both scenarios,our proposed semi-supervised learning algorithms provided higher accuracy than previous state-of-the-arts.Code is available at https://github.com/HeewonChung92/iterative-semi-learning. 展开更多
关键词 Semi-supervised learning class imbalance iterative learning unlabeled data
在线阅读 下载PDF
Research and Implementation of Unsupervised Clustering-Based Intrusion Detection
17
作者 Luo Min, Zhang Huan\|guo, Wang Li\|na School of Computer, Wuhan University, Wuhan 430072, Hubei, China 《Wuhan University Journal of Natural Sciences》 CAS 2003年第03A期803-807,共5页
An unsupervised clustering\|based intrusion detection algorithm is discussed in this paper. The basic idea of the algorithm is to produce the cluster by comparing the distances of unlabeled training data sets. With th... An unsupervised clustering\|based intrusion detection algorithm is discussed in this paper. The basic idea of the algorithm is to produce the cluster by comparing the distances of unlabeled training data sets. With the classified data instances, anomaly data clusters can be easily identified by normal cluster ratio and the identified cluster can be used in real data detection. The benefit of the algorithm is that it doesn't need labeled training data sets. The experiment concludes that this approach can detect unknown intrusions efficiently in the real network connections via using the data sets of KDD99. 展开更多
关键词 intrusion detection data mining unsupervised clustering unlabeled data
在线阅读 下载PDF
Disease Prediction Based on Transfer Learning in Individual Healthcare
18
作者 Yang Song Tianbai Yue +2 位作者 Hongzhi Wang Jianzhong Li Hong Gao 《国际计算机前沿大会会议论文集》 2017年第1期28-30,共3页
Nowadays,emerging mobile medical technology and disease prevention become new trends of disease prevention and control.Based on this technology,we present disease prediction models based on transfer learning.Breast ca... Nowadays,emerging mobile medical technology and disease prevention become new trends of disease prevention and control.Based on this technology,we present disease prediction models based on transfer learning.Breast cancer disease data has been used to build our model.According to the neural networks,the basic model has been provided.With unlabeled data,transfer learning is a appropriate way to revise the module to increase accuracy.The test results show that the algorithm is suitable for data classification,especially for unlabeled health data. 展开更多
关键词 INDIVIDUAL healthcare TRANSFER LEARNING NEURAL NETWORKS DISEASE prediction unlabeled data
在线阅读 下载PDF
Geostatistical semi-supervised learning for spatial prediction
19
作者 Francky Fouedjio Hassan Talebi 《Artificial Intelligence in Geosciences》 2022年第1期162-178,共17页
Geoscientists are increasingly tasked with spatially predicting a target variable in the presence of auxiliary information using supervised machine learning algorithms.Typically,the target variable is observed at a fe... Geoscientists are increasingly tasked with spatially predicting a target variable in the presence of auxiliary information using supervised machine learning algorithms.Typically,the target variable is observed at a few sampling locations due to the relatively time-consuming and costly process of obtaining measurements.In contrast,auxiliary variables are often exhaustively observed within the region under study through the increasing development of remote sensing platforms and sensor networks.Supervised machine learning methods do not fully leverage this large amount of auxiliary spatial data.Indeed,in these methods,the training dataset includes only labeled data locations(where both target and auxiliary variables were measured).At the same time,unlabeled data locations(where auxiliary variables were measured but not the target variable)are not considered during the model training phase.Consequently,only a limited amount of auxiliary spatial data is utilized during the model training stage.As an alternative to supervised learning,semi-supervised learning,which learns from labeled as well as unlabeled data,can be used to address this problem.However,conventional semi-supervised learning techniques do not account for the specificities of spatial data.This paper introduces a spatial semi-supervised learning framework where geostatistics and machine learning are combined to harness a large amount of unlabeled spatial data in combination with typically a smaller set of labeled spatial data.The main idea consists of leveraging the target variable’s spatial autocorrelation to generate pseudo labels at unlabeled data points that are geographically close to labeled data points.This is achieved through geostatistical conditional simulation,where an ensemble of pseudo labels is generated to account for the uncertainty in the pseudo labeling process.The observed labels are augmented by this ensemble of pseudo labels to create an ensemble of pseudo training datasets.A supervised machine learning model is then trained on each pseudo training dataset,followed by an aggregation of trained models.The proposed geostatistical semi-supervised learning method is applied to synthetic and real-world spatial datasets.Its predictive performance is compared with some classical supervised and semi-supervised machine learning methods.It appears that it can effectively leverage a large amount of unlabeled spatial data to improve the target variable’s spatial prediction. 展开更多
关键词 Labeled spatial data unlabeled spatial data Spatial autocorrelation Pseudo labeling Spatial prediction
在线阅读 下载PDF
Improving sound event detection through enhanced feature extraction and attention mechanisms
20
作者 Dongping ZHANG Siyi WU +3 位作者 Zhanhong LU Zhehao ZHANG Haimiao HU Jiabin YU 《Frontiers of Computer Science》 2025年第10期143-145,共3页
1 Introduction Sound event detection(SED)aims to identify and locate specific sound event categories and their corresponding timestamps within continuous audio streams.To overcome the limitations posed by the scarcity... 1 Introduction Sound event detection(SED)aims to identify and locate specific sound event categories and their corresponding timestamps within continuous audio streams.To overcome the limitations posed by the scarcity of strongly labeled training data,researchers have increasingly turned to semi-supervised learning(SSL)[1],which leverages unlabeled data to augment training and improve detection performance.Among many SSL methods[2-4]. 展开更多
关键词 sound event detection semi supervised learning feature extraction sound event detection sed aims identify locate specific sound event categories augment training unlabeled data attention mechanisms
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部