期刊文献+
共找到11篇文章
< 1 >
每页显示 20 50 100
Influence of different data selection criteria on internal geomagnetic field modeling 被引量:4
1
作者 HongBo Yao JuYuan Xu +3 位作者 Yi Jiang Qing Yan Liang Yin PengFei Liu 《Earth and Planetary Physics》 2025年第3期541-549,共9页
Earth’s internal core and crustal magnetic fields,as measured by geomagnetic satellites like MSS-1(Macao Science Satellite-1)and Swarm,are vital for understanding core dynamics and tectonic evolution.To model these i... Earth’s internal core and crustal magnetic fields,as measured by geomagnetic satellites like MSS-1(Macao Science Satellite-1)and Swarm,are vital for understanding core dynamics and tectonic evolution.To model these internal magnetic fields accurately,data selection based on specific criteria is often employed to minimize the influence of rapidly changing current systems in the ionosphere and magnetosphere.However,the quantitative impact of various data selection criteria on internal geomagnetic field modeling is not well understood.This study aims to address this issue and provide a reference for constructing and applying geomagnetic field models.First,we collect the latest MSS-1 and Swarm satellite magnetic data and summarize widely used data selection criteria in geomagnetic field modeling.Second,we briefly describe the method to co-estimate the core,crustal,and large-scale magnetospheric fields using satellite magnetic data.Finally,we conduct a series of field modeling experiments with different data selection criteria to quantitatively estimate their influence.Our numerical experiments confirm that without selecting data from dark regions and geomagnetically quiet times,the resulting internal field differences at the Earth’s surface can range from tens to hundreds of nanotesla(nT).Additionally,we find that the uncertainties introduced into field models by different data selection criteria are significantly larger than the measurement accuracy of modern geomagnetic satellites.These uncertainties should be considered when utilizing constructed magnetic field models for scientific research and applications. 展开更多
关键词 Macao Science Satellite-1 SWARM geomagnetic field modeling data selection core field crustal field
在线阅读 下载PDF
Machine Learning-Based Two-Stage Data Selection Scheme for Long-Term Influenza Forecasting 被引量:1
2
作者 Jaeuk Moon Seungwon Jung +1 位作者 Sungwoo Park Eenjun Hwang 《Computers, Materials & Continua》 SCIE EI 2021年第9期2945-2959,共15页
One popular strategy to reduce the enormous number of illnesses and deaths from a seasonal influenza pandemic is to obtain the influenza vaccine on time.Usually,vaccine production preparation must be done at least six... One popular strategy to reduce the enormous number of illnesses and deaths from a seasonal influenza pandemic is to obtain the influenza vaccine on time.Usually,vaccine production preparation must be done at least six months in advance,and accurate long-term influenza forecasting is essential for this.Although diverse machine learning models have been proposed for influenza forecasting,they focus on short-term forecasting,and their performance is too dependent on input variables.For a country’s longterm influenza forecasting,typical surveillance data are known to be more effective than diverse external data on the Internet.We propose a two-stage data selection scheme for worldwide surveillance data to construct a longterm forecasting model for influenza in the target country.In the first stage,using a simple forecasting model based on the country’s surveillance data,we measured the change in performance by adding surveillance data from other countries,shifted by up to 52 weeks.In the second stage,for each set of surveillance data sorted by accuracy,we incrementally added data as input if the data have a positive effect on the performance of the forecasting model in the first stage.Using the selected surveillance data,we trained a new longterm forecasting model for influenza and perform influenza forecasting for the target country.We conducted extensive experiments using six machine learning models for the three target countries to verify the effectiveness of the proposed method.We report some of the results. 展开更多
关键词 INFLUENZA data selection machine learning forecasting
在线阅读 下载PDF
Data Selection Using Support Vector Regression
3
作者 Michael B.RICHMAN Lance M.LESLIE +1 位作者 Theodore B.TRAFALIS Hicham MANSOURI 《Advances in Atmospheric Sciences》 SCIE CAS CSCD 2015年第3期277-286,共10页
Geophysical data sets are growing at an ever-increasing rate,requiring computationally efficient data selection (thinning) methods to preserve essential information.Satellites,such as WindSat,provide large data sets... Geophysical data sets are growing at an ever-increasing rate,requiring computationally efficient data selection (thinning) methods to preserve essential information.Satellites,such as WindSat,provide large data sets for assessing the accuracy and computational efficiency of data selection techniques.A new data thinning technique,based on support vector regression (SVR),is developed and tested.To manage large on-line satellite data streams,observations from WindSat are formed into subsets by Voronoi tessellation and then each is thinned by SVR (TSVR).Three experiments are performed.The first confirms the viability of TSVR for a relatively small sample,comparing it to several commonly used data thinning methods (random selection,averaging and Barnes filtering),producing a 10% thinning rate (90% data reduction),low mean absolute errors (MAE) and large correlations with the original data.A second experiment,using a larger dataset,shows TSVR retrievals with MAE < 1 m s-1 and correlations ≥ 0.98.TSVR was an order of magnitude faster than the commonly used thinning methods.A third experiment applies a two-stage pipeline to TSVR,to accommodate online data.The pipeline subsets reconstruct the wind field with the same accuracy as the second experiment,is an order of magnitude faster than the nonpipeline TSVR.Therefore,pipeline TSVR is two orders of magnitude faster than commonly used thinning methods that ingest the entire data set.This study demonstrates that TSVR pipeline thinning is an accurate and computationally efficient alternative to commonly used data selection techniques. 展开更多
关键词 data selection data thinning machine learning support vector regression Voronoi tessellation pipeline methods
在线阅读 下载PDF
SelectQ:Calibration Data Selection for Post-training Quantization
4
作者 Zhao Zhang Yangcheng Gao +3 位作者 Jicong Fan Zhongqiu Zhao Yi Yang Shuicheng Yan 《Machine Intelligence Research》 2025年第3期499-510,共12页
Post-training quantization(PTQ)can reduce the memory footprint and latency of deep model inference while still preserving the accuracy of model,with only a small unlabeled calibration set and without the retraining on... Post-training quantization(PTQ)can reduce the memory footprint and latency of deep model inference while still preserving the accuracy of model,with only a small unlabeled calibration set and without the retraining on full training set.To calibrate a quantized model,current PTQ methods usually randomly select some unlabeled data from the training set as calibration data.However,we show the random data selection would result in performance instability and degradation due to the activation distribution mismatch.In this paper,we attempt to solve the crucial task on appropriate calibration data selection,and propose a novel one-shot calibration data selection method termed SelectQ,which selects specific data for calibration via dynamic clustering.The setting of our SelectQ uses the statistic information of activation and performs layer-wise clustering to learn an activation distribution on training set.For that purpose,a new metric called knowledge distance is proposed to calculate the distances of the activation statistics to centroids.Finally,after calibration with the selected data,quantization noise can be alleviated by mitigating the distribution mismatch within activations.Extensive experiments on ImageNet dataset show that our SelectQ increases the top-1 accuracy of ResNet18 over 15% in 4-bit quantization,compared to randomly sampled calibration data.It's noteworthy that SelectQ does not involve both the backward propagation and batch normalization parameters,which means that it has fewer limitations in practical applications. 展开更多
关键词 Model compression low-bit model quantization less performance loss one-shot dynamic clustering calibration data selection
原文传递
Improving meta-analysis methodology:The necessity of prediction intervals and homogeneous data selection
5
作者 Hela Cherif Chadli Dziri 《Asia Pacific Journal of Clinical Nutrition》 2025年第2期262-264,共3页
Dear Editor,We read with interest the meta-analysis by Cho et al1 on Obesity and mortality in patients with COVID-19:A me ta-analysis of prospective studies,and congratulate the authors.However,we have a few comments ... Dear Editor,We read with interest the meta-analysis by Cho et al1 on Obesity and mortality in patients with COVID-19:A me ta-analysis of prospective studies,and congratulate the authors.However,we have a few comments concerning the methodology. 展开更多
关键词 meta analysis homogeneous data selection prediction intervals
暂未订购
Data point selection for weighted least square fitting of cavity decay time constant 被引量:1
6
作者 何星 晏虎 +2 位作者 董理治 杨平 许冰 《Chinese Physics B》 SCIE EI CAS CSCD 2016年第1期640-646,共7页
For the accurate extraction of cavity decay time, a selection of data points is supplemented to the weighted least square method. We derive the expected precision, accuracy and computation cost of this improved method... For the accurate extraction of cavity decay time, a selection of data points is supplemented to the weighted least square method. We derive the expected precision, accuracy and computation cost of this improved method, and examine these performances by simulation. By comparing this method with the nonlinear least square fitting (NLSF) method and the linear regression of the sum (LRS) method in derivations and simulations, we find that this method can achieve the same or even better precision, comparable accuracy, and lower computation cost. We test this method by experimental decay signals. The results are in agreement with the ones obtained from the nonlinear least square fitting method. 展开更多
关键词 cavity ring-down decay time extraction weighted least square method data point selection
原文传递
Cloud Data Center Selection Using a Modified Differential Evolution 被引量:1
7
作者 Yousef Sanjalawe Mohammed Anbar +3 位作者 Salam Al-E’mari Rosni Abdullah Iznan Hasbullah Mohammed Aladaileh 《Computers, Materials & Continua》 SCIE EI 2021年第12期3179-3204,共26页
The interest in selecting an appropriate cloud data center is exponentially increasing due to the popularity and continuous growth of the cloud computing sector.Cloud data center selection challenges are compounded by... The interest in selecting an appropriate cloud data center is exponentially increasing due to the popularity and continuous growth of the cloud computing sector.Cloud data center selection challenges are compounded by ever-increasing users’requests and the number of data centers required to execute these requests.Cloud service broker policy defines cloud data center’s selection,which is a case of an NP-hard problem that needs a precise solution for an efficient and superior solution.Differential evolution algorithm is a metaheuristic algorithm characterized by its speed and robustness,and it is well suited for selecting an appropriate cloud data center.This paper presents a modified differential evolution algorithm-based cloud service broker policy for the most appropriate data center selection in the cloud computing environment.The differential evolution algorithm is modified using the proposed new mutation technique ensuring enhanced performance and providing an appropriate selection of data centers.The proposed policy’s superiority in selecting the most suitable data center is evaluated using the CloudAnalyst simulator.The results are compared with the state-of-arts cloud service broker policies. 展开更多
关键词 Cloud computing data center data center selection cloud service broker differential evolution user request
在线阅读 下载PDF
Selection of Spectral Data for Classification of Steels Using Laser-Induced Breakdown Spectroscopy 被引量:3
8
作者 孔海洋 孙兰香 +2 位作者 胡静涛 辛勇 丛智博 《Plasma Science and Technology》 SCIE EI CAS CSCD 2015年第11期964-970,共7页
Principal component analysis (PCA) combined with artificial neural networks was used to classify the spectra of 27 steel samples acquired using laser-induced breakdown spectroscopy. Three methods of spectral data se... Principal component analysis (PCA) combined with artificial neural networks was used to classify the spectra of 27 steel samples acquired using laser-induced breakdown spectroscopy. Three methods of spectral data selection, selecting all the peak lines of the spectra, selecting intensive spectral partitions and the whole spectra, were utilized to compare the infiuence of different inputs of PCA on the classification of steels. Three intensive partitions were selected based on experience and prior knowledge to compare the classification, as the partitions can obtain the best results compared to all peak lines and the whole spectra. We also used two test data sets, mean spectra after being averaged and raw spectra without any pretreatment, to verify the results of the classification. The results of this comprehensive comparison show that a back propagation network trained using the principal components of appropriate, carefully selecred spectral partitions can obtain the best results accuracy can be achieved using the intensive spectral A perfect result with 100% classification partitions ranging of 357-367 nm. 展开更多
关键词 laser-induced breakdown spectroscopy classification of steel samples principal component analysis artificial neural networks selection of spectral data
在线阅读 下载PDF
Analyzing Electricity Consumption via Data Mining 被引量:1
9
作者 LIU Jinshuo LAN Huiying +2 位作者 FU Yizhen WU Hui LI Peng 《Wuhan University Journal of Natural Sciences》 CAS 2012年第2期121-125,共5页
This paper proposes a model to analyze the massive data of electricity.Feature subset is determined by the correla-tion-based feature selection and the data-driven methods.The attribute season can be classified succes... This paper proposes a model to analyze the massive data of electricity.Feature subset is determined by the correla-tion-based feature selection and the data-driven methods.The attribute season can be classified successfully through five classi-fiers using the selected feature subset,and the best model can be determined further.The effects on analyzing electricity consump-tion of the other three attributes,including months,businesses,and meters,can be estimated using the chosen model.The data used for the project is provided by Beijing Power Supply Bureau.We use WEKA as the machine learning tool.The models we built are promising for electricity scheduling and power theft detection. 展开更多
关键词 feature selection multi-classification prediction model data analysis
原文传递
A procedure to assess the spatial variability in the importance of abiotic factors affecting distributions: the case of world freshwater fishes 被引量:2
10
作者 Ana M. MANJARRES-HERNANDEZ Castor GUISANDE +7 位作者 Emilio GARCiA-ROSELLO Patricia PELAYO-VILLAMIL Jacinto GONZALEZ-DAcOSTA Juirgen HEINE Luis GONZALEZ VILAS Carlos GRANADO-LORENClO Santiago R. DUQUE Jorge M. LOBO 《Current Zoology》 SCIE CAS CSCD 2018年第5期549-557,共9页
Understanding the factors shaping species' distributions is a key longstanding topic in ecology with unresolved issues. The aims were to test whether the relative contribution of abiotic factors that set the geograph... Understanding the factors shaping species' distributions is a key longstanding topic in ecology with unresolved issues. The aims were to test whether the relative contribution of abiotic factors that set the geographical range of freshwater fish species may vary spatially and/or may depend on the geographical extent that is being considered. The relative contribution of factors, to discriminate between the conditions prevailing in the area where the species is present and those existing in the considered extent, was estimated with the instability index included in the R pack- age SPEDInstabR. We used 3 different extent sizes: 1) each river basin where the species is present (local); 2) all river basins where the species is present (regional); and 3) the whole Earth (global). We used a data set of 16,543 freshwater fish species with a total of 845,764 geographical records, together with bioclimatic and topographic variables. Factors associated with tempera- ture and altitude show the highest relative contribution to explain the distribution of freshwater fishes at the smaller considered extent. Altitude and a mix of factors associated with temperature and precipitation were more important when using the regional extent. Factors associated with precipitation show the highest contribution when using the global extent. There was also spatial variability in the importance of factors, both between species and within species and from region to region. Factors associated with precipitation show a clear latitudinal trend of decreasing in importance toward the equator. 展开更多
关键词 anisotropic predictors environmental data selection geographical background non-stationary predictors
原文传递
A novel pure data-selection framework for day-ahead wind power forecasting
11
作者 Ying Chen Jingjing Zhao +2 位作者 Jiancheng Qin Hua Li Zili Zhang 《Fundamental Research》 CAS CSCD 2023年第3期392-402,共11页
Numerical weather prediction(NWP)data possess internal inaccuracies,such as low NWP wind speed corresponding to high actual wind power generation.This study is intended to reduce the negative effects of such inaccurac... Numerical weather prediction(NWP)data possess internal inaccuracies,such as low NWP wind speed corresponding to high actual wind power generation.This study is intended to reduce the negative effects of such inaccuracies by proposing a pure data-selection framework(PDF)to choose useful data prior to modeling,thus improving the accuracy of day-ahead wind power forecasting.Briefly,we convert an entire NWP training dataset into many small subsets and then select the best subset combination via a validation set to build a forecasting model.Although a small subset can increase selection flexibility,it can also produce billions of subset combinations,resulting in computational issues.To address this problem,we incorporated metamodeling and optimization steps into PDF.We then proposed a design and analysis of the computer experiments-based metamodeling algorithm and heuristic-exhaustive search optimization algorithm,respectively.Experimental results demonstrate that(1)it is necessary to select data before constructing a forecasting model;(2)using a smaller subset will likely increase selection flexibility,leading to a more accurate forecasting model;(3)PDF can generate a better training dataset than similarity-based data selection methods(e.g.,K-means and support vector classification);and(4)choosing data before building a forecasting model produces a more accurate forecasting model compared with using a machine learning method to construct a model directly. 展开更多
关键词 Day-ahead wind power forecasting data selection Design and analysis of computer experiments Heuristic optimization Numerical weather prediction data
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部