The acquisition of precise soil data representative of the entire survey area, is a critical issue for many treatments such as irrigation or fertilization in precision agriculture. The aim of this study was to investi...The acquisition of precise soil data representative of the entire survey area, is a critical issue for many treatments such as irrigation or fertilization in precision agriculture. The aim of this study was to investigate the spatial variability of soil bulk electrical conductivity (ECb) in a coastal saline field and design an optimized spatial sampling scheme of ECb based on a sampling design algorithm, the variance quad-tree (VQT) method. Soil ECb data were collected from the field at 20 m interval in a regular grid scheme. The smooth contour map of the whole field was obtained by ordinary kriging interpolation, VQT algorithm was then used to split the smooth contour map into strata of different number desired, the sampling locations can be selected within each stratum in subsequent sampling. The result indicated that the probability of choosing representative sampling sites was increased significantly by using VQT method with the sampling number being greatly reduced compared to grid sampling design while retaining the same prediction accuracy. The advantage of the VQT method is that this scheme samples sparsely in fields where the spatial variability is relatively uniform and more intensive where the variability is large. Thus the sampling efficiency can be improved, hence facilitate an assessment methodology that can be applied in a rapid, practical and cost-effective manner.展开更多
The type 2 modified augmented design(MAD2) is an efficient unreplicated experimental design used for evaluating large numbers of lines in plant breeding and for assessing genetic variation in a population. Statistical...The type 2 modified augmented design(MAD2) is an efficient unreplicated experimental design used for evaluating large numbers of lines in plant breeding and for assessing genetic variation in a population. Statistical methods and data adjustment for soil heterogeneity have been previously described for this design. In the absence of replicated test genotypes in MAD2, their total variance cannot be partitioned into genetic and error components as required to estimate heritability and genetic correlation of quantitative traits, the two conventional genetic parameters used for breeding selection. We propose a method of estimating the error variance of unreplicated genotypes that uses replicated controls, and then of estimating the genetic parameters. Using the Delta method, we also derived formulas for estimating the sampling variances of the genetic parameters.Computer simulations indicated that the proposed method for estimating genetic parameters and their sampling variances was feasible and the reliability of the estimates was positively associated with the level of heritability of the trait. A case study of estimating the genetic parameters of three quantitative traits, iodine value, oil content, and linolenic acid content, in a biparental recombinant inbred line population of flax with 243 individuals, was conducted using our statistical models. A joint analysis of data over multiple years and sites was suggested for genetic parameter estimation. A pipeline module using SAS and Perl was developed to facilitate data analysis and appended to the previously developed MAD data analysis pipeline(http://probes.pw.usda.gov/bioinformatics_ tools/MADPipeline/index.html).展开更多
Global variance reduction is a bottleneck in Monte Carlo shielding calculations.The global variance reduction problem requires that the statistical error of the entire space is uniform.This study proposed a grid-AIS m...Global variance reduction is a bottleneck in Monte Carlo shielding calculations.The global variance reduction problem requires that the statistical error of the entire space is uniform.This study proposed a grid-AIS method for the global variance reduction problem based on the AIS method,which was implemented in the Monte Carlo program MCShield.The proposed method was validated using the VENUS-Ⅲ international benchmark problem and a self-shielding calculation example.The results from the VENUS-Ⅲ benchmark problem showed that the grid-AIS method achieved a significant reduction in the variance of the statistical errors of the MESH grids,decreasing from 1.08×10^(-2) to 3.84×10^(-3),representing a 64.00% reduction.This demonstrates that the grid-AIS method is effective in addressing global issues.The results of the selfshielding calculation demonstrate that the grid-AIS method produced accurate computational results.Moreover,the grid-AIS method exhibited a computational efficiency approximately one order of magnitude higher than that of the AIS method and approximately two orders of magnitude higher than that of the conventional Monte Carlo method.展开更多
Background:The double sampling method known as“big BAF sampling”has been advocated as a way to reduce sampling effort while still maintaining a reasonably precise estimate of volume.A well-known method for variance ...Background:The double sampling method known as“big BAF sampling”has been advocated as a way to reduce sampling effort while still maintaining a reasonably precise estimate of volume.A well-known method for variance determination,Bruce’s method,is customarily used because the volume estimator takes the form of a product of random variables.However,the genesis of Bruce’s method is not known to most foresters who use the method in practice.Methods:We establish that the Taylor series approximation known as the Delta method provides a plausible explanation for the origins of Bruce’s method.Simulations were conducted on two different tree populations to ascertain the similarities of the Delta method to the exact variance of a product.Additionally,two alternative estimators for the variance of individual tree volume-basal area ratios,which are part of the estimation process,were compared within the overall variance estimation procedure.Results:The simulation results demonstrate that Bruce’s method provides a robust method for estimating the variance of inventories conducted with the big BAF method.The simulations also demonstrate that the variance of the mean volume-basal area ratios can be computed using either the usual sample variance of the mean or the ratio variance estimators with equal accuracy,which had not been shown previously for Big BAF sampling.Conclusions:A plausible explanation for the origins of Bruce’s method has been set forth both historically and mathematically in the Delta Method.In most settings,there is evidently no practical difference between applying the exact variance of a product or the Delta method—either can be used.A caution is articulated concerning the aggregation of tree-wise attributes into point-wise summaries in order to test the correlation between the two as a possible indicator of the need for further covariance augmentation.展开更多
Background:Large area forest inventories often use regular grids(with a single random start)of sample locations to ensure a uniform sampling intensity across the space of the surveyed populations.A design-unbiased est...Background:Large area forest inventories often use regular grids(with a single random start)of sample locations to ensure a uniform sampling intensity across the space of the surveyed populations.A design-unbiased estimator of variance does not exist for this design.Oftentimes,a quasi-default estimator applicable to simple random sampling(SRS)is used,even if it carries with it the likely risk of overestimating the variance by a practically important margin.To better exploit the precision of systematic sampling we assess the performance of five estimators of variance,including the quasi default.In this study,simulated systematic sampling was applied to artificial populations with contrasting covariance structures and with or without linear trends.We compared the results obtained with the SRS,Matern’s,successive difference replication,Ripley’s,and D’Orazio’s variance estimators.Results:The variances obtained with the four alternatives to the SRS estimator of variance were strongly correlated,and in all study settings consistently closer to the target design variance than the estimator for SRS.The latter always produced the greatest overestimation.In populations with a near zero spatial autocorrelation,all estimators,performed equally,and delivered estimates close to the actual design variance.Conclusion:Without a linear trend,the SDR and DOR estimators were best with variance estimates more narrowly distributed around the benchmark;yet in terms of the least average absolute deviation,Matern’s estimator held a narrow lead.With a strong or moderate linear trend,Matern’s estimator is choice.In large populations,and a low sampling intensity,the performance of the investigated estimators becomes more similar.展开更多
Sampling plays an important role in acquiring precise soil information required in modern agricultural production worldwide, which determines both the cost and quality of final soil mapping products. For sampling desi...Sampling plays an important role in acquiring precise soil information required in modern agricultural production worldwide, which determines both the cost and quality of final soil mapping products. For sampling design, it has been proposed possibile to transfer the relationships between kriging variance and sampling grid spacing from an area with existing information to other areas with similar soil-forming environments. However, this approach is challenged in practice because of two problems: i) different population vaxiograms among similar areas and ii) sampling errors in estimated variograms. This study evaluated the effects of these two problems on the transferability of the relationships between kriging variance and sampling grid spacing, by using spatial data simulated with three variograms and soil samples collected from four grasslands in Ireland with similar soil-forming environments. Results showed that the variograms suggested by different samples collected with the same grid spacing in the same or similar areas were different, leading to a range of mean kriging variance (MKV) for each grid spacing. With increasing grid spacing, the variation of MKV for a specific grid spacing increased and deviated more from the MKV generated using the population variograms. As a result, the spatial transferability of the relationships between kriging variance and grid spacing for sampling design was limited.展开更多
In the recent research of network sampling, some sampling concepts are misunderstood, and the variance of subnets is not taken into account. We propose the correct definition of the sample and sampling rate in network...In the recent research of network sampling, some sampling concepts are misunderstood, and the variance of subnets is not taken into account. We propose the correct definition of the sample and sampling rate in network sampling, as well as the formula for calculating the variance of subnets. Then, three commonly used sampling strategies are applied to databases of the connecting nearest-neighbor(CNN) model, random network and small-world network to explore the variance in network sampling. As proved by the results, snowball sampling obtains the most variance of subnets, but does well in capturing the network structure. The variance of networks sampled by the hub and random strategy are much smaller. The hub strategy performs well in reflecting the property of the whole network, while random sampling obtains more accurate results in evaluating clustering coefficient.展开更多
This paper develops a sampling method to estimate the integral of a function of the area with a strategy to cover the area with parallel lines of observation. This sampling strategy is special in that lines very close...This paper develops a sampling method to estimate the integral of a function of the area with a strategy to cover the area with parallel lines of observation. This sampling strategy is special in that lines very close to each other are selected much more seldom than under a uniformly random design for the positions of the parallel lines. It is also special in that the positions of some of the lines are deterministic. Two different variance estimators are derived and investigated by sampling different man made signal functions. They show different properties in that the estimator that estimate the biggest variance gives an error interval that, in some situations, may be more than ten times the error interval computed from the other estimator. It is also obvious that the second estimator underestimates the variance. The author has not succeeded to derive an expression for the expectation of this estimator. This work is motivated towards finding the variance of acoustic abundance estimates.展开更多
Variance is one of the most vital measures of dispersion widely employed in practical aspects.A commonly used approach for variance estimation is the traditional method of moments that is strongly influenced by the pr...Variance is one of the most vital measures of dispersion widely employed in practical aspects.A commonly used approach for variance estimation is the traditional method of moments that is strongly influenced by the presence of extreme values,and thus its results cannot be relied on.Finding momentum from Koyuncu’s recent work,the present paper focuses first on proposing two classes of variance estimators based on linear moments(L-moments),and then employing them with auxiliary data under double stratified sampling to introduce a new class of calibration variance estimators using important properties of L-moments(L-location,L-cv,L-variance).Three populations are taken into account to assess the efficiency of the new estimators.The first and second populations are concerned with artificial data,and the third populations is concerned with real data.The percentage relative efficiency of the proposed estimators over existing ones is evaluated.In the presence of extreme values,our findings depict the superiority and high efficiency of the proposed classes over traditional classes.Hence,when auxiliary data is available along with extreme values,the proposed classes of estimators may be implemented in an extensive variety of sampling surveys.展开更多
Background:A new variance estimator is derived and tested for big BAF(Basal Area Factor)sampling which is a forest inventory system that utilizes Bitterlich sampling(point sampling)with two BAF sizes,a small BAF for t...Background:A new variance estimator is derived and tested for big BAF(Basal Area Factor)sampling which is a forest inventory system that utilizes Bitterlich sampling(point sampling)with two BAF sizes,a small BAF for tree counts and a larger BAF on which tree measurements are made usually including DBHs and heights needed for volume estimation.Methods:The new estimator is derived using the Delta method from an existing formulation of the big BAF estimator as consisting of three sample means.The new formula is compared to existing big BAF estimators including a popular estimator based on Bruce’s formula.Results:Several computer simulation studies were conducted comparing the new variance estimator to all known variance estimators for big BAF currently in the forest inventory literature.In simulations the new estimator performed well and comparably to existing variance formulas.Conclusions:A possible advantage of the new estimator is that it does not require the assumption of negligible correlation between basal area counts on the small BAF factor and volume-basal area ratios based on the large BAF factor selection trees,an assumption required by all previous big BAF variance estimation formulas.Although this correlation was negligible on the simulation stands used in this study,it is conceivable that the correlation could be significant in some forest types,such as those in which the DBH-height relationship can be affected substantially by density perhaps through competition.We derived a formula that can be used to estimate the covariance between estimates of mean basal area and the ratio of estimates of mean volume and mean basal area.We also mathematically derived expressions for bias in the big BAF estimator that can be used to show the bias approaches zero in large samples on the order of 1n where n is the number of sample points.展开更多
The study presents sampling interval impacts on variance components of the epoch-wise residual errors in relative GPS positioning. In the variance components estimation process, the 2-way nested ANOVA method was used....The study presents sampling interval impacts on variance components of the epoch-wise residual errors in relative GPS positioning. In the variance components estimation process, the 2-way nested ANOVA method was used. For that purpose, GPS observation data during four months at two permanent GPS stations, establishing a 40-km-long baseline as a part of the Montenegrin permanent network(Monte Pos), were used. The study results showed that there is no statistically significant impact of sampling interval changes on epoch-wise variance components related to the residual tropospheric and ionospheric delays(effect a) when it comes to such a baseline. However, it is not the case with epoch-wise variance components related to the interstation-distance-independent residual ‘far-field’ multipath effect(effect b). It turned out that the absolute values of relative differences of standard deviations of the effect a on the relative GPS coordinates(e, n and u) had maximum values 11.1%, 10.2% and 8.9%,respectively. Keeping the same order of presentation for the effect b, the values of 5.9%, 9.9% and 12.5%were obtained. In addition, absolute values of relative differences of standard deviations of horizontal and vertical position had maximum values of 3.8% and 7.7%, respectively.展开更多
In this study we have proposed a modified ratio type estimator for population variance of the study variable y under simple random sampling without replacement making use of coefficient of kurtosis and median of an au...In this study we have proposed a modified ratio type estimator for population variance of the study variable y under simple random sampling without replacement making use of coefficient of kurtosis and median of an auxiliary variable x. The estimator’s properties have been derived up to first order of Taylor’s series expansion. The efficiency conditions derived theoretically under which the proposed estimator performs better than existing estimators. Empirical studies have been done using real populations to demonstrate the performance of the developed estimator in comparison with the existing estimators. The proposed estimator as illustrated by the empirical studies performs better than the existing estimators under some specified conditions i.e. it has the smallest Mean Squared Error and the highest Percentage Relative Efficiency. The developed estimator therefore is suitable to be applied to situations in which the variable of interest has a positive correlation with the auxiliary variable.展开更多
In leading petroleum-producing countries like Kuwait, Brazil, Iran, Iraq and Mexico oil spills frequently occur on land, causing serious damage to crop fields. Soil remediation requires constant monitoring of the poll...In leading petroleum-producing countries like Kuwait, Brazil, Iran, Iraq and Mexico oil spills frequently occur on land, causing serious damage to crop fields. Soil remediation requires constant monitoring of the polluted area. One common monitoring method involves two-dimensional systematic sampling, which can be used to estimate the proportion of the contaminated soil and study the oil spills’ geographic distribution. A well-known issue using this sampling design involves the analytical derivation of variance of the sample mean (proportion), which requires at least two independent samples. To address the problem, this research proposed a variance estimator based on regression and a corrected estimator using the autocorrelation Geary Index under the model-assisted approach. The construction of the estimators was assisted by geo-statistical models by simulating an auxiliary variable. Similar populations to those in real oil spills were recreated, and the accuracy of proposed estimators was evaluated by comparing their performance with other well-known estimators. The factors considered in this simulation study were: a) the model for simulating the populations (exponential and wave), b) the mean and the variance of the process, c) the level of autocorrelation among units. Given the statistical and computing burdens (bias, ratio between estimated and real variance, convergence and computer time), under the exponential model, the regression estimator showed the best performance;and for the wave model, the corrected version performed even better.展开更多
Srivastava and Jhajj [ 1 6] proposed a class of estimators for estimating population variance using multi auxiliary variables in simple random sampling and they utilized the means and variances of auxiliary variables....Srivastava and Jhajj [ 1 6] proposed a class of estimators for estimating population variance using multi auxiliary variables in simple random sampling and they utilized the means and variances of auxiliary variables. In this paper, we adapted this class and motivated by Searle [13], and we suggested more generalized class of estimators for estimating the population variance in simple random sampling. The expressions for the mean square error of proposed class have been derived in general form. Besides obtaining the minimized MSE of the proposed and adapted class, it is shown that the adapted classis the special case of the proposed class. Moreover, these theoretical findings are supported by an empirical study of original data.展开更多
One of the detection objectives of the Chinese Asteroid Exploration mission is to investigate the space environment near the Main-belt Comet(MBC,Active Asteroid)311P/PANSTARRS.This paper outlines the scientific object...One of the detection objectives of the Chinese Asteroid Exploration mission is to investigate the space environment near the Main-belt Comet(MBC,Active Asteroid)311P/PANSTARRS.This paper outlines the scientific objectives,measurement targets,and measurement requirements for the proposed Gas and Ion Analyzer(GIA).The GIA is designed for in-situ mass spectrometry of neutral gases and low-energy ions,such as hydrogen,carbon,and oxygen,in the vicinity of 311P.Ion sampling techniques are essential for the GIA's Time-of-Flight(TOF)mass analysis capabilities.In this paper,we present an enhanced ion sampling technique through the development of an ion attraction model and an ion source model.The ion attraction model demonstrates that adjusting attraction grid voltage can enhance the detection efficiency of low-energy ions and mitigate the repulsive force of ions during sampling,which is influenced by the satellite's surface positive charging.The ion source model simulates the processes of gas ionization and ion multiplication.Simulation results indicate that the GIA can achieve a lower pressure limit below 10-13Pa and possess a dynamic range exceeding 10~9.These performances ensure the generation of ions with stable and consistent current,which is crucial for high-resolution and broad dynamic range mass spectrometer analysis.Preliminary testing experiments have verified GIA's capability to detect gas compositions such as H2O and N2.In-situ measurements near 311P using GIA are expected to significantly contribute to our understanding of asteroid activity mechanisms,the evolution of the atmospheric and ionized environments of main-belt comets,the interactions with solar wind,and the origin of Earth's water.展开更多
Electro-Optic Sampling(EOS)detection technique has been widely used in terahertz science and tech⁃nology,and it also can measure the field time waveform of the few-cycle laser pulse.Its frequency response and band lim...Electro-Optic Sampling(EOS)detection technique has been widely used in terahertz science and tech⁃nology,and it also can measure the field time waveform of the few-cycle laser pulse.Its frequency response and band limitation are determined directly by the electro-optic crystal and duration of the probe laser pulse.Here,we investigate the performance of the EOS with thin GaSe crystal in the measurement of the mid-infrared few-cycle la⁃ser pulse.The shift of the central frequency and change of the bandwidth induced by the EOS detection are calcu⁃lated,and then the pulse distortions induced in this detection process are discussed.It is found that this technique produces a red-shift of the central frequency and narrowing of the bandwidth.These changings decrease when the laser wavelength increases from 2μm to 10μm.This work can help to estimate the performance of the EOS de⁃tection technique in the mid-infrared band and offer a reference for the related experiment as well.展开更多
The prediction of the fracture plane orientation in fatigue is a scientific topic and remains relevant for every type of material. However, in this work, we compared the orientation of the fracture plane obtained expe...The prediction of the fracture plane orientation in fatigue is a scientific topic and remains relevant for every type of material. However, in this work, we compared the orientation of the fracture plane obtained experimentally through tests on specimens under multiaxial loading with that calculated by the variance method. In the statistical approach criteria, several methods have been developed but we have presented only one method, namely the variance method using the equivalent stress. She assumes that the fracture plane orientation is the one on which the variance of the equivalent stress is maximum. Three types of equivalent stress are defined for this method [1]: normal stress, shear stress and combined normal and shear stress. The results obtained were compared with experimental results for multiaxial cyclic stress states, and it emerges that the variance method for the case of combined loading is conservative as it gives a better prediction of the fracture plane.展开更多
Critical Height Sampling(CHS)estimates stand volume free from any model and tree form assumptions.Despite its introduction more than four decades ago,CHS has not been widely applied in the field due to perceived chall...Critical Height Sampling(CHS)estimates stand volume free from any model and tree form assumptions.Despite its introduction more than four decades ago,CHS has not been widely applied in the field due to perceived challenges in measurement.The objectives of this study were to compare estimated stand volume between CHS and sampling methods that used volume or taper models,the equivalence of the sampling methods,and their relative efficiency.We established 65 field plots in planted forests of two coniferous tree species.We estimated stand volume for a range of Basal Area Factors(BAFs).Results showed that CHS produced the most similar mean stand volume across BAFs and tree species with maximum differences between BAFs of 5-18m^(3)·ha^(−1).Horizontal Point Sampling(HPS)using volume models produced very large variability in mean stand volume across BAFs with the differences up to 126m^(3)·ha^(−1).However,CHS was less precise and less efficient than HPS.Furthermore,none of the sampling methods were statistically interchangeable with CHS at an allowable tolerance of≤55m^(3)·ha^(−1).About 72%of critical height measurements were below crown base indicating that critical height was more accessible to measurement than expected.Our study suggests that the consistency in the mean estimates of CHS is a major advantage when planning a forest inventory.When checking against CHS,results hint that HPS estimates might contain potential model bias.These strengths of CHS could outweigh its lower precision.Our study also implies serious implications in financial terms when choosing a sampling method.Lastly,CHS could potentially benefit forest management as an alternate option of estimating stand volume when volume or taper models are lacking or are not reliable.展开更多
Weighted exponential distribution W ED(α,λ)with shape parameterαand scale parameterλpossesses some good properties and can be used as a good fit to survival time data compared to other distributions such as gamma,...Weighted exponential distribution W ED(α,λ)with shape parameterαand scale parameterλpossesses some good properties and can be used as a good fit to survival time data compared to other distributions such as gamma,Weibull,or generalized exponential distribution.In this article,we proved the existence and uniqueness of the maximum likelihood estimator(MLE)of the parameters of W ED(α,λ)in simple random sampling(SRS)and provided explicit expressions for the Fisher information number in SRS.Moreover,we also proved the existence and uniqueness of the MLE of the parameters of W ED(α,λ)in ranked set sampling(RSS)and provided explicit expressions for the Fisher information number in RSS.Simulation studies show that these MLEs in RSS can be real competitors for those in SRS.展开更多
Selection of negative samples significantly influences landslide susceptibility assessment,especially when establishing the relationship between landslides and environmental factors in regions with complex geological ...Selection of negative samples significantly influences landslide susceptibility assessment,especially when establishing the relationship between landslides and environmental factors in regions with complex geological conditions.Traditional sampling strategies commonly used in landslide susceptibility models can lead to a misrepresentation of the distribution of negative samples,causing a deviation from actual geological conditions.This,in turn,negatively affects the discriminative ability and generalization performance of the models.To address this issue,we propose a novel approach for selecting negative samples to enhance the quality of machine learning models.We choose the Liangshan Yi Autonomous Prefecture,located in southwestern Sichuan,China,as the case study.This area,characterized by complex terrain,frequent tectonic activities,and steep slope erosion,experiences recurrent landslides,making it an ideal setting for validating our proposed method.We calculate the contribution values of environmental factors using the relief algorithm to construct the feature space,apply the Target Space Exteriorization Sampling(TSES)method to select negative samples,calculate landslide probability values by Random Forest(RF)modeling,and then create regional landslide susceptibility maps.We evaluate the performance of the RF model optimized by the Environmental Factor Selection-based TSES(EFSTSES)method using standard performance metrics.The results indicated that the model achieved an accuracy(ACC)of 0.962,precision(PRE)of 0.961,and an area under the curve(AUC)of 0.962.These findings demonstrate that the EFSTSES-based model effectively mitigates the negative sample imbalance issue,enhances the differentiation between landslide and non-landslide samples,and reduces misclassification,particularly in geologically complex areas.These improvements offer valuable insights for disaster prevention,land use planning,and risk mitigation strategies.展开更多
基金We thank the financial support from the National Natural Science Foundation of China(40701007,40571066)the Postdoctoral Science Foundation of China(20060401048).
文摘The acquisition of precise soil data representative of the entire survey area, is a critical issue for many treatments such as irrigation or fertilization in precision agriculture. The aim of this study was to investigate the spatial variability of soil bulk electrical conductivity (ECb) in a coastal saline field and design an optimized spatial sampling scheme of ECb based on a sampling design algorithm, the variance quad-tree (VQT) method. Soil ECb data were collected from the field at 20 m interval in a regular grid scheme. The smooth contour map of the whole field was obtained by ordinary kriging interpolation, VQT algorithm was then used to split the smooth contour map into strata of different number desired, the sampling locations can be selected within each stratum in subsequent sampling. The result indicated that the probability of choosing representative sampling sites was increased significantly by using VQT method with the sampling number being greatly reduced compared to grid sampling design while retaining the same prediction accuracy. The advantage of the VQT method is that this scheme samples sparsely in fields where the spatial variability is relatively uniform and more intensive where the variability is large. Thus the sampling efficiency can be improved, hence facilitate an assessment methodology that can be applied in a rapid, practical and cost-effective manner.
基金partly supported by an A-base project funded by Agriculture and Agri-Food Canadathe TUFGEN project funded by Genome Canada and other stakeholdersfunds from the Western Grains Research Foundation
文摘The type 2 modified augmented design(MAD2) is an efficient unreplicated experimental design used for evaluating large numbers of lines in plant breeding and for assessing genetic variation in a population. Statistical methods and data adjustment for soil heterogeneity have been previously described for this design. In the absence of replicated test genotypes in MAD2, their total variance cannot be partitioned into genetic and error components as required to estimate heritability and genetic correlation of quantitative traits, the two conventional genetic parameters used for breeding selection. We propose a method of estimating the error variance of unreplicated genotypes that uses replicated controls, and then of estimating the genetic parameters. Using the Delta method, we also derived formulas for estimating the sampling variances of the genetic parameters.Computer simulations indicated that the proposed method for estimating genetic parameters and their sampling variances was feasible and the reliability of the estimates was positively associated with the level of heritability of the trait. A case study of estimating the genetic parameters of three quantitative traits, iodine value, oil content, and linolenic acid content, in a biparental recombinant inbred line population of flax with 243 individuals, was conducted using our statistical models. A joint analysis of data over multiple years and sites was suggested for genetic parameter estimation. A pipeline module using SAS and Perl was developed to facilitate data analysis and appended to the previously developed MAD data analysis pipeline(http://probes.pw.usda.gov/bioinformatics_ tools/MADPipeline/index.html).
基金supported by the Platform Development Foundation of the China Institute for Radiation Protection(No.YP21030101)the National Natural Science Foundation of China(General Program)(Nos.12175114,U2167209)+1 种基金the National Key R&D Program of China(No.2021YFF0603600)the Tsinghua University Initiative Scientific Research Program(No.20211080081).
文摘Global variance reduction is a bottleneck in Monte Carlo shielding calculations.The global variance reduction problem requires that the statistical error of the entire space is uniform.This study proposed a grid-AIS method for the global variance reduction problem based on the AIS method,which was implemented in the Monte Carlo program MCShield.The proposed method was validated using the VENUS-Ⅲ international benchmark problem and a self-shielding calculation example.The results from the VENUS-Ⅲ benchmark problem showed that the grid-AIS method achieved a significant reduction in the variance of the statistical errors of the MESH grids,decreasing from 1.08×10^(-2) to 3.84×10^(-3),representing a 64.00% reduction.This demonstrates that the grid-AIS method is effective in addressing global issues.The results of the selfshielding calculation demonstrate that the grid-AIS method produced accurate computational results.Moreover,the grid-AIS method exhibited a computational efficiency approximately one order of magnitude higher than that of the AIS method and approximately two orders of magnitude higher than that of the conventional Monte Carlo method.
基金Research Joint Venture Agreement 17-JV-11242306045,“Old Growth Forest Dynamics and Structure,”between the USDA Forest Service and the University of New Hampshire.Additional support to MJD was provided by the USDA National Institute of Food and Agriculture McIntire-Stennis Project Accession Number 1020142,“Forest Structure,Volume,and Biomass in the Northeastern United States.”TBL:This work was supported by the USDA National Institute of Food and Agriculture,McIntire-Stennis project OKL02834 and the Division of Agricultural Sciences and Natural Resources at Oklahoma State University.
文摘Background:The double sampling method known as“big BAF sampling”has been advocated as a way to reduce sampling effort while still maintaining a reasonably precise estimate of volume.A well-known method for variance determination,Bruce’s method,is customarily used because the volume estimator takes the form of a product of random variables.However,the genesis of Bruce’s method is not known to most foresters who use the method in practice.Methods:We establish that the Taylor series approximation known as the Delta method provides a plausible explanation for the origins of Bruce’s method.Simulations were conducted on two different tree populations to ascertain the similarities of the Delta method to the exact variance of a product.Additionally,two alternative estimators for the variance of individual tree volume-basal area ratios,which are part of the estimation process,were compared within the overall variance estimation procedure.Results:The simulation results demonstrate that Bruce’s method provides a robust method for estimating the variance of inventories conducted with the big BAF method.The simulations also demonstrate that the variance of the mean volume-basal area ratios can be computed using either the usual sample variance of the mean or the ratio variance estimators with equal accuracy,which had not been shown previously for Big BAF sampling.Conclusions:A plausible explanation for the origins of Bruce’s method has been set forth both historically and mathematically in the Delta Method.In most settings,there is evidently no practical difference between applying the exact variance of a product or the Delta method—either can be used.A caution is articulated concerning the aggregation of tree-wise attributes into point-wise summaries in order to test the correlation between the two as a possible indicator of the need for further covariance augmentation.
文摘Background:Large area forest inventories often use regular grids(with a single random start)of sample locations to ensure a uniform sampling intensity across the space of the surveyed populations.A design-unbiased estimator of variance does not exist for this design.Oftentimes,a quasi-default estimator applicable to simple random sampling(SRS)is used,even if it carries with it the likely risk of overestimating the variance by a practically important margin.To better exploit the precision of systematic sampling we assess the performance of five estimators of variance,including the quasi default.In this study,simulated systematic sampling was applied to artificial populations with contrasting covariance structures and with or without linear trends.We compared the results obtained with the SRS,Matern’s,successive difference replication,Ripley’s,and D’Orazio’s variance estimators.Results:The variances obtained with the four alternatives to the SRS estimator of variance were strongly correlated,and in all study settings consistently closer to the target design variance than the estimator for SRS.The latter always produced the greatest overestimation.In populations with a near zero spatial autocorrelation,all estimators,performed equally,and delivered estimates close to the actual design variance.Conclusion:Without a linear trend,the SDR and DOR estimators were best with variance estimates more narrowly distributed around the benchmark;yet in terms of the least average absolute deviation,Matern’s estimator held a narrow lead.With a strong or moderate linear trend,Matern’s estimator is choice.In large populations,and a low sampling intensity,the performance of the investigated estimators becomes more similar.
基金?nancially supported by the National Natural Science Foundation of China (Nos. 41541006 and 41771246)co-funded by Enterprise Ireland and the European Regional Development Fund (ERDF) under the National Strategic Reference Framework (NSRF) 2007–2013
文摘Sampling plays an important role in acquiring precise soil information required in modern agricultural production worldwide, which determines both the cost and quality of final soil mapping products. For sampling design, it has been proposed possibile to transfer the relationships between kriging variance and sampling grid spacing from an area with existing information to other areas with similar soil-forming environments. However, this approach is challenged in practice because of two problems: i) different population vaxiograms among similar areas and ii) sampling errors in estimated variograms. This study evaluated the effects of these two problems on the transferability of the relationships between kriging variance and sampling grid spacing, by using spatial data simulated with three variograms and soil samples collected from four grasslands in Ireland with similar soil-forming environments. Results showed that the variograms suggested by different samples collected with the same grid spacing in the same or similar areas were different, leading to a range of mean kriging variance (MKV) for each grid spacing. With increasing grid spacing, the variation of MKV for a specific grid spacing increased and deviated more from the MKV generated using the population variograms. As a result, the spatial transferability of the relationships between kriging variance and grid spacing for sampling design was limited.
基金supported by the Basic Research Fund of Beijing Institute of Technology(20120642008)
文摘In the recent research of network sampling, some sampling concepts are misunderstood, and the variance of subnets is not taken into account. We propose the correct definition of the sample and sampling rate in network sampling, as well as the formula for calculating the variance of subnets. Then, three commonly used sampling strategies are applied to databases of the connecting nearest-neighbor(CNN) model, random network and small-world network to explore the variance in network sampling. As proved by the results, snowball sampling obtains the most variance of subnets, but does well in capturing the network structure. The variance of networks sampled by the hub and random strategy are much smaller. The hub strategy performs well in reflecting the property of the whole network, while random sampling obtains more accurate results in evaluating clustering coefficient.
文摘This paper develops a sampling method to estimate the integral of a function of the area with a strategy to cover the area with parallel lines of observation. This sampling strategy is special in that lines very close to each other are selected much more seldom than under a uniformly random design for the positions of the parallel lines. It is also special in that the positions of some of the lines are deterministic. Two different variance estimators are derived and investigated by sampling different man made signal functions. They show different properties in that the estimator that estimate the biggest variance gives an error interval that, in some situations, may be more than ten times the error interval computed from the other estimator. It is also obvious that the second estimator underestimates the variance. The author has not succeeded to derive an expression for the expectation of this estimator. This work is motivated towards finding the variance of acoustic abundance estimates.
基金The authors thank the Deanship of Scientific Research at King Khalid University,Kingdom of Saudi Arabia for funding this study through the research groups program under Project Number R.G.P.1/64/42.Ishfaq Ahmad and Ibrahim Mufrah Almanjahie received the grant.
文摘Variance is one of the most vital measures of dispersion widely employed in practical aspects.A commonly used approach for variance estimation is the traditional method of moments that is strongly influenced by the presence of extreme values,and thus its results cannot be relied on.Finding momentum from Koyuncu’s recent work,the present paper focuses first on proposing two classes of variance estimators based on linear moments(L-moments),and then employing them with auxiliary data under double stratified sampling to introduce a new class of calibration variance estimators using important properties of L-moments(L-location,L-cv,L-variance).Three populations are taken into account to assess the efficiency of the new estimators.The first and second populations are concerned with artificial data,and the third populations is concerned with real data.The percentage relative efficiency of the proposed estimators over existing ones is evaluated.In the presence of extreme values,our findings depict the superiority and high efficiency of the proposed classes over traditional classes.Hence,when auxiliary data is available along with extreme values,the proposed classes of estimators may be implemented in an extensive variety of sampling surveys.
基金Support was provided by Research Joint Venture Agreement 17-JV-11242306045,“Old Growth Forest Dynamics and Structure,”between the USDA Forest Service and the University of New HampshireAdditional support to MJD was provided by the USDA National Institute of Food and Agriculture McIntire-Stennis Project Accession Number 1020142,“Forest Structure,Volume,and Biomass in the Northeastern United States.”+1 种基金supported by the USDA National Institute of Food and Agriculture,McIntire-Stennis project OKL02834the Division of Agricultural Sciences and Natural Resources at Oklahoma State University.
文摘Background:A new variance estimator is derived and tested for big BAF(Basal Area Factor)sampling which is a forest inventory system that utilizes Bitterlich sampling(point sampling)with two BAF sizes,a small BAF for tree counts and a larger BAF on which tree measurements are made usually including DBHs and heights needed for volume estimation.Methods:The new estimator is derived using the Delta method from an existing formulation of the big BAF estimator as consisting of three sample means.The new formula is compared to existing big BAF estimators including a popular estimator based on Bruce’s formula.Results:Several computer simulation studies were conducted comparing the new variance estimator to all known variance estimators for big BAF currently in the forest inventory literature.In simulations the new estimator performed well and comparably to existing variance formulas.Conclusions:A possible advantage of the new estimator is that it does not require the assumption of negligible correlation between basal area counts on the small BAF factor and volume-basal area ratios based on the large BAF factor selection trees,an assumption required by all previous big BAF variance estimation formulas.Although this correlation was negligible on the simulation stands used in this study,it is conceivable that the correlation could be significant in some forest types,such as those in which the DBH-height relationship can be affected substantially by density perhaps through competition.We derived a formula that can be used to estimate the covariance between estimates of mean basal area and the ratio of estimates of mean volume and mean basal area.We also mathematically derived expressions for bias in the big BAF estimator that can be used to show the bias approaches zero in large samples on the order of 1n where n is the number of sample points.
文摘The study presents sampling interval impacts on variance components of the epoch-wise residual errors in relative GPS positioning. In the variance components estimation process, the 2-way nested ANOVA method was used. For that purpose, GPS observation data during four months at two permanent GPS stations, establishing a 40-km-long baseline as a part of the Montenegrin permanent network(Monte Pos), were used. The study results showed that there is no statistically significant impact of sampling interval changes on epoch-wise variance components related to the residual tropospheric and ionospheric delays(effect a) when it comes to such a baseline. However, it is not the case with epoch-wise variance components related to the interstation-distance-independent residual ‘far-field’ multipath effect(effect b). It turned out that the absolute values of relative differences of standard deviations of the effect a on the relative GPS coordinates(e, n and u) had maximum values 11.1%, 10.2% and 8.9%,respectively. Keeping the same order of presentation for the effect b, the values of 5.9%, 9.9% and 12.5%were obtained. In addition, absolute values of relative differences of standard deviations of horizontal and vertical position had maximum values of 3.8% and 7.7%, respectively.
文摘In this study we have proposed a modified ratio type estimator for population variance of the study variable y under simple random sampling without replacement making use of coefficient of kurtosis and median of an auxiliary variable x. The estimator’s properties have been derived up to first order of Taylor’s series expansion. The efficiency conditions derived theoretically under which the proposed estimator performs better than existing estimators. Empirical studies have been done using real populations to demonstrate the performance of the developed estimator in comparison with the existing estimators. The proposed estimator as illustrated by the empirical studies performs better than the existing estimators under some specified conditions i.e. it has the smallest Mean Squared Error and the highest Percentage Relative Efficiency. The developed estimator therefore is suitable to be applied to situations in which the variable of interest has a positive correlation with the auxiliary variable.
文摘In leading petroleum-producing countries like Kuwait, Brazil, Iran, Iraq and Mexico oil spills frequently occur on land, causing serious damage to crop fields. Soil remediation requires constant monitoring of the polluted area. One common monitoring method involves two-dimensional systematic sampling, which can be used to estimate the proportion of the contaminated soil and study the oil spills’ geographic distribution. A well-known issue using this sampling design involves the analytical derivation of variance of the sample mean (proportion), which requires at least two independent samples. To address the problem, this research proposed a variance estimator based on regression and a corrected estimator using the autocorrelation Geary Index under the model-assisted approach. The construction of the estimators was assisted by geo-statistical models by simulating an auxiliary variable. Similar populations to those in real oil spills were recreated, and the accuracy of proposed estimators was evaluated by comparing their performance with other well-known estimators. The factors considered in this simulation study were: a) the model for simulating the populations (exponential and wave), b) the mean and the variance of the process, c) the level of autocorrelation among units. Given the statistical and computing burdens (bias, ratio between estimated and real variance, convergence and computer time), under the exponential model, the regression estimator showed the best performance;and for the wave model, the corrected version performed even better.
文摘Srivastava and Jhajj [ 1 6] proposed a class of estimators for estimating population variance using multi auxiliary variables in simple random sampling and they utilized the means and variances of auxiliary variables. In this paper, we adapted this class and motivated by Searle [13], and we suggested more generalized class of estimators for estimating the population variance in simple random sampling. The expressions for the mean square error of proposed class have been derived in general form. Besides obtaining the minimized MSE of the proposed and adapted class, it is shown that the adapted classis the special case of the proposed class. Moreover, these theoretical findings are supported by an empirical study of original data.
基金Supported by the National Natural Science Foundation of China(42474239,41204128)China National Space Administration(Pre-research project on Civil Aerospace Technologies No.D010301)Strategic Priority Research Program of the Chinese Academy of Sciences(XDA17010303)。
文摘One of the detection objectives of the Chinese Asteroid Exploration mission is to investigate the space environment near the Main-belt Comet(MBC,Active Asteroid)311P/PANSTARRS.This paper outlines the scientific objectives,measurement targets,and measurement requirements for the proposed Gas and Ion Analyzer(GIA).The GIA is designed for in-situ mass spectrometry of neutral gases and low-energy ions,such as hydrogen,carbon,and oxygen,in the vicinity of 311P.Ion sampling techniques are essential for the GIA's Time-of-Flight(TOF)mass analysis capabilities.In this paper,we present an enhanced ion sampling technique through the development of an ion attraction model and an ion source model.The ion attraction model demonstrates that adjusting attraction grid voltage can enhance the detection efficiency of low-energy ions and mitigate the repulsive force of ions during sampling,which is influenced by the satellite's surface positive charging.The ion source model simulates the processes of gas ionization and ion multiplication.Simulation results indicate that the GIA can achieve a lower pressure limit below 10-13Pa and possess a dynamic range exceeding 10~9.These performances ensure the generation of ions with stable and consistent current,which is crucial for high-resolution and broad dynamic range mass spectrometer analysis.Preliminary testing experiments have verified GIA's capability to detect gas compositions such as H2O and N2.In-situ measurements near 311P using GIA are expected to significantly contribute to our understanding of asteroid activity mechanisms,the evolution of the atmospheric and ionized environments of main-belt comets,the interactions with solar wind,and the origin of Earth's water.
基金Supported by the National Natural Science Foundation of China(12064028)Jiangxi Provincial Natural Science Foundation(20232BAB201045).
文摘Electro-Optic Sampling(EOS)detection technique has been widely used in terahertz science and tech⁃nology,and it also can measure the field time waveform of the few-cycle laser pulse.Its frequency response and band limitation are determined directly by the electro-optic crystal and duration of the probe laser pulse.Here,we investigate the performance of the EOS with thin GaSe crystal in the measurement of the mid-infrared few-cycle la⁃ser pulse.The shift of the central frequency and change of the bandwidth induced by the EOS detection are calcu⁃lated,and then the pulse distortions induced in this detection process are discussed.It is found that this technique produces a red-shift of the central frequency and narrowing of the bandwidth.These changings decrease when the laser wavelength increases from 2μm to 10μm.This work can help to estimate the performance of the EOS de⁃tection technique in the mid-infrared band and offer a reference for the related experiment as well.
文摘The prediction of the fracture plane orientation in fatigue is a scientific topic and remains relevant for every type of material. However, in this work, we compared the orientation of the fracture plane obtained experimentally through tests on specimens under multiaxial loading with that calculated by the variance method. In the statistical approach criteria, several methods have been developed but we have presented only one method, namely the variance method using the equivalent stress. She assumes that the fracture plane orientation is the one on which the variance of the equivalent stress is maximum. Three types of equivalent stress are defined for this method [1]: normal stress, shear stress and combined normal and shear stress. The results obtained were compared with experimental results for multiaxial cyclic stress states, and it emerges that the variance method for the case of combined loading is conservative as it gives a better prediction of the fracture plane.
文摘Critical Height Sampling(CHS)estimates stand volume free from any model and tree form assumptions.Despite its introduction more than four decades ago,CHS has not been widely applied in the field due to perceived challenges in measurement.The objectives of this study were to compare estimated stand volume between CHS and sampling methods that used volume or taper models,the equivalence of the sampling methods,and their relative efficiency.We established 65 field plots in planted forests of two coniferous tree species.We estimated stand volume for a range of Basal Area Factors(BAFs).Results showed that CHS produced the most similar mean stand volume across BAFs and tree species with maximum differences between BAFs of 5-18m^(3)·ha^(−1).Horizontal Point Sampling(HPS)using volume models produced very large variability in mean stand volume across BAFs with the differences up to 126m^(3)·ha^(−1).However,CHS was less precise and less efficient than HPS.Furthermore,none of the sampling methods were statistically interchangeable with CHS at an allowable tolerance of≤55m^(3)·ha^(−1).About 72%of critical height measurements were below crown base indicating that critical height was more accessible to measurement than expected.Our study suggests that the consistency in the mean estimates of CHS is a major advantage when planning a forest inventory.When checking against CHS,results hint that HPS estimates might contain potential model bias.These strengths of CHS could outweigh its lower precision.Our study also implies serious implications in financial terms when choosing a sampling method.Lastly,CHS could potentially benefit forest management as an alternate option of estimating stand volume when volume or taper models are lacking or are not reliable.
基金Supported by the National Science Foundation of China(11901236,12261036)Scientific Research Fund of Hunan Provincial Education Department(21A0328)+2 种基金Provincial Natural Science Foundation of Hunan(2022JJ30469)Young Core Teacher Foundation of Hunan Province([2020]43)Provincial Postgraduate Innovation Foundation of Hunan(CX20221113)。
文摘Weighted exponential distribution W ED(α,λ)with shape parameterαand scale parameterλpossesses some good properties and can be used as a good fit to survival time data compared to other distributions such as gamma,Weibull,or generalized exponential distribution.In this article,we proved the existence and uniqueness of the maximum likelihood estimator(MLE)of the parameters of W ED(α,λ)in simple random sampling(SRS)and provided explicit expressions for the Fisher information number in SRS.Moreover,we also proved the existence and uniqueness of the MLE of the parameters of W ED(α,λ)in ranked set sampling(RSS)and provided explicit expressions for the Fisher information number in RSS.Simulation studies show that these MLEs in RSS can be real competitors for those in SRS.
基金supported by Natural Science Research Project of Anhui Educational Committee(2023AH030041)National Natural Science Foundation of China(42277136)Anhui Province Young and Middle-aged Teacher Training Action Project(DTR2023018).
文摘Selection of negative samples significantly influences landslide susceptibility assessment,especially when establishing the relationship between landslides and environmental factors in regions with complex geological conditions.Traditional sampling strategies commonly used in landslide susceptibility models can lead to a misrepresentation of the distribution of negative samples,causing a deviation from actual geological conditions.This,in turn,negatively affects the discriminative ability and generalization performance of the models.To address this issue,we propose a novel approach for selecting negative samples to enhance the quality of machine learning models.We choose the Liangshan Yi Autonomous Prefecture,located in southwestern Sichuan,China,as the case study.This area,characterized by complex terrain,frequent tectonic activities,and steep slope erosion,experiences recurrent landslides,making it an ideal setting for validating our proposed method.We calculate the contribution values of environmental factors using the relief algorithm to construct the feature space,apply the Target Space Exteriorization Sampling(TSES)method to select negative samples,calculate landslide probability values by Random Forest(RF)modeling,and then create regional landslide susceptibility maps.We evaluate the performance of the RF model optimized by the Environmental Factor Selection-based TSES(EFSTSES)method using standard performance metrics.The results indicated that the model achieved an accuracy(ACC)of 0.962,precision(PRE)of 0.961,and an area under the curve(AUC)of 0.962.These findings demonstrate that the EFSTSES-based model effectively mitigates the negative sample imbalance issue,enhances the differentiation between landslide and non-landslide samples,and reduces misclassification,particularly in geologically complex areas.These improvements offer valuable insights for disaster prevention,land use planning,and risk mitigation strategies.