Combining the advantages of the stratified sampling and the importance sampling, a stratified importance sampling method (SISM) is presented to analyze the reliability sensitivity for structure with multiple failure...Combining the advantages of the stratified sampling and the importance sampling, a stratified importance sampling method (SISM) is presented to analyze the reliability sensitivity for structure with multiple failure modes. In the presented method, the variable space is divided into several disjoint subspace by n-dimensional coordinate planes at the mean point of the random vec- tor, and the importance sampling functions in the subspaces are constructed by keeping the sampling center at the mean point and augmenting the standard deviation by a factor of 2. The sample size generated from the importance sampling function in each subspace is determined by the contribution of the subspace to the reliability sensitivity, which can be estimated by iterative simulation in the sampling process. The formulae of the reliability sensitivity estimation, the variance and the coefficient of variation are derived for the presented SISM. Comparing with the Monte Carlo method, the stratified sampling method and the importance sampling method, the presented SISM has wider applicability and higher calculation efficiency, which is demonstrated by numerical examples. Finally, the reliability sensitivity analysis of flap structure is illustrated that the SISM can be applied to engineering structure.展开更多
In general the accuracy of mean estimator can be improved by stratified random sampling. In this paper, we provide an idea different from empirical methods that the accuracy can be more improved through bootstrap resa...In general the accuracy of mean estimator can be improved by stratified random sampling. In this paper, we provide an idea different from empirical methods that the accuracy can be more improved through bootstrap resampling method under some conditions. The determination of sample size by bootstrap method is also discussed, and a simulation is made to verify the accuracy of the proposed method. The simulation results show that the sample size based on bootstrapping is smaller than that based on central limit theorem.展开更多
Variance is one of the most vital measures of dispersion widely employed in practical aspects.A commonly used approach for variance estimation is the traditional method of moments that is strongly influenced by the pr...Variance is one of the most vital measures of dispersion widely employed in practical aspects.A commonly used approach for variance estimation is the traditional method of moments that is strongly influenced by the presence of extreme values,and thus its results cannot be relied on.Finding momentum from Koyuncu’s recent work,the present paper focuses first on proposing two classes of variance estimators based on linear moments(L-moments),and then employing them with auxiliary data under double stratified sampling to introduce a new class of calibration variance estimators using important properties of L-moments(L-location,L-cv,L-variance).Three populations are taken into account to assess the efficiency of the new estimators.The first and second populations are concerned with artificial data,and the third populations is concerned with real data.The percentage relative efficiency of the proposed estimators over existing ones is evaluated.In the presence of extreme values,our findings depict the superiority and high efficiency of the proposed classes over traditional classes.Hence,when auxiliary data is available along with extreme values,the proposed classes of estimators may be implemented in an extensive variety of sampling surveys.展开更多
This paper mainly addresses maximum likelihood estimation for a response-selective stratified sampling scheme, the basic stratified sampling (BSS), in which the maximum subsample size in each stratum is fixed. We deri...This paper mainly addresses maximum likelihood estimation for a response-selective stratified sampling scheme, the basic stratified sampling (BSS), in which the maximum subsample size in each stratum is fixed. We derived the complete-data likelihood for BSS, and extended it as a full-data likelihood by incorporating incomplete data. We also similarly extended the empirical proportion likelihood approach for consistent and efficient estimation. We conducted a simulation study to compare these two new approaches with the existing estimation methods in BSS. Our result indicates that they perform as well as the standard full information likelihood approach. Methods were illustrated using a growth model for fish size at age, including between-individual variability. One of our major conclusions is that the fully observed BSS data, the partially observed data used for stratification, and the sampling strategy are all important in constructing a consistent and efficient estimator.展开更多
This paper is an attempt to work out a compromise allocation to construct combined ratio estimates under multivariate double sampling design in presence of non-response when the population mean of the auxiliary variab...This paper is an attempt to work out a compromise allocation to construct combined ratio estimates under multivariate double sampling design in presence of non-response when the population mean of the auxiliary variable is unknown. The problem has been formulated as a multi-objective integer non-linear programming problem. Two solution procedures are developed using goal programming and fuzzy programming techniques. A numerical example is also worked out to illustrate the computational details. A comparison of the two methods is also carried out.展开更多
Conflicting views had greeted the use of systematic sampling for sample selection and estimation in stratified sampling in terms of the precision of the population mean base on the inherent characteristics of the popu...Conflicting views had greeted the use of systematic sampling for sample selection and estimation in stratified sampling in terms of the precision of the population mean base on the inherent characteristics of the population. These conflicting views were analyzed using Cochran data (1977, p. 211) [1]. When the population units are ordered, variance of systematic sampling for all possible systematic samples provides equal, non-negative and most precise estimates for all the variance functions considered i.e. , unlike when a single systematic sample is used and when variance of simple random sampling is used to estimate selected systematic samples.展开更多
In this paper, we study the estimators of the population mean in stratified adaptive cluster sampling by using the information of the auxiliary variable. Simulations showed that if the variable of interest (y) and the...In this paper, we study the estimators of the population mean in stratified adaptive cluster sampling by using the information of the auxiliary variable. Simulations showed that if the variable of interest (y) and the auxiliary variables (x,z) have high positive correlation then the estimate of the mean square error of the ratio estimators is less than the estimate of the mean square error of the product estimator. The estimators which use only one auxiliary variable were better than the estimators which use two auxiliary variables.展开更多
In stratified survey sampling, sometimes we have complete auxiliary information. One of the fundamental questions is how to effectively use the complete auxiliary information at the estimation stage. In this paper, we...In stratified survey sampling, sometimes we have complete auxiliary information. One of the fundamental questions is how to effectively use the complete auxiliary information at the estimation stage. In this paper, we extend the model-calibration method to obtain estimators of the finite population mean by using complete auxiliary information from stratified sampling survey data. We show that the resulting estimators effectively use auxiliary information at the estimation stage and possess a number of attractive features such as asymptotically design-unbiased irrespective of the working model and approximately model-unbiased under the model. When a linear working-model is used, the resulting estimators reduce to the usual calibration estimator(or GREG).展开更多
In this paper, auxiliary information is used to determine an estimator of finite population total using nonparametric regression under stratified random sampling. To achieve this, a model-based approach is adopted by ...In this paper, auxiliary information is used to determine an estimator of finite population total using nonparametric regression under stratified random sampling. To achieve this, a model-based approach is adopted by making use of the local polynomial regression estimation to predict the nonsampled values of the survey variable y. The performance of the proposed estimator is investigated against some design-based and model-based regression estimators. The simulation experiments show that the resulting estimator exhibits good properties. Generally, good confidence intervals are seen for the nonparametric regression estimators, and use of the proposed estimator leads to relatively smaller values of RE compared to other estimators.展开更多
There is a growing interest in leveraging LiDAR-generated forest Aboveground Biomass(LG-AGB)data as a reference to retrieve AGB from satellite observations.However,the biases arising from the upscaling process and the...There is a growing interest in leveraging LiDAR-generated forest Aboveground Biomass(LG-AGB)data as a reference to retrieve AGB from satellite observations.However,the biases arising from the upscaling process and the impact of the sampling strategy on model accuracy still need to be resolved.In this study,we first corrected the bias arising from upscaling the LG-AGB map to match the spatial resolution of Landsat observations.Subsequently,the stratified random sampling method was used to select training samples from the corrected LG-AGB map(cLG-AGB)for the Random Forest(RF)regression model.The RF model features were extracted from the Landsat observations and auxiliary data.The impact of strata numbers on model accuracy was explored during the sampling process.Finally,independent validation was conducted using in situ measurements.The results indicated that:(1)about 68% of the biases can be corrected in the up-scale transformation;(2)compared to no stratification,a three-strata model achieved a 6.5% improvement in AGB estimation accuracy while requiring a 37.8% reduction in sample size;(3)the black locust forest had a low saturation point at 60.52±4.46 Mg/ha AGB and 72.4%AGB values were underestimated and the remaining were overestimated.In summary,our study provides a framework to harmonize near-surface LiDAR and satellite data for AGB estimation in plantation forest ecosystems with small patch sizes and fragmented distribution.展开更多
In this study, we propose a two stage randomized response model. Improved unbiased estimators of the mean number of persons possessing a rare sensitive attribute under two different situations are proposed. The propos...In this study, we propose a two stage randomized response model. Improved unbiased estimators of the mean number of persons possessing a rare sensitive attribute under two different situations are proposed. The proposed estimators are evaluated using a relative efficiency comparison. It is shown that our estimators are efficient as compared to existing estimators when the parameter of rare unrelated attribute is known and in unknown case, depending on the probability of selecting a question.展开更多
In this paper, analysis of methodology was realized for the application of stratified random sampling with optimum allocation in the case of a subject of research which concerns the rural population and presents high ...In this paper, analysis of methodology was realized for the application of stratified random sampling with optimum allocation in the case of a subject of research which concerns the rural population and presents high differentiations among the three strata in which this population could be classified. The rural population of Evros Prefecture (Greece) with criterion the mean altitude of settlements was classified in three strata, the mountainous, semi-mountainous and fiat population for the estimation of mean consumption of forest fuelwood for covering of heating and cooking needs in households of these three strata. The analysis of this methodology includes: (1) the determination of total size of sample for entire the rural population and its allocation to the various strata; (2) the investigation of effectiveness of stratification with the technique of analysis of variance (One-Way ANOVA); (3) the conduct of sampling research with the realization of face-to-face interviews in selected households and (4) the control of forms of the questionnaire and the analysis of data by using the statistical package for social sciences, SPSS for Windows. All data for the analysis of this methodology and its practical application were taken by the pilot sampling which was realized in each stratum. Relative paper was not found by the review of literature.展开更多
Many operations carried out by official statistical institutes use large-scale surveys obtained by stratified random sampling without replacement. Variables commonly examined in this type of surveys are binary, catego...Many operations carried out by official statistical institutes use large-scale surveys obtained by stratified random sampling without replacement. Variables commonly examined in this type of surveys are binary, categorical and continuous, and hence, the estimates of interest involve estimates of proportions, totals and means. The problem of approximating the sampling relative error of this kind of estimates is studied in this paper. Some new jackknife methods are proposed and compared with plug-in and bootstrap methods. An extensive simulation study is carried out to compare the behavior of all the methods considered in this paper.展开更多
In this paper, the problem of non-response with significant travel costs in multivariate stratified sample surveys has been formulated of as a Multi-Objective Geometric Programming Problem (MOGPP). The fuzzy programmi...In this paper, the problem of non-response with significant travel costs in multivariate stratified sample surveys has been formulated of as a Multi-Objective Geometric Programming Problem (MOGPP). The fuzzy programming approach has been described for solving the formulated MOGPP. The formulated MOGPP has been solved with the help of LINGO Software and the dual solution is obtained. The optimum allocations of sample sizes of respondents and non respondents are obtained with the help of dual solutions and primal-dual relationship theorem. A numerical example is given to illustrate the procedure.展开更多
The procedure of stratified double quartile ranked set sampling (SDQRSS) method is introduced to estimate the population mean. The SDQRSS is compared with the simple random sampling (SRS), stratified ranked set sa...The procedure of stratified double quartile ranked set sampling (SDQRSS) method is introduced to estimate the population mean. The SDQRSS is compared with the simple random sampling (SRS), stratified ranked set sampling (SRSS) and stratified simple random sampling (SSRS). It is shown that SDQRSS estimator is an unbiased of the population mean and more efficient than SRS, SRSS and SSRS for symmetric and asymmetric distributions. In addition, by SDQRSS we can increase the efficiency of mean estimator for specific value of the sample size.展开更多
Digital soil mapping (DSM) aims to produce detailed maps of soil properties or soil classes to improve agricultural management and soil quality assessment. Optimized sampling design can reduce the substantial costs an...Digital soil mapping (DSM) aims to produce detailed maps of soil properties or soil classes to improve agricultural management and soil quality assessment. Optimized sampling design can reduce the substantial costs and efforts associated with sampling, profile description, and laboratory analysis. The purpose of this study was to compare common sampling designs for DSM, including grid sampling (GS), grid random sampling (GRS), stratified random sampling (StRS), and conditioned Latin hypercube sampling (cLHS). In an agricultural field (11 ha) in Quebec, Canada, a total of unique 118 locations were selected using each of the four sampling designs (45 locations each), and additional 30 sample locations were selected as an independent testing dataset (evaluation dataset). Soil visible near-infrared (Vis-NIR) spectra were collected in situ at the 148 locations (1 m depth), and soil cores were collected from a subset of 32 locations and subdivided at 10-cm depth intervals, totaling 251 samples. The Cubist model was used to elucidate the relationship between Vis-NIR spectra and soil properties (soil organic matter (SOM) and clay), which was then used to predict the soil properties at all 148 sample locations. Digital maps of soil properties at multiple depths for the entire field (148 sample locations) were prepared using a quantile random forest model to obtain complete model maps (CM-maps). Soil properties were also mapped using the samples from each of the 45 locations for each sampling design to obtain sampling design maps (SD-maps). The SD-maps were evaluated using the independent testing dataset (30 sample locations), and the spatial distribution and model uncertainty of each SD-map were compared with those of the corresponding CM-map. The spatial and feature space coverage were compared across the four sampling designs. The results showed that GS resulted in the most even spatial coverage, cLHS resulted in the best coverage of the feature space, and GS and cLHS resulted in similar prediction accuracies and spatial distributions of soil properties. The SOM content was underestimated using GRS, with large errors at 0–50 cm depth, due to some values not being captured by this sampling design, whereas larger errors for the deeper soil layers were produced using StRS. Predictions of SOM and clay contents had higher accuracy for topsoil (0–30 cm) than for deep subsoil (60–100 cm). It was concluded that the soil sampling designs with either good spatial coverage or feature space coverage can provide good accuracy in 3D DSM, but their performances may be different for different soil properties.展开更多
The curve of relationship between fatigue crack growth rate and the stress strength factor amplitude represented an important fatigue property in designing of damage tolerance limits and predicting life of metallic co...The curve of relationship between fatigue crack growth rate and the stress strength factor amplitude represented an important fatigue property in designing of damage tolerance limits and predicting life of metallic component parts. In order to have a more reasonable use of testing data, samples from population were stratified suggested by the stratified random sample model (SRAM). The data in each stratum corresponded to the same experiment conditions. A suitable weight was assigned to each stratified sample according to the actual working states of the pressure vessel, so that the estimation of fatigue crack growth rate equation was more accurate for practice. An empirical study shows that the SRAM estimation by using fatigue crack growth rate data from different stoves is obviously better than the estimation from simple random sample model.展开更多
This paper reveaed some problems of the forest samling investigation from application.and pointed out the defects. Determining sample size method was precisely put forward from formla's origin in simple random Sam...This paper reveaed some problems of the forest samling investigation from application.and pointed out the defects. Determining sample size method was precisely put forward from formla's origin in simple random Samling procedure In stratified random samgling, two cases were distinguished: the variances Sh2 are equal for all h and not all Sh2 are equal This method made the assertion of making confidence interval more reliable.展开更多
基金National Natural Science Foundation of China (10572117,10802063,50875213)Aeronautical Science Foundation of China (2007ZA53012)+1 种基金New Century Program For Excellent Talents of Ministry of Education of China (NCET-05-0868)National High-tech Research and Development Program (2007AA04Z401)
文摘Combining the advantages of the stratified sampling and the importance sampling, a stratified importance sampling method (SISM) is presented to analyze the reliability sensitivity for structure with multiple failure modes. In the presented method, the variable space is divided into several disjoint subspace by n-dimensional coordinate planes at the mean point of the random vec- tor, and the importance sampling functions in the subspaces are constructed by keeping the sampling center at the mean point and augmenting the standard deviation by a factor of 2. The sample size generated from the importance sampling function in each subspace is determined by the contribution of the subspace to the reliability sensitivity, which can be estimated by iterative simulation in the sampling process. The formulae of the reliability sensitivity estimation, the variance and the coefficient of variation are derived for the presented SISM. Comparing with the Monte Carlo method, the stratified sampling method and the importance sampling method, the presented SISM has wider applicability and higher calculation efficiency, which is demonstrated by numerical examples. Finally, the reliability sensitivity analysis of flap structure is illustrated that the SISM can be applied to engineering structure.
基金The Science Research Start-up Foundation for Young Teachers of Southwest Jiaotong University(No.2007Q091)
文摘In general the accuracy of mean estimator can be improved by stratified random sampling. In this paper, we provide an idea different from empirical methods that the accuracy can be more improved through bootstrap resampling method under some conditions. The determination of sample size by bootstrap method is also discussed, and a simulation is made to verify the accuracy of the proposed method. The simulation results show that the sample size based on bootstrapping is smaller than that based on central limit theorem.
基金The authors thank the Deanship of Scientific Research at King Khalid University,Kingdom of Saudi Arabia for funding this study through the research groups program under Project Number R.G.P.1/64/42.Ishfaq Ahmad and Ibrahim Mufrah Almanjahie received the grant.
文摘Variance is one of the most vital measures of dispersion widely employed in practical aspects.A commonly used approach for variance estimation is the traditional method of moments that is strongly influenced by the presence of extreme values,and thus its results cannot be relied on.Finding momentum from Koyuncu’s recent work,the present paper focuses first on proposing two classes of variance estimators based on linear moments(L-moments),and then employing them with auxiliary data under double stratified sampling to introduce a new class of calibration variance estimators using important properties of L-moments(L-location,L-cv,L-variance).Three populations are taken into account to assess the efficiency of the new estimators.The first and second populations are concerned with artificial data,and the third populations is concerned with real data.The percentage relative efficiency of the proposed estimators over existing ones is evaluated.In the presence of extreme values,our findings depict the superiority and high efficiency of the proposed classes over traditional classes.Hence,when auxiliary data is available along with extreme values,the proposed classes of estimators may be implemented in an extensive variety of sampling surveys.
文摘This paper mainly addresses maximum likelihood estimation for a response-selective stratified sampling scheme, the basic stratified sampling (BSS), in which the maximum subsample size in each stratum is fixed. We derived the complete-data likelihood for BSS, and extended it as a full-data likelihood by incorporating incomplete data. We also similarly extended the empirical proportion likelihood approach for consistent and efficient estimation. We conducted a simulation study to compare these two new approaches with the existing estimation methods in BSS. Our result indicates that they perform as well as the standard full information likelihood approach. Methods were illustrated using a growth model for fish size at age, including between-individual variability. One of our major conclusions is that the fully observed BSS data, the partially observed data used for stratification, and the sampling strategy are all important in constructing a consistent and efficient estimator.
文摘This paper is an attempt to work out a compromise allocation to construct combined ratio estimates under multivariate double sampling design in presence of non-response when the population mean of the auxiliary variable is unknown. The problem has been formulated as a multi-objective integer non-linear programming problem. Two solution procedures are developed using goal programming and fuzzy programming techniques. A numerical example is also worked out to illustrate the computational details. A comparison of the two methods is also carried out.
文摘Conflicting views had greeted the use of systematic sampling for sample selection and estimation in stratified sampling in terms of the precision of the population mean base on the inherent characteristics of the population. These conflicting views were analyzed using Cochran data (1977, p. 211) [1]. When the population units are ordered, variance of systematic sampling for all possible systematic samples provides equal, non-negative and most precise estimates for all the variance functions considered i.e. , unlike when a single systematic sample is used and when variance of simple random sampling is used to estimate selected systematic samples.
文摘In this paper, we study the estimators of the population mean in stratified adaptive cluster sampling by using the information of the auxiliary variable. Simulations showed that if the variable of interest (y) and the auxiliary variables (x,z) have high positive correlation then the estimate of the mean square error of the ratio estimators is less than the estimate of the mean square error of the product estimator. The estimators which use only one auxiliary variable were better than the estimators which use two auxiliary variables.
基金Supported by the National Natural Science Foundation of China(10571093)
文摘In stratified survey sampling, sometimes we have complete auxiliary information. One of the fundamental questions is how to effectively use the complete auxiliary information at the estimation stage. In this paper, we extend the model-calibration method to obtain estimators of the finite population mean by using complete auxiliary information from stratified sampling survey data. We show that the resulting estimators effectively use auxiliary information at the estimation stage and possess a number of attractive features such as asymptotically design-unbiased irrespective of the working model and approximately model-unbiased under the model. When a linear working-model is used, the resulting estimators reduce to the usual calibration estimator(or GREG).
文摘In this paper, auxiliary information is used to determine an estimator of finite population total using nonparametric regression under stratified random sampling. To achieve this, a model-based approach is adopted by making use of the local polynomial regression estimation to predict the nonsampled values of the survey variable y. The performance of the proposed estimator is investigated against some design-based and model-based regression estimators. The simulation experiments show that the resulting estimator exhibits good properties. Generally, good confidence intervals are seen for the nonparametric regression estimators, and use of the proposed estimator leads to relatively smaller values of RE compared to other estimators.
基金supported by the National Natural Science Foundation of China[grant numbers 41471419 and 31971579].
文摘There is a growing interest in leveraging LiDAR-generated forest Aboveground Biomass(LG-AGB)data as a reference to retrieve AGB from satellite observations.However,the biases arising from the upscaling process and the impact of the sampling strategy on model accuracy still need to be resolved.In this study,we first corrected the bias arising from upscaling the LG-AGB map to match the spatial resolution of Landsat observations.Subsequently,the stratified random sampling method was used to select training samples from the corrected LG-AGB map(cLG-AGB)for the Random Forest(RF)regression model.The RF model features were extracted from the Landsat observations and auxiliary data.The impact of strata numbers on model accuracy was explored during the sampling process.Finally,independent validation was conducted using in situ measurements.The results indicated that:(1)about 68% of the biases can be corrected in the up-scale transformation;(2)compared to no stratification,a three-strata model achieved a 6.5% improvement in AGB estimation accuracy while requiring a 37.8% reduction in sample size;(3)the black locust forest had a low saturation point at 60.52±4.46 Mg/ha AGB and 72.4%AGB values were underestimated and the remaining were overestimated.In summary,our study provides a framework to harmonize near-surface LiDAR and satellite data for AGB estimation in plantation forest ecosystems with small patch sizes and fragmented distribution.
文摘In this study, we propose a two stage randomized response model. Improved unbiased estimators of the mean number of persons possessing a rare sensitive attribute under two different situations are proposed. The proposed estimators are evaluated using a relative efficiency comparison. It is shown that our estimators are efficient as compared to existing estimators when the parameter of rare unrelated attribute is known and in unknown case, depending on the probability of selecting a question.
文摘In this paper, analysis of methodology was realized for the application of stratified random sampling with optimum allocation in the case of a subject of research which concerns the rural population and presents high differentiations among the three strata in which this population could be classified. The rural population of Evros Prefecture (Greece) with criterion the mean altitude of settlements was classified in three strata, the mountainous, semi-mountainous and fiat population for the estimation of mean consumption of forest fuelwood for covering of heating and cooking needs in households of these three strata. The analysis of this methodology includes: (1) the determination of total size of sample for entire the rural population and its allocation to the various strata; (2) the investigation of effectiveness of stratification with the technique of analysis of variance (One-Way ANOVA); (3) the conduct of sampling research with the realization of face-to-face interviews in selected households and (4) the control of forms of the questionnaire and the analysis of data by using the statistical package for social sciences, SPSS for Windows. All data for the analysis of this methodology and its practical application were taken by the pilot sampling which was realized in each stratum. Relative paper was not found by the review of literature.
基金supported by the Galician Official Statistical Institute(IGE)and by Grants 10DPI105003PRCN2012/130 from Xunta de Galicia(Spain)by Grant number MTM2011-22392 from Ministerio de Ciencia e Innovacion(Spain).
文摘Many operations carried out by official statistical institutes use large-scale surveys obtained by stratified random sampling without replacement. Variables commonly examined in this type of surveys are binary, categorical and continuous, and hence, the estimates of interest involve estimates of proportions, totals and means. The problem of approximating the sampling relative error of this kind of estimates is studied in this paper. Some new jackknife methods are proposed and compared with plug-in and bootstrap methods. An extensive simulation study is carried out to compare the behavior of all the methods considered in this paper.
文摘In this paper, the problem of non-response with significant travel costs in multivariate stratified sample surveys has been formulated of as a Multi-Objective Geometric Programming Problem (MOGPP). The fuzzy programming approach has been described for solving the formulated MOGPP. The formulated MOGPP has been solved with the help of LINGO Software and the dual solution is obtained. The optimum allocations of sample sizes of respondents and non respondents are obtained with the help of dual solutions and primal-dual relationship theorem. A numerical example is given to illustrate the procedure.
文摘The procedure of stratified double quartile ranked set sampling (SDQRSS) method is introduced to estimate the population mean. The SDQRSS is compared with the simple random sampling (SRS), stratified ranked set sampling (SRSS) and stratified simple random sampling (SSRS). It is shown that SDQRSS estimator is an unbiased of the population mean and more efficient than SRS, SRSS and SSRS for symmetric and asymmetric distributions. In addition, by SDQRSS we can increase the efficiency of mean estimator for specific value of the sample size.
基金the National Science and Engineering Research Council of Canada(No.RGPIN-2014-04100)for funding this project.
文摘Digital soil mapping (DSM) aims to produce detailed maps of soil properties or soil classes to improve agricultural management and soil quality assessment. Optimized sampling design can reduce the substantial costs and efforts associated with sampling, profile description, and laboratory analysis. The purpose of this study was to compare common sampling designs for DSM, including grid sampling (GS), grid random sampling (GRS), stratified random sampling (StRS), and conditioned Latin hypercube sampling (cLHS). In an agricultural field (11 ha) in Quebec, Canada, a total of unique 118 locations were selected using each of the four sampling designs (45 locations each), and additional 30 sample locations were selected as an independent testing dataset (evaluation dataset). Soil visible near-infrared (Vis-NIR) spectra were collected in situ at the 148 locations (1 m depth), and soil cores were collected from a subset of 32 locations and subdivided at 10-cm depth intervals, totaling 251 samples. The Cubist model was used to elucidate the relationship between Vis-NIR spectra and soil properties (soil organic matter (SOM) and clay), which was then used to predict the soil properties at all 148 sample locations. Digital maps of soil properties at multiple depths for the entire field (148 sample locations) were prepared using a quantile random forest model to obtain complete model maps (CM-maps). Soil properties were also mapped using the samples from each of the 45 locations for each sampling design to obtain sampling design maps (SD-maps). The SD-maps were evaluated using the independent testing dataset (30 sample locations), and the spatial distribution and model uncertainty of each SD-map were compared with those of the corresponding CM-map. The spatial and feature space coverage were compared across the four sampling designs. The results showed that GS resulted in the most even spatial coverage, cLHS resulted in the best coverage of the feature space, and GS and cLHS resulted in similar prediction accuracies and spatial distributions of soil properties. The SOM content was underestimated using GRS, with large errors at 0–50 cm depth, due to some values not being captured by this sampling design, whereas larger errors for the deeper soil layers were produced using StRS. Predictions of SOM and clay contents had higher accuracy for topsoil (0–30 cm) than for deep subsoil (60–100 cm). It was concluded that the soil sampling designs with either good spatial coverage or feature space coverage can provide good accuracy in 3D DSM, but their performances may be different for different soil properties.
文摘The curve of relationship between fatigue crack growth rate and the stress strength factor amplitude represented an important fatigue property in designing of damage tolerance limits and predicting life of metallic component parts. In order to have a more reasonable use of testing data, samples from population were stratified suggested by the stratified random sample model (SRAM). The data in each stratum corresponded to the same experiment conditions. A suitable weight was assigned to each stratified sample according to the actual working states of the pressure vessel, so that the estimation of fatigue crack growth rate equation was more accurate for practice. An empirical study shows that the SRAM estimation by using fatigue crack growth rate data from different stoves is obviously better than the estimation from simple random sample model.
文摘This paper reveaed some problems of the forest samling investigation from application.and pointed out the defects. Determining sample size method was precisely put forward from formla's origin in simple random Samling procedure In stratified random samgling, two cases were distinguished: the variances Sh2 are equal for all h and not all Sh2 are equal This method made the assertion of making confidence interval more reliable.