There is a growing interest in leveraging LiDAR-generated forest Aboveground Biomass(LG-AGB)data as a reference to retrieve AGB from satellite observations.However,the biases arising from the upscaling process and the...There is a growing interest in leveraging LiDAR-generated forest Aboveground Biomass(LG-AGB)data as a reference to retrieve AGB from satellite observations.However,the biases arising from the upscaling process and the impact of the sampling strategy on model accuracy still need to be resolved.In this study,we first corrected the bias arising from upscaling the LG-AGB map to match the spatial resolution of Landsat observations.Subsequently,the stratified random sampling method was used to select training samples from the corrected LG-AGB map(cLG-AGB)for the Random Forest(RF)regression model.The RF model features were extracted from the Landsat observations and auxiliary data.The impact of strata numbers on model accuracy was explored during the sampling process.Finally,independent validation was conducted using in situ measurements.The results indicated that:(1)about 68% of the biases can be corrected in the up-scale transformation;(2)compared to no stratification,a three-strata model achieved a 6.5% improvement in AGB estimation accuracy while requiring a 37.8% reduction in sample size;(3)the black locust forest had a low saturation point at 60.52±4.46 Mg/ha AGB and 72.4%AGB values were underestimated and the remaining were overestimated.In summary,our study provides a framework to harmonize near-surface LiDAR and satellite data for AGB estimation in plantation forest ecosystems with small patch sizes and fragmented distribution.展开更多
In stratified survey sampling, sometimes we have complete auxiliary information. One of the fundamental questions is how to effectively use the complete auxiliary information at the estimation stage. In this paper, we...In stratified survey sampling, sometimes we have complete auxiliary information. One of the fundamental questions is how to effectively use the complete auxiliary information at the estimation stage. In this paper, we extend the model-calibration method to obtain estimators of the finite population mean by using complete auxiliary information from stratified sampling survey data. We show that the resulting estimators effectively use auxiliary information at the estimation stage and possess a number of attractive features such as asymptotically design-unbiased irrespective of the working model and approximately model-unbiased under the model. When a linear working-model is used, the resulting estimators reduce to the usual calibration estimator(or GREG).展开更多
In this study, we propose a two stage randomized response model. Improved unbiased estimators of the mean number of persons possessing a rare sensitive attribute under two different situations are proposed. The propos...In this study, we propose a two stage randomized response model. Improved unbiased estimators of the mean number of persons possessing a rare sensitive attribute under two different situations are proposed. The proposed estimators are evaluated using a relative efficiency comparison. It is shown that our estimators are efficient as compared to existing estimators when the parameter of rare unrelated attribute is known and in unknown case, depending on the probability of selecting a question.展开更多
Variance is one of the most vital measures of dispersion widely employed in practical aspects.A commonly used approach for variance estimation is the traditional method of moments that is strongly influenced by the pr...Variance is one of the most vital measures of dispersion widely employed in practical aspects.A commonly used approach for variance estimation is the traditional method of moments that is strongly influenced by the presence of extreme values,and thus its results cannot be relied on.Finding momentum from Koyuncu’s recent work,the present paper focuses first on proposing two classes of variance estimators based on linear moments(L-moments),and then employing them with auxiliary data under double stratified sampling to introduce a new class of calibration variance estimators using important properties of L-moments(L-location,L-cv,L-variance).Three populations are taken into account to assess the efficiency of the new estimators.The first and second populations are concerned with artificial data,and the third populations is concerned with real data.The percentage relative efficiency of the proposed estimators over existing ones is evaluated.In the presence of extreme values,our findings depict the superiority and high efficiency of the proposed classes over traditional classes.Hence,when auxiliary data is available along with extreme values,the proposed classes of estimators may be implemented in an extensive variety of sampling surveys.展开更多
Combining the advantages of the stratified sampling and the importance sampling, a stratified importance sampling method (SISM) is presented to analyze the reliability sensitivity for structure with multiple failure...Combining the advantages of the stratified sampling and the importance sampling, a stratified importance sampling method (SISM) is presented to analyze the reliability sensitivity for structure with multiple failure modes. In the presented method, the variable space is divided into several disjoint subspace by n-dimensional coordinate planes at the mean point of the random vec- tor, and the importance sampling functions in the subspaces are constructed by keeping the sampling center at the mean point and augmenting the standard deviation by a factor of 2. The sample size generated from the importance sampling function in each subspace is determined by the contribution of the subspace to the reliability sensitivity, which can be estimated by iterative simulation in the sampling process. The formulae of the reliability sensitivity estimation, the variance and the coefficient of variation are derived for the presented SISM. Comparing with the Monte Carlo method, the stratified sampling method and the importance sampling method, the presented SISM has wider applicability and higher calculation efficiency, which is demonstrated by numerical examples. Finally, the reliability sensitivity analysis of flap structure is illustrated that the SISM can be applied to engineering structure.展开更多
Rather than the difficulties of highly non-linear and non-Gaussian observation process and the state distribution in single target tracking, the presence of a large, varying number of targets and their interactions pl...Rather than the difficulties of highly non-linear and non-Gaussian observation process and the state distribution in single target tracking, the presence of a large, varying number of targets and their interactions place more challenge on visual tracking. To overcome these difficulties, we formulate multiple targets tracking problem in a dynamic Markov network which consists of three coupled Markov random fields that model the following: a field for joint state of multi-target, one binary process for existence of individual target, and another binary process for occlusion of dual adjacent targets. By introducing two robust functions, we eliminate the two binary processes, and then apply a novel version of belief propagation called sequential stratified sampling belief propagation algorithm to obtain the maximum a posteriori (MAP) estimation in the dynamic Markov network, By using stratified sampler, we incorporate bottom-up information provided by a learned detector (e.g. SVM classifier) and belief information for the messages updating. Other low-level visual cues (e.g. color and shape) can be easily incorporated in our multi-target tracking model to obtain better tracking results. Experimental results suggest that our method is comparable to the state-of-the-art multiple targets tracking methods in several test cases.展开更多
To analyze the efficiency of area estimations(i.e.estimation accuracy and variation of estimation)impacted by crop mapping error,we simulated error at eight levels for thematic maps using a stratified sampling estimat...To analyze the efficiency of area estimations(i.e.estimation accuracy and variation of estimation)impacted by crop mapping error,we simulated error at eight levels for thematic maps using a stratified sampling estimation methodology.The results show that the estimation efficiency is influenced by the combination of the sample size and the error level.Evaluating the trade-offs between sample size and error level showed that reducing the crop mapping error level provides the most benefit(i.e.higher estimation efficiency).Further,sampling performance differed based on the heterogeneity of the crop area.The results demonstrated that the influence of increasing the error level on estimation efficiency is more detrimental in heterogeneous areas than in homogeneous ones.Therefore,to obtain higher estimation efficiency,a larger sample size and lower error level or both are needed,especially in heterogeneous areas.We suggest that existing land-cover maps should first be used to determine the heterogeneity of the area.The appropriate sample size for these areas then can be determined according to all three factors:heterogeneity,expected estimation efficiency,and sampling budget.Overall,extending our understanding of the impacts of crop mapping error is necessary for decision making to improve our ability to effectively estimate crop area.展开更多
Many operations carried out by official statistical institutes use large-scale surveys obtained by stratified random sampling without replacement. Variables commonly examined in this type of surveys are binary, catego...Many operations carried out by official statistical institutes use large-scale surveys obtained by stratified random sampling without replacement. Variables commonly examined in this type of surveys are binary, categorical and continuous, and hence, the estimates of interest involve estimates of proportions, totals and means. The problem of approximating the sampling relative error of this kind of estimates is studied in this paper. Some new jackknife methods are proposed and compared with plug-in and bootstrap methods. An extensive simulation study is carried out to compare the behavior of all the methods considered in this paper.展开更多
In general the accuracy of mean estimator can be improved by stratified random sampling. In this paper, we provide an idea different from empirical methods that the accuracy can be more improved through bootstrap resa...In general the accuracy of mean estimator can be improved by stratified random sampling. In this paper, we provide an idea different from empirical methods that the accuracy can be more improved through bootstrap resampling method under some conditions. The determination of sample size by bootstrap method is also discussed, and a simulation is made to verify the accuracy of the proposed method. The simulation results show that the sample size based on bootstrapping is smaller than that based on central limit theorem.展开更多
In this paper, analysis of methodology was realized for the application of stratified random sampling with optimum allocation in the case of a subject of research which concerns the rural population and presents high ...In this paper, analysis of methodology was realized for the application of stratified random sampling with optimum allocation in the case of a subject of research which concerns the rural population and presents high differentiations among the three strata in which this population could be classified. The rural population of Evros Prefecture (Greece) with criterion the mean altitude of settlements was classified in three strata, the mountainous, semi-mountainous and fiat population for the estimation of mean consumption of forest fuelwood for covering of heating and cooking needs in households of these three strata. The analysis of this methodology includes: (1) the determination of total size of sample for entire the rural population and its allocation to the various strata; (2) the investigation of effectiveness of stratification with the technique of analysis of variance (One-Way ANOVA); (3) the conduct of sampling research with the realization of face-to-face interviews in selected households and (4) the control of forms of the questionnaire and the analysis of data by using the statistical package for social sciences, SPSS for Windows. All data for the analysis of this methodology and its practical application were taken by the pilot sampling which was realized in each stratum. Relative paper was not found by the review of literature.展开更多
In this paper, auxiliary information is used to determine an estimator of finite population total using nonparametric regression under stratified random sampling. To achieve this, a model-based approach is adopted by ...In this paper, auxiliary information is used to determine an estimator of finite population total using nonparametric regression under stratified random sampling. To achieve this, a model-based approach is adopted by making use of the local polynomial regression estimation to predict the nonsampled values of the survey variable y. The performance of the proposed estimator is investigated against some design-based and model-based regression estimators. The simulation experiments show that the resulting estimator exhibits good properties. Generally, good confidence intervals are seen for the nonparametric regression estimators, and use of the proposed estimator leads to relatively smaller values of RE compared to other estimators.展开更多
This study examined gender differences in modal choice among residents of coastal communities of Yenagoa metropolis in Bayelsa State, Nigeria. The Four-Step model of transportation planning and modal choice provided t...This study examined gender differences in modal choice among residents of coastal communities of Yenagoa metropolis in Bayelsa State, Nigeria. The Four-Step model of transportation planning and modal choice provided the theoretical basis for this study. A survey research design involving a stratified sampling technique was adopted. The descriptives on transport modes, amount and time spent revealed that 10 (76.9%) males and 3 (23.1%) females preferred bicycle as means of transportation, 7 (58.3%) males and 5 (41.7%) females preferred motorcycle, while a significant proportion 90 (53.9%) males and 77 (46.1%) females preferred tricycle, 80 (63.0%) males and 47 (37.0%) females preferred cars/taxis, and 12 (46.2%) males and 14 (53.8%) females preferred mass transit bus. However, 14 (46.7%) males and 16 (53.3%) females in marshy terrain and coastal locations preferred canoes and boats. The result of the logistic regression model revealed that gender modal preference is more likely to be influenced by mode of transportation with a beta weight of 1.140, safety considerations 1.139, ownership of transport 1.135 and distance to place of work 1.073. Hence, this study recommends that a combination of these factors should be incorporated into transport planning to achieve effective transport planning and sustainable development in the Yenagoa metropolis.展开更多
Digital soil mapping (DSM) aims to produce detailed maps of soil properties or soil classes to improve agricultural management and soil quality assessment. Optimized sampling design can reduce the substantial costs an...Digital soil mapping (DSM) aims to produce detailed maps of soil properties or soil classes to improve agricultural management and soil quality assessment. Optimized sampling design can reduce the substantial costs and efforts associated with sampling, profile description, and laboratory analysis. The purpose of this study was to compare common sampling designs for DSM, including grid sampling (GS), grid random sampling (GRS), stratified random sampling (StRS), and conditioned Latin hypercube sampling (cLHS). In an agricultural field (11 ha) in Quebec, Canada, a total of unique 118 locations were selected using each of the four sampling designs (45 locations each), and additional 30 sample locations were selected as an independent testing dataset (evaluation dataset). Soil visible near-infrared (Vis-NIR) spectra were collected in situ at the 148 locations (1 m depth), and soil cores were collected from a subset of 32 locations and subdivided at 10-cm depth intervals, totaling 251 samples. The Cubist model was used to elucidate the relationship between Vis-NIR spectra and soil properties (soil organic matter (SOM) and clay), which was then used to predict the soil properties at all 148 sample locations. Digital maps of soil properties at multiple depths for the entire field (148 sample locations) were prepared using a quantile random forest model to obtain complete model maps (CM-maps). Soil properties were also mapped using the samples from each of the 45 locations for each sampling design to obtain sampling design maps (SD-maps). The SD-maps were evaluated using the independent testing dataset (30 sample locations), and the spatial distribution and model uncertainty of each SD-map were compared with those of the corresponding CM-map. The spatial and feature space coverage were compared across the four sampling designs. The results showed that GS resulted in the most even spatial coverage, cLHS resulted in the best coverage of the feature space, and GS and cLHS resulted in similar prediction accuracies and spatial distributions of soil properties. The SOM content was underestimated using GRS, with large errors at 0–50 cm depth, due to some values not being captured by this sampling design, whereas larger errors for the deeper soil layers were produced using StRS. Predictions of SOM and clay contents had higher accuracy for topsoil (0–30 cm) than for deep subsoil (60–100 cm). It was concluded that the soil sampling designs with either good spatial coverage or feature space coverage can provide good accuracy in 3D DSM, but their performances may be different for different soil properties.展开更多
The curve of relationship between fatigue crack growth rate and the stress strength factor amplitude represented an important fatigue property in designing of damage tolerance limits and predicting life of metallic co...The curve of relationship between fatigue crack growth rate and the stress strength factor amplitude represented an important fatigue property in designing of damage tolerance limits and predicting life of metallic component parts. In order to have a more reasonable use of testing data, samples from population were stratified suggested by the stratified random sample model (SRAM). The data in each stratum corresponded to the same experiment conditions. A suitable weight was assigned to each stratified sample according to the actual working states of the pressure vessel, so that the estimation of fatigue crack growth rate equation was more accurate for practice. An empirical study shows that the SRAM estimation by using fatigue crack growth rate data from different stoves is obviously better than the estimation from simple random sample model.展开更多
This paper reveaed some problems of the forest samling investigation from application.and pointed out the defects. Determining sample size method was precisely put forward from formla's origin in simple random Sam...This paper reveaed some problems of the forest samling investigation from application.and pointed out the defects. Determining sample size method was precisely put forward from formla's origin in simple random Samling procedure In stratified random samgling, two cases were distinguished: the variances Sh2 are equal for all h and not all Sh2 are equal This method made the assertion of making confidence interval more reliable.展开更多
The procedure of stratified double quartile ranked set sampling (SDQRSS) method is introduced to estimate the population mean. The SDQRSS is compared with the simple random sampling (SRS), stratified ranked set sa...The procedure of stratified double quartile ranked set sampling (SDQRSS) method is introduced to estimate the population mean. The SDQRSS is compared with the simple random sampling (SRS), stratified ranked set sampling (SRSS) and stratified simple random sampling (SSRS). It is shown that SDQRSS estimator is an unbiased of the population mean and more efficient than SRS, SRSS and SSRS for symmetric and asymmetric distributions. In addition, by SDQRSS we can increase the efficiency of mean estimator for specific value of the sample size.展开更多
Random forest model is the mainstream research method used to accurately describe the distribution law and impact mechanism of regional population.We took Shijiazhuang as the research area,with comprehensive zoning ba...Random forest model is the mainstream research method used to accurately describe the distribution law and impact mechanism of regional population.We took Shijiazhuang as the research area,with comprehensive zoning based on endowments as the modeling unit,conducted stratified sampling on a hectare grid cell,and systematically carried out incremental selection experiments of population density impact factors,optimizing the population density random forest model throughout the process(zonal modeling,stratified sampling,factor selection,weighted output).The results are as follows:(1)Zonal modeling addresses the issue of confusion in population distribution laws caused by a single model.Sampling on a grid cell not only ensures the quality of training data by avoiding the modifiable areal unit problem(MAUP)but also attempts to mitigate the adverse effects of the ecological fallacy.Stratified sampling ensures the stability of population density label values(target variable)in the training sample.(2)Zonal selection experiments on population density impact factors help identify suitable combinations of factors,leading to a significant improvement in the goodness of fit(R^(2))of the zonal models.(3)Weighted combination output of the population density prediction dataset substantially enhances the model's robustness.(4)The population density dataset exhibits multi-scale superposition characteristics.On a large scale,the population density in plains is higher than that in mountainous areas,while on a small scale,urban areas have higher density compared to rural areas.The optimization scheme for the population density random forest model that we propose offers a unified technical framework for uncovering local population distribution law and the impact mechanisms.展开更多
Spatial variability of soil properties imposes a challenge for practical analysis and design in geotechnical engineering.The latter is particularly true for slope stability assessment,where the effects of uncertainty ...Spatial variability of soil properties imposes a challenge for practical analysis and design in geotechnical engineering.The latter is particularly true for slope stability assessment,where the effects of uncertainty are synthesized in the so-called probability of failure.This probability quantifies the reliability of a slope and its numerical calculation is usually quite involved from a numerical viewpoint.In view of this issue,this paper proposes an approach for failure probability assessment based on Latinized partially stratified sampling and maximum entropy distribution with fractional moments.The spatial variability of geotechnical properties is represented by means of random fields and the Karhunen-Loève expansion.Then,failure probabilities are estimated employing maximum entropy distribution with fractional moments.The application of the proposed approach is examined with two examples:a case study of an undrained slope and a case study of a slope with cross-correlated random fields of strength parameters under a drained slope.The results show that the proposed approach has excellent accuracy and high efficiency,and it can be applied straightforwardly to similar geotechnical engineering problems.展开更多
Fishery-independent surveys are often used for collecting high quality biological and ecological data to support fisheries management. A careful optimization of fishery-independent survey design is necessary to improv...Fishery-independent surveys are often used for collecting high quality biological and ecological data to support fisheries management. A careful optimization of fishery-independent survey design is necessary to improve the precision of survey estimates with cost-effective sampling efforts. We developed a simulation approach to evaluate and optimize the stratification scheme for a fishery-independent survey with multiple goals including estimation of abundance indices of individual species and species diversity indices. We compared the performances of the sampling designs with different stratification schemes for different goals over different months. Gains in precision of survey estimates from the stratification schemes were acquired compared to simple random sampling design for most indices. The stratification scheme with five strata performed the best. This study showed that the loss of precision of survey estimates due to the reduction of sampling efforts could be compensated by improved stratification schemes, which would reduce the cost and negative impacts of survey trawling on those species with low abundance in the fishery-independent survey. This study also suggests that optimization of a survey design differed with different survey objectives. A post-survey analysis can improve the stratification scheme of fishery-independent survey designs.展开更多
Stratified random survey is commonly used to estimate abundance indices of fish populations in multispecies survey,providing reliable data for stock assessment and fisheries management.In some cases,however,the sample...Stratified random survey is commonly used to estimate abundance indices of fish populations in multispecies survey,providing reliable data for stock assessment and fisheries management.In some cases,however,the sample size is relatively small because of the limitation of survey cost or other factors.The allocation methods of sampling efforts among strata in stratified random surveys with small sample size may need adjustment compared with traditional approaches.In this study,two sampling stations were allocated to each stratum first and then the remaining sampling units were allocated among strata using five traditional allocation methods.In order to distinguish them from traditional methods,we called them adjusted methods in this study.A simulation study was conducted to compare the performances of different allocation strategies of sampling efforts in a stratified random survey for estimating abundance indices of multiple target species.Relative estimation error(REE)and relative bias(RB)were used to measure the precision and accuracy of estimates of abundance indices under different allocation schemes of sampling efforts in the multispecies survey.The performances of different allocation schemes in estimating abundance indices varied greatly for different species over different seasons.The adjusted Neyman allocation scheme could significantly reduce the REE and RB of estimates of abundance index for single species survey.For multiple species surveys,the adjusted average-Neyman allocation method,the adjusted Yate allocation method,the adjusted proportional allocation method and current allocation method had relatively high accuracy and precision of estimates of abundance indices for four species in terms of the total_(REE) and total_(RB).Though the adjusted average-Neyman allocation scheme did not always have the best performance,it was the optimal one considering the accuracy and precision of estimates of abundance indices for all species simultaneously.The allocation of sampling efforts among strata in stratified random surveys targeting for estimating abundance indices of multiple species should comprehensively consider the variance of abundance of different species in stratum and the seasonal changes.展开更多
基金supported by the National Natural Science Foundation of China[grant numbers 41471419 and 31971579].
文摘There is a growing interest in leveraging LiDAR-generated forest Aboveground Biomass(LG-AGB)data as a reference to retrieve AGB from satellite observations.However,the biases arising from the upscaling process and the impact of the sampling strategy on model accuracy still need to be resolved.In this study,we first corrected the bias arising from upscaling the LG-AGB map to match the spatial resolution of Landsat observations.Subsequently,the stratified random sampling method was used to select training samples from the corrected LG-AGB map(cLG-AGB)for the Random Forest(RF)regression model.The RF model features were extracted from the Landsat observations and auxiliary data.The impact of strata numbers on model accuracy was explored during the sampling process.Finally,independent validation was conducted using in situ measurements.The results indicated that:(1)about 68% of the biases can be corrected in the up-scale transformation;(2)compared to no stratification,a three-strata model achieved a 6.5% improvement in AGB estimation accuracy while requiring a 37.8% reduction in sample size;(3)the black locust forest had a low saturation point at 60.52±4.46 Mg/ha AGB and 72.4%AGB values were underestimated and the remaining were overestimated.In summary,our study provides a framework to harmonize near-surface LiDAR and satellite data for AGB estimation in plantation forest ecosystems with small patch sizes and fragmented distribution.
基金Supported by the National Natural Science Foundation of China(10571093)
文摘In stratified survey sampling, sometimes we have complete auxiliary information. One of the fundamental questions is how to effectively use the complete auxiliary information at the estimation stage. In this paper, we extend the model-calibration method to obtain estimators of the finite population mean by using complete auxiliary information from stratified sampling survey data. We show that the resulting estimators effectively use auxiliary information at the estimation stage and possess a number of attractive features such as asymptotically design-unbiased irrespective of the working model and approximately model-unbiased under the model. When a linear working-model is used, the resulting estimators reduce to the usual calibration estimator(or GREG).
文摘In this study, we propose a two stage randomized response model. Improved unbiased estimators of the mean number of persons possessing a rare sensitive attribute under two different situations are proposed. The proposed estimators are evaluated using a relative efficiency comparison. It is shown that our estimators are efficient as compared to existing estimators when the parameter of rare unrelated attribute is known and in unknown case, depending on the probability of selecting a question.
基金The authors thank the Deanship of Scientific Research at King Khalid University,Kingdom of Saudi Arabia for funding this study through the research groups program under Project Number R.G.P.1/64/42.Ishfaq Ahmad and Ibrahim Mufrah Almanjahie received the grant.
文摘Variance is one of the most vital measures of dispersion widely employed in practical aspects.A commonly used approach for variance estimation is the traditional method of moments that is strongly influenced by the presence of extreme values,and thus its results cannot be relied on.Finding momentum from Koyuncu’s recent work,the present paper focuses first on proposing two classes of variance estimators based on linear moments(L-moments),and then employing them with auxiliary data under double stratified sampling to introduce a new class of calibration variance estimators using important properties of L-moments(L-location,L-cv,L-variance).Three populations are taken into account to assess the efficiency of the new estimators.The first and second populations are concerned with artificial data,and the third populations is concerned with real data.The percentage relative efficiency of the proposed estimators over existing ones is evaluated.In the presence of extreme values,our findings depict the superiority and high efficiency of the proposed classes over traditional classes.Hence,when auxiliary data is available along with extreme values,the proposed classes of estimators may be implemented in an extensive variety of sampling surveys.
基金National Natural Science Foundation of China (10572117,10802063,50875213)Aeronautical Science Foundation of China (2007ZA53012)+1 种基金New Century Program For Excellent Talents of Ministry of Education of China (NCET-05-0868)National High-tech Research and Development Program (2007AA04Z401)
文摘Combining the advantages of the stratified sampling and the importance sampling, a stratified importance sampling method (SISM) is presented to analyze the reliability sensitivity for structure with multiple failure modes. In the presented method, the variable space is divided into several disjoint subspace by n-dimensional coordinate planes at the mean point of the random vec- tor, and the importance sampling functions in the subspaces are constructed by keeping the sampling center at the mean point and augmenting the standard deviation by a factor of 2. The sample size generated from the importance sampling function in each subspace is determined by the contribution of the subspace to the reliability sensitivity, which can be estimated by iterative simulation in the sampling process. The formulae of the reliability sensitivity estimation, the variance and the coefficient of variation are derived for the presented SISM. Comparing with the Monte Carlo method, the stratified sampling method and the importance sampling method, the presented SISM has wider applicability and higher calculation efficiency, which is demonstrated by numerical examples. Finally, the reliability sensitivity analysis of flap structure is illustrated that the SISM can be applied to engineering structure.
基金supported in part by the National Natural Science Foundation of China(Grant Nos.60205001,60405004 and 60021302).
文摘Rather than the difficulties of highly non-linear and non-Gaussian observation process and the state distribution in single target tracking, the presence of a large, varying number of targets and their interactions place more challenge on visual tracking. To overcome these difficulties, we formulate multiple targets tracking problem in a dynamic Markov network which consists of three coupled Markov random fields that model the following: a field for joint state of multi-target, one binary process for existence of individual target, and another binary process for occlusion of dual adjacent targets. By introducing two robust functions, we eliminate the two binary processes, and then apply a novel version of belief propagation called sequential stratified sampling belief propagation algorithm to obtain the maximum a posteriori (MAP) estimation in the dynamic Markov network, By using stratified sampler, we incorporate bottom-up information provided by a learned detector (e.g. SVM classifier) and belief information for the messages updating. Other low-level visual cues (e.g. color and shape) can be easily incorporated in our multi-target tracking model to obtain better tracking results. Experimental results suggest that our method is comparable to the state-of-the-art multiple targets tracking methods in several test cases.
基金the Major Project of High-Resolution Earth Observation System,China[grant number 09-20A05-9001-17/18]the New Hampshire Agricultural Experiment Station.This is Scientific Contribution Number 2728the USDA National Institute of Food and Agriculture McIntire Stennis Project#NH00077-M(Accession#1002519)。
文摘To analyze the efficiency of area estimations(i.e.estimation accuracy and variation of estimation)impacted by crop mapping error,we simulated error at eight levels for thematic maps using a stratified sampling estimation methodology.The results show that the estimation efficiency is influenced by the combination of the sample size and the error level.Evaluating the trade-offs between sample size and error level showed that reducing the crop mapping error level provides the most benefit(i.e.higher estimation efficiency).Further,sampling performance differed based on the heterogeneity of the crop area.The results demonstrated that the influence of increasing the error level on estimation efficiency is more detrimental in heterogeneous areas than in homogeneous ones.Therefore,to obtain higher estimation efficiency,a larger sample size and lower error level or both are needed,especially in heterogeneous areas.We suggest that existing land-cover maps should first be used to determine the heterogeneity of the area.The appropriate sample size for these areas then can be determined according to all three factors:heterogeneity,expected estimation efficiency,and sampling budget.Overall,extending our understanding of the impacts of crop mapping error is necessary for decision making to improve our ability to effectively estimate crop area.
基金supported by the Galician Official Statistical Institute(IGE)and by Grants 10DPI105003PRCN2012/130 from Xunta de Galicia(Spain)by Grant number MTM2011-22392 from Ministerio de Ciencia e Innovacion(Spain).
文摘Many operations carried out by official statistical institutes use large-scale surveys obtained by stratified random sampling without replacement. Variables commonly examined in this type of surveys are binary, categorical and continuous, and hence, the estimates of interest involve estimates of proportions, totals and means. The problem of approximating the sampling relative error of this kind of estimates is studied in this paper. Some new jackknife methods are proposed and compared with plug-in and bootstrap methods. An extensive simulation study is carried out to compare the behavior of all the methods considered in this paper.
基金The Science Research Start-up Foundation for Young Teachers of Southwest Jiaotong University(No.2007Q091)
文摘In general the accuracy of mean estimator can be improved by stratified random sampling. In this paper, we provide an idea different from empirical methods that the accuracy can be more improved through bootstrap resampling method under some conditions. The determination of sample size by bootstrap method is also discussed, and a simulation is made to verify the accuracy of the proposed method. The simulation results show that the sample size based on bootstrapping is smaller than that based on central limit theorem.
文摘In this paper, analysis of methodology was realized for the application of stratified random sampling with optimum allocation in the case of a subject of research which concerns the rural population and presents high differentiations among the three strata in which this population could be classified. The rural population of Evros Prefecture (Greece) with criterion the mean altitude of settlements was classified in three strata, the mountainous, semi-mountainous and fiat population for the estimation of mean consumption of forest fuelwood for covering of heating and cooking needs in households of these three strata. The analysis of this methodology includes: (1) the determination of total size of sample for entire the rural population and its allocation to the various strata; (2) the investigation of effectiveness of stratification with the technique of analysis of variance (One-Way ANOVA); (3) the conduct of sampling research with the realization of face-to-face interviews in selected households and (4) the control of forms of the questionnaire and the analysis of data by using the statistical package for social sciences, SPSS for Windows. All data for the analysis of this methodology and its practical application were taken by the pilot sampling which was realized in each stratum. Relative paper was not found by the review of literature.
文摘In this paper, auxiliary information is used to determine an estimator of finite population total using nonparametric regression under stratified random sampling. To achieve this, a model-based approach is adopted by making use of the local polynomial regression estimation to predict the nonsampled values of the survey variable y. The performance of the proposed estimator is investigated against some design-based and model-based regression estimators. The simulation experiments show that the resulting estimator exhibits good properties. Generally, good confidence intervals are seen for the nonparametric regression estimators, and use of the proposed estimator leads to relatively smaller values of RE compared to other estimators.
文摘This study examined gender differences in modal choice among residents of coastal communities of Yenagoa metropolis in Bayelsa State, Nigeria. The Four-Step model of transportation planning and modal choice provided the theoretical basis for this study. A survey research design involving a stratified sampling technique was adopted. The descriptives on transport modes, amount and time spent revealed that 10 (76.9%) males and 3 (23.1%) females preferred bicycle as means of transportation, 7 (58.3%) males and 5 (41.7%) females preferred motorcycle, while a significant proportion 90 (53.9%) males and 77 (46.1%) females preferred tricycle, 80 (63.0%) males and 47 (37.0%) females preferred cars/taxis, and 12 (46.2%) males and 14 (53.8%) females preferred mass transit bus. However, 14 (46.7%) males and 16 (53.3%) females in marshy terrain and coastal locations preferred canoes and boats. The result of the logistic regression model revealed that gender modal preference is more likely to be influenced by mode of transportation with a beta weight of 1.140, safety considerations 1.139, ownership of transport 1.135 and distance to place of work 1.073. Hence, this study recommends that a combination of these factors should be incorporated into transport planning to achieve effective transport planning and sustainable development in the Yenagoa metropolis.
基金the National Science and Engineering Research Council of Canada(No.RGPIN-2014-04100)for funding this project.
文摘Digital soil mapping (DSM) aims to produce detailed maps of soil properties or soil classes to improve agricultural management and soil quality assessment. Optimized sampling design can reduce the substantial costs and efforts associated with sampling, profile description, and laboratory analysis. The purpose of this study was to compare common sampling designs for DSM, including grid sampling (GS), grid random sampling (GRS), stratified random sampling (StRS), and conditioned Latin hypercube sampling (cLHS). In an agricultural field (11 ha) in Quebec, Canada, a total of unique 118 locations were selected using each of the four sampling designs (45 locations each), and additional 30 sample locations were selected as an independent testing dataset (evaluation dataset). Soil visible near-infrared (Vis-NIR) spectra were collected in situ at the 148 locations (1 m depth), and soil cores were collected from a subset of 32 locations and subdivided at 10-cm depth intervals, totaling 251 samples. The Cubist model was used to elucidate the relationship between Vis-NIR spectra and soil properties (soil organic matter (SOM) and clay), which was then used to predict the soil properties at all 148 sample locations. Digital maps of soil properties at multiple depths for the entire field (148 sample locations) were prepared using a quantile random forest model to obtain complete model maps (CM-maps). Soil properties were also mapped using the samples from each of the 45 locations for each sampling design to obtain sampling design maps (SD-maps). The SD-maps were evaluated using the independent testing dataset (30 sample locations), and the spatial distribution and model uncertainty of each SD-map were compared with those of the corresponding CM-map. The spatial and feature space coverage were compared across the four sampling designs. The results showed that GS resulted in the most even spatial coverage, cLHS resulted in the best coverage of the feature space, and GS and cLHS resulted in similar prediction accuracies and spatial distributions of soil properties. The SOM content was underestimated using GRS, with large errors at 0–50 cm depth, due to some values not being captured by this sampling design, whereas larger errors for the deeper soil layers were produced using StRS. Predictions of SOM and clay contents had higher accuracy for topsoil (0–30 cm) than for deep subsoil (60–100 cm). It was concluded that the soil sampling designs with either good spatial coverage or feature space coverage can provide good accuracy in 3D DSM, but their performances may be different for different soil properties.
文摘The curve of relationship between fatigue crack growth rate and the stress strength factor amplitude represented an important fatigue property in designing of damage tolerance limits and predicting life of metallic component parts. In order to have a more reasonable use of testing data, samples from population were stratified suggested by the stratified random sample model (SRAM). The data in each stratum corresponded to the same experiment conditions. A suitable weight was assigned to each stratified sample according to the actual working states of the pressure vessel, so that the estimation of fatigue crack growth rate equation was more accurate for practice. An empirical study shows that the SRAM estimation by using fatigue crack growth rate data from different stoves is obviously better than the estimation from simple random sample model.
文摘This paper reveaed some problems of the forest samling investigation from application.and pointed out the defects. Determining sample size method was precisely put forward from formla's origin in simple random Samling procedure In stratified random samgling, two cases were distinguished: the variances Sh2 are equal for all h and not all Sh2 are equal This method made the assertion of making confidence interval more reliable.
文摘The procedure of stratified double quartile ranked set sampling (SDQRSS) method is introduced to estimate the population mean. The SDQRSS is compared with the simple random sampling (SRS), stratified ranked set sampling (SRSS) and stratified simple random sampling (SSRS). It is shown that SDQRSS estimator is an unbiased of the population mean and more efficient than SRS, SRSS and SSRS for symmetric and asymmetric distributions. In addition, by SDQRSS we can increase the efficiency of mean estimator for specific value of the sample size.
基金National Natural Science Foundation of China,No.42071167,No.42201197,No.40871073The Second Tibetan Plateau Scientific Expedition and Research Program,No.2019QZKK0406Natural Science Foundation of Hebei Province,No.D2007000272。
文摘Random forest model is the mainstream research method used to accurately describe the distribution law and impact mechanism of regional population.We took Shijiazhuang as the research area,with comprehensive zoning based on endowments as the modeling unit,conducted stratified sampling on a hectare grid cell,and systematically carried out incremental selection experiments of population density impact factors,optimizing the population density random forest model throughout the process(zonal modeling,stratified sampling,factor selection,weighted output).The results are as follows:(1)Zonal modeling addresses the issue of confusion in population distribution laws caused by a single model.Sampling on a grid cell not only ensures the quality of training data by avoiding the modifiable areal unit problem(MAUP)but also attempts to mitigate the adverse effects of the ecological fallacy.Stratified sampling ensures the stability of population density label values(target variable)in the training sample.(2)Zonal selection experiments on population density impact factors help identify suitable combinations of factors,leading to a significant improvement in the goodness of fit(R^(2))of the zonal models.(3)Weighted combination output of the population density prediction dataset substantially enhances the model's robustness.(4)The population density dataset exhibits multi-scale superposition characteristics.On a large scale,the population density in plains is higher than that in mountainous areas,while on a small scale,urban areas have higher density compared to rural areas.The optimization scheme for the population density random forest model that we propose offers a unified technical framework for uncovering local population distribution law and the impact mechanisms.
基金funding support from the China Scholarship Council(CSC).
文摘Spatial variability of soil properties imposes a challenge for practical analysis and design in geotechnical engineering.The latter is particularly true for slope stability assessment,where the effects of uncertainty are synthesized in the so-called probability of failure.This probability quantifies the reliability of a slope and its numerical calculation is usually quite involved from a numerical viewpoint.In view of this issue,this paper proposes an approach for failure probability assessment based on Latinized partially stratified sampling and maximum entropy distribution with fractional moments.The spatial variability of geotechnical properties is represented by means of random fields and the Karhunen-Loève expansion.Then,failure probabilities are estimated employing maximum entropy distribution with fractional moments.The application of the proposed approach is examined with two examples:a case study of an undrained slope and a case study of a slope with cross-correlated random fields of strength parameters under a drained slope.The results show that the proposed approach has excellent accuracy and high efficiency,and it can be applied straightforwardly to similar geotechnical engineering problems.
基金The Public Science and Technology Research Funds Projects of Ocean under contract No.201305030the Specialized Research Fund for the Doctoral Program of Higher Education under contract No.20120132130001
文摘Fishery-independent surveys are often used for collecting high quality biological and ecological data to support fisheries management. A careful optimization of fishery-independent survey design is necessary to improve the precision of survey estimates with cost-effective sampling efforts. We developed a simulation approach to evaluate and optimize the stratification scheme for a fishery-independent survey with multiple goals including estimation of abundance indices of individual species and species diversity indices. We compared the performances of the sampling designs with different stratification schemes for different goals over different months. Gains in precision of survey estimates from the stratification schemes were acquired compared to simple random sampling design for most indices. The stratification scheme with five strata performed the best. This study showed that the loss of precision of survey estimates due to the reduction of sampling efforts could be compensated by improved stratification schemes, which would reduce the cost and negative impacts of survey trawling on those species with low abundance in the fishery-independent survey. This study also suggests that optimization of a survey design differed with different survey objectives. A post-survey analysis can improve the stratification scheme of fishery-independent survey designs.
基金This work was funded by the National Key R&D Program of China(2018YFD0900904)the National Natural Science Foundation of China(31772852)the Fundamental Research Funds for the Central Universities(No.201562030,No.201612004).
文摘Stratified random survey is commonly used to estimate abundance indices of fish populations in multispecies survey,providing reliable data for stock assessment and fisheries management.In some cases,however,the sample size is relatively small because of the limitation of survey cost or other factors.The allocation methods of sampling efforts among strata in stratified random surveys with small sample size may need adjustment compared with traditional approaches.In this study,two sampling stations were allocated to each stratum first and then the remaining sampling units were allocated among strata using five traditional allocation methods.In order to distinguish them from traditional methods,we called them adjusted methods in this study.A simulation study was conducted to compare the performances of different allocation strategies of sampling efforts in a stratified random survey for estimating abundance indices of multiple target species.Relative estimation error(REE)and relative bias(RB)were used to measure the precision and accuracy of estimates of abundance indices under different allocation schemes of sampling efforts in the multispecies survey.The performances of different allocation schemes in estimating abundance indices varied greatly for different species over different seasons.The adjusted Neyman allocation scheme could significantly reduce the REE and RB of estimates of abundance index for single species survey.For multiple species surveys,the adjusted average-Neyman allocation method,the adjusted Yate allocation method,the adjusted proportional allocation method and current allocation method had relatively high accuracy and precision of estimates of abundance indices for four species in terms of the total_(REE) and total_(RB).Though the adjusted average-Neyman allocation scheme did not always have the best performance,it was the optimal one considering the accuracy and precision of estimates of abundance indices for all species simultaneously.The allocation of sampling efforts among strata in stratified random surveys targeting for estimating abundance indices of multiple species should comprehensively consider the variance of abundance of different species in stratum and the seasonal changes.