We propose a new nonparametric test based on the rank difference between the paired sample for testing the equality of the marginal distributions from a bivariate distribution. We also consider a modification of the n...We propose a new nonparametric test based on the rank difference between the paired sample for testing the equality of the marginal distributions from a bivariate distribution. We also consider a modification of the novel nonparametric test based on the test proposed by Baumgartern, Weiβ, and Schindler (1998). An extensive numerical power comparison for various parametric and nonparametric tests was conducted under a wide range of bivariate distributions for small sample sizes. The two new nonparametric tests have comparable power to the paired t test for the data simulated from bivariate normal distributions, and are generally more powerful than the paired t test and other commonly used nonparametric tests in several important bivariate distributions.展开更多
Healthcare decisions are based on scientific evidence obtained from medical studies by gathering data and analyzing it to obtain the best results. When analyzing data, biostatistics is a powerful tool, but healthcare ...Healthcare decisions are based on scientific evidence obtained from medical studies by gathering data and analyzing it to obtain the best results. When analyzing data, biostatistics is a powerful tool, but healthcare professionals lack knowledge in this field. This lack of knowledge can manifest itself in situations such as choosing the wrong statistical test for the right situation or applying a statistical test without checking its assumptions, leading to inaccurate results and misleading conclusions. With the help of this “narrative review”, the aim is to bring biostatistics closer to healthcare professionals by answering certain questions: how to describe the distribution of data? how to assess the normality of data? how to transform data? and how to choose between nonparametric and parametric tests? Through this work, our hope is that the reader will be able to choose the right test for the right situation, in order to obtain the most accurate results.展开更多
Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Kn...Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Knowing how to select an appropriate test can lead to more accurate results. Invalid results and misleading conclusions may be drawn from a study if an incorrect statistical test is used. Therefore, to avoid these it is essential to understand the nature of the data, the research question, and the assumptions of the tests before selecting one. This is because there are a wide variety of tests available. This paper provides a step-by-step approach to selecting the right statistical test for any study, with an explanation of when it is appropriate to use it and relevant examples of each statistical test. Furthermore, this guide provides a comprehensive overview of the assumptions of each test and what to do if these assumptions are violated.展开更多
Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Kn...Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Knowing how to select an appropriate test can lead to more accurate results. Invalid results and misleading conclusions may be drawn from a study if an incorrect statistical test is used. Therefore, to avoid these it is essential to understand the nature of the data, the research question, and the assumptions of the tests before selecting one. This is because there are a wide variety of tests available. This paper provides a step-by-step approach to selecting the right statistical test for any study, with an explanation of when it is appropriate to use it and relevant examples of each statistical test. Furthermore, this guide provides a comprehensive overview of the assumptions of each test and what to do if these assumptions are violated.展开更多
Nonparametric time-of-arrival(TOA) estimators for impulse radio ultra-wideband(IR-UWB) signals are proposed. Nonparametric detection is obviously useful in situations where detailed information about the statistic...Nonparametric time-of-arrival(TOA) estimators for impulse radio ultra-wideband(IR-UWB) signals are proposed. Nonparametric detection is obviously useful in situations where detailed information about the statistics of the noise is unavailable or not accurate. Such TOA estimators are obtained based on conditional statistical tests with only a symmetry distribution assumption on the noise probability density function. The nonparametric estimators are attractive choices for low-resolution IR-UWB digital receivers which can be implemented by fast comparators or high sampling rate low resolution analog-to-digital converters(ADCs),in place of high sampling rate high resolution ADCs which may not be available in practice. Simulation results demonstrate that nonparametric TOA estimators provide more effective and robust performance than typical energy detection(ED) based estimators.展开更多
This study presents the results of a Monte Carlo simulation to compare the statistical power of Siegel-Tukey and Savage tests.The main purpose of the study is to evaluate the statistical power of both tests in scenari...This study presents the results of a Monte Carlo simulation to compare the statistical power of Siegel-Tukey and Savage tests.The main purpose of the study is to evaluate the statistical power of both tests in scenarios involving Normal,Platykurtic and Skewed distributions over different sample sizes and standard deviation values.In the study,standard deviation ratios were set as 2,3,4,1/2,1/3 and 1/4 and power comparisons were made between small and large sample sizes.For equal sample sizes,small sample sizes of 5,8,10,12,16 and 20 and large sample sizes of 25,50,75 and 100 were used.For different sample sizes,the combinations of(4,16),(8,16),(10,20),(16,4),(16,8)and(20,10)small sample sizes and(10,30),(30,10),(50,75),(50,100),(75,50),(75,100),(100,50)and(100,75)large sample sizes were examined in detail.According to the findings,the power analysis under variance heterogeneity conditions shows that the Siegel-Tukey test has a higher statistical power than the other nonparametric Savage test at small and large sample sizes.In particular,the Siegel-Tukey test was reported to offer higher precision and power under variance heterogeneity,regardless of having equal or different sample sizes.展开更多
Based on monthly runoff and climate datasets spanning 2000–2024,this study employed the Theil–Sen’s slope estimation,Mann–Kendall(M–K)trend test,as well as Pearson correlation and Spearman rank correlation analys...Based on monthly runoff and climate datasets spanning 2000–2024,this study employed the Theil–Sen’s slope estimation,Mann–Kendall(M–K)trend test,as well as Pearson correlation and Spearman rank correlation analyses to systematically examine the spatiotemporal patterns of runoff and its climatic driving mechanisms across Tajikistan,providing a scientific basis for sustainable water resource utilization and management in the study area.Results indicated that during 2000–2024,the annual runoff in Tajikistan exhibited statistically non-significant long-term trend(P=0.76),while displaying pronounced seasonal variability and strong spatial heterogeneity.Spring and summer average runoff primarily exhibited slight declining tendencies,while winter average runoff exhibited pronounced reduction in localized regions,such as the Syr Darya Basin,the Vakhsh River Basin,and the lower reaches of the Zeravshan River Basin.Precipitation emerged as the dominant positive driver of runoff,exhibiting moderate to strong positive correlations across over 78.00%of the country,whereas potential evapotranspiration consistently functioned as a negative driver.Rising temperatures exerted a dual competitive effect on runoff:in high-elevation,glacier-covered regions,rising temperatures temporarily increased runoff by accelerating glacier melt;however,at the national scale,the negative impact of rising temperature on runoff has played a slightly dominant role to a certain extent by enhancing evapotranspiration.Collectively,these results indicated that the present stability of runoff in Tajikistan is strongly dependent on the short-term compensatory effects of glacier melt and the risk of future runoff decline is likely to intensify as glacier reserves continue to diminish.This study provides a critical scientific evidence to inform sustainable water resource management in Tajikistan and underscores the need for glacier conservation and integrated water resource management strategies.展开更多
We present a new nonparametric predictive inference(NPI)method using a power-normal model for accelerated life testing(ALT).Combined with the accelerating link function and imprecise probability theory,the proposed me...We present a new nonparametric predictive inference(NPI)method using a power-normal model for accelerated life testing(ALT).Combined with the accelerating link function and imprecise probability theory,the proposed method is a feasible way to predict the life of the product using ALT failure data.To validate the method,we run a series of simulations and conduct accelerated life tests with real products.The NPI lower and upper survival functions show the robustness of our method for life prediction.This is a continuous research,and some progresses have been made by updating the link function between different stress levels.We also explain how to renew and apply our model.Moreover,discussions have been made about the performance.展开更多
In large sample studies where distributions may be skewed and not readily transformed to symmetry, it may be of greater interest to compare different distributions in terms of percentiles rather than means. For exampl...In large sample studies where distributions may be skewed and not readily transformed to symmetry, it may be of greater interest to compare different distributions in terms of percentiles rather than means. For example, it may be more informative to compare two or more populations with respect to their within population distributions by testing the hypothesis that their corresponding respective 10th, 50th, and 90th percentiles are equal. As a generalization of the median test, the proposed test statistic is asymptotically distributed as Chi-square with degrees of freedom dependent upon the number of percentiles tested and constraints of the null hypothesis. Results from simulation studies are used to validate the nominal 0.05 significance level under the null hypothesis, and asymptotic power properties that are suitable for testing equality of percentile profiles against selected profile discrepancies for a variety of underlying distributions. A pragmatic example is provided to illustrate the comparison of the percentile profiles for four body mass index distributions.展开更多
Zero-inflated distributions are common in statistical problems where there is interest in testing homogeneity of two or more independent groups. Often, the underlying distribution that has an inflated number of zero-v...Zero-inflated distributions are common in statistical problems where there is interest in testing homogeneity of two or more independent groups. Often, the underlying distribution that has an inflated number of zero-valued observations is asymmetric, and its functional form may not be known or easily characterized. In this case, comparisons of the groups in terms of their respective percentiles may be appropriate as these estimates are nonparametric and more robust to outliers and other irregularities. The median test is often used to compare distributions with similar but asymmetric shapes but may be uninformative when there are excess zeros or dissimilar shapes. For zero-inflated distributions, it is useful to compare the distributions with respect to their proportion of zeros, coupled with the comparison of percentile profiles for the observed non-zero values. A simple chi-square test for simultaneous testing of these two components is proposed, applicable to both continuous and discrete data. Results of simulation studies are reported to summarize empirical power under several scenarios. We give recommendations for the minimum sample size which is necessary to achieve suitable test performance in specific examples.展开更多
This paper, comparison of two sample tests, is motivated by the fact that in the test of significant difference between two independent samples, numerous methods can be adopted;each may lead to significant different r...This paper, comparison of two sample tests, is motivated by the fact that in the test of significant difference between two independent samples, numerous methods can be adopted;each may lead to significant different results;this implies that wrong choice of test statistic could lead to erroneous conclusion. To prevent misleading information, there is a need for proper investigation of some selected methods for test of significant difference between variables/subjects most especially, independent samples. The paper examines the efficiency and sensitivity of four test statistics to ascertain which test performs better. Based on the results, the relative efficiency favours median test as being more efficient than modified median test for both symmetric and asymmetric distributions. In terms of power of test, median test is more sensitive than Modified Median (MMED) test since it has higher power irrespective of the sample sizes for both symmetric and asymmetric distribution. In terms of relative efficiency for asymmetric distribution Modified Mann-Whitney U test is more efficient than Mann-Whitney U test (MMWU), and then for symmetric distribution, Mann-Whitney U test (MMWU) is more efficient than Modified Mann-Whitney in sample size of 5;but for other sample sizes considered Modified Mann-Whitney U test (MMWU) is better than Mann-Whitney. Using power of test for both symmetric and asymmetric distributions, Mann-Whitney is more sensitive than Modified Mann-Whitney U test (MMWU) because it has higher power.展开更多
Testing the equality of percentiles (quantiles) between populations is an effective method for robust, nonparametric comparison, especially when the distributions are asymmetric or irregularly shaped. Unlike global no...Testing the equality of percentiles (quantiles) between populations is an effective method for robust, nonparametric comparison, especially when the distributions are asymmetric or irregularly shaped. Unlike global nonparametric tests for homogeneity such as the Kolmogorv-Smirnov test, testing the equality of a set of percentiles (i.e., a percentile profile) yields an estimate of the location and extent of the differences between the populations along the entire domain. The Wald test using bootstrap estimates of variance of the order statistics provides a unified method for hypothesis testing of functions of the population percentiles. Simulation studies are conducted to show performance of the method under various scenarios and to give suggestions on its use. Several examples are given to illustrate some useful applications to real data.展开更多
The objectives of this paper are to demonstrate the algorithms employed by three statistical software programs (R, Real Statistics using Excel, and SPSS) for calculating the exact two-tailed probability of the Wald-Wo...The objectives of this paper are to demonstrate the algorithms employed by three statistical software programs (R, Real Statistics using Excel, and SPSS) for calculating the exact two-tailed probability of the Wald-Wolfowitz one-sample runs test for randomness, to present a novel approach for computing this probability, and to compare the four procedures by generating samples of 10 and 11 data points, varying the parameters n<sub>0</sub> (number of zeros) and n<sub>1</sub> (number of ones), as well as the number of runs. Fifty-nine samples are created to replicate the behavior of the distribution of the number of runs with 10 and 11 data points. The exact two-tailed probabilities for the four procedures were compared using Friedman’s test. Given the significant difference in central tendency, post-hoc comparisons were conducted using Conover’s test with Benjamini-Yekutielli correction. It is concluded that the procedures of Real Statistics using Excel and R exhibit some inadequacies in the calculation of the exact two-tailed probability, whereas the new proposal and the SPSS procedure are deemed more suitable. The proposed robust algorithm has a more transparent rationale than the SPSS one, albeit being somewhat more conservative. We recommend its implementation for this test and its application to others, such as the binomial and sign test.展开更多
Normality testing is a fundamental hypothesis test in the statistical analysis of key biological indicators of diabetes.If this assumption is violated,it may cause the test results to deviate from the true value,leadi...Normality testing is a fundamental hypothesis test in the statistical analysis of key biological indicators of diabetes.If this assumption is violated,it may cause the test results to deviate from the true value,leading to incorrect inferences and conclusions,and ultimately affecting the validity and accuracy of statistical inferences.Considering this,the study designs a unified analysis scheme for different data types based on parametric statistical test methods and non-parametric test methods.The data were grouped according to sample type and divided into discrete data and continuous data.To account for differences among subgroups,the conventional chi-squared test was used for discrete data.The normal distribution is the basis of many statistical methods;if the data does not follow a normal distribution,many statistical methods will fail or produce incorrect results.Therefore,before data analysis and modeling,the data were divided into normal and non-normal groups through normality testing.For normally distributed data,parametric statistical methods were used to judge the differences between groups.For non-normal data,non-parametric tests were employed to improve the accuracy of the analysis.Statistically significant indicators were retained according to the significance index P-value of the statistical test or corresponding statistics.These indicators were then combined with relevant medical background to further explore the etiology leading to the occurrence or transformation of diabetes status.展开更多
Sanjiang Plain,located in the northeast of Heilongjiang Province,is one of the important grain producing areas in Heilongjiang Province,which has made great contributions to China's grain industry.However,the rapi...Sanjiang Plain,located in the northeast of Heilongjiang Province,is one of the important grain producing areas in Heilongjiang Province,which has made great contributions to China's grain industry.However,the rapid development of agriculture has increased the demand for water resources,and the water cycle process has been damaged,which has caused a series of problems,and the adverse effects have aroused great concern from all sides.In order to study the evolution law of precipitation and runoff in the lower Songhua River basin of Sanjiang Plain,the monthly precipitation data of Fujin,Jiamusi and Yilan meteorological stations and the measured runoff data of Jiamusi and Changjiangtun hydrological stations of the Songhua River from 1956 to 2011 were used.Mann-Kendall method,Hurst index method,sliding T test and wavelet analysis were used to analyze the trend,mutability and periodicity of precipitation and runoff evolution.The results show that the annual precipitation and runoff of the lower Songhua River basin in Sanjiang Plain showed a downward trend in the past 56 years(1956-2011);the mutation time of precipitation at Fujin Station,Jiamusi Station and Yilan Station was 1987,1959 and 2007,1973,respectively,the mutation time of runoff at Jiamusi Station and Changjiangtun Station appeared in 1967 and 1988;the first main periods of the annual precipitation wavelets are 42 years,54 years and56 years,respectively,and the periods are about 28-31 years,36-39 years and 36-38 years,the annual runoff of Jiamusi Station has the first main period of 57 years and the period is about 37-40 years,and the annual runoff of Changjiangtun Station has the first main period of 35 years and the period is about 20-24 years;the freeze-thaw process has significant influence on runoff in the study area.The results of this study have practical significance for rational planning and utilization of surface water resources and joint operation of surface water and groundwater in Sanjiang Plain.展开更多
We employ the Multi-Scale Log-Periodic Power Law Singularity Confidence Indicator(MS-LPPLS-CI)approach to identify positive and negative bubbles in the short-,medium,and long-term for the Indian stock market,using wee...We employ the Multi-Scale Log-Periodic Power Law Singularity Confidence Indicator(MS-LPPLS-CI)approach to identify positive and negative bubbles in the short-,medium,and long-term for the Indian stock market,using weekly data from November 2003 to December 2020.We use a nonparametric causality-in-quantiles approach to analyze the predictive impact of monetary policy shocks on bubble indicators.We find,in general,strong evidence of predictability across the entire conditional distribution for the two monetary policy shock factors,with stronger impacts for negative bubbles.Our findings have critical implications for the Reserve Bank of India,academics,and investors.展开更多
For high-dimensional nonparametric Behrens-Fisher problem in which the data dimension is larger than the sample size,the authors propose two test statistics in which one is U-statistic Rankbased Test(URT)and another i...For high-dimensional nonparametric Behrens-Fisher problem in which the data dimension is larger than the sample size,the authors propose two test statistics in which one is U-statistic Rankbased Test(URT)and another is Cauchy Combination Test(CCT).CCT is analogous to the maximumtype test,while URT takes into account the sum of squares of differences of ranked samples in different dimensions,which is free of shapes of distributions and robust to outliers.The asymptotic distribution of URT is derived and the closed form for calculating the statistical significance of CCT is given.Extensive simulation studies are conducted to evaluate the finite sample power performance of the statistics by comparing with the existing method.The simulation results show that our URT is robust and powerful method,meanwhile,its practicability and effectiveness can be illustrated by an application to the gene expression data.展开更多
This paper is focused on the goodness-of-fit test of the functional linear composite quantile regression model.A nonparametric test is proposed by using the orthogonality of the residual and its conditional expectatio...This paper is focused on the goodness-of-fit test of the functional linear composite quantile regression model.A nonparametric test is proposed by using the orthogonality of the residual and its conditional expectation under the null model.The proposed test statistic has an asymptotic standard normal distribution under the null hypothesis,and tends to infinity in probability under the alternative hypothesis,which implies the consistency of the test.Furthermore,it is proved that the test statistic converges to a normal distribution with nonzero mean under a local alternative hypothesis.Extensive simulations are reported,and the results show that the proposed test has proper sizes and is sensitive to the considered model discrepancies.The proposed methods are also applied to two real datasets.展开更多
In this thesis,we construct test statistic for association test and independence test in high dimension,respectively,and study the corresponding theoretical properties under some regularity conditions.Meanwhile,we pro...In this thesis,we construct test statistic for association test and independence test in high dimension,respectively,and study the corresponding theoretical properties under some regularity conditions.Meanwhile,we propose a nonparametric variable screening procedure for sparse additive model with multivariate response in untra-high dimension and established some screening properties.展开更多
文摘We propose a new nonparametric test based on the rank difference between the paired sample for testing the equality of the marginal distributions from a bivariate distribution. We also consider a modification of the novel nonparametric test based on the test proposed by Baumgartern, Weiβ, and Schindler (1998). An extensive numerical power comparison for various parametric and nonparametric tests was conducted under a wide range of bivariate distributions for small sample sizes. The two new nonparametric tests have comparable power to the paired t test for the data simulated from bivariate normal distributions, and are generally more powerful than the paired t test and other commonly used nonparametric tests in several important bivariate distributions.
文摘Healthcare decisions are based on scientific evidence obtained from medical studies by gathering data and analyzing it to obtain the best results. When analyzing data, biostatistics is a powerful tool, but healthcare professionals lack knowledge in this field. This lack of knowledge can manifest itself in situations such as choosing the wrong statistical test for the right situation or applying a statistical test without checking its assumptions, leading to inaccurate results and misleading conclusions. With the help of this “narrative review”, the aim is to bring biostatistics closer to healthcare professionals by answering certain questions: how to describe the distribution of data? how to assess the normality of data? how to transform data? and how to choose between nonparametric and parametric tests? Through this work, our hope is that the reader will be able to choose the right test for the right situation, in order to obtain the most accurate results.
文摘Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Knowing how to select an appropriate test can lead to more accurate results. Invalid results and misleading conclusions may be drawn from a study if an incorrect statistical test is used. Therefore, to avoid these it is essential to understand the nature of the data, the research question, and the assumptions of the tests before selecting one. This is because there are a wide variety of tests available. This paper provides a step-by-step approach to selecting the right statistical test for any study, with an explanation of when it is appropriate to use it and relevant examples of each statistical test. Furthermore, this guide provides a comprehensive overview of the assumptions of each test and what to do if these assumptions are violated.
文摘Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Knowing how to select an appropriate test can lead to more accurate results. Invalid results and misleading conclusions may be drawn from a study if an incorrect statistical test is used. Therefore, to avoid these it is essential to understand the nature of the data, the research question, and the assumptions of the tests before selecting one. This is because there are a wide variety of tests available. This paper provides a step-by-step approach to selecting the right statistical test for any study, with an explanation of when it is appropriate to use it and relevant examples of each statistical test. Furthermore, this guide provides a comprehensive overview of the assumptions of each test and what to do if these assumptions are violated.
基金supported by the National High Technology Research and Development Program of China(863 Program)(2009AA011204)
文摘Nonparametric time-of-arrival(TOA) estimators for impulse radio ultra-wideband(IR-UWB) signals are proposed. Nonparametric detection is obviously useful in situations where detailed information about the statistics of the noise is unavailable or not accurate. Such TOA estimators are obtained based on conditional statistical tests with only a symmetry distribution assumption on the noise probability density function. The nonparametric estimators are attractive choices for low-resolution IR-UWB digital receivers which can be implemented by fast comparators or high sampling rate low resolution analog-to-digital converters(ADCs),in place of high sampling rate high resolution ADCs which may not be available in practice. Simulation results demonstrate that nonparametric TOA estimators provide more effective and robust performance than typical energy detection(ED) based estimators.
文摘This study presents the results of a Monte Carlo simulation to compare the statistical power of Siegel-Tukey and Savage tests.The main purpose of the study is to evaluate the statistical power of both tests in scenarios involving Normal,Platykurtic and Skewed distributions over different sample sizes and standard deviation values.In the study,standard deviation ratios were set as 2,3,4,1/2,1/3 and 1/4 and power comparisons were made between small and large sample sizes.For equal sample sizes,small sample sizes of 5,8,10,12,16 and 20 and large sample sizes of 25,50,75 and 100 were used.For different sample sizes,the combinations of(4,16),(8,16),(10,20),(16,4),(16,8)and(20,10)small sample sizes and(10,30),(30,10),(50,75),(50,100),(75,50),(75,100),(100,50)and(100,75)large sample sizes were examined in detail.According to the findings,the power analysis under variance heterogeneity conditions shows that the Siegel-Tukey test has a higher statistical power than the other nonparametric Savage test at small and large sample sizes.In particular,the Siegel-Tukey test was reported to offer higher precision and power under variance heterogeneity,regardless of having equal or different sample sizes.
基金funded by the Strategic Priority Research Program of the Chinese Academy of Sciences(XDB0720203)the National Key Research and Development Program of China(2023YFF0805603).
文摘Based on monthly runoff and climate datasets spanning 2000–2024,this study employed the Theil–Sen’s slope estimation,Mann–Kendall(M–K)trend test,as well as Pearson correlation and Spearman rank correlation analyses to systematically examine the spatiotemporal patterns of runoff and its climatic driving mechanisms across Tajikistan,providing a scientific basis for sustainable water resource utilization and management in the study area.Results indicated that during 2000–2024,the annual runoff in Tajikistan exhibited statistically non-significant long-term trend(P=0.76),while displaying pronounced seasonal variability and strong spatial heterogeneity.Spring and summer average runoff primarily exhibited slight declining tendencies,while winter average runoff exhibited pronounced reduction in localized regions,such as the Syr Darya Basin,the Vakhsh River Basin,and the lower reaches of the Zeravshan River Basin.Precipitation emerged as the dominant positive driver of runoff,exhibiting moderate to strong positive correlations across over 78.00%of the country,whereas potential evapotranspiration consistently functioned as a negative driver.Rising temperatures exerted a dual competitive effect on runoff:in high-elevation,glacier-covered regions,rising temperatures temporarily increased runoff by accelerating glacier melt;however,at the national scale,the negative impact of rising temperature on runoff has played a slightly dominant role to a certain extent by enhancing evapotranspiration.Collectively,these results indicated that the present stability of runoff in Tajikistan is strongly dependent on the short-term compensatory effects of glacier melt and the risk of future runoff decline is likely to intensify as glacier reserves continue to diminish.This study provides a critical scientific evidence to inform sustainable water resource management in Tajikistan and underscores the need for glacier conservation and integrated water resource management strategies.
基金the National Natural Science Foundation of China(No.11272082)the China Scholarship Council State Scholarship Fund(No.201506070017)
文摘We present a new nonparametric predictive inference(NPI)method using a power-normal model for accelerated life testing(ALT).Combined with the accelerating link function and imprecise probability theory,the proposed method is a feasible way to predict the life of the product using ALT failure data.To validate the method,we run a series of simulations and conduct accelerated life tests with real products.The NPI lower and upper survival functions show the robustness of our method for life prediction.This is a continuous research,and some progresses have been made by updating the link function between different stress levels.We also explain how to renew and apply our model.Moreover,discussions have been made about the performance.
文摘In large sample studies where distributions may be skewed and not readily transformed to symmetry, it may be of greater interest to compare different distributions in terms of percentiles rather than means. For example, it may be more informative to compare two or more populations with respect to their within population distributions by testing the hypothesis that their corresponding respective 10th, 50th, and 90th percentiles are equal. As a generalization of the median test, the proposed test statistic is asymptotically distributed as Chi-square with degrees of freedom dependent upon the number of percentiles tested and constraints of the null hypothesis. Results from simulation studies are used to validate the nominal 0.05 significance level under the null hypothesis, and asymptotic power properties that are suitable for testing equality of percentile profiles against selected profile discrepancies for a variety of underlying distributions. A pragmatic example is provided to illustrate the comparison of the percentile profiles for four body mass index distributions.
文摘Zero-inflated distributions are common in statistical problems where there is interest in testing homogeneity of two or more independent groups. Often, the underlying distribution that has an inflated number of zero-valued observations is asymmetric, and its functional form may not be known or easily characterized. In this case, comparisons of the groups in terms of their respective percentiles may be appropriate as these estimates are nonparametric and more robust to outliers and other irregularities. The median test is often used to compare distributions with similar but asymmetric shapes but may be uninformative when there are excess zeros or dissimilar shapes. For zero-inflated distributions, it is useful to compare the distributions with respect to their proportion of zeros, coupled with the comparison of percentile profiles for the observed non-zero values. A simple chi-square test for simultaneous testing of these two components is proposed, applicable to both continuous and discrete data. Results of simulation studies are reported to summarize empirical power under several scenarios. We give recommendations for the minimum sample size which is necessary to achieve suitable test performance in specific examples.
文摘This paper, comparison of two sample tests, is motivated by the fact that in the test of significant difference between two independent samples, numerous methods can be adopted;each may lead to significant different results;this implies that wrong choice of test statistic could lead to erroneous conclusion. To prevent misleading information, there is a need for proper investigation of some selected methods for test of significant difference between variables/subjects most especially, independent samples. The paper examines the efficiency and sensitivity of four test statistics to ascertain which test performs better. Based on the results, the relative efficiency favours median test as being more efficient than modified median test for both symmetric and asymmetric distributions. In terms of power of test, median test is more sensitive than Modified Median (MMED) test since it has higher power irrespective of the sample sizes for both symmetric and asymmetric distribution. In terms of relative efficiency for asymmetric distribution Modified Mann-Whitney U test is more efficient than Mann-Whitney U test (MMWU), and then for symmetric distribution, Mann-Whitney U test (MMWU) is more efficient than Modified Mann-Whitney in sample size of 5;but for other sample sizes considered Modified Mann-Whitney U test (MMWU) is better than Mann-Whitney. Using power of test for both symmetric and asymmetric distributions, Mann-Whitney is more sensitive than Modified Mann-Whitney U test (MMWU) because it has higher power.
文摘Testing the equality of percentiles (quantiles) between populations is an effective method for robust, nonparametric comparison, especially when the distributions are asymmetric or irregularly shaped. Unlike global nonparametric tests for homogeneity such as the Kolmogorv-Smirnov test, testing the equality of a set of percentiles (i.e., a percentile profile) yields an estimate of the location and extent of the differences between the populations along the entire domain. The Wald test using bootstrap estimates of variance of the order statistics provides a unified method for hypothesis testing of functions of the population percentiles. Simulation studies are conducted to show performance of the method under various scenarios and to give suggestions on its use. Several examples are given to illustrate some useful applications to real data.
文摘The objectives of this paper are to demonstrate the algorithms employed by three statistical software programs (R, Real Statistics using Excel, and SPSS) for calculating the exact two-tailed probability of the Wald-Wolfowitz one-sample runs test for randomness, to present a novel approach for computing this probability, and to compare the four procedures by generating samples of 10 and 11 data points, varying the parameters n<sub>0</sub> (number of zeros) and n<sub>1</sub> (number of ones), as well as the number of runs. Fifty-nine samples are created to replicate the behavior of the distribution of the number of runs with 10 and 11 data points. The exact two-tailed probabilities for the four procedures were compared using Friedman’s test. Given the significant difference in central tendency, post-hoc comparisons were conducted using Conover’s test with Benjamini-Yekutielli correction. It is concluded that the procedures of Real Statistics using Excel and R exhibit some inadequacies in the calculation of the exact two-tailed probability, whereas the new proposal and the SPSS procedure are deemed more suitable. The proposed robust algorithm has a more transparent rationale than the SPSS one, albeit being somewhat more conservative. We recommend its implementation for this test and its application to others, such as the binomial and sign test.
基金National Natural Science Foundation of China(No.12271261)Postgraduate Research and Practice Innovation Program of Jiangsu Province,China(Grant No.SJCX230368)。
文摘Normality testing is a fundamental hypothesis test in the statistical analysis of key biological indicators of diabetes.If this assumption is violated,it may cause the test results to deviate from the true value,leading to incorrect inferences and conclusions,and ultimately affecting the validity and accuracy of statistical inferences.Considering this,the study designs a unified analysis scheme for different data types based on parametric statistical test methods and non-parametric test methods.The data were grouped according to sample type and divided into discrete data and continuous data.To account for differences among subgroups,the conventional chi-squared test was used for discrete data.The normal distribution is the basis of many statistical methods;if the data does not follow a normal distribution,many statistical methods will fail or produce incorrect results.Therefore,before data analysis and modeling,the data were divided into normal and non-normal groups through normality testing.For normally distributed data,parametric statistical methods were used to judge the differences between groups.For non-normal data,non-parametric tests were employed to improve the accuracy of the analysis.Statistically significant indicators were retained according to the significance index P-value of the statistical test or corresponding statistics.These indicators were then combined with relevant medical background to further explore the etiology leading to the occurrence or transformation of diabetes status.
基金funded by Northeast Institute of Geography and Agroecology,Chinese Academy of Sciences commissioned the project Research and Application of Key Technologies of Water Resources Allocation in Ten Million mu Irrigated Area of Sanjiang Plain,grant number XDA28100105。
文摘Sanjiang Plain,located in the northeast of Heilongjiang Province,is one of the important grain producing areas in Heilongjiang Province,which has made great contributions to China's grain industry.However,the rapid development of agriculture has increased the demand for water resources,and the water cycle process has been damaged,which has caused a series of problems,and the adverse effects have aroused great concern from all sides.In order to study the evolution law of precipitation and runoff in the lower Songhua River basin of Sanjiang Plain,the monthly precipitation data of Fujin,Jiamusi and Yilan meteorological stations and the measured runoff data of Jiamusi and Changjiangtun hydrological stations of the Songhua River from 1956 to 2011 were used.Mann-Kendall method,Hurst index method,sliding T test and wavelet analysis were used to analyze the trend,mutability and periodicity of precipitation and runoff evolution.The results show that the annual precipitation and runoff of the lower Songhua River basin in Sanjiang Plain showed a downward trend in the past 56 years(1956-2011);the mutation time of precipitation at Fujin Station,Jiamusi Station and Yilan Station was 1987,1959 and 2007,1973,respectively,the mutation time of runoff at Jiamusi Station and Changjiangtun Station appeared in 1967 and 1988;the first main periods of the annual precipitation wavelets are 42 years,54 years and56 years,respectively,and the periods are about 28-31 years,36-39 years and 36-38 years,the annual runoff of Jiamusi Station has the first main period of 57 years and the period is about 37-40 years,and the annual runoff of Changjiangtun Station has the first main period of 35 years and the period is about 20-24 years;the freeze-thaw process has significant influence on runoff in the study area.The results of this study have practical significance for rational planning and utilization of surface water resources and joint operation of surface water and groundwater in Sanjiang Plain.
文摘We employ the Multi-Scale Log-Periodic Power Law Singularity Confidence Indicator(MS-LPPLS-CI)approach to identify positive and negative bubbles in the short-,medium,and long-term for the Indian stock market,using weekly data from November 2003 to December 2020.We use a nonparametric causality-in-quantiles approach to analyze the predictive impact of monetary policy shocks on bubble indicators.We find,in general,strong evidence of predictability across the entire conditional distribution for the two monetary policy shock factors,with stronger impacts for negative bubbles.Our findings have critical implications for the Reserve Bank of India,academics,and investors.
基金supported by Beijing Natural Science Foundation under Grant No.Z180006the National Nature Science Foundation of China under Grant No.11722113。
文摘For high-dimensional nonparametric Behrens-Fisher problem in which the data dimension is larger than the sample size,the authors propose two test statistics in which one is U-statistic Rankbased Test(URT)and another is Cauchy Combination Test(CCT).CCT is analogous to the maximumtype test,while URT takes into account the sum of squares of differences of ranked samples in different dimensions,which is free of shapes of distributions and robust to outliers.The asymptotic distribution of URT is derived and the closed form for calculating the statistical significance of CCT is given.Extensive simulation studies are conducted to evaluate the finite sample power performance of the statistics by comparing with the existing method.The simulation results show that our URT is robust and powerful method,meanwhile,its practicability and effectiveness can be illustrated by an application to the gene expression data.
基金supported by the Natural Science Foundation of China under Grant Nos.11271014 and 11971045。
文摘This paper is focused on the goodness-of-fit test of the functional linear composite quantile regression model.A nonparametric test is proposed by using the orthogonality of the residual and its conditional expectation under the null model.The proposed test statistic has an asymptotic standard normal distribution under the null hypothesis,and tends to infinity in probability under the alternative hypothesis,which implies the consistency of the test.Furthermore,it is proved that the test statistic converges to a normal distribution with nonzero mean under a local alternative hypothesis.Extensive simulations are reported,and the results show that the proposed test has proper sizes and is sensitive to the considered model discrepancies.The proposed methods are also applied to two real datasets.
文摘In this thesis,we construct test statistic for association test and independence test in high dimension,respectively,and study the corresponding theoretical properties under some regularity conditions.Meanwhile,we propose a nonparametric variable screening procedure for sparse additive model with multivariate response in untra-high dimension and established some screening properties.