This paper studies evolutionary mechanism of parameter selection in the construction of weight function for Nearest Neighbour Estimate in nonparametric regression. Construct an algorithm which adaptively evolves fine ...This paper studies evolutionary mechanism of parameter selection in the construction of weight function for Nearest Neighbour Estimate in nonparametric regression. Construct an algorithm which adaptively evolves fine weight and makes good prediction about unknown points. The numerical experiments indicate that this method is effective. It is a meaningful discussion about practicability of nonparametric regression and methodology of adaptive model-building.展开更多
In this paper, we study the strong consistency for partitioning estimation of regression function under samples that axe φ-mixing sequences with identically distribution.Key words: nonparametric regression function; ...In this paper, we study the strong consistency for partitioning estimation of regression function under samples that axe φ-mixing sequences with identically distribution.Key words: nonparametric regression function; partitioning estimation; strong convergence;φ-mixing sequences.展开更多
This paper introduces a method of bootstrap wavelet estimation in a non-parametric regression model with weakly dependent processes for both fixed and random designs. The asymptotic bounds for the bias and variance of...This paper introduces a method of bootstrap wavelet estimation in a non-parametric regression model with weakly dependent processes for both fixed and random designs. The asymptotic bounds for the bias and variance of the bootstrap wavelet estimators are given in the fixed design model. The conditional normality for a modified version of the bootstrap wavelet estimators is obtained in the fixed model. The consistency for the bootstrap wavelet estimator is also proved in the random design model. These results show that the bootstrap wavelet method is valid for the model with weakly dependent processes.展开更多
A K-nearest neighbor (K-NN) based nonparametric regression model was proposed to predict travel speed for Beijing expressway. By using the historical traffic data collected from the detectors in Beijing expressways,...A K-nearest neighbor (K-NN) based nonparametric regression model was proposed to predict travel speed for Beijing expressway. By using the historical traffic data collected from the detectors in Beijing expressways, a specically designed database was developed via the processes including data filtering, wavelet analysis and clustering. The relativity based weighted Euclidean distance was used as the distance metric to identify the K groups of nearest data series. Then, a K-NN nonparametric regression model was built to predict the average travel speeds up to 6 min into the future. Several randomly selected travel speed data series, collected from the floating car data (FCD) system, were used to validate the model. The results indicate that using the FCD, the model can predict average travel speeds with an accuracy of above 90%, and hence is feasible and effective.展开更多
The importance of detecting heteroscedasticity in regression analysis is widely recognized because efficient inference for the regression function requires that heteroscedasticity should be taken into account. In this...The importance of detecting heteroscedasticity in regression analysis is widely recognized because efficient inference for the regression function requires that heteroscedasticity should be taken into account. In this paper, a simple test for heteroscedasticity is proposed in nonparametric regression based on residual analysis. Furthermore, some simulations with a comparison with Dette and Munk's method are conducted to evaluate the performance of the proposed test. The results demonstrate that the method in this paper performs quite satisfactorily and is much more powerful than Dette and Munk's method in some cases.展开更多
A number of statistical tests are proposed for the purpose of change-point detection in a general nonparametric regression model under mild conditions. New proofs are given to prove the weak convergence of the underly...A number of statistical tests are proposed for the purpose of change-point detection in a general nonparametric regression model under mild conditions. New proofs are given to prove the weak convergence of the underlying processes which assume remove the stringent condition of bounded total variation of the regression function and need only second moments. Since many quantities, such as the regression function, the distribution of the covariates and the distribution of the errors, are unspecified, the results are not distribution-free. A weighted bootstrap approach is proposed to approximate the limiting distributions. Results of a simulation study for this paper show good performance for moderate samples sizes.展开更多
The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. T...The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.展开更多
In this paper, by using some inequalities of negatively orthant dependent(NOD,in short) random variables and the truncated method of random variables, we investigate the nonparametric regression model. The complete co...In this paper, by using some inequalities of negatively orthant dependent(NOD,in short) random variables and the truncated method of random variables, we investigate the nonparametric regression model. The complete consistency result for the estimator of g(x) is presented.展开更多
Let Y_i=M(X_i)+ei, where M(x)=E(Y|X=x) is an unknown realfunction on B(? R), {(X_1,Y_i)} is a stationary and m(n)-dependent sample from(X, Y), the residuals {e_i} are independent of {X_i} and have unknown common densi...Let Y_i=M(X_i)+ei, where M(x)=E(Y|X=x) is an unknown realfunction on B(? R), {(X_1,Y_i)} is a stationary and m(n)-dependent sample from(X, Y), the residuals {e_i} are independent of {X_i} and have unknown common densityf(x). In [2] a nonparametric estimate f_n(x) for f(x) has been proposed on the basisof the residuals estimates. In this paper, we further obtain the asymptotic normalityand the law of the iterated logarithm of f_n(x) under some suitable conditions. Theseresults together with those in [2] bring the asymptotic theory for the residuals densityestimate in nonparametric regression under m(n)-dependent sample to completion.展开更多
We consider the problem of estimating a function g in nonparametric regression model when only some of covariates are measured with errors with the assistance of validation data. Without specifying any error model str...We consider the problem of estimating a function g in nonparametric regression model when only some of covariates are measured with errors with the assistance of validation data. Without specifying any error model structure between the surrogate and true covariables, we propose an estimator which integrates orthogonal series estimation and truncated series approximation method. Under general regularity conditions, we get the convergence rate of this estimator. Simulations demonstrate the finite-sample properties of the new estimator.展开更多
In this article we study the estimation method of nonparametric regression measurement error model based on a validation data. The estimation procedures are based on orthogonal series estimation and truncated series a...In this article we study the estimation method of nonparametric regression measurement error model based on a validation data. The estimation procedures are based on orthogonal series estimation and truncated series approximation methods without specifying any structure equation and the distribution assumption. The convergence rates of the proposed estimator are derived. By example and through simulation, the method is robust against the misspecification of a measurement error model.展开更多
In this article, we develop estimation approaches for nonparametric multiple regression measurement error models when both independent validation data on covariables and primary data on the response variable and surro...In this article, we develop estimation approaches for nonparametric multiple regression measurement error models when both independent validation data on covariables and primary data on the response variable and surrogate covariables are available. An estimator which integrates Fourier series estimation and truncated series approximation methods is derived without any error model structure assumption between the true covariables and surrogate variables. Most importantly, our proposed methodology can be readily extended to the case that only some of covariates are measured with errors with the assistance of validation data. Under mild conditions, we derive the convergence rates of the proposed estimators. The finite-sample properties of the estimators are investigated through simulation studies.展开更多
The study focuses on the imputation for the longitudinal survey data which often has nonignorable nonrespondents. Local linear regression is used to impute the missing values and then the estimation of the time-depend...The study focuses on the imputation for the longitudinal survey data which often has nonignorable nonrespondents. Local linear regression is used to impute the missing values and then the estimation of the time-dependent finite populations means. The asymptotic properties (unbiasedness and consistency) of the proposed estimator are investigated. Comparisons between different parametric and nonparametric estimators are performed based on the bootstrap standard deviation, mean square error and percentage relative bias. A simulation study is carried out to determine the best performing estimator of the time-dependent finite population means. The simulation results show that local linear regression estimator yields good properties.展开更多
This article develops a procedure for screening variables, in ultra high-di- mensional settings, based on their predictive significance. This is achieved by ranking the variables according to the variance of their res...This article develops a procedure for screening variables, in ultra high-di- mensional settings, based on their predictive significance. This is achieved by ranking the variables according to the variance of their respective marginal regression functions (RV-SIS). We show that, under some mild technical conditions, the RV-SIS possesses a sure screening property, which is defined by Fan and Lv (2008). Numerical comparisons suggest that RV-SIS has competitive performance compared to other screening procedures, and outperforms them in many different model settings.展开更多
The Contingent Valuation Method is used to evaluate individual preferences for a change concerning a public non-market resource or property. The objective is to build a nonparametric forecasting model of an individual...The Contingent Valuation Method is used to evaluate individual preferences for a change concerning a public non-market resource or property. The objective is to build a nonparametric forecasting model of an individual's Willingness To Pay according to geographical location. Within this framework, an estimator (of type Nadaraya-Watson) is proposed for the regression of the variable related to geolocation. The specific characteristics of the location variable lead us to a more general regression model than the traditional models. Results are established for convergence of our estimator.展开更多
In this paper, we investigate the nonparametric regression model based on ρ-mixing errors, which are stochastically dominated by a nonnegative random variable. Weobtain the convergence rate for the weighted estimator...In this paper, we investigate the nonparametric regression model based on ρ-mixing errors, which are stochastically dominated by a nonnegative random variable. Weobtain the convergence rate for the weighted estimator of unknown function g(x) in pth-mean, which yields the convergence rate in probability. Moreover, an example of the nearestneighbor estimator is also illustrated and the convergence rates of estimator are presented.展开更多
The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software w...The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software with defects negatively impacts operational costs and finally affects customer satisfaction. Numerous approaches exist to predict software defects. However, the timely and accurate software bugs are the major challenging issues. To improve the timely and accurate software defect prediction, a novel technique called Nonparametric Statistical feature scaled QuAdratic regressive convolution Deep nEural Network (SQADEN) is introduced. The proposed SQADEN technique mainly includes two major processes namely metric or feature selection and classification. First, the SQADEN uses the nonparametric statistical Torgerson–Gower scaling technique for identifying the relevant software metrics by measuring the similarity using the dice coefficient. The feature selection process is used to minimize the time complexity of software fault prediction. With the selected metrics, software fault perdition with the help of the Quadratic Censored regressive convolution deep neural network-based classification. The deep learning classifier analyzes the training and testing samples using the contingency correlation coefficient. The softstep activation function is used to provide the final fault prediction results. To minimize the error, the Nelder–Mead method is applied to solve non-linear least-squares problems. Finally, accurate classification results with a minimum error are obtained at the output layer. Experimental evaluation is carried out with different quantitative metrics such as accuracy, precision, recall, F-measure, and time complexity. The analyzed results demonstrate the superior performance of our proposed SQADEN technique with maximum accuracy, sensitivity and specificity by 3%, 3%, 2% and 3% and minimum time and space by 13% and 15% when compared with the two state-of-the-art methods.展开更多
In this work,we construct and study a family of robust nonparametric estimators for a regression function based on kernel methods.The data are functional,independent and identically distributed,and are linked to a sin...In this work,we construct and study a family of robust nonparametric estimators for a regression function based on kernel methods.The data are functional,independent and identically distributed,and are linked to a single-index model.Under general conditions,we establish the pointwise and uniform almost complete convergence,as well as the asymptotic normality of the estimator.We explicitly derive the asymptotic variance and,as a result,provide confidence bands for the theoretical parameter.A simulation study is conducted to illustrate the proposed methodology.展开更多
This paper presents a simple nonparametric regression approach to data-driven computing in elasticity. We apply the kernel regression to the material data set, and formulate a system of nonlinear equations solved to o...This paper presents a simple nonparametric regression approach to data-driven computing in elasticity. We apply the kernel regression to the material data set, and formulate a system of nonlinear equations solved to obtain a static equilibrium state of an elastic structure. Preliminary numerical experiments illustrate that, compared with existing methods, the proposed method finds a reasonable solution even if data points distribute coarsely in a given material data set.展开更多
It is well known that the nonparametric estimation of the regression function is highly sensitive to the presence of even a small proportion of outliers in the data.To solve the problem of typical observations when th...It is well known that the nonparametric estimation of the regression function is highly sensitive to the presence of even a small proportion of outliers in the data.To solve the problem of typical observations when the covariates of the nonparametric component are functional,the robust estimates for the regression parameter and regression operator are introduced.The main propose of the paper is to consider data-driven methods of selecting the number of neighbors in order to make the proposed processes fully automatic.We use thek Nearest Neighbors procedure(kNN)to construct the kernel estimator of the proposed robust model.Under some regularity conditions,we state consistency results for kNN functional estimators,which are uniform in the number of neighbors(UINN).Furthermore,a simulation study and an empirical application to a real data analysis of octane gasoline predictions are carried out to illustrate the higher predictive performances and the usefulness of the kNN approach.展开更多
文摘This paper studies evolutionary mechanism of parameter selection in the construction of weight function for Nearest Neighbour Estimate in nonparametric regression. Construct an algorithm which adaptively evolves fine weight and makes good prediction about unknown points. The numerical experiments indicate that this method is effective. It is a meaningful discussion about practicability of nonparametric regression and methodology of adaptive model-building.
基金Supported by the Science Development Foundation of HFUT(041002F)
文摘In this paper, we study the strong consistency for partitioning estimation of regression function under samples that axe φ-mixing sequences with identically distribution.Key words: nonparametric regression function; partitioning estimation; strong convergence;φ-mixing sequences.
基金This paper is supported by NNSF project(10371059)China and Youth Teacher Foundation of Nankai University
文摘This paper introduces a method of bootstrap wavelet estimation in a non-parametric regression model with weakly dependent processes for both fixed and random designs. The asymptotic bounds for the bias and variance of the bootstrap wavelet estimators are given in the fixed design model. The conditional normality for a modified version of the bootstrap wavelet estimators is obtained in the fixed model. The consistency for the bootstrap wavelet estimator is also proved in the random design model. These results show that the bootstrap wavelet method is valid for the model with weakly dependent processes.
基金The Project of Research on Technologyand Devices for Traffic Guidance (Vehicle Navigation)System of Beijing Municipal Commission of Science and Technology(No H030630340320)the Project of Research on theIntelligence Traffic Information Platform of Beijing Education Committee
文摘A K-nearest neighbor (K-NN) based nonparametric regression model was proposed to predict travel speed for Beijing expressway. By using the historical traffic data collected from the detectors in Beijing expressways, a specically designed database was developed via the processes including data filtering, wavelet analysis and clustering. The relativity based weighted Euclidean distance was used as the distance metric to identify the K groups of nearest data series. Then, a K-NN nonparametric regression model was built to predict the average travel speeds up to 6 min into the future. Several randomly selected travel speed data series, collected from the floating car data (FCD) system, were used to validate the model. The results indicate that using the FCD, the model can predict average travel speeds with an accuracy of above 90%, and hence is feasible and effective.
基金the National Natural Science Foundation of China (10531030)
文摘The importance of detecting heteroscedasticity in regression analysis is widely recognized because efficient inference for the regression function requires that heteroscedasticity should be taken into account. In this paper, a simple test for heteroscedasticity is proposed in nonparametric regression based on residual analysis. Furthermore, some simulations with a comparison with Dette and Munk's method are conducted to evaluate the performance of the proposed test. The results demonstrate that the method in this paper performs quite satisfactorily and is much more powerful than Dette and Munk's method in some cases.
文摘A number of statistical tests are proposed for the purpose of change-point detection in a general nonparametric regression model under mild conditions. New proofs are given to prove the weak convergence of the underlying processes which assume remove the stringent condition of bounded total variation of the regression function and need only second moments. Since many quantities, such as the regression function, the distribution of the covariates and the distribution of the errors, are unspecified, the results are not distribution-free. A weighted bootstrap approach is proposed to approximate the limiting distributions. Results of a simulation study for this paper show good performance for moderate samples sizes.
文摘The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.
基金Supported by the Research Teaching Model Curriculum of Anhui University(xjyjkc1407)Supported by the Students Innovative Training Project of Anhui University(201310357004,201410357117,201410357249)Supported by the Quality Improvement Projects for Undergraduate Education of Anhui University(ZLTS2015035)
文摘In this paper, by using some inequalities of negatively orthant dependent(NOD,in short) random variables and the truncated method of random variables, we investigate the nonparametric regression model. The complete consistency result for the estimator of g(x) is presented.
基金Project Supported by National Natural Science Foundation of China.
文摘Let Y_i=M(X_i)+ei, where M(x)=E(Y|X=x) is an unknown realfunction on B(? R), {(X_1,Y_i)} is a stationary and m(n)-dependent sample from(X, Y), the residuals {e_i} are independent of {X_i} and have unknown common densityf(x). In [2] a nonparametric estimate f_n(x) for f(x) has been proposed on the basisof the residuals estimates. In this paper, we further obtain the asymptotic normalityand the law of the iterated logarithm of f_n(x) under some suitable conditions. Theseresults together with those in [2] bring the asymptotic theory for the residuals densityestimate in nonparametric regression under m(n)-dependent sample to completion.
文摘We consider the problem of estimating a function g in nonparametric regression model when only some of covariates are measured with errors with the assistance of validation data. Without specifying any error model structure between the surrogate and true covariables, we propose an estimator which integrates orthogonal series estimation and truncated series approximation method. Under general regularity conditions, we get the convergence rate of this estimator. Simulations demonstrate the finite-sample properties of the new estimator.
文摘In this article we study the estimation method of nonparametric regression measurement error model based on a validation data. The estimation procedures are based on orthogonal series estimation and truncated series approximation methods without specifying any structure equation and the distribution assumption. The convergence rates of the proposed estimator are derived. By example and through simulation, the method is robust against the misspecification of a measurement error model.
文摘In this article, we develop estimation approaches for nonparametric multiple regression measurement error models when both independent validation data on covariables and primary data on the response variable and surrogate covariables are available. An estimator which integrates Fourier series estimation and truncated series approximation methods is derived without any error model structure assumption between the true covariables and surrogate variables. Most importantly, our proposed methodology can be readily extended to the case that only some of covariates are measured with errors with the assistance of validation data. Under mild conditions, we derive the convergence rates of the proposed estimators. The finite-sample properties of the estimators are investigated through simulation studies.
文摘The study focuses on the imputation for the longitudinal survey data which often has nonignorable nonrespondents. Local linear regression is used to impute the missing values and then the estimation of the time-dependent finite populations means. The asymptotic properties (unbiasedness and consistency) of the proposed estimator are investigated. Comparisons between different parametric and nonparametric estimators are performed based on the bootstrap standard deviation, mean square error and percentage relative bias. A simulation study is carried out to determine the best performing estimator of the time-dependent finite population means. The simulation results show that local linear regression estimator yields good properties.
文摘This article develops a procedure for screening variables, in ultra high-di- mensional settings, based on their predictive significance. This is achieved by ranking the variables according to the variance of their respective marginal regression functions (RV-SIS). We show that, under some mild technical conditions, the RV-SIS possesses a sure screening property, which is defined by Fan and Lv (2008). Numerical comparisons suggest that RV-SIS has competitive performance compared to other screening procedures, and outperforms them in many different model settings.
文摘The Contingent Valuation Method is used to evaluate individual preferences for a change concerning a public non-market resource or property. The objective is to build a nonparametric forecasting model of an individual's Willingness To Pay according to geographical location. Within this framework, an estimator (of type Nadaraya-Watson) is proposed for the regression of the variable related to geolocation. The specific characteristics of the location variable lead us to a more general regression model than the traditional models. Results are established for convergence of our estimator.
基金Supported by National Natural Science Foundation of China(11426032,11501005)Natural Science Foundation of Anhui Province(1408085QA02,1508085QA01,1508085J06)+5 种基金Provincial Natural Science Research Project of Anhui Colleges(KJ2014A010,KJ2014A020,KJ2015A065)Higher Education Talent Revitalization Project of Anhui Province(2013SQRL005ZD)Quality Engineering Project of Anhui Province(2015jyxm054,2015jyxm057)Students Science Research Training Program of Anhui University(KYXL2014016,KYXL2014013)Applied Teaching Model Curriculum of Anhui University(XJYYKC1401,ZLTS2015052,ZLTS2015053)Doctoral Research Start-up Funds Projects of Anhui University
文摘In this paper, we investigate the nonparametric regression model based on ρ-mixing errors, which are stochastically dominated by a nonnegative random variable. Weobtain the convergence rate for the weighted estimator of unknown function g(x) in pth-mean, which yields the convergence rate in probability. Moreover, an example of the nearestneighbor estimator is also illustrated and the convergence rates of estimator are presented.
文摘The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software with defects negatively impacts operational costs and finally affects customer satisfaction. Numerous approaches exist to predict software defects. However, the timely and accurate software bugs are the major challenging issues. To improve the timely and accurate software defect prediction, a novel technique called Nonparametric Statistical feature scaled QuAdratic regressive convolution Deep nEural Network (SQADEN) is introduced. The proposed SQADEN technique mainly includes two major processes namely metric or feature selection and classification. First, the SQADEN uses the nonparametric statistical Torgerson–Gower scaling technique for identifying the relevant software metrics by measuring the similarity using the dice coefficient. The feature selection process is used to minimize the time complexity of software fault prediction. With the selected metrics, software fault perdition with the help of the Quadratic Censored regressive convolution deep neural network-based classification. The deep learning classifier analyzes the training and testing samples using the contingency correlation coefficient. The softstep activation function is used to provide the final fault prediction results. To minimize the error, the Nelder–Mead method is applied to solve non-linear least-squares problems. Finally, accurate classification results with a minimum error are obtained at the output layer. Experimental evaluation is carried out with different quantitative metrics such as accuracy, precision, recall, F-measure, and time complexity. The analyzed results demonstrate the superior performance of our proposed SQADEN technique with maximum accuracy, sensitivity and specificity by 3%, 3%, 2% and 3% and minimum time and space by 13% and 15% when compared with the two state-of-the-art methods.
基金supported by PRFU of Ministry of Higher Education and Scientific Research Algeria(MESRS),University of Sciences and Technology Oran Mohamed Boudiaf(USTO-MB),Code:C00L03UN310220230005.
文摘In this work,we construct and study a family of robust nonparametric estimators for a regression function based on kernel methods.The data are functional,independent and identically distributed,and are linked to a single-index model.Under general conditions,we establish the pointwise and uniform almost complete convergence,as well as the asymptotic normality of the estimator.We explicitly derive the asymptotic variance and,as a result,provide confidence bands for the theoretical parameter.A simulation study is conducted to illustrate the proposed methodology.
基金supported by JSPS KAKENHI (Grants 17K06633 and 18K18898)
文摘This paper presents a simple nonparametric regression approach to data-driven computing in elasticity. We apply the kernel regression to the material data set, and formulate a system of nonlinear equations solved to obtain a static equilibrium state of an elastic structure. Preliminary numerical experiments illustrate that, compared with existing methods, the proposed method finds a reasonable solution even if data points distribute coarsely in a given material data set.
文摘It is well known that the nonparametric estimation of the regression function is highly sensitive to the presence of even a small proportion of outliers in the data.To solve the problem of typical observations when the covariates of the nonparametric component are functional,the robust estimates for the regression parameter and regression operator are introduced.The main propose of the paper is to consider data-driven methods of selecting the number of neighbors in order to make the proposed processes fully automatic.We use thek Nearest Neighbors procedure(kNN)to construct the kernel estimator of the proposed robust model.Under some regularity conditions,we state consistency results for kNN functional estimators,which are uniform in the number of neighbors(UINN).Furthermore,a simulation study and an empirical application to a real data analysis of octane gasoline predictions are carried out to illustrate the higher predictive performances and the usefulness of the kNN approach.