In this paper, we establish asymptotically optimal simultaneous confidence bands for the copula function based on the local linear kernel estimator proposed by Chen and Huang [1]. For this, we prove under smoothness c...In this paper, we establish asymptotically optimal simultaneous confidence bands for the copula function based on the local linear kernel estimator proposed by Chen and Huang [1]. For this, we prove under smoothness conditions on the derivatives of the copula a uniform in bandwidth law of the iterated logarithm for the maximal deviation of this estimator from its expectation. We also show that the bias term converges uniformly to zero with a precise rate. The performance of these bands is illustrated by a simulation study. An application based on pseudo-panel data is also provided for modeling the dependence structure of Senegalese households’ expense data in 2001 and 2006.展开更多
Under some mild conditions, we derive the asymptotic normality of the Nadaraya-Watson and local linear estimators of the conditional hazard function for left-truncated and dependent data. The estimators were proposed ...Under some mild conditions, we derive the asymptotic normality of the Nadaraya-Watson and local linear estimators of the conditional hazard function for left-truncated and dependent data. The estimators were proposed by Liang and Ould-Sa?d [1]. The results confirm the guess in Liang and Ould-Sa?d [1].展开更多
A kernel-type estimator of the quantile function Q(p) = inf{t:F(t) ≥ p}, 0 ≤ p ≤ 1, is proposed based on the kernel smoother when the data are subjected to random truncation. The Bahadur-type representations o...A kernel-type estimator of the quantile function Q(p) = inf{t:F(t) ≥ p}, 0 ≤ p ≤ 1, is proposed based on the kernel smoother when the data are subjected to random truncation. The Bahadur-type representations of the kernel smooth estimator are established, and from Bahadur representations the authors can show that this estimator is strongly consistent, asymptotically normal, and weakly convergent.展开更多
In this paper, we define the Weibull kernel and use it to nonparametric estimation of the probability density function (pdf) and the hazard rate function for independent and identically distributed (iid) data. The bia...In this paper, we define the Weibull kernel and use it to nonparametric estimation of the probability density function (pdf) and the hazard rate function for independent and identically distributed (iid) data. The bias, variance and the optimal bandwidth of the proposed estimator are investigated. Moreover, the asymptotic normality of the proposed estimator is investigated. The performance of the proposed estimator is tested using simulation study and real data.展开更多
In this paper we study the estimation of the regression function.We establish a law ofthe iterated logarithm for the random window-width kernel estimator and,as an application,fora nearest neighbor estimator.These res...In this paper we study the estimation of the regression function.We establish a law ofthe iterated logarithm for the random window-width kernel estimator and,as an application,fora nearest neighbor estimator.These results give sharp pointwise rates of strong consistency ofthese estimators.展开更多
In this paper, based on random left truncated and right censored data, the authors derive strong representations of the cumulative hazard function estimator and the product-limit estimator of the survival function. wh...In this paper, based on random left truncated and right censored data, the authors derive strong representations of the cumulative hazard function estimator and the product-limit estimator of the survival function. which are valid up to a given order statistic of the observations. A precise bound for the errors is obtained which only depends on the index of the last order statistic to be included.展开更多
Let {Xn, n≥1} be a strictly stationary sequence of random variables, which are either associated or negatively associated, f(.) be their common density. In this paper, the author shows a central limit theorem for a k...Let {Xn, n≥1} be a strictly stationary sequence of random variables, which are either associated or negatively associated, f(.) be their common density. In this paper, the author shows a central limit theorem for a kernel estimate of f(.) under certain regular conditions.展开更多
Let (X,Y) be an R^d×R^1 valued random vector (X_1,Y_1),…, (X_n,Y_n) be a random sample drawn from (X,Y), and let E|Y|<∞. The regression function m(x)=E(Y|X=x) for x∈R^d is estimated by where, and h_n is a p...Let (X,Y) be an R^d×R^1 valued random vector (X_1,Y_1),…, (X_n,Y_n) be a random sample drawn from (X,Y), and let E|Y|<∞. The regression function m(x)=E(Y|X=x) for x∈R^d is estimated by where, and h_n is a positive number depending upon n only, nad K is a given nonnegative function on R^d. In the paper, we study the L_p convergence rate of kernel estimate m_n(x) of m(x) in suitable condition, and improve and extend the results of Wei Lansheng.展开更多
The study of estimation of conditional extreme quantile in incomplete data frameworks is of growing interest. Specially, the estimation of the extreme value index in a censorship framework has been the purpose of many...The study of estimation of conditional extreme quantile in incomplete data frameworks is of growing interest. Specially, the estimation of the extreme value index in a censorship framework has been the purpose of many inves<span style="font-family:Verdana;">tigations when finite dimension covariate information has been considered. In this paper, the estimation of the conditional extreme quantile of a </span><span style="font-family:Verdana;">heavy-tailed distribution is discussed when some functional random covariate (</span><i><span style="font-family:Verdana;">i.e.</span></i><span style="font-family:Verdana;"> valued in some infinite-dimensional space) information is available and the scalar response variable is right-censored. A Weissman-type estimator of conditional extreme quantiles is proposed and its asymptotic normality is established under mild assumptions. A simulation study is conducted to assess the finite-sample behavior of the proposed estimator and a comparison with two simple estimations strategies is provided.</span>展开更多
There have been vast amount of studies on background modeling to detect moving objects. Two recent reviews[1,2] showed that kernel density estimation(KDE) method and Gaussian mixture model(GMM) perform about equally b...There have been vast amount of studies on background modeling to detect moving objects. Two recent reviews[1,2] showed that kernel density estimation(KDE) method and Gaussian mixture model(GMM) perform about equally best among possible background models. For KDE, the selection of kernel functions and their bandwidths greatly influence the performance. There were few attempts to compare the adequacy of functions for KDE. In this paper, we evaluate the performance of various functions for KDE. Functions tested include almost everyone cited in the literature and a new function, Laplacian of Gaussian(LoG) is also introduced for comparison. All tests were done on real videos with vary-ing background dynamics and results were analyzed both qualitatively and quantitatively. Effect of different bandwidths was also investigated.展开更多
In this paper, the normal approximation rate and the random weighting approximation rate of error distribution of the kernel estimator of conditional density function f(y|x) are studied. The results may be used to...In this paper, the normal approximation rate and the random weighting approximation rate of error distribution of the kernel estimator of conditional density function f(y|x) are studied. The results may be used to construct the confidence interval of f(y|x) .展开更多
The accurate estimation of road traffic states can provide decision making for travelers and traffic managers. In this work,an algorithm based on kernel-k nearest neighbor(KNN) matching of road traffic spatial charact...The accurate estimation of road traffic states can provide decision making for travelers and traffic managers. In this work,an algorithm based on kernel-k nearest neighbor(KNN) matching of road traffic spatial characteristics is presented to estimate road traffic states. Firstly, the representative road traffic state data were extracted to establish the reference sequences of road traffic running characteristics(RSRTRC). Secondly, the spatial road traffic state data sequence was selected and the kernel function was constructed, with which the spatial road traffic data sequence could be mapped into a high dimensional feature space. Thirdly, the referenced and current spatial road traffic data sequences were extracted and the Euclidean distances in the feature space between them were obtained. Finally, the road traffic states were estimated from weighted averages of the selected k road traffic states, which corresponded to the nearest Euclidean distances. Several typical links in Beijing were adopted for case studies. The final results of the experiments show that the accuracy of this algorithm for estimating speed and volume is 95.27% and 91.32% respectively, which prove that this road traffic states estimation approach based on kernel-KNN matching of road traffic spatial characteristics is feasible and can achieve a high accuracy.展开更多
Traditional methods for early warning of dam displacements usually assume that residual displacements follow a normal distribution.This assumption deviates from the reality,thereby affecting the reliability of early w...Traditional methods for early warning of dam displacements usually assume that residual displacements follow a normal distribution.This assumption deviates from the reality,thereby affecting the reliability of early warning results and leading to misjudgments of dam displacement behavior.To solve this problem,this study proposed an early warning method using a non-normal distribution function.A new early warning index was developed using cumulative distribution function(CDF)values.The method of kernel density estimation was used to calculate the CDF values of residual displacements at a single point.The copula function was used to compute the CDF values of residual displacements at multiple points.Numerical results showed that,with residual displacements in a non-normal distribution,the early warning method proposed in this study accurately reflected the dam displacement behavior and effectively reduced the frequency of false alarms.This method is expected to aid in the safe operation of dams.展开更多
Functional statistics is a new technique for dealing with data thatcan be viewed as curves or images. Parallel to this approach, the Near-InfraredReflectance (NIR) spectroscopymethodology has been used in modern chemi...Functional statistics is a new technique for dealing with data thatcan be viewed as curves or images. Parallel to this approach, the Near-InfraredReflectance (NIR) spectroscopymethodology has been used in modern chemistryas a rapid, low-cost, and exact means of assessing an object’s chemicalproperties. In this research, we investigate the quality of corn and cookiedough by analyzing the spectroscopic technique using certain cutting-edgestatistical models. By analyzing spectral data and applying functional modelsto it, we could predict the chemical components of corn and cookie dough.Kernel Functional Classical Estimation (KFCE), Kernel Functional QuantileEstimation (KFQE), Kernel Functional Expectile Estimation (KFEE),Semi-Partial Linear Functional Classical Estimation (SPLFCE), Semi-PartialLinear Functional Quantile Estimation (SPLFQE), and Semi-Partial LinearFunctional Expectile Estimation (SPLFEE) are models used to accuratelyestimate the different quantities present in Corn and Cookie dough. Theselection of these functional models is based on their ability to constructa forecast region with a high level of confidence. We demonstrate that theconsidered models outperform traditional models such as the partial leastsquaresregression and the principal component regression in terms of predictionaccuracy. Furthermore, we show that the proposed models are morerobust than competing models such as SPLFQE and SPLFEE in the sensethat data heterogeneity has no effect on their efficiency.展开更多
Logistic regression is often used to solve linear binary classification problems such as machine vision,speech recognition,and handwriting recognition.However,it usually fails to solve certain nonlinear multi-classifi...Logistic regression is often used to solve linear binary classification problems such as machine vision,speech recognition,and handwriting recognition.However,it usually fails to solve certain nonlinear multi-classification problem,such as problem with non-equilibrium samples.Many scholars have proposed some methods,such as neural network,least square support vector machine,AdaBoost meta-algorithm,etc.These methods essentially belong to machine learning categories.In this work,based on the probability theory and statistical principle,we propose an improved logistic regression algorithm based on kernel density estimation for solving nonlinear multi-classification.We have compared our approach with other methods using non-equilibrium samples,the results show that our approach guarantees sample integrity and achieves superior classification.展开更多
The spherical approximation between two nested reproducing kernels Hilbert spaces generated from different smooth kernels is investigated. It is shown that the functions of a space can be approximated by that of the s...The spherical approximation between two nested reproducing kernels Hilbert spaces generated from different smooth kernels is investigated. It is shown that the functions of a space can be approximated by that of the subspace with better smoothness. Furthermore, the upper bound of approximation error is given.展开更多
令H_n(x)是基于来自密度函数f(x)的容量为n的一个随机样本的hazard函数H(x)=f^(x)/[1-integral from x=∞ to x(f(t)dt)]的一个核形估计.对于非参数hazard估计的积分均方误差integral ((H_n(x)-H(x))~2ω(x)f(x)dx)中心极限定理成立的...令H_n(x)是基于来自密度函数f(x)的容量为n的一个随机样本的hazard函数H(x)=f^(x)/[1-integral from x=∞ to x(f(t)dt)]的一个核形估计.对于非参数hazard估计的积分均方误差integral ((H_n(x)-H(x))~2ω(x)f(x)dx)中心极限定理成立的一个充分条件被给出,这里ω(x)是一个权函数.展开更多
The authors derive laws of the iterated logarithm for kernel estimator of regression function based on directional data. The results are distribution free in the sense that they are true for all distributions of desig...The authors derive laws of the iterated logarithm for kernel estimator of regression function based on directional data. The results are distribution free in the sense that they are true for all distributions of design variable.展开更多
文摘In this paper, we establish asymptotically optimal simultaneous confidence bands for the copula function based on the local linear kernel estimator proposed by Chen and Huang [1]. For this, we prove under smoothness conditions on the derivatives of the copula a uniform in bandwidth law of the iterated logarithm for the maximal deviation of this estimator from its expectation. We also show that the bias term converges uniformly to zero with a precise rate. The performance of these bands is illustrated by a simulation study. An application based on pseudo-panel data is also provided for modeling the dependence structure of Senegalese households’ expense data in 2001 and 2006.
基金supported by National Natural Science Foundation of China(No.11301084)Natural Science Foundation of Fujian Province(No.2014J01010)
文摘Under some mild conditions, we derive the asymptotic normality of the Nadaraya-Watson and local linear estimators of the conditional hazard function for left-truncated and dependent data. The estimators were proposed by Liang and Ould-Sa?d [1]. The results confirm the guess in Liang and Ould-Sa?d [1].
基金Zhou's research was partially supported by the NNSF of China (10471140, 10571169)Wu's research was partially supported by NNSF of China (0571170)
文摘A kernel-type estimator of the quantile function Q(p) = inf{t:F(t) ≥ p}, 0 ≤ p ≤ 1, is proposed based on the kernel smoother when the data are subjected to random truncation. The Bahadur-type representations of the kernel smooth estimator are established, and from Bahadur representations the authors can show that this estimator is strongly consistent, asymptotically normal, and weakly convergent.
文摘In this paper, we define the Weibull kernel and use it to nonparametric estimation of the probability density function (pdf) and the hazard rate function for independent and identically distributed (iid) data. The bias, variance and the optimal bandwidth of the proposed estimator are investigated. Moreover, the asymptotic normality of the proposed estimator is investigated. The performance of the proposed estimator is tested using simulation study and real data.
文摘In this paper we study the estimation of the regression function.We establish a law ofthe iterated logarithm for the random window-width kernel estimator and,as an application,fora nearest neighbor estimator.These results give sharp pointwise rates of strong consistency ofthese estimators.
文摘In this paper, based on random left truncated and right censored data, the authors derive strong representations of the cumulative hazard function estimator and the product-limit estimator of the survival function. which are valid up to a given order statistic of the observations. A precise bound for the errors is obtained which only depends on the index of the last order statistic to be included.
文摘Let {Xn, n≥1} be a strictly stationary sequence of random variables, which are either associated or negatively associated, f(.) be their common density. In this paper, the author shows a central limit theorem for a kernel estimate of f(.) under certain regular conditions.
文摘Let (X,Y) be an R^d×R^1 valued random vector (X_1,Y_1),…, (X_n,Y_n) be a random sample drawn from (X,Y), and let E|Y|<∞. The regression function m(x)=E(Y|X=x) for x∈R^d is estimated by where, and h_n is a positive number depending upon n only, nad K is a given nonnegative function on R^d. In the paper, we study the L_p convergence rate of kernel estimate m_n(x) of m(x) in suitable condition, and improve and extend the results of Wei Lansheng.
文摘The study of estimation of conditional extreme quantile in incomplete data frameworks is of growing interest. Specially, the estimation of the extreme value index in a censorship framework has been the purpose of many inves<span style="font-family:Verdana;">tigations when finite dimension covariate information has been considered. In this paper, the estimation of the conditional extreme quantile of a </span><span style="font-family:Verdana;">heavy-tailed distribution is discussed when some functional random covariate (</span><i><span style="font-family:Verdana;">i.e.</span></i><span style="font-family:Verdana;"> valued in some infinite-dimensional space) information is available and the scalar response variable is right-censored. A Weissman-type estimator of conditional extreme quantiles is proposed and its asymptotic normality is established under mild assumptions. A simulation study is conducted to assess the finite-sample behavior of the proposed estimator and a comparison with two simple estimations strategies is provided.</span>
文摘There have been vast amount of studies on background modeling to detect moving objects. Two recent reviews[1,2] showed that kernel density estimation(KDE) method and Gaussian mixture model(GMM) perform about equally best among possible background models. For KDE, the selection of kernel functions and their bandwidths greatly influence the performance. There were few attempts to compare the adequacy of functions for KDE. In this paper, we evaluate the performance of various functions for KDE. Functions tested include almost everyone cited in the literature and a new function, Laplacian of Gaussian(LoG) is also introduced for comparison. All tests were done on real videos with vary-ing background dynamics and results were analyzed both qualitatively and quantitatively. Effect of different bandwidths was also investigated.
基金Supported by Natural Science Foundation of Beijing City and National Natural Science Foundation ofChina(2 2 30 4 1 0 0 1 30 1
文摘In this paper, the normal approximation rate and the random weighting approximation rate of error distribution of the kernel estimator of conditional density function f(y|x) are studied. The results may be used to construct the confidence interval of f(y|x) .
基金Projects(LQ16E080012,LY14F030012)supported by the Zhejiang Provincial Natural Science Foundation,ChinaProject(61573317)supported by the National Natural Science Foundation of ChinaProject(2015001)supported by the Open Fund for a Key-Key Discipline of Zhejiang University of Technology,China
文摘The accurate estimation of road traffic states can provide decision making for travelers and traffic managers. In this work,an algorithm based on kernel-k nearest neighbor(KNN) matching of road traffic spatial characteristics is presented to estimate road traffic states. Firstly, the representative road traffic state data were extracted to establish the reference sequences of road traffic running characteristics(RSRTRC). Secondly, the spatial road traffic state data sequence was selected and the kernel function was constructed, with which the spatial road traffic data sequence could be mapped into a high dimensional feature space. Thirdly, the referenced and current spatial road traffic data sequences were extracted and the Euclidean distances in the feature space between them were obtained. Finally, the road traffic states were estimated from weighted averages of the selected k road traffic states, which corresponded to the nearest Euclidean distances. Several typical links in Beijing were adopted for case studies. The final results of the experiments show that the accuracy of this algorithm for estimating speed and volume is 95.27% and 91.32% respectively, which prove that this road traffic states estimation approach based on kernel-KNN matching of road traffic spatial characteristics is feasible and can achieve a high accuracy.
基金supported by the National Natural Science Foundation of China(Grant No.52109156)the Science and Technology Project of the Jiangxi Provincial Education Department(Grant No.GJJ190970).
文摘Traditional methods for early warning of dam displacements usually assume that residual displacements follow a normal distribution.This assumption deviates from the reality,thereby affecting the reliability of early warning results and leading to misjudgments of dam displacement behavior.To solve this problem,this study proposed an early warning method using a non-normal distribution function.A new early warning index was developed using cumulative distribution function(CDF)values.The method of kernel density estimation was used to calculate the CDF values of residual displacements at a single point.The copula function was used to compute the CDF values of residual displacements at multiple points.Numerical results showed that,with residual displacements in a non-normal distribution,the early warning method proposed in this study accurately reflected the dam displacement behavior and effectively reduced the frequency of false alarms.This method is expected to aid in the safe operation of dams.
基金This work is funded by the Deanship of Scientific Research at King Khalid University for funding this work through Large Groups Project under grant number RGP.2/132/43.
文摘Functional statistics is a new technique for dealing with data thatcan be viewed as curves or images. Parallel to this approach, the Near-InfraredReflectance (NIR) spectroscopymethodology has been used in modern chemistryas a rapid, low-cost, and exact means of assessing an object’s chemicalproperties. In this research, we investigate the quality of corn and cookiedough by analyzing the spectroscopic technique using certain cutting-edgestatistical models. By analyzing spectral data and applying functional modelsto it, we could predict the chemical components of corn and cookie dough.Kernel Functional Classical Estimation (KFCE), Kernel Functional QuantileEstimation (KFQE), Kernel Functional Expectile Estimation (KFEE),Semi-Partial Linear Functional Classical Estimation (SPLFCE), Semi-PartialLinear Functional Quantile Estimation (SPLFQE), and Semi-Partial LinearFunctional Expectile Estimation (SPLFEE) are models used to accuratelyestimate the different quantities present in Corn and Cookie dough. Theselection of these functional models is based on their ability to constructa forecast region with a high level of confidence. We demonstrate that theconsidered models outperform traditional models such as the partial leastsquaresregression and the principal component regression in terms of predictionaccuracy. Furthermore, we show that the proposed models are morerobust than competing models such as SPLFQE and SPLFEE in the sensethat data heterogeneity has no effect on their efficiency.
基金The authors would like to thank all anonymous reviewers for their suggestions and feedback.This work was supported by National Natural Science Foundation of China(Grant No.61379103).
文摘Logistic regression is often used to solve linear binary classification problems such as machine vision,speech recognition,and handwriting recognition.However,it usually fails to solve certain nonlinear multi-classification problem,such as problem with non-equilibrium samples.Many scholars have proposed some methods,such as neural network,least square support vector machine,AdaBoost meta-algorithm,etc.These methods essentially belong to machine learning categories.In this work,based on the probability theory and statistical principle,we propose an improved logistic regression algorithm based on kernel density estimation for solving nonlinear multi-classification.We have compared our approach with other methods using non-equilibrium samples,the results show that our approach guarantees sample integrity and achieves superior classification.
基金the NSFC(60473034)the Science Foundation of Zhejiang Province(Y604003).
文摘The spherical approximation between two nested reproducing kernels Hilbert spaces generated from different smooth kernels is investigated. It is shown that the functions of a space can be approximated by that of the subspace with better smoothness. Furthermore, the upper bound of approximation error is given.
文摘令H_n(x)是基于来自密度函数f(x)的容量为n的一个随机样本的hazard函数H(x)=f^(x)/[1-integral from x=∞ to x(f(t)dt)]的一个核形估计.对于非参数hazard估计的积分均方误差integral ((H_n(x)-H(x))~2ω(x)f(x)dx)中心极限定理成立的一个充分条件被给出,这里ω(x)是一个权函数.
基金Project supported by the National Natural Science Foundation of China (Nos. 19631040 19971085), the Doctoral Program Foundatio
文摘The authors derive laws of the iterated logarithm for kernel estimator of regression function based on directional data. The results are distribution free in the sense that they are true for all distributions of design variable.