期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Bayesian Analysis of Simple Random Densities
1
作者 Paulo C.Marques F. Carlos A.de B.Pereira 《Open Journal of Statistics》 2014年第5期377-390,共14页
A tractable nonparametric prior over densities is introduced which is closed under sampling and exhibits proper posterior asymptotics.
关键词 Bayesian Nonparametrics Bayesian Density Estimation random Densities random partitions Stochastic Simulations SMOOTHING
在线阅读 下载PDF
Density estimation-based method to determine sample size for random sample partition of big data
2
作者 Yulin HE Jiaqi CHEN +2 位作者 Jiaxing SHEN Philippe FOURNIER-VIGER Joshua Zhexue HUANG 《Frontiers of Computer Science》 SCIE EI CSCD 2024年第5期57-70,共14页
Random sample partition(RSP)is a newly developed big data representation and management model to deal with big data approximate computation problems.Academic research and practical applications have confirmed that RSP... Random sample partition(RSP)is a newly developed big data representation and management model to deal with big data approximate computation problems.Academic research and practical applications have confirmed that RSP is an efficient solution for big data processing and analysis.However,a challenge for implementing RSP is determining an appropriate sample size for RSP data blocks.While a large sample size increases the burden of big data computation,a small size will lead to insufficient distribution information for RSP data blocks.To address this problem,this paper presents a novel density estimation-based method(DEM)to determine the optimal sample size for RSP data blocks.First,a theoretical sample size is calculated based on the multivariate Dvoretzky-Kiefer-Wolfowitz(DKW)inequality by using the fixed-point iteration(FPI)method.Second,a practical sample size is determined by minimizing the validation error of a kernel density estimator(KDE)constructed on RSP data blocks for an increasing sample size.Finally,a series of persuasive experiments are conducted to validate the feasibility,rationality,and effectiveness of DEM.Experimental results show that(1)the iteration function of the FPI method is convergent for calculating the theoretical sample size from the multivariate DKW inequality;(2)the KDE constructed on RSP data blocks with sample size determined by DEM can yield a good approximation of the probability density function(p.d.f);and(3)DEM provides more accurate sample sizes than the existing sample size determination methods from the perspective of p.d.f.estimation.This demonstrates that DEM is a viable approach to deal with the sample size determination problem for big data RSP implementation. 展开更多
关键词 random sample partition big data sample size Dvoretzky-Kiefer-Wolfowitz inequality kerneldensity estimator probability density function
原文传递
Krigings over space and time based on latent low-dimensional structures 被引量:1
3
作者 Da Huang Qiwei Yao Rongmao Zhang 《Science China Mathematics》 SCIE CSCD 2021年第4期823-848,共26页
We propose a new nonparametric approach to represent the linear dependence structure of a spatiotemporal process in terms of latent common factors.Though it is formally similar to the existing reduced rank approximati... We propose a new nonparametric approach to represent the linear dependence structure of a spatiotemporal process in terms of latent common factors.Though it is formally similar to the existing reduced rank approximation methods,the fundamental difference is that the low-dimensional structure is completely unknown in our setting,which is learned from the data collected irregularly over space but regularly in time.Furthermore,a graph Laplacian is incorporated in the learning in order to take the advantage of the continuity over space,and a new aggregation method via randomly partitioning space is introduced to improve the efficiency.We do not impose any stationarity conditions over space either,as the learning is facilitated by the stationarity in time.Krigings over space and time are carried out based on the learned low-dimensional structure,which is scalable to the cases when the data are taken over a large number of locations and/or over a long time period.Asymptotic properties of the proposed methods are established.An illustration with both simulated and real data sets is also reported. 展开更多
关键词 aggregation via random partitioning common factors EIGENANALYSIS graph Laplacian nugget effect spatio-temporal processes
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部