期刊文献+
共找到26篇文章
< 1 2 >
每页显示 20 50 100
Testing conditional independence with data missing at random
1
作者 LIU Yi LIU Xiao-hui 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2018年第3期298-312,共15页
It is known that conditional independence is a quite basic assumption in many fields of statistics. How to test its validity is of great importance and has been extensively studied by the literature. Nevertheless, all... It is known that conditional independence is a quite basic assumption in many fields of statistics. How to test its validity is of great importance and has been extensively studied by the literature. Nevertheless, all of the existing methods focus on the case that data are fully observed, but none of them seems having taken into account of the scenario when missing data are present. Motivated by this, this paper develops two testing statistics to handle such a situation relying on the idea of inverse probability weighted and augmented inverse probability weighted techniques. The asymptotic distributions of the proposed statistics are also derived under the null hypothesis. The simulation studies indicate that both testing statistics perform well in terms of size and power. 展开更多
关键词 conditional independence cumulative sum process of residuals missing at random inverse probability weighting re-sampling
在线阅读 下载PDF
Cross Validation Based Model Averaging for Varying-Coefficient Models with Response Missing at Random
2
作者 Huixin Li Xiuli Wang 《Journal of Applied Mathematics and Physics》 2024年第3期764-777,共14页
In this paper, a model averaging method is proposed for varying-coefficient models with response missing at random by establishing a weight selection criterion based on cross-validation. Under certain regularity condi... In this paper, a model averaging method is proposed for varying-coefficient models with response missing at random by establishing a weight selection criterion based on cross-validation. Under certain regularity conditions, it is proved that the proposed method is asymptotically optimal in the sense of achieving the minimum squared error. 展开更多
关键词 Response missing at random Model Averaging Asymptotic Optimality B-Spline Approximation
在线阅读 下载PDF
High-dimensional large-scale mixed-type data imputation under missing at random
3
作者 Wei Liu Guizhen Li +1 位作者 Ling Zhou Lan Luo 《Science China Mathematics》 2025年第4期969-1000,共32页
Missingness in mixed-type variables is commonly encountered in a variety of areas.The requirement of complete observations necessitates data imputation when a moderate or large proportion of data is missing.However,in... Missingness in mixed-type variables is commonly encountered in a variety of areas.The requirement of complete observations necessitates data imputation when a moderate or large proportion of data is missing.However,inappropriate imputation would downgrade the performance of machine learning algorithms,leading to bad predictions and unreliable statistical inference.For high-dimensional large-scale mixed-type missing data,we develop a computationally efficient imputation method,missing value imputation via generalized factor models(MIG),under missing at random.The proposed MIG method allows missing variables to be of different types,including continuous,binary,and count variables,and are scalable to both data size n and variable dimension p while existing imputation methods rely on restrictive assumptions such as the same type of missing variables,the low dimensionality of variables,and a limited sample size.We explicitly show that the imputation error of the proposed MIG method diminishes to zero with the rate Op(max{n^(-1/2),p^(-1/2)})as both n and p tend to infinity.Five real datasets demonstrate the superior empirical performance of the proposed MIG method over existing methods that the average normalized absolute imputation error is reduced by 5.3%–34.1%. 展开更多
关键词 IMPUTatION high-dimensional mixed-type data missing at random generalized factor model
原文传递
An Efficient Multiple Imputation Approach for Estimating Equations with Response Missing at Random and High-Dimensional Covariates 被引量:1
4
作者 WANG Lei SUN Siying XIA Zheng 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第1期440-464,共25页
Empirical-likelihood-based inference for parameters defined by the general estimating equations of Qin and Lawless(1994) remains an active research topic. When the response is missing at random(MAR) and the dimension ... Empirical-likelihood-based inference for parameters defined by the general estimating equations of Qin and Lawless(1994) remains an active research topic. When the response is missing at random(MAR) and the dimension of covariate is not low, the authors propose a two-stage estimation procedure by using the dimension-reduced kernel estimators in conjunction with an unbiased estimating function based on augmented inverse probability weighting and multiple imputation(AIPW-MI) methods. The authors show that the resulting estimator achieves consistency and asymptotic normality. In addition, the corresponding empirical likelihood ratio statistics asymptotically follow central chi-square distributions when evaluated at the true parameter. The finite-sample performance of the proposed estimator is studied through simulation, and an application to HIV-CD4 data set is also presented. 展开更多
关键词 Consistency and asymptotic normality dimension reduction kernel-assisted missing at random multiple imputation
原文传递
A class of weighted estimating equations for additive hazards models with covariates missing at random
5
作者 Jin Jin Peng Ye Liuquan Sun 《Science China Mathematics》 SCIE CSCD 2022年第3期583-602,共20页
Missing covariate data arise frequently in biomedical studies.In this article,we propose a class of weighted estimating equations for the additive hazards regression model when some of the covariates are missing at ra... Missing covariate data arise frequently in biomedical studies.In this article,we propose a class of weighted estimating equations for the additive hazards regression model when some of the covariates are missing at random.Time-specific and subject-specific weights are incorporated into the formulation of weighted estimating equations.Unified results are established for estimating selection probabilities that cover both parametric and non-parametric modelling schemes.The resulting estimators have closed forms and are shown to be consistent and asymptotically normal.Simulation studies indicate that the proposed estimators perform well for practical settings.An application to a mouse leukemia study is illustrated. 展开更多
关键词 additive hazards model censored data kernel smoothing missing at random weighted estimating equation
原文传递
Weighted local polynomial estimations of a non-parametric function with censoring indicators missing at random and their applications
6
作者 Jiangfeng WANG Yangcheng ZHOU Ju TANG 《Frontiers of Mathematics in China》 SCIE CSCD 2022年第1期117-139,共23页
In this paper,we consider the weighted local polynomial calibration estimation and imputation estimation of a non-parametric function when the data are right censored and the censoring indicators are missing at random... In this paper,we consider the weighted local polynomial calibration estimation and imputation estimation of a non-parametric function when the data are right censored and the censoring indicators are missing at random,and establish the asymptotic normality of these estimators.As their applications,we derive the weighted local linear calibration estimators and imputation estimations of the conditional distribution function,the conditional density function and the conditional quantile function,and investigate the asymptotic normality of these estimators.Finally,the simulation studies are conducted to illustrate the finite sample performance of the estimators. 展开更多
关键词 Local polynomial estimation asymptotic normality non-parametric function censoring indicator missing at random
原文传递
The abstract of doctoral dissertation‘nonlinear wavelet density estimation and hazard rate estimation with data missing at random’
7
作者 Yuye Zou Guoliang Fan Riquan Zhang 《Statistical Theory and Related Fields》 2020年第1期117-119,共3页
In this thesis,we establish non-linear wavelet density estimators and studying the asymptotic properties of the estimators with data missing at random when covariates are present.The outstanding advantage of non-linea... In this thesis,we establish non-linear wavelet density estimators and studying the asymptotic properties of the estimators with data missing at random when covariates are present.The outstanding advantage of non-linear wavelet method is estimating the unsoothed functions,however,the classical kernel estimation cannot do this work.At the same time,we study the larger sample properties of the ISE for hazard rate estimator. 展开更多
关键词 Asymptotic normality integral square error mean integral square error missing at random non-linear wavelet
原文传递
CBPS-Based Inference in Nonlinear Regression Models with Missing Data 被引量:1
8
作者 Donglin Guo Liugen Xue Haiqing Chen 《Open Journal of Statistics》 2016年第4期675-684,共11页
In this article, to improve the doubly robust estimator, the nonlinear regression models with missing responses are studied. Based on the covariate balancing propensity score (CBPS), estimators for the regression coef... In this article, to improve the doubly robust estimator, the nonlinear regression models with missing responses are studied. Based on the covariate balancing propensity score (CBPS), estimators for the regression coefficients and the population mean are obtained. It is proved that the proposed estimators are asymptotically normal. In simulation studies, the proposed estimators show improved performance relative to usual augmented inverse probability weighted estimators. 展开更多
关键词 Nonlinear Regression Model missing at random Covariate Balancing Propensity Score GMM Augmented Inverse Probability Weighted
在线阅读 下载PDF
Using Statistical Learning to Treat Missing Data: A Case of HIV/TB Co-Infection in Kenya
9
作者 Joshua O. Mwaro Linda Chaba Collins Odhiambo 《Journal of Data Analysis and Information Processing》 2020年第3期110-133,共24页
In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objec... In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objective is to identify the best method for correcting missing data in TB/HIV Co-infection setting. We employ both empirical data analysis and extensive simulation study to examine the effects of missing data, the accuracy, sensitivity, specificity and train and test error for different approaches. The novelty of this work hinges on the use of modern statistical learning algorithm when treating missingness. In the empirical analysis, both HIV data and TB-HIV co-infection data imputations were performed, and the missing values were imputed using different approaches. In the simulation study, sets of 0% (Complete case), 10%, 30%, 50% and 80% of the data were drawn randomly and replaced with missing values. Results show complete cases only had a co-infection rate (95% Confidence Interval band) of 29% (25%, 33%), weighted method 27% (23%, 31%), likelihood-based approach 26% (24%, 28%) and multiple imputation approach 21% (20%, 22%). In conclusion, MI remains the best approach for dealing with missing data and failure to apply it, results to overestimation of HIV/TB co-infection rate by 8%. 展开更多
关键词 missing Data HIV/TB Co-Infection IMPUTatION missing at random Count Data
暂未订购
Smoothed Empirical Likelihood Inference for Nonlinear Quantile Regression Models with Missing Response
10
作者 Honghua Dong Xiuli Wang 《Open Journal of Applied Sciences》 2023年第6期921-933,共13页
In this paper, three smoothed empirical log-likelihood ratio functions for the parameters of nonlinear models with missing response are suggested. Under some regular conditions, the corresponding Wilks phenomena are o... In this paper, three smoothed empirical log-likelihood ratio functions for the parameters of nonlinear models with missing response are suggested. Under some regular conditions, the corresponding Wilks phenomena are obtained and the confidence regions for the parameter can be constructed easily. 展开更多
关键词 Nonlinear Model Quantile Regression Smoothed Empirical Likelihood missing at random
在线阅读 下载PDF
CONFIDENCE INTERVALS FOR NONPARAMETRIC REGRESSION FUNCTIONS WITH MISSING DATA: MULTIPLE DESIGN CASE 被引量:2
11
作者 Qingzhu LEI Yongsong QIN 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2011年第6期1204-1217,共14页
This paper considers two estimators of θ= g(x) in a nonparametric regression model Y = g(x) + ε(x∈ (0, 1)p) with missing responses: Imputation and inverse probability weighted esti- mators. Asymptotic nor... This paper considers two estimators of θ= g(x) in a nonparametric regression model Y = g(x) + ε(x∈ (0, 1)p) with missing responses: Imputation and inverse probability weighted esti- mators. Asymptotic normality of the two estimators is established, which is used to construct normal approximation based confidence intervals on θ. 展开更多
关键词 Confidence interval missing at random nonparametric regression normal approximation.
原文传递
Empirical likelihood inference for estimating equation with missing data 被引量:2
12
作者 WANG XiuLi CHEN Fang LIN Lu 《Science China Mathematics》 SCIE 2013年第6期1233-1245,共13页
In this article, empirical likelihood inference for estimating equation with missing data is considered. Based on the weighted-corrected estimating function, an empirical log-likelihood ratio is proved to be a standar... In this article, empirical likelihood inference for estimating equation with missing data is considered. Based on the weighted-corrected estimating function, an empirical log-likelihood ratio is proved to be a standard chiqsquare distribution asymptotically under some suitable conditions. This result is different from those derived before. So it is convenient to construct confidence regions for the parameters of interest. We also prove that our proposed maximum empirical likelihood estimator θ is asymptotically normal and attains the semiparametric efficiency bound of missing data. Some simulations indicate that the proposed method performs the best. 展开更多
关键词 empirical likelihood estimating equation kernel regression missing at random
原文传递
A fusion of least squares and empirical likelihood for regression models with a missing binary covariate 被引量:1
13
作者 DUAN XiaoGang WANG Zhi 《Science China Mathematics》 SCIE CSCD 2016年第10期2027-2036,共10页
Multiply robust inference has attracted much attention recently in the context of missing response data. An estimation procedure is multiply robust, if it can incorporate information from multiple candidate models, an... Multiply robust inference has attracted much attention recently in the context of missing response data. An estimation procedure is multiply robust, if it can incorporate information from multiple candidate models, and meanwhile the resulting estimator is consistent as long as one of the candidate models is correctly specified. This property is appealing, since it provides the user a flexible modeling strategy with better protection against model misspecification. We explore this attractive property for the regression models with a binary covariate that is missing at random. We start from a reformulation of the celebrated augmented inverse probability weighted estimating equation, and based on this reformulation, we propose a novel combination of the least squares and empirical likelihood to separately handle each of the two types of multiple candidate models,one for the missing variable regression and the other for the missingness mechanism. Due to the separation, all the working models are fused concisely and effectively. The asymptotic normality of our estimator is established through the theory of estimating function with plugged-in nuisance parameter estimates. The finite-sample performance of our procedure is illustrated both through the simulation studies and the analysis of a dementia data collected by the national Alzheimer's coordinating center. 展开更多
关键词 calibration covariate adjustment effect modification missing at random multiple robustness refitting
原文传递
Empirical Likelihood of Quantile Difference with Missing Response When High-dimensional Covariates Are Present
14
作者 Cui Juan KONG Han Ying LIANG 《Acta Mathematica Sinica,English Series》 SCIE CSCD 2021年第12期1803-1825,共23页
We,in this paper,investigate two-sample quantile difference by empirical likelihood method when the responses with high-dimensional covariates of the two populations are missing at random.In particular,based on suffic... We,in this paper,investigate two-sample quantile difference by empirical likelihood method when the responses with high-dimensional covariates of the two populations are missing at random.In particular,based on sufficient dimension reduction technique,we construct three empirical log-likelihood ratios for the quantile difference between two samples by using inverse probability weighting imputation,regression imputation as well as augmented inverse probability weighting imputation,respectively,and prove their asymptotic distributions.At the same time,we give a test to check whether two populations have the same distribution.A simulation study is carried out to investigate finite sample behavior of the proposed methods too. 展开更多
关键词 Empirical likelihood HIGH-DIMENSIONAL missing at random sufficient dimension reduction two-sample quantile difference
原文传递
Regression Analysis of Right-censored Failure Time Data with Missing Censoring Indicators
15
作者 Ping Chen Ren He +1 位作者 Jun-shan Shen Jian-guo Sun 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2009年第3期415-426,共12页
This paper discusses regression analysis of right-censored failure time data when censoring indicators are missing for some subjects. Several methods have been developed for the analysis under different situations and... This paper discusses regression analysis of right-censored failure time data when censoring indicators are missing for some subjects. Several methods have been developed for the analysis under different situations and especially, Goetghebeur and Ryan considered the situation where both the failure time and the censoring time follow the proportional hazards models marginally and developed an estimating equation approach. One limitation of their approach is that the two baseline hazard functions were assumed to be proportional to each other. We consider the same problem and present an efficient estimation procedure for regression parameters that does not require the proportionality assumption. An EM algorithm is developed and the method is evaluated by a simulation study, which indicates that the proposed methodology performs well for practical situations. An illustrative example is provided. 展开更多
关键词 Efficient estimation em algorithm incomplete data missing at random Proportional hazards model
原文传递
Empirical Likelihood for Response Differences in Two Linear Regression Models with Missing Data
16
作者 Yong-song QIN Tao QIU Qing-zhu LEI 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2015年第4期963-976,共14页
Oonsider two linear models Xi = U'β + ei, Yj = V1/2y + ηj with response variables missing at random. In this paper, we assume that X, Y are missing at random (MAR) and use the inverse probability weighted imput... Oonsider two linear models Xi = U'β + ei, Yj = V1/2y + ηj with response variables missing at random. In this paper, we assume that X, Y are missing at random (MAR) and use the inverse probability weighted imputation to produce 'complete' data sets for X and Y. Based on these data sets, we construct an empirical likelihood (EL) statistic for the difference of X and Y (denoted as A), and show that the EL statistic has the limiting distribution of X~, which is used to construct a confidence interval for A. Results of a simulation study on the finite sample performance of EL-based confidence intervals on A are reported. 展开更多
关键词 linear model inverse probability weighted imputation empirical likelihood missing at random confidence interval
原文传递
Integrated Square Error of Hazard Rate Estimation for Survival Data with Missing Censoring Indicators
17
作者 ZOU Yuye FAN Guoliang ZHANG Riquan 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第2期735-758,共24页
The problem of hazard rate estimation under right-censored assumption has been investigated extensively.Integrated square error(ISE)of estimation is one of the most widely accepted measurements of the global performan... The problem of hazard rate estimation under right-censored assumption has been investigated extensively.Integrated square error(ISE)of estimation is one of the most widely accepted measurements of the global performance for nonparametric kernel estimation.But there are no results available for ISE of hazard rate estimation under right-censored model with censoring indicators missing at random(MAR)so far.This paper constructs an imputation estimator of the hazard rate function and establish asymptotic normality of the ISE for the kernel hazard rate estimator with censoring indicators MAR.At the same time,an asymptotic representation of the mean integrated square error(MISE)is also presented.The finite sample behavior of the estimator is investigated via one simple simulation. 展开更多
关键词 Asymptotic normality integrated square error missing at random right-censored model
原文传递
Estimation for Partially Linear Models with Missing Responses:the Fixed Design Case
18
作者 Yong-song QIN Ying-hua LI 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2014年第2期447-472,共26页
Suppose that we have a partially linear model Yi = xiβ + g(ti) +εi with independent zero mean errors εi, where (xi,ti, i = 1, ... ,n} are non-random and observed completely and (Yi, i = 1,...,n} are missing a... Suppose that we have a partially linear model Yi = xiβ + g(ti) +εi with independent zero mean errors εi, where (xi,ti, i = 1, ... ,n} are non-random and observed completely and (Yi, i = 1,...,n} are missing at random(MAR). Two types of estimators of β and g(t) for fixed t are investigated: estimators based on semiparametric regression and inverse probability weighted imputations. Asymptotic normality of the estimators is established, which is used to construct normal approximation based confidence intervals on β and g(t). Results are reported of a simulation study on the finite sample performance of the estimators and confidence intervals proposed in this paper. 展开更多
关键词 partially linear model fixed design point missing at random confidence interval
原文传递
Variable screening with missing covariates: a discussion of ‘statistical inferencefor nonignorable missing data problems: a selective review’ by NianshengTang and Yuanyuan Ju
19
作者 Fang Fang Lyu Ni 《Statistical Theory and Related Fields》 2018年第2期134-136,共3页
Feature screening with missing data is a critical problem but has not been well addressed in theliterature. In this discussion we propose a new screening index based on “information value” andapply it to feature scr... Feature screening with missing data is a critical problem but has not been well addressed in theliterature. In this discussion we propose a new screening index based on “information value” andapply it to feature screening with missing covariates. 展开更多
关键词 Feature screening missing at random missing covariates
原文传递
Empirical Likelihood-based Inferences in Varying Coefficient Models with Missing Data
20
作者 Xiao-hui LIU 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2015年第3期823-840,共18页
In this paper, we consider the empirical likelihood-based inferences for varying coefficient models Y = X^τα(U) + ε when X are subject to missing at random. Based on the inverse probability-weighted idea, a clas... In this paper, we consider the empirical likelihood-based inferences for varying coefficient models Y = X^τα(U) + ε when X are subject to missing at random. Based on the inverse probability-weighted idea, a class of empirical log-likelihood ratios, as well as two maximum empirical likelihood estimators, are developed for α(u). The resulting statistics are shown to have standard chi-squared or normal distributions asymptotically.Simulation studies are also constructed to illustrate the finite sample properties of the proposed statistics. 展开更多
关键词 varying coefficient models missing at random empirical likelihood maximum empirical likelihood estimator
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部