期刊文献+
共找到28篇文章
< 1 2 >
每页显示 20 50 100
Cross Validation Based Model Averaging for Varying-Coefficient Models with Response Missing at Random 被引量:1
1
作者 Huixin Li Xiuli Wang 《Journal of Applied Mathematics and Physics》 2024年第3期764-777,共14页
In this paper, a model averaging method is proposed for varying-coefficient models with response missing at random by establishing a weight selection criterion based on cross-validation. Under certain regularity condi... In this paper, a model averaging method is proposed for varying-coefficient models with response missing at random by establishing a weight selection criterion based on cross-validation. Under certain regularity conditions, it is proved that the proposed method is asymptotically optimal in the sense of achieving the minimum squared error. 展开更多
关键词 Response missing at random Model Averaging Asymptotic Optimality B-Spline Approximation
在线阅读 下载PDF
Testing conditional independence with data missing at random
2
作者 LIU Yi LIU Xiao-hui 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2018年第3期298-312,共15页
It is known that conditional independence is a quite basic assumption in many fields of statistics. How to test its validity is of great importance and has been extensively studied by the literature. Nevertheless, all... It is known that conditional independence is a quite basic assumption in many fields of statistics. How to test its validity is of great importance and has been extensively studied by the literature. Nevertheless, all of the existing methods focus on the case that data are fully observed, but none of them seems having taken into account of the scenario when missing data are present. Motivated by this, this paper develops two testing statistics to handle such a situation relying on the idea of inverse probability weighted and augmented inverse probability weighted techniques. The asymptotic distributions of the proposed statistics are also derived under the null hypothesis. The simulation studies indicate that both testing statistics perform well in terms of size and power. 展开更多
关键词 conditional independence cumulative sum process of residuals missing at random inverse probability weighting re-sampling
在线阅读 下载PDF
An Adaptive Multivariate EWMA Control Chart for Monitoring Missing Data 被引量:1
3
作者 PU Xiaolong XIANG Dongdong CHEN Xinyan 《应用概率统计》 CSCD 北大核心 2024年第2期343-363,共21页
With the increasing complexity of production processes,there has been a growing focus on online algorithms within the domain of multivariate statistical process control(SPC).Nonetheless,conventional methods,based on t... With the increasing complexity of production processes,there has been a growing focus on online algorithms within the domain of multivariate statistical process control(SPC).Nonetheless,conventional methods,based on the assumption of complete data obtained at uniform time intervals,exhibit suboptimal performance in the presence of missing data.In our pursuit of maximizing available information,we propose an adaptive exponentially weighted moving average(EWMA)control chart employing a weighted imputation approach that leverages the relationships between complete and incomplete data.Specifically,we introduce two recovery methods:an improved K-Nearest Neighbors imputing value and the conventional univariate EWMA statistic.We then formulate an adaptive weighting function to amalgamate these methods,assigning a diminished weight to the EWMA statistic when the sample information suggests an increased likelihood of the process being out of control,and vice versa.The robustness and sensitivity of the proposed scheme are shown through simulation results and an illustrative example. 展开更多
关键词 online monitoring completely random missing weighted imputing values EWMA improved K-nearest neighbors
在线阅读 下载PDF
CBPS-Based Inference in Nonlinear Regression Models with Missing Data 被引量:1
4
作者 Donglin Guo Liugen Xue Haiqing Chen 《Open Journal of Statistics》 2016年第4期675-684,共11页
In this article, to improve the doubly robust estimator, the nonlinear regression models with missing responses are studied. Based on the covariate balancing propensity score (CBPS), estimators for the regression coef... In this article, to improve the doubly robust estimator, the nonlinear regression models with missing responses are studied. Based on the covariate balancing propensity score (CBPS), estimators for the regression coefficients and the population mean are obtained. It is proved that the proposed estimators are asymptotically normal. In simulation studies, the proposed estimators show improved performance relative to usual augmented inverse probability weighted estimators. 展开更多
关键词 Nonlinear Regression Model missing at random Covariate Balancing Propensity Score GMM Augmented Inverse Probability Weighted
在线阅读 下载PDF
Using Statistical Learning to Treat Missing Data: A Case of HIV/TB Co-Infection in Kenya
5
作者 Joshua O. Mwaro Linda Chaba Collins Odhiambo 《Journal of Data Analysis and Information Processing》 2020年第3期110-133,共24页
In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objec... In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objective is to identify the best method for correcting missing data in TB/HIV Co-infection setting. We employ both empirical data analysis and extensive simulation study to examine the effects of missing data, the accuracy, sensitivity, specificity and train and test error for different approaches. The novelty of this work hinges on the use of modern statistical learning algorithm when treating missingness. In the empirical analysis, both HIV data and TB-HIV co-infection data imputations were performed, and the missing values were imputed using different approaches. In the simulation study, sets of 0% (Complete case), 10%, 30%, 50% and 80% of the data were drawn randomly and replaced with missing values. Results show complete cases only had a co-infection rate (95% Confidence Interval band) of 29% (25%, 33%), weighted method 27% (23%, 31%), likelihood-based approach 26% (24%, 28%) and multiple imputation approach 21% (20%, 22%). In conclusion, MI remains the best approach for dealing with missing data and failure to apply it, results to overestimation of HIV/TB co-infection rate by 8%. 展开更多
关键词 missing Data HIV/TB Co-Infection IMPUTATION missing at random Count Data
暂未订购
Smoothed Empirical Likelihood Inference for Nonlinear Quantile Regression Models with Missing Response
6
作者 Honghua Dong Xiuli Wang 《Open Journal of Applied Sciences》 2023年第6期921-933,共13页
In this paper, three smoothed empirical log-likelihood ratio functions for the parameters of nonlinear models with missing response are suggested. Under some regular conditions, the corresponding Wilks phenomena are o... In this paper, three smoothed empirical log-likelihood ratio functions for the parameters of nonlinear models with missing response are suggested. Under some regular conditions, the corresponding Wilks phenomena are obtained and the confidence regions for the parameter can be constructed easily. 展开更多
关键词 Nonlinear Model Quantile Regression Smoothed Empirical Likelihood missing at random
在线阅读 下载PDF
High-dimensional large-scale mixed-type data imputation under missing at random
7
作者 Wei Liu Guizhen Li +1 位作者 Ling Zhou Lan Luo 《Science China Mathematics》 2025年第4期969-1000,共32页
Missingness in mixed-type variables is commonly encountered in a variety of areas.The requirement of complete observations necessitates data imputation when a moderate or large proportion of data is missing.However,in... Missingness in mixed-type variables is commonly encountered in a variety of areas.The requirement of complete observations necessitates data imputation when a moderate or large proportion of data is missing.However,inappropriate imputation would downgrade the performance of machine learning algorithms,leading to bad predictions and unreliable statistical inference.For high-dimensional large-scale mixed-type missing data,we develop a computationally efficient imputation method,missing value imputation via generalized factor models(MIG),under missing at random.The proposed MIG method allows missing variables to be of different types,including continuous,binary,and count variables,and are scalable to both data size n and variable dimension p while existing imputation methods rely on restrictive assumptions such as the same type of missing variables,the low dimensionality of variables,and a limited sample size.We explicitly show that the imputation error of the proposed MIG method diminishes to zero with the rate Op(max{n^(-1/2),p^(-1/2)})as both n and p tend to infinity.Five real datasets demonstrate the superior empirical performance of the proposed MIG method over existing methods that the average normalized absolute imputation error is reduced by 5.3%–34.1%. 展开更多
关键词 IMPUTATION high-dimensional mixed-type data missing at random generalized factor model
原文传递
An Efficient Multiple Imputation Approach for Estimating Equations with Response Missing at Random and High-Dimensional Covariates 被引量:1
8
作者 WANG Lei SUN Siying XIA Zheng 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第1期440-464,共25页
Empirical-likelihood-based inference for parameters defined by the general estimating equations of Qin and Lawless(1994) remains an active research topic. When the response is missing at random(MAR) and the dimension ... Empirical-likelihood-based inference for parameters defined by the general estimating equations of Qin and Lawless(1994) remains an active research topic. When the response is missing at random(MAR) and the dimension of covariate is not low, the authors propose a two-stage estimation procedure by using the dimension-reduced kernel estimators in conjunction with an unbiased estimating function based on augmented inverse probability weighting and multiple imputation(AIPW-MI) methods. The authors show that the resulting estimator achieves consistency and asymptotic normality. In addition, the corresponding empirical likelihood ratio statistics asymptotically follow central chi-square distributions when evaluated at the true parameter. The finite-sample performance of the proposed estimator is studied through simulation, and an application to HIV-CD4 data set is also presented. 展开更多
关键词 Consistency and asymptotic normality dimension reduction kernel-assisted missing at random multiple imputation
原文传递
Weighted local polynomial estimations of a non-parametric function with censoring indicators missing at random and their applications
9
作者 Jiangfeng WANG Yangcheng ZHOU Ju TANG 《Frontiers of Mathematics in China》 SCIE CSCD 2022年第1期117-139,共23页
In this paper,we consider the weighted local polynomial calibration estimation and imputation estimation of a non-parametric function when the data are right censored and the censoring indicators are missing at random... In this paper,we consider the weighted local polynomial calibration estimation and imputation estimation of a non-parametric function when the data are right censored and the censoring indicators are missing at random,and establish the asymptotic normality of these estimators.As their applications,we derive the weighted local linear calibration estimators and imputation estimations of the conditional distribution function,the conditional density function and the conditional quantile function,and investigate the asymptotic normality of these estimators.Finally,the simulation studies are conducted to illustrate the finite sample performance of the estimators. 展开更多
关键词 Local polynomial estimation asymptotic normality non-parametric function censoring indicator missing at random
原文传递
A class of weighted estimating equations for additive hazards models with covariates missing at random
10
作者 Jin Jin Peng Ye Liuquan Sun 《Science China Mathematics》 SCIE CSCD 2022年第3期583-602,共20页
Missing covariate data arise frequently in biomedical studies.In this article,we propose a class of weighted estimating equations for the additive hazards regression model when some of the covariates are missing at ra... Missing covariate data arise frequently in biomedical studies.In this article,we propose a class of weighted estimating equations for the additive hazards regression model when some of the covariates are missing at random.Time-specific and subject-specific weights are incorporated into the formulation of weighted estimating equations.Unified results are established for estimating selection probabilities that cover both parametric and non-parametric modelling schemes.The resulting estimators have closed forms and are shown to be consistent and asymptotically normal.Simulation studies indicate that the proposed estimators perform well for practical settings.An application to a mouse leukemia study is illustrated. 展开更多
关键词 additive hazards model censored data kernel smoothing missing at random weighted estimating equation
原文传递
The abstract of doctoral dissertation‘nonlinear wavelet density estimation and hazard rate estimation with data missing at random’
11
作者 Yuye Zou Guoliang Fan Riquan Zhang 《Statistical Theory and Related Fields》 2020年第1期117-119,共3页
In this thesis,we establish non-linear wavelet density estimators and studying the asymptotic properties of the estimators with data missing at random when covariates are present.The outstanding advantage of non-linea... In this thesis,we establish non-linear wavelet density estimators and studying the asymptotic properties of the estimators with data missing at random when covariates are present.The outstanding advantage of non-linear wavelet method is estimating the unsoothed functions,however,the classical kernel estimation cannot do this work.At the same time,we study the larger sample properties of the ISE for hazard rate estimator. 展开更多
关键词 Asymptotic normality integral square error mean integral square error missing at random non-linear wavelet
原文传递
非实验性药物流行病学研究数据缺失的预防、检查和处理
12
作者 CD Mack Z Su +1 位作者 AB Mendelsohn N Dreyer 《药物流行病学杂志》 CAS 2015年第1期14-22,共9页
与精心开展的随机化临床试验相比,非实验性研究中的缺失数据会对有效性形成更大的威胁。然而,非实验性真实世界研究可以更加准确地描述医疗干预措施在实际情境中对不同的患者群体的作用,从而补偿这些限制条件。如果研究者认识到缺失一... 与精心开展的随机化临床试验相比,非实验性研究中的缺失数据会对有效性形成更大的威胁。然而,非实验性真实世界研究可以更加准确地描述医疗干预措施在实际情境中对不同的患者群体的作用,从而补偿这些限制条件。如果研究者认识到缺失一定数量的数据(无论关于暴露、结局还是混杂因素)不可避免,就应该在研究开始时针对缺失数据制定计划,尽可能防止缺失数据,同时为处理重要变量的缺失数据制定规划。如不能获得所有患者的全部数据元素,就必须认真检查和处理缺失数据。可采用多重填补等统计技术填补空缺。这些方法都需要尽可能了解导致数据缺失的因素的相关假设及其与研究结果的关联。此文描述了预防缺失数据的策略和处理非实验性真实世界研究中缺失数据的分析方法,并加入了例证说明。 展开更多
关键词 数据缺失 缺失 非随机缺失 填补 观察性研究 非实验性研究 注册登记
原文传递
CONFIDENCE INTERVALS FOR NONPARAMETRIC REGRESSION FUNCTIONS WITH MISSING DATA: MULTIPLE DESIGN CASE 被引量:2
13
作者 Qingzhu LEI Yongsong QIN 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2011年第6期1204-1217,共14页
This paper considers two estimators of θ= g(x) in a nonparametric regression model Y = g(x) + ε(x∈ (0, 1)p) with missing responses: Imputation and inverse probability weighted esti- mators. Asymptotic nor... This paper considers two estimators of θ= g(x) in a nonparametric regression model Y = g(x) + ε(x∈ (0, 1)p) with missing responses: Imputation and inverse probability weighted esti- mators. Asymptotic normality of the two estimators is established, which is used to construct normal approximation based confidence intervals on θ. 展开更多
关键词 Confidence interval missing at random nonparametric regression normal approximation.
原文传递
Empilrical Likelihood for Non-parametric Regression Models with Missing Responses:Multiple Design Case 被引量:2
14
作者 Qing-zhu Lei Yong-song Qin 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2011年第1期1-12,共12页
Empirical likelihood (EL) ratio statistic on θ=g(x) is constructed based on the inverse probability weighted imputation approach in a nonparametric regression model Y = g(x) +ε (x ∈ [0, 1]p) with fixed des... Empirical likelihood (EL) ratio statistic on θ=g(x) is constructed based on the inverse probability weighted imputation approach in a nonparametric regression model Y = g(x) +ε (x ∈ [0, 1]p) with fixed designs and missing responses, which asymptotically has X1^2 distribution. This result is used to obtain a EL based confidence interval on θ. 展开更多
关键词 Nonparametric regression empirical likelihood missing at random confidence interval
原文传递
Empirical likelihood inference for estimating equation with missing data 被引量:2
15
作者 WANG XiuLi CHEN Fang LIN Lu 《Science China Mathematics》 SCIE 2013年第6期1233-1245,共13页
In this article, empirical likelihood inference for estimating equation with missing data is considered. Based on the weighted-corrected estimating function, an empirical log-likelihood ratio is proved to be a standar... In this article, empirical likelihood inference for estimating equation with missing data is considered. Based on the weighted-corrected estimating function, an empirical log-likelihood ratio is proved to be a standard chiqsquare distribution asymptotically under some suitable conditions. This result is different from those derived before. So it is convenient to construct confidence regions for the parameters of interest. We also prove that our proposed maximum empirical likelihood estimator θ is asymptotically normal and attains the semiparametric efficiency bound of missing data. Some simulations indicate that the proposed method performs the best. 展开更多
关键词 empirical likelihood estimating equation kernel regression missing at random
原文传递
A fusion of least squares and empirical likelihood for regression models with a missing binary covariate 被引量:1
16
作者 DUAN XiaoGang WANG Zhi 《Science China Mathematics》 SCIE CSCD 2016年第10期2027-2036,共10页
Multiply robust inference has attracted much attention recently in the context of missing response data. An estimation procedure is multiply robust, if it can incorporate information from multiple candidate models, an... Multiply robust inference has attracted much attention recently in the context of missing response data. An estimation procedure is multiply robust, if it can incorporate information from multiple candidate models, and meanwhile the resulting estimator is consistent as long as one of the candidate models is correctly specified. This property is appealing, since it provides the user a flexible modeling strategy with better protection against model misspecification. We explore this attractive property for the regression models with a binary covariate that is missing at random. We start from a reformulation of the celebrated augmented inverse probability weighted estimating equation, and based on this reformulation, we propose a novel combination of the least squares and empirical likelihood to separately handle each of the two types of multiple candidate models,one for the missing variable regression and the other for the missingness mechanism. Due to the separation, all the working models are fused concisely and effectively. The asymptotic normality of our estimator is established through the theory of estimating function with plugged-in nuisance parameter estimates. The finite-sample performance of our procedure is illustrated both through the simulation studies and the analysis of a dementia data collected by the national Alzheimer's coordinating center. 展开更多
关键词 calibration covariate adjustment effect modification missing at random multiple robustness refitting
原文传递
Integrated Square Error of Hazard Rate Estimation for Survival Data with Missing Censoring Indicators
17
作者 ZOU Yuye FAN Guoliang ZHANG Riquan 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第2期735-758,共24页
The problem of hazard rate estimation under right-censored assumption has been investigated extensively.Integrated square error(ISE)of estimation is one of the most widely accepted measurements of the global performan... The problem of hazard rate estimation under right-censored assumption has been investigated extensively.Integrated square error(ISE)of estimation is one of the most widely accepted measurements of the global performance for nonparametric kernel estimation.But there are no results available for ISE of hazard rate estimation under right-censored model with censoring indicators missing at random(MAR)so far.This paper constructs an imputation estimator of the hazard rate function and establish asymptotic normality of the ISE for the kernel hazard rate estimator with censoring indicators MAR.At the same time,an asymptotic representation of the mean integrated square error(MISE)is also presented.The finite sample behavior of the estimator is investigated via one simple simulation. 展开更多
关键词 Asymptotic normality integrated square error missing at random right-censored model
原文传递
Empirical Likelihood of Quantile Difference with Missing Response When High-dimensional Covariates Are Present
18
作者 Cui Juan KONG Han Ying LIANG 《Acta Mathematica Sinica,English Series》 SCIE CSCD 2021年第12期1803-1825,共23页
We,in this paper,investigate two-sample quantile difference by empirical likelihood method when the responses with high-dimensional covariates of the two populations are missing at random.In particular,based on suffic... We,in this paper,investigate two-sample quantile difference by empirical likelihood method when the responses with high-dimensional covariates of the two populations are missing at random.In particular,based on sufficient dimension reduction technique,we construct three empirical log-likelihood ratios for the quantile difference between two samples by using inverse probability weighting imputation,regression imputation as well as augmented inverse probability weighting imputation,respectively,and prove their asymptotic distributions.At the same time,we give a test to check whether two populations have the same distribution.A simulation study is carried out to investigate finite sample behavior of the proposed methods too. 展开更多
关键词 Empirical likelihood HIGH-DIMENSIONAL missing at random sufficient dimension reduction two-sample quantile difference
原文传递
Regression Analysis of Right-censored Failure Time Data with Missing Censoring Indicators
19
作者 Ping Chen Ren He +1 位作者 Jun-shan Shen Jian-guo Sun 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2009年第3期415-426,共12页
This paper discusses regression analysis of right-censored failure time data when censoring indicators are missing for some subjects. Several methods have been developed for the analysis under different situations and... This paper discusses regression analysis of right-censored failure time data when censoring indicators are missing for some subjects. Several methods have been developed for the analysis under different situations and especially, Goetghebeur and Ryan considered the situation where both the failure time and the censoring time follow the proportional hazards models marginally and developed an estimating equation approach. One limitation of their approach is that the two baseline hazard functions were assumed to be proportional to each other. We consider the same problem and present an efficient estimation procedure for regression parameters that does not require the proportionality assumption. An EM algorithm is developed and the method is evaluated by a simulation study, which indicates that the proposed methodology performs well for practical situations. An illustrative example is provided. 展开更多
关键词 Efficient estimation em algorithm incomplete data missing at random Proportional hazards model
原文传递
Estimation for Partially Linear Models with Missing Responses:the Fixed Design Case
20
作者 Yong-song QIN Ying-hua LI 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2014年第2期447-472,共26页
Suppose that we have a partially linear model Yi = xiβ + g(ti) +εi with independent zero mean errors εi, where (xi,ti, i = 1, ... ,n} are non-random and observed completely and (Yi, i = 1,...,n} are missing a... Suppose that we have a partially linear model Yi = xiβ + g(ti) +εi with independent zero mean errors εi, where (xi,ti, i = 1, ... ,n} are non-random and observed completely and (Yi, i = 1,...,n} are missing at random(MAR). Two types of estimators of β and g(t) for fixed t are investigated: estimators based on semiparametric regression and inverse probability weighted imputations. Asymptotic normality of the estimators is established, which is used to construct normal approximation based confidence intervals on β and g(t). Results are reported of a simulation study on the finite sample performance of the estimators and confidence intervals proposed in this paper. 展开更多
关键词 partially linear model fixed design point missing at random confidence interval
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部