期刊文献+
共找到293篇文章
< 1 2 15 >
每页显示 20 50 100
Panel Count Data模型参数的经验似然推断
1
作者 胡宏昌 崔恒建 《数理统计与管理》 CSSCI 北大核心 2014年第4期647-654,共8页
对Panel Count Data的处理越来越受到人们的关注,Sun与Wei^([1-2])基于简单的半参数模型,提出了Panel Count Data的回归分析,并且给出了参数的估计方程。本文则基于经验似然的思想,讨论了上述Panel Count Data模型参数的置信域构造问题... 对Panel Count Data的处理越来越受到人们的关注,Sun与Wei^([1-2])基于简单的半参数模型,提出了Panel Count Data的回归分析,并且给出了参数的估计方程。本文则基于经验似然的思想,讨论了上述Panel Count Data模型参数的置信域构造问题,特别仅通过经验似然置信区域给出了参数估计的方差阵估计,证明了估计的1/n相合性。基于Sun与Wei所给的数据,给出了参数置信区域的具体构造过程和结果。通过作图比较可以看出经验似然置信域要优于依据渐近正态性所构造的置信域。我们还依据所作出的经验似然置信域对参数估计的方差矩阵进行了估计,与用传统渐近正态性得到的矩阵较为接近。 展开更多
关键词 PANEL count data 经验似然 置信域 协方差矩阵估计
原文传递
Effect Modeling of Count Data Using Logistic Regression with Qualitative Predictors
2
作者 Haeil Ahn 《Engineering(科研)》 2014年第12期758-772,共15页
We modeled binary count data with categorical predictors, using logistic regression to develop a statistical method. We found that ANOVA-type analyses often performed unsatisfactorily, even when using different transf... We modeled binary count data with categorical predictors, using logistic regression to develop a statistical method. We found that ANOVA-type analyses often performed unsatisfactorily, even when using different transformations. The logistic transformation of fraction data could be an alternative, but it is not desirable in the statistical sense. We concluded that such methods are not appropriate, especially in cases where the fractions were close to 0 or 1. The major purpose of this paper is to demonstrate that logistic regression with an ANOVA-model like parameterization aids our understanding and provides a somewhat different, but sound, statistical background. We examined a simple real world example to show that we can efficiently test the significance of regression parameters, look for interactions, estimate related confidence intervals, and calculate the difference between the mean values of the referent and experimental subgroups. This paper demonstrates that precise confidence interval estimates can be obtained using the proposed ANOVA-model like approach. The method discussed here can be extended to any type of experimental fraction data analysis, particularly for experimental design. 展开更多
关键词 LOGISTIC Regression LOGIT LOGISTIC Response CATEGORICAL BINARY count data
暂未订购
CountNet:一种用于堆叠胶合板计数的自监督学习框架
3
作者 苏凡 王若琪 王海涛 《软件导刊》 2025年第2期19-25,共7页
自动化胶合板计数是工业生产中的一大难题,传统基于人工计数和物理计数的方法耗时且低效。然而,堆叠胶合板图像又存在着边缘不均匀、厚度不规律等干扰因素,现有的深度学习算法提取到的特征代表性不强,导致计数结果不准确。针对上述问题... 自动化胶合板计数是工业生产中的一大难题,传统基于人工计数和物理计数的方法耗时且低效。然而,堆叠胶合板图像又存在着边缘不均匀、厚度不规律等干扰因素,现有的深度学习算法提取到的特征代表性不强,导致计数结果不准确。针对上述问题,提出一种用于堆叠胶合板计数的自监督学习框架——CountNet。CountNet针对计数问题优化了损失函数的使用方式,该损失函数利用对比学习的方法,可以进一步放大正负样本间的差异,使网络能提取到更具代表性的视觉特征。最后将该特征投入下游任务中,完成计数。实验结果表明,该方法在准确率、损失下降等方面均优于其他常见的计数模型,证明了其在计数能力上的优越性。 展开更多
关键词 自监督对比学习 计算机视觉 堆叠胶合板计数 数据增强技术 损失函数优化
在线阅读 下载PDF
Robust Estimation of Semiparametric Transformation Model for Panel Count Data 被引量:2
4
作者 FENG Yan WANG Yijun +1 位作者 WANG Weiwei CHEN Zhuo 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第6期2334-2356,共23页
Panel count data are frequently encountered when study subjects are under discrete observations.However,limited literature has been found on variable selection for panel count data.In this paper,without considering th... Panel count data are frequently encountered when study subjects are under discrete observations.However,limited literature has been found on variable selection for panel count data.In this paper,without considering the model assumption of observation process,a more general semiparametric transformation model for panel count data with informative observation process is developed.A penalized estimation procedure based on the quantile regression function is proposed for variable selection and parameter estimation simultaneously.The consistency and oracle properties of the estimators are established under some mild conditions.Some simulations and an application are reported to evaluate the proposed approach. 展开更多
关键词 B-spline function panel count data quantile regression semiparametric transformation model variable selection
原文传递
Some Additional Moment Conditions for a Dynamic Count Panel Data Model with Predetermined Explanatory Variables
5
作者 Yoshitsugu Kitazawa 《Open Journal of Statistics》 2013年第5期319-333,共15页
This paper proposes some additional moment conditions for the linear feedback model with explanatory variables being predetermined, which is proposed by [1] for the purpose of dealing with count panel data. The newly ... This paper proposes some additional moment conditions for the linear feedback model with explanatory variables being predetermined, which is proposed by [1] for the purpose of dealing with count panel data. The newly proposed moment conditions include those associated with the equidispersion, the Negbin I-type model and the stationarity. The GMM estimators are constructed incorporating the additional moment conditions. Some Monte Carlo experiments indicate that the GMM estimators incorporating the additional moment conditions perform well, compared to that using only the conventional moment conditions proposed by [2,3]. 展开更多
关键词 count PANEL data Linear Feedback Model MOMENT Conditions GMM MONTE Carlo Experiments
在线阅读 下载PDF
Dynamically Computing Approximate Frequency Counts in Sliding Window over Data Stream 被引量:1
6
作者 NIE Guo-liang LU Zheng-ding 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期283-288,共6页
This paper presents two one-pass algorithms for dynamically computing frequency counts in sliding window over a data stream-computing frequency counts exceeding user-specified threshold ε. The first algorithm constru... This paper presents two one-pass algorithms for dynamically computing frequency counts in sliding window over a data stream-computing frequency counts exceeding user-specified threshold ε. The first algorithm constructs subwindows and deletes expired sub-windows periodically in sliding window, and each sub-window maintains a summary data structure. The first algorithm outputs at most 1/ε + 1 elements for frequency queries over the most recent N elements. The second algorithm adapts multiple levels method to deal with data stream. Once the sketch of the most recent N elements has been constructed, the second algorithm can provides the answers to the frequency queries over the most recent n ( n≤N) elements. The second algorithm outputs at most 1/ε + 2 elements. The analytical and experimental results show that our algorithms are accurate and effective. 展开更多
关键词 data stream sliding window approximation algorithms frequency counts
在线阅读 下载PDF
Accuracy Assessment and Guidelines for Manual Traffic Counts from Pre-Recorded Video Data
7
作者 Mishuk Majumder Chester Wilmot 《Journal of Transportation Technologies》 2023年第4期497-523,共27页
Traffic count is the fundamental data source for transportation planning, management, design, and effectiveness evaluation. Recording traffic flow and counting from the recorded videos are increasingly used due to con... Traffic count is the fundamental data source for transportation planning, management, design, and effectiveness evaluation. Recording traffic flow and counting from the recorded videos are increasingly used due to convenience, high accuracy, and cost-effectiveness. Manual counting from pre-recorded video footage can be prone to inconsistencies and errors, leading to inaccurate counts. Besides, there are no standard guidelines for collecting video data and conducting manual counts from the recorded videos. This paper aims to comprehensively assess the accuracy of manual counts from pre-recorded videos and introduces guidelines for efficiently collecting video data and conducting manual counts by trained individuals. The accuracy assessment of the manual counts was conducted based on repeated counts, and the guidelines were provided from the experience of conducting a traffic survey on forty strip mall access points in Baton Rouge, Louisiana, USA. The percentage of total error, classification error, and interval error were found to be 1.05 percent, 1.08 percent, and 1.29 percent, respectively. Besides, the percent root mean square errors (RMSE) were found to be 1.13 percent, 1.21 percent, and 1.48 percent, respectively. Guidelines were provided for selecting survey sites, instruments and timeframe, fieldwork, and manual counts for an efficient traffic data collection survey. 展开更多
关键词 Traffic Survey counting Error Transportation Planning Total Error Collecting Video data Classification Error Standard Guidelines Repeated counts Interval Error
在线阅读 下载PDF
经济发展、价值观念与育龄群体生育意愿——基于计数数据分位回归模型
8
作者 戴丽娜 《统计理论与实践》 2025年第6期71-80,共10页
生育水平与社会发展息息相关,生育意愿是保证生育水平的前提条件。基于计数数据分位回归模型,利用CGSS数据库2012年、2013年、2015年、2017年、2018年、2021年的调查数据,从经济发展与价值观念入手,深入剖析其对生育意愿的影响机制。研... 生育水平与社会发展息息相关,生育意愿是保证生育水平的前提条件。基于计数数据分位回归模型,利用CGSS数据库2012年、2013年、2015年、2017年、2018年、2021年的调查数据,从经济发展与价值观念入手,深入剖析其对生育意愿的影响机制。研究表明:地区经济发展水平对育龄群体具有负向的影响,并且在低分位点的影响更大;地区经济发展水平对育龄群体生育意愿有直接的影响,且通过影响育龄群体的价值观念影响其生育意愿;性别、年龄、受教育程度、住房、户口、养老方式选择、婚姻状况等对育龄群体的生育意愿都具有显著影响。根据研究结果,从经济、价值观念、教育与职业、社会保障、公共服务等方面提出提升育龄群体生育意愿的政策建议。 展开更多
关键词 生育意愿 经济发展 价值观念 计数数据分位回归模型
在线阅读 下载PDF
海量面板计数数据的分位数回归模型及在社交网络分析中的应用
9
作者 徐璇 傅梦逸 赵晓兵 《数理统计与管理》 北大核心 2025年第3期407-426,共20页
面板计数数据是离散观测的复发事件数据,它在保险精算行业、生物医学等领域有广泛的应用。在大数据环境下基于分位数回归的面板计数数据分析较为少见,本论文主要研究海量面板计数数据的分位数回归分析,并在两种并行计算框架下讨论该模... 面板计数数据是离散观测的复发事件数据,它在保险精算行业、生物医学等领域有广泛的应用。在大数据环境下基于分位数回归的面板计数数据分析较为少见,本论文主要研究海量面板计数数据的分位数回归分析,并在两种并行计算框架下讨论该模型的具体算法设计。最后利用数值模拟验证了模型的有效性,并在实证分析中讨论了在流行病预防和控制中的社交网络数据。 展开更多
关键词 面板计数数据 海量数据 并行计算 分位数回归 社交网络
原文传递
带有测量误差的零膨胀负二项模型的统计推断及其应用
10
作者 赵霞 徐泽轩 钟玉洁 《浙江大学学报(理学版)》 北大核心 2025年第6期706-718,共13页
在数据收集过程中,忽略测量误差往往会导致分析结果出现偏差。为此,针对带有测量误差的响应变量,构建了零膨胀负二项模型,讨论了该模型的识别性问题,得到零膨胀负二项模型的潜变量表达式,并基于改进的贝叶斯方法对模型进行了统计推断。... 在数据收集过程中,忽略测量误差往往会导致分析结果出现偏差。为此,针对带有测量误差的响应变量,构建了零膨胀负二项模型,讨论了该模型的识别性问题,得到零膨胀负二项模型的潜变量表达式,并基于改进的贝叶斯方法对模型进行了统计推断。通过多次重复模拟实验,与未考虑测量误差的常规方法及零膨胀泊松模型进行了对比,从不同零值占比和超参数条件及不同误差程度两个角度验证了零膨胀负二项模型的有效性。最后,基于人体乳腺癌基因组数据,实证分析了肿瘤分期和信号通路被激活概率之间的关系。结果表明,带有测量误差的零膨胀负二项模型表现较优。研究丰富了计数模型和测量误差理论,为处理带有测量误差的过度离散性数据提供了新的方法。 展开更多
关键词 测量误差 零膨胀负二项模型 计数数据 模型可识别性 贝叶斯框架
在线阅读 下载PDF
Bayesian Computation for the Parameters of a Zero-Inflated Cosine Geometric Distribution with Application to COVID-19 Pandemic Data
11
作者 Sunisa Junnumtuam Sa-Aat Niwitpong Suparat Niwitpong 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第5期1229-1254,共26页
A new three-parameter discrete distribution called the zero-inflated cosine geometric(ZICG)distribution is proposed for the first time herein.It can be used to analyze over-dispersed count data with excess zeros.The b... A new three-parameter discrete distribution called the zero-inflated cosine geometric(ZICG)distribution is proposed for the first time herein.It can be used to analyze over-dispersed count data with excess zeros.The basic statistical properties of the new distribution,such as the moment generating function,mean,and variance are presented.Furthermore,confidence intervals are constructed by using the Wald,Bayesian,and highest posterior density(HPD)methods to estimate the true confidence intervals for the parameters of the ZICG distribution.Their efficacies were investigated by using both simulation and real-world data comprising the number of daily COVID-19 positive cases at the Olympic Games in Tokyo 2020.The results show that the HPD interval performed better than the other methods in terms of coverage probability and average length in most cases studied. 展开更多
关键词 Bayesian analysis confidence interval gibbs sampling random-walk metropolis zero-inflated count data
暂未订购
Using Statistical Learning to Treat Missing Data: A Case of HIV/TB Co-Infection in Kenya
12
作者 Joshua O. Mwaro Linda Chaba Collins Odhiambo 《Journal of Data Analysis and Information Processing》 2020年第3期110-133,共24页
In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objec... In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objective is to identify the best method for correcting missing data in TB/HIV Co-infection setting. We employ both empirical data analysis and extensive simulation study to examine the effects of missing data, the accuracy, sensitivity, specificity and train and test error for different approaches. The novelty of this work hinges on the use of modern statistical learning algorithm when treating missingness. In the empirical analysis, both HIV data and TB-HIV co-infection data imputations were performed, and the missing values were imputed using different approaches. In the simulation study, sets of 0% (Complete case), 10%, 30%, 50% and 80% of the data were drawn randomly and replaced with missing values. Results show complete cases only had a co-infection rate (95% Confidence Interval band) of 29% (25%, 33%), weighted method 27% (23%, 31%), likelihood-based approach 26% (24%, 28%) and multiple imputation approach 21% (20%, 22%). In conclusion, MI remains the best approach for dealing with missing data and failure to apply it, results to overestimation of HIV/TB co-infection rate by 8%. 展开更多
关键词 Missing data HIV/TB Co-Infection IMPUTATION Missing at Random count data
暂未订购
Challenges Analyzing RNA-Seq Gene Expression Data
13
作者 Liliana López-Kleine Cristian González-Prieto 《Open Journal of Statistics》 2016年第4期628-636,共9页
The analysis of messenger Ribonucleic acid obtained through sequencing techniques (RNA-se- quencing) data is very challenging. Once technical difficulties have been sorted, an important choice has to be made during pr... The analysis of messenger Ribonucleic acid obtained through sequencing techniques (RNA-se- quencing) data is very challenging. Once technical difficulties have been sorted, an important choice has to be made during pre-processing: Two different paths can be chosen: Transform RNA- sequencing count data to a continuous variable or continue to work with count data. For each data type, analysis tools have been developed and seem appropriate at first sight, but a deeper analysis of data distribution and structure, are a discussion worth. In this review, open questions regarding RNA-sequencing data nature are discussed and highlighted, indicating important future research topics in statistics that should be addressed for a better analysis of already available and new appearing gene expression data. Moreover, a comparative analysis of RNAseq count and transformed data is presented. This comparison indicates that transforming RNA-seq count data seems appropriate, at least for differential expression detection. 展开更多
关键词 RNA-Seq Analysis count data PREPROCESSING Differential Expression Gene Co-Expression Network
暂未订购
基于区间Ⅰ型删失计数数据的泊松部分线性回归模型
14
作者 董小刚 马琴 刘新蕊 《长春工业大学学报》 2025年第1期72-79,共8页
区间删失数据经常出现在生物学、医学、社会学、工程学等学科中。目前在生存分析研究领域对区间删失数据的研究多集中在连续数据上,常见是建立参数与半参数模型。而在生物医学与流行病学的研究中,感兴趣对象常常为某事件发生的次数,即... 区间删失数据经常出现在生物学、医学、社会学、工程学等学科中。目前在生存分析研究领域对区间删失数据的研究多集中在连续数据上,常见是建立参数与半参数模型。而在生物医学与流行病学的研究中,感兴趣对象常常为某事件发生的次数,即计数数据。针对区间Ⅰ型删失计数数据,考虑部分协变量与计数响应变量间存在非线性关系,建立泊松部分线性回归模型,运用B样条与极大似然法估计参数。最后通过模拟分析验证所提方法的有效性,并将该方法应用于心血管病的实例问题分析中。 展开更多
关键词 区间Ⅰ型删失计数数据 泊松部分线性模型 极大似然估计 B样条
在线阅读 下载PDF
航空发动机故障数据自动统计方法研究
15
作者 张琴 周毅 黄浩 《新潮电子》 2025年第13期193-195,共3页
文章分析了从航空发动机故障报告中的非结构化故障数据中获取关键信息形成结构化故障数据的思路,并提出了具体实现方法。研究结论表明:借助零件的件号遍历Word文档定位相关的故障描述和故障图片信息,再使用io.BytesIO、openpyxl、matplo... 文章分析了从航空发动机故障报告中的非结构化故障数据中获取关键信息形成结构化故障数据的思路,并提出了具体实现方法。研究结论表明:借助零件的件号遍历Word文档定位相关的故障描述和故障图片信息,再使用io.BytesIO、openpyxl、matplotlib等函数依次将需要的信息形成结构化数据,并可视化展示航空发动机各个零件故障数据,能够实现航空发动机故障数据的自动统计。 展开更多
关键词 航空发动机 非结构化数据 自动统计 PYTHON 故障数据
在线阅读 下载PDF
电表座智能识别与计数系统的设计与实现
16
作者 吴高群 李喻 蒋秋亮 《信息与电脑》 2025年第6期165-167,共3页
文章阐述了电表座智能识别与计数系统,介绍了其涵盖感知层等的架构,分析了复杂环境图像识别等关键技术难题,讲述了核心算法与技术实现方法,以及功能性及稳定性测试情况。该系统在实际应用中提高了电力管理效率,增强了用电安全保障,助力... 文章阐述了电表座智能识别与计数系统,介绍了其涵盖感知层等的架构,分析了复杂环境图像识别等关键技术难题,讲述了核心算法与技术实现方法,以及功能性及稳定性测试情况。该系统在实际应用中提高了电力管理效率,增强了用电安全保障,助力能源数据分析与决策。 展开更多
关键词 电表座 智能识别 计数系统 数据处理 应用成效
在线阅读 下载PDF
Modelling fertility:an application of count regression models
17
作者 Ranjita Pandey Charanjit Kaur 《Chinese Journal of Population,Resources and Environment》 2015年第4期349-357,共9页
Often the lifecycle data occur as count of the vital events and are recorded as integers.The purpose of this article is to model the fertility behavior based on religious,educational,economic,and occupational characte... Often the lifecycle data occur as count of the vital events and are recorded as integers.The purpose of this article is to model the fertility behavior based on religious,educational,economic,and occupational characteristics.The responses of classified groups according to these determinants are examined for significant influence on fertility using Poisson regression model(PRM) based on the National Family Health Survey-3 dataset.The observed and predicted probabilities under PRM indicate modal value of two children for the Poisson distribution modeled data.Presence of dominance of two child in the data motivates the authors to adopt multinomial regression model(MRM) in order to link fertility with various socioeconomic indicators responsible for fertility variation.Choice of the explanatory factors is limited to the availability of data.Trends and patterns of preference for birth counts suggest that religion,caste,wealth,female education,and occupation are the dominant factors shaping the observed birth process.Empirical analysis suggests that both the models used in the study perform similarly on the sample data.However,fitting of MRM by taking birth count of two as comparison category shows improved Akaike information criterion and consistent Akaike information criterion values.Current work contributes to the existing literature as it attempts to provide more insight into the determinants of Indian fertility using Poisson and MRM. 展开更多
关键词 count data FERTILITY POISSON model MULTINOMIAL regression MODELS
在线阅读 下载PDF
Determinants of Antenatal Health Care Utilization in Egypt (2000-2014) Using Binary and Count Outcomes
18
作者 Hassan H. M. Zaky Dina M. Armanious Mohamed Ali Hussein 《Health》 2019年第1期25-39,共15页
Aim: This study seeks to investigate the factors determining the utilization of antenatal care services, the frequency of that use, and the timing of receiving antenatal care among Egyptian women utilizing a national ... Aim: This study seeks to investigate the factors determining the utilization of antenatal care services, the frequency of that use, and the timing of receiving antenatal care among Egyptian women utilizing a national representative data from Egypt Demographic and Health Surveys (EDHS) in 2000 and 2014. Methods: The paper estimates the logistic regression model, zero-inflated negative binomial model (ZINB), and negative binomial regression model (NB) to identify the most important determinants of antenatal health care utilization. Results: The findings indicate that the period 2000-2014 has experienced a significant increase in the use of antenatal health care services. The use of the public sector antenatal care services relative to that of the private sector has been decreasing over time. Moreover, wealth index, women’s education and quality of health services play significant roles in increasing accessibility of antenatal health care services. On the other hand, women’s empowerment has shown a positive effect in 2000 only. Conclusion: The study highlights the most vulnerable groups that are less likely to have access to antenatal health care services, mainly women who are less educated, poor and living in rural areas especially Upper Egypt. This certainly requires a more targeted health strategy with an equity lens. 展开更多
关键词 ANTENATAL Health Care Services BINARY and count data Negative BINOMIAL Regression Determinants EGYPT
暂未订购
基于广义线性混合效应模型的森林树木死亡研究 被引量:3
19
作者 闫明 陈艳梅 +1 位作者 闫静 奚为民 《生态学报》 CAS CSCD 北大核心 2024年第6期2420-2436,共17页
基于计数模型方法,同时考虑样地的随机效应,构建林分水平死亡模型,探究影响树木死亡的因素,以期为森林资源的监测与管理提供参考依据。以美国德州东部森林连续清查的样地数据为数据源,按4∶1的比例将其进行随机抽样,划分为训练集和验证... 基于计数模型方法,同时考虑样地的随机效应,构建林分水平死亡模型,探究影响树木死亡的因素,以期为森林资源的监测与管理提供参考依据。以美国德州东部森林连续清查的样地数据为数据源,按4∶1的比例将其进行随机抽样,划分为训练集和验证集数据,将立地因子、林分因子和气候因子作为模型的自变量,林木死亡株数则作为模型的因变量,运用计数模型和混合效应模型方法进行模型的构建,并分析影响林木死亡株数的因子。使用赤池信息准则(AIC)、贝叶斯信息准则(BIC)和-2倍对数似然函数值(-2logL)3种模型评价指标评估各模型间的拟合效果;采用平均绝对误差(MAE)和均方根误差(RMSE)2种评价指标评估其预测效果,以便筛选出最佳的林分水平死亡模型。结果表明:立地因子方面,林木死亡株数与海拔(P<0.01)呈显著的负效应,与坡度(P<0.05)呈显著的正效应,说明林木死亡株数随海拔的升高而减少,随坡度的增加而增多;林分因子方面,林木死亡株数与林分年龄(P<0.001)和树木基面积(P<0.001)呈显著的正效应,与林分平方平均胸径(P<0.001)和林分密度(P<0.05)呈显著的负效应,说明林木死亡株数随林分年龄的增加和树木基面积的增大而增加,随林分平方平均胸径和林分密度的增大而减少;气候因子方面,林木死亡株数与SPEI(P<0.05)、干旱长度(P<0.001)、年平均温度(P<0.001)和夏季平均降雨量(P<0.05)均呈显著的负效应,与夏季平均温度(P<0.001)呈显著的正效应,说明林木死亡株数随干旱强度和夏季平均温度的增加而增多,随干旱长度、年平均温度和夏季平均降雨量的增加而减少。在基础计数模型中,零膨胀负二项(ZINB)模型的拟合效果最好。而加入样地随机效应后,混合效应模型的拟合精度明显有所提高。基于所有模型模拟结果的比较,得出德州东部森林的林分水平死亡模型以ZINB-mixed模型为最优模型。 展开更多
关键词 树木死亡 计数模型 混合效应模型 影响因子
在线阅读 下载PDF
Comparative Assessment of Zero-Inflated Models with Application to HIV Exposed Infants Data
20
作者 Faith Nekesa Collins Odhiambo Linda Chaba 《Open Journal of Statistics》 2019年第6期664-685,共22页
In a typical Kenyan HIV clinical setting, there is a likelihood of registering many zeros during the routine monthly data collection of new HIV infections among HIV exposed infants (HEI). This is attributed to the imp... In a typical Kenyan HIV clinical setting, there is a likelihood of registering many zeros during the routine monthly data collection of new HIV infections among HIV exposed infants (HEI). This is attributed to the implementation of the prevention of mother to child transmission (PMTCT) policies. However, even though the PMTCT policy is implemented uniformly across all public health facilities, implementation naturally differs from every facility due to differential health systems and infrastructure. This leads to structured zero among reported positive HEI (where PMTCT implementation is optimum) and non-structured zero among reported positive HEI (where PMTCT implementation is not optimum). Hence the classical zero-inflated and hurdle models that do not account for the abundance of structured and non-structured zeros in the data can give misleading results. The purpose of this study is to systematically compare performance of the various zero-inflated models with an application to HIV Exposed Infants (HEI) in the context of structured and unstructured zeros. We revisit zero-inflated, hurdle models, Poisson and negative binomial count models and conduct the simulations by varying sample size and levels of abundance zeros. Results from simulation study and real data analysis of exposed infant diagnosis show the negative binomial emerging as the best performing model when fitting data with both structured and non-structured zeros under various settings. 展开更多
关键词 ZERO-INFLATED Models HIV EXPOSED INFANTS Structured Zeroes Mother-to-Child Transmission count data
暂未订购
上一页 1 2 15 下一页 到第
使用帮助 返回顶部