期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
Bayesian Computation for the Parameters of a Zero-Inflated Cosine Geometric Distribution with Application to COVID-19 Pandemic Data
1
作者 Sunisa Junnumtuam Sa-Aat Niwitpong Suparat Niwitpong 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第5期1229-1254,共26页
A new three-parameter discrete distribution called the zero-inflated cosine geometric(ZICG)distribution is proposed for the first time herein.It can be used to analyze over-dispersed count data with excess zeros.The b... A new three-parameter discrete distribution called the zero-inflated cosine geometric(ZICG)distribution is proposed for the first time herein.It can be used to analyze over-dispersed count data with excess zeros.The basic statistical properties of the new distribution,such as the moment generating function,mean,and variance are presented.Furthermore,confidence intervals are constructed by using the Wald,Bayesian,and highest posterior density(HPD)methods to estimate the true confidence intervals for the parameters of the ZICG distribution.Their efficacies were investigated by using both simulation and real-world data comprising the number of daily COVID-19 positive cases at the Olympic Games in Tokyo 2020.The results show that the HPD interval performed better than the other methods in terms of coverage probability and average length in most cases studied. 展开更多
关键词 Bayesian analysis confidence interval gibbs sampling random-walk metropolis zero-inflated count data
暂未订购
Using Statistical Learning to Treat Missing Data: A Case of HIV/TB Co-Infection in Kenya
2
作者 Joshua O. Mwaro Linda Chaba Collins Odhiambo 《Journal of Data Analysis and Information Processing》 2020年第3期110-133,共24页
In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objec... In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objective is to identify the best method for correcting missing data in TB/HIV Co-infection setting. We employ both empirical data analysis and extensive simulation study to examine the effects of missing data, the accuracy, sensitivity, specificity and train and test error for different approaches. The novelty of this work hinges on the use of modern statistical learning algorithm when treating missingness. In the empirical analysis, both HIV data and TB-HIV co-infection data imputations were performed, and the missing values were imputed using different approaches. In the simulation study, sets of 0% (Complete case), 10%, 30%, 50% and 80% of the data were drawn randomly and replaced with missing values. Results show complete cases only had a co-infection rate (95% Confidence Interval band) of 29% (25%, 33%), weighted method 27% (23%, 31%), likelihood-based approach 26% (24%, 28%) and multiple imputation approach 21% (20%, 22%). In conclusion, MI remains the best approach for dealing with missing data and failure to apply it, results to overestimation of HIV/TB co-infection rate by 8%. 展开更多
关键词 Missing data HIV/TB Co-Infection IMPUTATION Missing at Random Count data
暂未订购
Challenges Analyzing RNA-Seq Gene Expression Data
3
作者 Liliana López-Kleine Cristian González-Prieto 《Open Journal of Statistics》 2016年第4期628-636,共9页
The analysis of messenger Ribonucleic acid obtained through sequencing techniques (RNA-se- quencing) data is very challenging. Once technical difficulties have been sorted, an important choice has to be made during pr... The analysis of messenger Ribonucleic acid obtained through sequencing techniques (RNA-se- quencing) data is very challenging. Once technical difficulties have been sorted, an important choice has to be made during pre-processing: Two different paths can be chosen: Transform RNA- sequencing count data to a continuous variable or continue to work with count data. For each data type, analysis tools have been developed and seem appropriate at first sight, but a deeper analysis of data distribution and structure, are a discussion worth. In this review, open questions regarding RNA-sequencing data nature are discussed and highlighted, indicating important future research topics in statistics that should be addressed for a better analysis of already available and new appearing gene expression data. Moreover, a comparative analysis of RNAseq count and transformed data is presented. This comparison indicates that transforming RNA-seq count data seems appropriate, at least for differential expression detection. 展开更多
关键词 RNA-Seq Analysis Count data PREPROCESSING Differential Expression Gene Co-Expression Network
暂未订购
Factors Associated with Physical-Activity Performance by Older Individuals in a Medium-Sized City in S^o Paulo State, Brazil
4
作者 Jos Eduardo Corrente Giovana Fumes Tania Ruiz 《Journal of Life Sciences》 2013年第2期210-218,共9页
Physical activity has been scientifically discussed as fundamental in the process of healthy ageing. Hence, this study aimed at determining the factors that influence older people to perform physical activities. The c... Physical activity has been scientifically discussed as fundamental in the process of healthy ageing. Hence, this study aimed at determining the factors that influence older people to perform physical activities. The complete IPAQ (International Physical Activity Questionnaire) was applied to a population-based sample consisting of 364 elderly persons in the city of Botucatu, SAo Paulo, Brazil. Days of physical activity performed by the older people were considered by taking into account household and leisure activities. Models for count data were fitted by including socio-demographic variables as well as those related to life satisfaction. It was shown that housework physical-activity performance is associated with female, who predominantly showed to be more active in all levels. Male seemed to be more predisposed to perform lighter recreation, sports and leisure-time physical activities, such as walking. Additionally, poor schooling showed to he decisive for not performing physical activities both at home and during leisure. 展开更多
关键词 Older people physical activity models for count data.
在线阅读 下载PDF
Empirical Bayesian Approach to Testing Homogeneity of Several Means of Inflated Poisson Distributions (IPD)
5
作者 Mohamed M. Shoukri Maha Aleid 《Open Journal of Statistics》 2023年第3期285-299,共15页
Objectives: We introduce a special form of the Generalized Poisson Distribution. The distribution has one parameter, yet it has a variance that is larger than the mean a phenomenon known as “over dispersion”. We dis... Objectives: We introduce a special form of the Generalized Poisson Distribution. The distribution has one parameter, yet it has a variance that is larger than the mean a phenomenon known as “over dispersion”. We discuss potential applications of the distribution as a model of counts, and under the assumption of independence we will perform statistical inference on the ratio of two means, with generalization to testing the homogeneity of several means. Methods: Bayesian methods depend on the choice of the prior distributions of the population parameters. In this paper, we describe a Bayesian approach for estimation and inference on the parameters of several independent Inflated Poisson (IPD) distributions with two possible priors, the first is the reciprocal of the square root of the Poisson parameter and the other is a conjugate Gamma prior. The parameters of Gamma distribution are estimated in the empirical Bayesian framework using the maximum likelihood (ML) solution using nonlinear mixed model (NLMIXED) in SAS. With these priors we construct the highest posterior confidence intervals on the ratio of two IPD parameters and test the homogeneity of several populations. Results: We encountered convergence problem in estimating the hyperparameters of the posterior distribution using the NLMIXED. However, direct maximization of the predictive density produced solutions to the maximum likelihood equations. We apply the methodologies to RNA-SEQ read count data of gene expression values. 展开更多
关键词 Distributions of Over-Dispersed Counts Lagrange Class of Distributions Knowledge Transfer Gamma Prior Posterior Inference Wilson-Hilferty Transformation RNA_SEQ Read Counts data
在线阅读 下载PDF
A First Order Stationary Branching Negative Binomial Autoregressive Model with Application
6
作者 Bakary Traore Bonface Miya Malenje Herbert Imboga 《Open Journal of Statistics》 2022年第6期810-826,共17页
In the area of time series modelling, several applications are encountered in real-life that involve analysis of count time series data. The distribution characteristics and dependence structure are the major issues t... In the area of time series modelling, several applications are encountered in real-life that involve analysis of count time series data. The distribution characteristics and dependence structure are the major issues that arise while specifying a modelling strategy to handle the analysis of those kinds of data. Owing to the numerous applications there is a need to develop models that can capture these features. However, accounting for both aspects simultaneously presents complexities while specifying a modeling strategy. In this paper, an alternative statistical model able to deal with issues of discreteness, overdispersion, serial correlation over time is proposed. In particular, we adopt a branching mechanism to develop a first-order stationary negative binomial autoregressive model. Inference is based on maximum likelihood estimation and a simulation study is conducted to evaluate the performance of the proposed approach. As an illustration, the model is applied to a real-life dataset in crime analysis. 展开更多
关键词 Branching Process Negative Binomial Time Series of Count data Serial Dependence Overdispersion
在线阅读 下载PDF
On Bivariate Self-Exciting Hysteretic Integer-Valued Autoregressive Processes
7
作者 YANG Kai CHEN Xiaoman +2 位作者 LI Han XIA Chao WANG Xinyang 《Journal of Systems Science & Complexity》 2025年第5期2204-2225,共22页
This paper introduces a bivariate hysteretic integer-valued autoregressive(INAR)process driven by a bivariate Poisson innovation.It deals well with the buffered or hysteretic characteristics of the data.Model properti... This paper introduces a bivariate hysteretic integer-valued autoregressive(INAR)process driven by a bivariate Poisson innovation.It deals well with the buffered or hysteretic characteristics of the data.Model properties such as sationarity and ergodicity are studied in detail.Parameter estimation problem is also well address via methods of two-step conditional least squares(CLS)and conditional maximum likelihood(CML).The boundary parameters are estimated via triangular grid searching algorithm.The estimation effect is verified through simulations based on three scenarios.Finally,the new model is applied to the offence counts in New South Wales(NSW),Australia. 展开更多
关键词 Bivariate integer-valued time series buffered autoregressive process count data hysteretic autoregressive process TGSM algorithm
原文传递
Robust Estimation of Semiparametric Transformation Model for Panel Count Data 被引量:2
8
作者 FENG Yan WANG Yijun +1 位作者 WANG Weiwei CHEN Zhuo 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第6期2334-2356,共23页
Panel count data are frequently encountered when study subjects are under discrete observations.However,limited literature has been found on variable selection for panel count data.In this paper,without considering th... Panel count data are frequently encountered when study subjects are under discrete observations.However,limited literature has been found on variable selection for panel count data.In this paper,without considering the model assumption of observation process,a more general semiparametric transformation model for panel count data with informative observation process is developed.A penalized estimation procedure based on the quantile regression function is proposed for variable selection and parameter estimation simultaneously.The consistency and oracle properties of the estimators are established under some mild conditions.Some simulations and an application are reported to evaluate the proposed approach. 展开更多
关键词 B-spline function panel count data quantile regression semiparametric transformation model variable selection
原文传递
On random coefficient INAR(1) processes 被引量:1
9
作者 ROITERSHTEIN Alexander ZHONG Zheng 《Science China Mathematics》 SCIE 2013年第1期177-200,共24页
The random coefficient integer-valued autoregressive process was introduced by Zheng,Basawa,and Datta in 2007.In this paper we study the asymptotic behavior of this model(in particular,weak limits of extreme values an... The random coefficient integer-valued autoregressive process was introduced by Zheng,Basawa,and Datta in 2007.In this paper we study the asymptotic behavior of this model(in particular,weak limits of extreme values and the growth rate of partial sums) in the case where the additive term in the underlying random linear recursion belongs to the domain of attraction of a stable law. 展开更多
关键词 models for count data thinning models branching processes random environment limit theorems
原文传递
On testing for infections during epidemics, with application to Covid-19 in Ontario, Canada 被引量:1
10
作者 Jerald F.Lawless Ping Yan 《Infectious Disease Modelling》 2021年第1期930-941,共12页
During an epidemic,accurate estimation of the numbers of viral infections in different regions and groups is important for understanding transmission and guiding public health actions.This depends on effective testing... During an epidemic,accurate estimation of the numbers of viral infections in different regions and groups is important for understanding transmission and guiding public health actions.This depends on effective testing strategies that identify a high proportion of infections(that is,provide high ascertainment rates).For the novel coronavirus SARS-CoV-2,ascertainment rates do not appear to be high in most jurisdictions,but quantitative analysis of testing has been limited.We provide statistical models for studying testing and ascertainment rates,and illustrate them on public data on testing and case counts in Ontario,Canada. 展开更多
关键词 Count data COVID-19 Modelling Testing strategies Ascertainment rate
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部