In clinical research,subgroup analysis can help identify patient groups that respond better or worse to specific treatments,improve therapeutic effect and safety,and is of great significance in precision medicine.This...In clinical research,subgroup analysis can help identify patient groups that respond better or worse to specific treatments,improve therapeutic effect and safety,and is of great significance in precision medicine.This article considers subgroup analysis methods for longitudinal data containing multiple covariates and biomarkers.We divide subgroups based on whether a linear combination of these biomarkers exceeds a predetermined threshold,and assess the heterogeneity of treatment effects across subgroups using the interaction between subgroups and exposure variables.Quantile regression is used to better characterize the global distribution of the response variable and sparsity penalties are imposed to achieve variable selection of covariates and biomarkers.The effectiveness of our proposed methodology for both variable selection and parameter estimation is verified through random simulations.Finally,we demonstrate the application of this method by analyzing data from the PA.3 trial,further illustrating the practicality of the method proposed in this paper.展开更多
Copula functions have been widely used in stochastic simulation and prediction of streamflow.However,existing models are usually limited to single two-dimensional or three-dimensional copulas with the same bivariate b...Copula functions have been widely used in stochastic simulation and prediction of streamflow.However,existing models are usually limited to single two-dimensional or three-dimensional copulas with the same bivariate block for all months.To address this limitation,this study developed a mixed D-vine copula-based conditional quantile model that can capture temporal correlations.This model can generate streamflow by selecting different historical streamflow variables as the conditions for different months and by exploiting the conditional quantile functions of streamflows in different months with mixed D-vine copulas.The up-to-down sequential method,which couples the maximum weight approach with the Akaike information criteria and the maximum likelihood approach,was used to determine the structures of multivariate Dvine copulas.The developed model was used in a case study to synthesize the monthly streamflow at the Tangnaihai hydrological station,the inflow control station of the Longyangxia Reservoir in the Yellow River Basin.The results showed that the developed model outperformed the commonly used bivariate copula model in terms of the performance in simulating the seasonality and interannual variability of streamflow.This model provides useful information for water-related natural hazard risk assessment and integrated water resources management and utilization.展开更多
In this paper, we propose the double-penalized quantile regression estimators in partially linear models. An iterative algorithm is proposed for solving the proposed optimization problem. Some numerical examples illus...In this paper, we propose the double-penalized quantile regression estimators in partially linear models. An iterative algorithm is proposed for solving the proposed optimization problem. Some numerical examples illustrate that the finite sample performances of proposed method perform better than the least squares based method with regard to the non-causal selection rate (NSR) and the median of model error (MME) when the error distribution is heavy-tail. Finally, we apply the proposed methodology to analyze the ragweed pollen level dataset.展开更多
The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. T...The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.展开更多
In this paper,we consider testing the hypothesis concerning the means of two independent semicontinuous distributions whose observations are zero-inflated,characterized by a sizable number of zeros and positive observ...In this paper,we consider testing the hypothesis concerning the means of two independent semicontinuous distributions whose observations are zero-inflated,characterized by a sizable number of zeros and positive observations from a continuous distribution.The continuous parts of the two semicontinuous distributions are assumed to follow a density ratio model.A new two-part test is developed for this kind of data.The proposed test takes the sum of one test for equality of proportions of zero values and one conditional test for the continuous distribution.The test is proved to follow a2 distribution with two degrees of freedom.Simulation studies show that the proposed test controls the type I error rates at the desired level,and is competitive to,and most of the time more powerful than two popular tests.A real data example from a dietary intervention study is used to illustrate the usefulness of the proposed test.展开更多
In this paper, three smoothed empirical log-likelihood ratio functions for the parameters of nonlinear models with missing response are suggested. Under some regular conditions, the corresponding Wilks phenomena are o...In this paper, three smoothed empirical log-likelihood ratio functions for the parameters of nonlinear models with missing response are suggested. Under some regular conditions, the corresponding Wilks phenomena are obtained and the confidence regions for the parameter can be constructed easily.展开更多
Wastewater treatment is one of critical issues faced by water utilities, and receives more and more attentions recently. The energy consumption modeling in biochemical wastewater treatment was investigated in the stud...Wastewater treatment is one of critical issues faced by water utilities, and receives more and more attentions recently. The energy consumption modeling in biochemical wastewater treatment was investigated in the study via a general and robust approach based on Bayesian semi-parametric quantile regression. The dataset was derived from a municipal wastewater treatment plant, where the energy consumption of unit chemical oxygen demand(COD) reduction was the response variable of interest. Via the proposed approach,the comprehensive regression pictures of the energy consumption and truly influencing factors, i.e., the regression relationships at lower, median and higher energy consumption levels were characterized respectively. Meanwhile, the proposals for energy saving in different cases were also facilitated specifically. First, the lower level of energy consumption was closely associated with the temperature of influent wastewater, and the chroma-rich wastewater also showed helpful in the execution of energy saving. Second, at median energy consumption level, the COD-rich wastewater played a determinative role in the reduction of energy consumption, while the higher quality of treated water led to slightly energy intensive. Third, the higher level of energy consumption was most likely to be attributed to the relatively high temperature of wastewater and total nitrogen(TN)-rich wastewater,and both of the factors were preferably to be avoided to alleviate the burden of energy consumption. The study provided an efficient approach to controlling the energy consumption of wastewater treatment in the perspective of statistical regression modeling, and offered valuable suggestions for the future energy saving.展开更多
This paper studies how the price movements of pork,chicken and egg respond to those of related cost factors in short terms in Chinese market.We employ a linear quantile approach not only to explore potential data hete...This paper studies how the price movements of pork,chicken and egg respond to those of related cost factors in short terms in Chinese market.We employ a linear quantile approach not only to explore potential data heteroscedasticity but also to generate confidence bands for the purpose of price stability study.We then evaluate our models by comparing the prediction intervals generated from the quantile regression models with in-sample and out-of-sample forecasts.Using monthly data from January 2000 to October 2010,we observed these findings:(i) the price changes of cost factors asymmetrically and unequally influence those of the livestock across different quantiles;(ii) the performance of our models is robust and consistent for both in-sample and out-of-sample forecasts;(iii) the confidence intervals generated from 0.05th and 0.95th quantile regression models are good methods to forecast livestock price fluctuation.展开更多
We introduce a new wavelet based procedure for detecting outliers in financial discrete time series.The procedure focuses on the analysis of residuals obtained from a model fit,and applied to the Generalized Autoregre...We introduce a new wavelet based procedure for detecting outliers in financial discrete time series.The procedure focuses on the analysis of residuals obtained from a model fit,and applied to the Generalized Autoregressive Conditional Heteroskedasticity(GARCH)like model,but not limited to these models.We apply the Maximal-Overlap Discrete Wavelet Transform(MODWT)to the residuals and compare their wavelet coefficients against quantile thresholds to detect outliers.Our methodology has several advantages over existing methods that make use of the standard Discrete Wavelet Transform(DWT).The series sample size does not need to be a power of 2 and the transform can explore any wavelet filter and be run up to the desired level.Simulated wavelet quantiles from a Normal and Student t-distribution are used as threshold for the maximum of the absolute value of wavelet coefficients.The performance of the procedure is illustrated and applied to two real series:the closed price of the Saudi Stock market and the S&P 500 index respectively.The efficiency of the proposed method is demonstrated and can be considered as a distinct important addition to the existing methods.展开更多
Scour has been widely accepted as a key reason for bridge failures.Bridges are susceptible and sensitive to the scour phenomenon,which describes the loss of riverbed sediments around the bridge supports because of flo...Scour has been widely accepted as a key reason for bridge failures.Bridges are susceptible and sensitive to the scour phenomenon,which describes the loss of riverbed sediments around the bridge supports because of flow.The carrying capacity of a deep-water foundation is influenced by the formation of a scour hole,which means that a severe scour can lead to a bridge failure without warning.Most of the current scour predictions are based on deterministic models,while other loads at bridges are usually provided as probabilistic values.To integrate scour factors with other loads in bridge design and research,a quantile regression model was utilized to estimate scour depth.Field data and experimental data from previous studies were collected to build the model.Moreover,scour estimations using the HEC-18 equation and the proposed method were compared.By using the“CCC(Calculate,Confirm,and Check)”procedure,the probabilistic concept could be used to calculate various scour depths with the targeted likelihood according to a specified chance of bridge failure.The study shows that with a sufficiently large and continuously updated database,the proposed model could present reasonable results and provide guidance for scour mitigation.展开更多
This paper considers quantile regression analysis based on semi-competing risks data in which a non-terminal event may be dependently censored by a terminal event. The major interest is the covariate effects on the qu...This paper considers quantile regression analysis based on semi-competing risks data in which a non-terminal event may be dependently censored by a terminal event. The major interest is the covariate effects on the quantile of the non-terminal event time. Dependent censoring is handled by assuming that the joint distribution of the two event times follows a parametric copula model with unspecified marginal distributions. The technique of inverse probability weighting (IPW) is adopted to adjust for the selection bias. Large-sample properties of the proposed estimator are derived and a model diagnostic procedure is developed to check the adequacy of the model assumption. Simulation results show that the proposed estimator performs well. For illustrative purposes, our method is applied to analyze the bone marrow transplant data in [1].展开更多
Because the U.S.is a major player in the international oil market,it is interesting to study whether aggregate and state-level economic conditions can predict the subse-quent realized volatility of oil price returns.T...Because the U.S.is a major player in the international oil market,it is interesting to study whether aggregate and state-level economic conditions can predict the subse-quent realized volatility of oil price returns.To address this research question,we frame our analysis in terms of variants of the popular heterogeneous autoregressive realized volatility(HAR-RV)model.To estimate the models,we use quantile-regression and quantile machine learning(Lasso)estimators.Our estimation results highlights the dif-ferential effects of economic conditions on the quantiles of the conditional distribution of realized volatility.Using weekly data for the period April 1987 to December 2021,we document evidence of predictability at a biweekly and monthly horizon.展开更多
Regional climate change impact assessments are becoming increasingly important for developing adaptation strategies in an uncertain future with respect to hydro-climatic extremes. There are a number of Global Climate ...Regional climate change impact assessments are becoming increasingly important for developing adaptation strategies in an uncertain future with respect to hydro-climatic extremes. There are a number of Global Climate Models (GCMs) and emission scenarios providing predictions of future changes in climate. As a result, there is a level of uncertainty associated with the decision of which climate models to use for the assessment of climate change impacts. The IPCC has recommended using as many global climate model scenarios as possible;however, this approach may be impractical for regional assessments that are computationally demanding. Methods have been developed to select climate model scenarios, generally consisting of selecting a model with the highest skill (validation), creating an ensemble, or selecting one or more extremes. Validation methods limit analyses to models with higher skill in simulating historical climate, ensemble methods typically take multi model means, median, or percentiles, and extremes methods tend to use scenarios which bound the projected changes in precipitation and temperature. In this paper a quantile regression based validation method is developed and applied to generate a reduced set of GCM-scenarios to analyze daily maximum streamflow uncertainty in the Upper Thames River Basin, Canada, while extremes and percentile ensemble approaches are also used for comparison. Results indicate that the validation method was able to effectively rank and reduce the set of scenarios, while the extremes and percentile ensemble methods were found not to necessarily correlate well with the range of extreme flows for all calendar months and return periods.展开更多
The sparse phase retrieval aims to recover the sparse signal from quadratic measurements. However, the measurements are often affected by outliers and asymmetric distribution noise. This paper introduces a novel metho...The sparse phase retrieval aims to recover the sparse signal from quadratic measurements. However, the measurements are often affected by outliers and asymmetric distribution noise. This paper introduces a novel method that combines the quantile regression and the L<sub>1/2</sub>-regularizer. It is a non-convex, non-smooth, non-Lipschitz optimization problem. We propose an efficient algorithm based on the Alternating Direction Methods of Multiplier (ADMM) to solve the corresponding optimization problem. Numerous numerical experiments show that this method can recover sparse signals with fewer measurements and is robust to dense bounded noise and Laplace noise.展开更多
The rise of non-marital fertility, which seems to defy the Bongaarts model by decoupling marriage from fertility, has become a subject of interest in both the developed and developing world. Consequences of non-marita...The rise of non-marital fertility, which seems to defy the Bongaarts model by decoupling marriage from fertility, has become a subject of interest in both the developed and developing world. Consequences of non-marital fertility are mostly negative particularly in developing countries. In Namibia, although premarital childbearing has been reported to be high and increasing, no studies have explicitly analyzed factors influencing non-marital fertility. This paper uses data from the 2006/7 Namibia DHS to establish the determinants of non-marital fertility among women by applying a two-part model, with one part to describe the presence of non-marital birth and the other part to explain its intensity (number of children born). Using the number of children ever born as an outcome, we explored various count data models. Based on the Voung statistics model comparison, we settled for the Hurdle logit Negative Binomial regression to model the number of non-marital births. Non-marital fertility in Namibia is associated with the age, with young women likely to have lower fertility compared to older women. Women with secondary or higher education had lower fertility compared those with no formal education. Findings also show that rural women higher fertility propensity compared to their urban counterparts even though there was no significant difference in fertility intensity. With regard to socio-economic status, fertility intensity decreased as the women got richer. Intervention efforts should focus on promoting education among girls and women especially in rural areas to improve their socio-economic status, reduce teenage pregnancy and non-marital fertility.展开更多
In analyzing semi-continuous data,two-part model is a widely appreciated tool,in which two components are enclosed to characterize the mixing proportion of zeros and the actual level of positive values in semi-continu...In analyzing semi-continuous data,two-part model is a widely appreciated tool,in which two components are enclosed to characterize the mixing proportion of zeros and the actual level of positive values in semi-continuous data.The primary interest underlying such a model is primarily to exploit the dependence of the observed covariates on the semi-continuous variables;as such,the exploitation of unobserved heterogeneity is sometimes ignored.In this paper,we extend the conventional two-part regression model to much more general situations where multiple latent factors are considered to interpret the latent heterogeneity arising from the absence of covariates.A structural equation is constructed to describe the interrelationships between the latent factors.Moreover,a general statistical analysis procedure is developed to accommodate semi-continuous,ordered and unordered data simultaneously.A procedure for parameter estimation and model assessment is developed under a Bayesian framework.Empirical results including a simulation study and a real example are presented to illustrate the proposed methodology.展开更多
基金Supported by the Natural Science Foundation of Fujian Province(2022J011177,2024J01903)the Key Project of Fujian Provincial Education Department(JZ230054)。
文摘In clinical research,subgroup analysis can help identify patient groups that respond better or worse to specific treatments,improve therapeutic effect and safety,and is of great significance in precision medicine.This article considers subgroup analysis methods for longitudinal data containing multiple covariates and biomarkers.We divide subgroups based on whether a linear combination of these biomarkers exceeds a predetermined threshold,and assess the heterogeneity of treatment effects across subgroups using the interaction between subgroups and exposure variables.Quantile regression is used to better characterize the global distribution of the response variable and sparsity penalties are imposed to achieve variable selection of covariates and biomarkers.The effectiveness of our proposed methodology for both variable selection and parameter estimation is verified through random simulations.Finally,we demonstrate the application of this method by analyzing data from the PA.3 trial,further illustrating the practicality of the method proposed in this paper.
基金supported by the National Natural Science Foundation of China(Grant No.52109010)the Postdoctoral Science Foundation of China(Grant No.2021M701047)the China National Postdoctoral Program for Innovative Talents(Grant No.BX20200113).
文摘Copula functions have been widely used in stochastic simulation and prediction of streamflow.However,existing models are usually limited to single two-dimensional or three-dimensional copulas with the same bivariate block for all months.To address this limitation,this study developed a mixed D-vine copula-based conditional quantile model that can capture temporal correlations.This model can generate streamflow by selecting different historical streamflow variables as the conditions for different months and by exploiting the conditional quantile functions of streamflows in different months with mixed D-vine copulas.The up-to-down sequential method,which couples the maximum weight approach with the Akaike information criteria and the maximum likelihood approach,was used to determine the structures of multivariate Dvine copulas.The developed model was used in a case study to synthesize the monthly streamflow at the Tangnaihai hydrological station,the inflow control station of the Longyangxia Reservoir in the Yellow River Basin.The results showed that the developed model outperformed the commonly used bivariate copula model in terms of the performance in simulating the seasonality and interannual variability of streamflow.This model provides useful information for water-related natural hazard risk assessment and integrated water resources management and utilization.
文摘In this paper, we propose the double-penalized quantile regression estimators in partially linear models. An iterative algorithm is proposed for solving the proposed optimization problem. Some numerical examples illustrate that the finite sample performances of proposed method perform better than the least squares based method with regard to the non-causal selection rate (NSR) and the median of model error (MME) when the error distribution is heavy-tail. Finally, we apply the proposed methodology to analyze the ragweed pollen level dataset.
文摘The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.
基金Supported by the National Natural Science Foundation of China(No.11971433)the First Class Discipline of Zhejiang-A(Zhejiang Gongshang University-Statistics)the Intramural Research Program of the Eunice Kennedy Shriver National Institute of Child Health and Human Development.
文摘In this paper,we consider testing the hypothesis concerning the means of two independent semicontinuous distributions whose observations are zero-inflated,characterized by a sizable number of zeros and positive observations from a continuous distribution.The continuous parts of the two semicontinuous distributions are assumed to follow a density ratio model.A new two-part test is developed for this kind of data.The proposed test takes the sum of one test for equality of proportions of zero values and one conditional test for the continuous distribution.The test is proved to follow a2 distribution with two degrees of freedom.Simulation studies show that the proposed test controls the type I error rates at the desired level,and is competitive to,and most of the time more powerful than two popular tests.A real data example from a dietary intervention study is used to illustrate the usefulness of the proposed test.
文摘In this paper, three smoothed empirical log-likelihood ratio functions for the parameters of nonlinear models with missing response are suggested. Under some regular conditions, the corresponding Wilks phenomena are obtained and the confidence regions for the parameter can be constructed easily.
基金supported by the National Natural Science Foundation of China (Nos.51478025,11701023,71420107025)
文摘Wastewater treatment is one of critical issues faced by water utilities, and receives more and more attentions recently. The energy consumption modeling in biochemical wastewater treatment was investigated in the study via a general and robust approach based on Bayesian semi-parametric quantile regression. The dataset was derived from a municipal wastewater treatment plant, where the energy consumption of unit chemical oxygen demand(COD) reduction was the response variable of interest. Via the proposed approach,the comprehensive regression pictures of the energy consumption and truly influencing factors, i.e., the regression relationships at lower, median and higher energy consumption levels were characterized respectively. Meanwhile, the proposals for energy saving in different cases were also facilitated specifically. First, the lower level of energy consumption was closely associated with the temperature of influent wastewater, and the chroma-rich wastewater also showed helpful in the execution of energy saving. Second, at median energy consumption level, the COD-rich wastewater played a determinative role in the reduction of energy consumption, while the higher quality of treated water led to slightly energy intensive. Third, the higher level of energy consumption was most likely to be attributed to the relatively high temperature of wastewater and total nitrogen(TN)-rich wastewater,and both of the factors were preferably to be avoided to alleviate the burden of energy consumption. The study provided an efficient approach to controlling the energy consumption of wastewater treatment in the perspective of statistical regression modeling, and offered valuable suggestions for the future energy saving.
基金supported by the Key Project of National Key Technology R&D Program of China(2009BADA9B01)
文摘This paper studies how the price movements of pork,chicken and egg respond to those of related cost factors in short terms in Chinese market.We employ a linear quantile approach not only to explore potential data heteroscedasticity but also to generate confidence bands for the purpose of price stability study.We then evaluate our models by comparing the prediction intervals generated from the quantile regression models with in-sample and out-of-sample forecasts.Using monthly data from January 2000 to October 2010,we observed these findings:(i) the price changes of cost factors asymmetrically and unequally influence those of the livestock across different quantiles;(ii) the performance of our models is robust and consistent for both in-sample and out-of-sample forecasts;(iii) the confidence intervals generated from 0.05th and 0.95th quantile regression models are good methods to forecast livestock price fluctuation.
文摘We introduce a new wavelet based procedure for detecting outliers in financial discrete time series.The procedure focuses on the analysis of residuals obtained from a model fit,and applied to the Generalized Autoregressive Conditional Heteroskedasticity(GARCH)like model,but not limited to these models.We apply the Maximal-Overlap Discrete Wavelet Transform(MODWT)to the residuals and compare their wavelet coefficients against quantile thresholds to detect outliers.Our methodology has several advantages over existing methods that make use of the standard Discrete Wavelet Transform(DWT).The series sample size does not need to be a power of 2 and the transform can explore any wavelet filter and be run up to the desired level.Simulated wavelet quantiles from a Normal and Student t-distribution are used as threshold for the maximum of the absolute value of wavelet coefficients.The performance of the procedure is illustrated and applied to two real series:the closed price of the Saudi Stock market and the S&P 500 index respectively.The efficiency of the proposed method is demonstrated and can be considered as a distinct important addition to the existing methods.
基金Sponsored by the National Natural Science Foundation of China(Grant Nos.51908421 and 41172246).
文摘Scour has been widely accepted as a key reason for bridge failures.Bridges are susceptible and sensitive to the scour phenomenon,which describes the loss of riverbed sediments around the bridge supports because of flow.The carrying capacity of a deep-water foundation is influenced by the formation of a scour hole,which means that a severe scour can lead to a bridge failure without warning.Most of the current scour predictions are based on deterministic models,while other loads at bridges are usually provided as probabilistic values.To integrate scour factors with other loads in bridge design and research,a quantile regression model was utilized to estimate scour depth.Field data and experimental data from previous studies were collected to build the model.Moreover,scour estimations using the HEC-18 equation and the proposed method were compared.By using the“CCC(Calculate,Confirm,and Check)”procedure,the probabilistic concept could be used to calculate various scour depths with the targeted likelihood according to a specified chance of bridge failure.The study shows that with a sufficiently large and continuously updated database,the proposed model could present reasonable results and provide guidance for scour mitigation.
文摘This paper considers quantile regression analysis based on semi-competing risks data in which a non-terminal event may be dependently censored by a terminal event. The major interest is the covariate effects on the quantile of the non-terminal event time. Dependent censoring is handled by assuming that the joint distribution of the two event times follows a parametric copula model with unspecified marginal distributions. The technique of inverse probability weighting (IPW) is adopted to adjust for the selection bias. Large-sample properties of the proposed estimator are derived and a model diagnostic procedure is developed to check the adequacy of the model assumption. Simulation results show that the proposed estimator performs well. For illustrative purposes, our method is applied to analyze the bone marrow transplant data in [1].
文摘Because the U.S.is a major player in the international oil market,it is interesting to study whether aggregate and state-level economic conditions can predict the subse-quent realized volatility of oil price returns.To address this research question,we frame our analysis in terms of variants of the popular heterogeneous autoregressive realized volatility(HAR-RV)model.To estimate the models,we use quantile-regression and quantile machine learning(Lasso)estimators.Our estimation results highlights the dif-ferential effects of economic conditions on the quantiles of the conditional distribution of realized volatility.Using weekly data for the period April 1987 to December 2021,we document evidence of predictability at a biweekly and monthly horizon.
文摘Regional climate change impact assessments are becoming increasingly important for developing adaptation strategies in an uncertain future with respect to hydro-climatic extremes. There are a number of Global Climate Models (GCMs) and emission scenarios providing predictions of future changes in climate. As a result, there is a level of uncertainty associated with the decision of which climate models to use for the assessment of climate change impacts. The IPCC has recommended using as many global climate model scenarios as possible;however, this approach may be impractical for regional assessments that are computationally demanding. Methods have been developed to select climate model scenarios, generally consisting of selecting a model with the highest skill (validation), creating an ensemble, or selecting one or more extremes. Validation methods limit analyses to models with higher skill in simulating historical climate, ensemble methods typically take multi model means, median, or percentiles, and extremes methods tend to use scenarios which bound the projected changes in precipitation and temperature. In this paper a quantile regression based validation method is developed and applied to generate a reduced set of GCM-scenarios to analyze daily maximum streamflow uncertainty in the Upper Thames River Basin, Canada, while extremes and percentile ensemble approaches are also used for comparison. Results indicate that the validation method was able to effectively rank and reduce the set of scenarios, while the extremes and percentile ensemble methods were found not to necessarily correlate well with the range of extreme flows for all calendar months and return periods.
文摘The sparse phase retrieval aims to recover the sparse signal from quadratic measurements. However, the measurements are often affected by outliers and asymmetric distribution noise. This paper introduces a novel method that combines the quantile regression and the L<sub>1/2</sub>-regularizer. It is a non-convex, non-smooth, non-Lipschitz optimization problem. We propose an efficient algorithm based on the Alternating Direction Methods of Multiplier (ADMM) to solve the corresponding optimization problem. Numerous numerical experiments show that this method can recover sparse signals with fewer measurements and is robust to dense bounded noise and Laplace noise.
文摘The rise of non-marital fertility, which seems to defy the Bongaarts model by decoupling marriage from fertility, has become a subject of interest in both the developed and developing world. Consequences of non-marital fertility are mostly negative particularly in developing countries. In Namibia, although premarital childbearing has been reported to be high and increasing, no studies have explicitly analyzed factors influencing non-marital fertility. This paper uses data from the 2006/7 Namibia DHS to establish the determinants of non-marital fertility among women by applying a two-part model, with one part to describe the presence of non-marital birth and the other part to explain its intensity (number of children born). Using the number of children ever born as an outcome, we explored various count data models. Based on the Voung statistics model comparison, we settled for the Hurdle logit Negative Binomial regression to model the number of non-marital births. Non-marital fertility in Namibia is associated with the age, with young women likely to have lower fertility compared to older women. Women with secondary or higher education had lower fertility compared those with no formal education. Findings also show that rural women higher fertility propensity compared to their urban counterparts even though there was no significant difference in fertility intensity. With regard to socio-economic status, fertility intensity decreased as the women got richer. Intervention efforts should focus on promoting education among girls and women especially in rural areas to improve their socio-economic status, reduce teenage pregnancy and non-marital fertility.
基金supported by National Nature Science Foundation of China(No.11471161)the Natural Science Foundation of the Higher Education Institutions of Jiangsu Province(No.15KJB110010).
文摘In analyzing semi-continuous data,two-part model is a widely appreciated tool,in which two components are enclosed to characterize the mixing proportion of zeros and the actual level of positive values in semi-continuous data.The primary interest underlying such a model is primarily to exploit the dependence of the observed covariates on the semi-continuous variables;as such,the exploitation of unobserved heterogeneity is sometimes ignored.In this paper,we extend the conventional two-part regression model to much more general situations where multiple latent factors are considered to interpret the latent heterogeneity arising from the absence of covariates.A structural equation is constructed to describe the interrelationships between the latent factors.Moreover,a general statistical analysis procedure is developed to accommodate semi-continuous,ordered and unordered data simultaneously.A procedure for parameter estimation and model assessment is developed under a Bayesian framework.Empirical results including a simulation study and a real example are presented to illustrate the proposed methodology.