Building a well-off society in an all-round way is the goal put forward at the 16th CPC National Congress for the first two decades of this century.According to 'Statistical Monitoring Program on Building a Well-o...Building a well-off society in an all-round way is the goal put forward at the 16th CPC National Congress for the first two decades of this century.According to 'Statistical Monitoring Program on Building a Well-off Society' [1], Institute of Statistical Science,National Bureau of Statistics of China and local statistics research departments had conducted statistical monitoring for the process of building a well-off society in an all-round way from 2000 to 2010 nationwide and locally.The result shows that,over the past decade,under the correct leadership of the CPC Central Committee and the State Council,China has succeeded in overcoming the impacts of many unfavorable factors including serious international financial crisis,rising production costs,the SARS epidemic,rare snow disasters and earthquakes, landslides,and the debt crisis of European sovereign.展开更多
For structural comparisons of paired prokaryotic genomes,an important topic in synthetic and evolutionary biology,the locations of shared orthologous genes(henceforth orthologs)are observed as binned data.This and oth...For structural comparisons of paired prokaryotic genomes,an important topic in synthetic and evolutionary biology,the locations of shared orthologous genes(henceforth orthologs)are observed as binned data.This and other data,e.g.,wind directions recorded at monitoring sites and intensive care unit arrival times on the 24-hour clock,are counted in binned circular arcs,thus modeling them by discrete circular distributions(DCDs)is required.We propose a novel method to construct a DCD from a base continuous circular distribution(CCD).The probability mass function is defined to take the normalized values of the probability density function at some pre-fixed equidistant points on the circle.Five families of constructed DCDs which have normalizing constants in closed form are presented.Simulation studies show that DCDs outperform the corresponding CCDs in modeling grouped(discrete)circular data,and minimum chi-square estimation outperforms maximum likelihood estimation for parameters.We apply the constructed DCDs,invariant wrapped Poisson and wrapped discrete skew Laplace to compare the structures of paired bacterial genomes.Specifically,discrete four-parameter wrapped Cauchy(nonnegative trigonometric sums)distribution models multi-modal shared orthologs in Clostridium(Sulfolobus)better than the others considered,in terms of AIC and Freedman’s goodness-of-fit test.The result that different DCDs fit the shared orthologs is consistent with the fact they belong to two kingdoms.Nevertheless,these prokaryotes have a common favored site around 70°on the unit circle;this finding is important for building synthetic prokaryotic genomes in synthetic biology.These DCDs can also be applied to other binned circular data.展开更多
The World Wide Web is essential to general public nowadays. From a data analysis viewpoint, it provides rich opportunities to gather observational data on a large-scale. This paper focuses on modeling the behavior of ...The World Wide Web is essential to general public nowadays. From a data analysis viewpoint, it provides rich opportunities to gather observational data on a large-scale. This paper focuses on modeling the behavior of visitors to an academic website. Although the conventional probability models, which were used by other literature for fitting in a commercial web site, capture the power law behavior in our data, they fail to capture other important features like the long tail. We propose a new model based on the identities of the users. Qualitative and quantitative tests, which are used for comparing the model fitting to our data, show that the new model outperforms other two conventional probability models.展开更多
This paper proposes a new personal tour planning problem with time-dependent satisfactions, traveling and activity duration times for sightseeing. It is difficult to represent the time-dependent model using general st...This paper proposes a new personal tour planning problem with time-dependent satisfactions, traveling and activity duration times for sightseeing. It is difficult to represent the time-dependent model using general static network models, and hence, Time-Expanded Network (TEN) is introduced. The TEN contains a copy to the set of nodes in the underlying static network for each discrete time step, and it turns the problem of determining an optimal flow over time into a classical static network flow problem. Using the proposed TEN-based model, it is possible not only to construct various variations with time of costs and satisfactions flexibly in a single network, but also to select optimal departure places and accommodations according to the tour route with tourist’s favorite places and to obtain the time scheduling of tour route, simultaneously. The proposed model is formulated as a 0 - 1 integer programming problem which can be applied by existing useful combinatorial optimization and soft computing algorithms. It’s also equivalently transformed into several existing tour planning problems using some natural assumptions. Furthermore, comparing the proposed model with some previous models using a numerical example with time-dependent parameters, both the similarity of these models in the static network and the advantage of the proposed TEN-based model are obtained.展开更多
The world today is undergoing profound changes unseen in a century,that are accelerated by the global spread of Covid-19 pandemic.On November 15,2020,15 member parties including China signed the Regional Comprehensive...The world today is undergoing profound changes unseen in a century,that are accelerated by the global spread of Covid-19 pandemic.On November 15,2020,15 member parties including China signed the Regional Comprehensive Economic Partnership(RCEP).On March 8,2021,the Chinese Government declared official rectification of the RCEP.In the future,as other signatories rectify the agreement(when six member states of the ASEAN and three non-ASEAN countries do so),this Free Trade Area(FTA)will become operational,one that boasts the world’s largest population,largest economic and trade size,and greatest development potentials.Not only is the RCEP the most important milestone in the Asia-Pacific regional economic cooperation,it has also injected strong driving force to the ebbing economic globalization and the international economic and trade cooperation.展开更多
Following the order of events, this paper makes a systematic and comprehensive summary of how the global financial crisis of 2008 affected China. It includes an econometric assessment using by-industry and by-region d...Following the order of events, this paper makes a systematic and comprehensive summary of how the global financial crisis of 2008 affected China. It includes an econometric assessment using by-industry and by-region data, and describes the role of government regulation from a new perspective. China's economic recovery is a result of regulatory intervention, and enhancing economic momentum created conditions for such intervention to phase out.展开更多
Tukey's halfspace median(HM), servicing as the multivariate counterpart of the univariate median,has been introduced and extensively studied in the literature. It is supposed and expected to preserve robustness pr...Tukey's halfspace median(HM), servicing as the multivariate counterpart of the univariate median,has been introduced and extensively studied in the literature. It is supposed and expected to preserve robustness property(the most outstanding property) of the univariate median. One of prevalent quantitative assessments of robustness is finite sample breakdown point(FSBP). Indeed, the FSBP of many multivariate medians have been identified, except for the most prevailing one—the Tukey's halfspace median. This paper presents a precise result on FSBP for Tukey's halfspace median. The result here depicts the complete prospect of the global robustness of HM in the finite sample practical scenario, revealing the dimension effect on the breakdown point robustness and complimenting the existing asymptotic breakdown point result.展开更多
Variable selection for varying coefficient models includes the separation of varying and constant effects,and the selection of variables with nonzero varying effects and those with nonzero constant effects.This paper ...Variable selection for varying coefficient models includes the separation of varying and constant effects,and the selection of variables with nonzero varying effects and those with nonzero constant effects.This paper proposes a unified variable selection approach called the double-penalized quadratic inference functions method for varying coefficient models of longitudinal data.The proposed method can not only separate varying coefficients and constant coefficients,but also estimate and select the nonzero varying coefficients and nonzero constant coefficients.It is suitable for variable selection of linear models,varying coefficient models,and partial linear varying coefficient models.Under regularity conditions,the proposed method is consistent in both separation and selection of varying coefficients and constant coefficients.The obtained estimators of varying coefficients possess the optimal convergence rate of non-parametric function estimation,and the estimators of nonzero constant coefficients are consistent and asymptotically normal.Finally,the authors investigate the finite sample performance of the proposed method through simulation studies and a real data analysis.The results show that the proposed method performs better than the existing competitor.展开更多
In two-level fractional factorial designs,conditional main effects can provide insights by which to analyze factorial effects and facilitate the de-aliasing of fully aliased two-factor interactions.Con-ditional main e...In two-level fractional factorial designs,conditional main effects can provide insights by which to analyze factorial effects and facilitate the de-aliasing of fully aliased two-factor interactions.Con-ditional main effects are of particular interest in situations where some factors are nested within others.Most of the relevant literature has focused on the development of data analysis tools that use conditional main effects,while the issue of optimal factorial design for a given linear model involving conditional main effects has been largely overlooked.Mukerjee,Wu and Chang[Statist.Sinica 27(2017)997-1016]established a framework by which to optimize designs under a con-ditional effect model.Although theoretically sound,their results were limited to a single pair of conditional and conditioning factors.In this paper,we extend the applicability of their frame-work to double pairs of conditional and conditioning factors by providing the corresponding parameterization and effect hierarchy.We propose a minimum contamination-based criterion by which to evaluate designs and develop a complementary set theory to facilitate the search of minimum contamination designs.The catalogues of 16-and 32-run minimum contamination designs are provided.For five to twelve factors,we show that all 16-run minimum contamination designs under the conditional effect model are also minimum aberration according to Fries and Hunter[Technometrics 22(1980)601-608].展开更多
Quadratic discriminant analysis is a classical and popular classification tool,but it fails to work in high-dimensional situations where the dimension p is larger than the sample size n.To address this issue,the autho...Quadratic discriminant analysis is a classical and popular classification tool,but it fails to work in high-dimensional situations where the dimension p is larger than the sample size n.To address this issue,the authors propose a ridge-forward quadratic discriminant(RFQD) analysis method via screening relevant predictors in a successive manner to reduce misclassification rate.The authors use extended Bayesian information criterion to determine the final model and prove that RFQD is selection consistent.Monte Carlo simulations are conducted to examine its performance.展开更多
We propose a method which uses functional singular component to establish functional additive models. The proposed methodology reduces the curve regression problem to ordinary(i.e., scalar) additive regression problem...We propose a method which uses functional singular component to establish functional additive models. The proposed methodology reduces the curve regression problem to ordinary(i.e., scalar) additive regression problems of the singular components of the predictor process and response process. Consistency of estimators for the nonparametric function and prediction are proved, respectively. A simulation study is conducted to investigate the finite sample performances of the proposed estimators.展开更多
文摘Building a well-off society in an all-round way is the goal put forward at the 16th CPC National Congress for the first two decades of this century.According to 'Statistical Monitoring Program on Building a Well-off Society' [1], Institute of Statistical Science,National Bureau of Statistics of China and local statistics research departments had conducted statistical monitoring for the process of building a well-off society in an all-round way from 2000 to 2010 nationwide and locally.The result shows that,over the past decade,under the correct leadership of the CPC Central Committee and the State Council,China has succeeded in overcoming the impacts of many unfavorable factors including serious international financial crisis,rising production costs,the SARS epidemic,rare snow disasters and earthquakes, landslides,and the debt crisis of European sovereign.
基金supported by JSPS KAKENHI Grant Number 18K13459 and Grace S.Shieh was supported in part by MOST 106-2118-M-001-017 and MOST 107-2118-M-001-009-MY2.
文摘For structural comparisons of paired prokaryotic genomes,an important topic in synthetic and evolutionary biology,the locations of shared orthologous genes(henceforth orthologs)are observed as binned data.This and other data,e.g.,wind directions recorded at monitoring sites and intensive care unit arrival times on the 24-hour clock,are counted in binned circular arcs,thus modeling them by discrete circular distributions(DCDs)is required.We propose a novel method to construct a DCD from a base continuous circular distribution(CCD).The probability mass function is defined to take the normalized values of the probability density function at some pre-fixed equidistant points on the circle.Five families of constructed DCDs which have normalizing constants in closed form are presented.Simulation studies show that DCDs outperform the corresponding CCDs in modeling grouped(discrete)circular data,and minimum chi-square estimation outperforms maximum likelihood estimation for parameters.We apply the constructed DCDs,invariant wrapped Poisson and wrapped discrete skew Laplace to compare the structures of paired bacterial genomes.Specifically,discrete four-parameter wrapped Cauchy(nonnegative trigonometric sums)distribution models multi-modal shared orthologs in Clostridium(Sulfolobus)better than the others considered,in terms of AIC and Freedman’s goodness-of-fit test.The result that different DCDs fit the shared orthologs is consistent with the fact they belong to two kingdoms.Nevertheless,these prokaryotes have a common favored site around 70°on the unit circle;this finding is important for building synthetic prokaryotic genomes in synthetic biology.These DCDs can also be applied to other binned circular data.
文摘The World Wide Web is essential to general public nowadays. From a data analysis viewpoint, it provides rich opportunities to gather observational data on a large-scale. This paper focuses on modeling the behavior of visitors to an academic website. Although the conventional probability models, which were used by other literature for fitting in a commercial web site, capture the power law behavior in our data, they fail to capture other important features like the long tail. We propose a new model based on the identities of the users. Qualitative and quantitative tests, which are used for comparing the model fitting to our data, show that the new model outperforms other two conventional probability models.
文摘This paper proposes a new personal tour planning problem with time-dependent satisfactions, traveling and activity duration times for sightseeing. It is difficult to represent the time-dependent model using general static network models, and hence, Time-Expanded Network (TEN) is introduced. The TEN contains a copy to the set of nodes in the underlying static network for each discrete time step, and it turns the problem of determining an optimal flow over time into a classical static network flow problem. Using the proposed TEN-based model, it is possible not only to construct various variations with time of costs and satisfactions flexibly in a single network, but also to select optimal departure places and accommodations according to the tour route with tourist’s favorite places and to obtain the time scheduling of tour route, simultaneously. The proposed model is formulated as a 0 - 1 integer programming problem which can be applied by existing useful combinatorial optimization and soft computing algorithms. It’s also equivalently transformed into several existing tour planning problems using some natural assumptions. Furthermore, comparing the proposed model with some previous models using a numerical example with time-dependent parameters, both the similarity of these models in the static network and the advantage of the proposed TEN-based model are obtained.
文摘The world today is undergoing profound changes unseen in a century,that are accelerated by the global spread of Covid-19 pandemic.On November 15,2020,15 member parties including China signed the Regional Comprehensive Economic Partnership(RCEP).On March 8,2021,the Chinese Government declared official rectification of the RCEP.In the future,as other signatories rectify the agreement(when six member states of the ASEAN and three non-ASEAN countries do so),this Free Trade Area(FTA)will become operational,one that boasts the world’s largest population,largest economic and trade size,and greatest development potentials.Not only is the RCEP the most important milestone in the Asia-Pacific regional economic cooperation,it has also injected strong driving force to the ebbing economic globalization and the international economic and trade cooperation.
文摘Following the order of events, this paper makes a systematic and comprehensive summary of how the global financial crisis of 2008 affected China. It includes an econometric assessment using by-industry and by-region data, and describes the role of government regulation from a new perspective. China's economic recovery is a result of regulatory intervention, and enhancing economic momentum created conditions for such intervention to phase out.
基金supported by National Natural Science Foundation of China(Grant Nos.11601197,11461029,71463020,61263014 and 61563018),National Natural Science Foundation of China(Grant Nos.General program 11171331 and Key program 11331011)National Science Foundation of Jiangxi Province(Grant Nos.20161BAB201024,20142BAB211014,20143ACB21012 and 20151BAB211016)+3 种基金the Key Science Fund Project of Jiangxi Provincial Education Department(Grant Nos.GJJ150439,KJLD13033 and KJLD14034)the National Science Fund for Distinguished Young Scholars in China(Grant No.10725106)a grant from the Key Lab of Random Complex Structure and Data Science,Chinese Academy of SciencesNatural Science Foundation of Shenzhen University
文摘Tukey's halfspace median(HM), servicing as the multivariate counterpart of the univariate median,has been introduced and extensively studied in the literature. It is supposed and expected to preserve robustness property(the most outstanding property) of the univariate median. One of prevalent quantitative assessments of robustness is finite sample breakdown point(FSBP). Indeed, the FSBP of many multivariate medians have been identified, except for the most prevailing one—the Tukey's halfspace median. This paper presents a precise result on FSBP for Tukey's halfspace median. The result here depicts the complete prospect of the global robustness of HM in the finite sample practical scenario, revealing the dimension effect on the breakdown point robustness and complimenting the existing asymptotic breakdown point result.
基金supported in part by the National Science Foundation of China under Grant Nos.12071305and 71803001in part by the national social science foundation of China under Grant No.19BTJ014+1 种基金in part by the University Social Science Research Project of Anhui Province under Grant No.SK2020A0051in part by the Social Science Foundation of the Ministry of Education of China under Grant Nos.19YJCZH250 and 21YJAZH081。
文摘Variable selection for varying coefficient models includes the separation of varying and constant effects,and the selection of variables with nonzero varying effects and those with nonzero constant effects.This paper proposes a unified variable selection approach called the double-penalized quadratic inference functions method for varying coefficient models of longitudinal data.The proposed method can not only separate varying coefficients and constant coefficients,but also estimate and select the nonzero varying coefficients and nonzero constant coefficients.It is suitable for variable selection of linear models,varying coefficient models,and partial linear varying coefficient models.Under regularity conditions,the proposed method is consistent in both separation and selection of varying coefficients and constant coefficients.The obtained estimators of varying coefficients possess the optimal convergence rate of non-parametric function estimation,and the estimators of nonzero constant coefficients are consistent and asymptotically normal.Finally,the authors investigate the finite sample performance of the proposed method through simulation studies and a real data analysis.The results show that the proposed method performs better than the existing competitor.
基金We gratefully acknowledge funding from Academia Sinica[Grant Number AS-CDA-111-M05]from National Science and Technology Council[Grant Number 111-2118-M-001-001-MY3].
文摘In two-level fractional factorial designs,conditional main effects can provide insights by which to analyze factorial effects and facilitate the de-aliasing of fully aliased two-factor interactions.Con-ditional main effects are of particular interest in situations where some factors are nested within others.Most of the relevant literature has focused on the development of data analysis tools that use conditional main effects,while the issue of optimal factorial design for a given linear model involving conditional main effects has been largely overlooked.Mukerjee,Wu and Chang[Statist.Sinica 27(2017)997-1016]established a framework by which to optimize designs under a con-ditional effect model.Although theoretically sound,their results were limited to a single pair of conditional and conditioning factors.In this paper,we extend the applicability of their frame-work to double pairs of conditional and conditioning factors by providing the corresponding parameterization and effect hierarchy.We propose a minimum contamination-based criterion by which to evaluate designs and develop a complementary set theory to facilitate the search of minimum contamination designs.The catalogues of 16-and 32-run minimum contamination designs are provided.For five to twelve factors,we show that all 16-run minimum contamination designs under the conditional effect model are also minimum aberration according to Fries and Hunter[Technometrics 22(1980)601-608].
基金supported by the National Natural Science Foundation of China under Grant No.11401391
文摘Quadratic discriminant analysis is a classical and popular classification tool,but it fails to work in high-dimensional situations where the dimension p is larger than the sample size n.To address this issue,the authors propose a ridge-forward quadratic discriminant(RFQD) analysis method via screening relevant predictors in a successive manner to reduce misclassification rate.The authors use extended Bayesian information criterion to determine the final model and prove that RFQD is selection consistent.Monte Carlo simulations are conducted to examine its performance.
基金supported by National Natural Science Foundation of China (Grant Nos. 11171331, 11561006, 11331011)Program for Creative Research Group of National Natural Science Foundation of China (Grant No. 61621003)+4 种基金a Grant from the Key Lab of Random Complex Structure and Data Science, Chinese Academy of Sciencesthe Natural Science Foundation of Shenzhen UniversityResearch Projects of Colleges and Universities in Guangxi (Grant No. KY2015YB171)Innovation Project of Guangxi Graduate Education (Grant No. JGY2015122)a Grant from the Key Base of Humanities and Social Sciences in Guangxi College
文摘We propose a method which uses functional singular component to establish functional additive models. The proposed methodology reduces the curve regression problem to ordinary(i.e., scalar) additive regression problems of the singular components of the predictor process and response process. Consistency of estimators for the nonparametric function and prediction are proved, respectively. A simulation study is conducted to investigate the finite sample performances of the proposed estimators.