期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
AN INFORMATIC APPROACH TO A LONG MEMORY STATIONARY PROCESS
1
作者 丁义明 吴量 向绪言 《Acta Mathematica Scientia》 SCIE CSCD 2023年第6期2629-2648,共20页
Long memory is an important phenomenon that arises sometimes in the analysis of time series or spatial data.Most of the definitions concerning the long memory of a stationary process are based on the second-order prop... Long memory is an important phenomenon that arises sometimes in the analysis of time series or spatial data.Most of the definitions concerning the long memory of a stationary process are based on the second-order properties of the process.The mutual information between the past and future I_(p−f) of a stationary process represents the information stored in the history of the process which can be used to predict the future.We suggest that a stationary process can be referred to as long memory if its I_(p−f) is infinite.For a stationary process with finite block entropy,I_(p−f) is equal to the excess entropy,which is the summation of redundancies that relate the convergence rate of the conditional(differential)entropy to the entropy rate.Since the definitions of the I_(p−f) and the excess entropy of a stationary process require a very weak moment condition on the distribution of the process,it can be applied to processes whose distributions are without a bounded second moment.A significant property of I_(p−f) is that it is invariant under one-to-one transformation;this enables us to know the I_(p−f) of a stationary process from other processes.For a stationary Gaussian process,the long memory in the sense of mutual information is more strict than that in the sense of covariance.We demonstrate that the I_(p−f) of fractional Gaussian noise is infinite if and only if the Hurst parameter is H∈(1/2,1). 展开更多
关键词 mutual information between past and future long memory stationary process excess entropy fractional Gaussian noise
在线阅读 下载PDF
Analysis of Heterogeneous Networks with Unknown Dependence Structure
2
作者 Fang Mei HOU Jia Xin LIU +1 位作者 Shao Gao Lü Hua Zhen LIN 《Acta Mathematica Sinica,English Series》 SCIE CSCD 2024年第12期2953-2983,共31页
In multiple heterogeneous networks,developing a model that considers both individual and shared structures is crucial for improving estimation efficiency and interpretability.In this paper,we introduce a semi-parametr... In multiple heterogeneous networks,developing a model that considers both individual and shared structures is crucial for improving estimation efficiency and interpretability.In this paper,we introduce a semi-parametric individual network autoregressive model.We allow autoregression and regression coefficients to vary across networks with subgroup structure,and integrate both covariates and node relationships into network dependence using a single-index structure with unknown links.To estimate all individual and commonly shared parameters and functions,we introduce a novel penalized semiparametric approach based on the generalized method of moments.Theoretically,our proposed semiparametric estimator for heterogeneous networks exhibits estimation and selection consistency under regular conditions.Numerical experiments are conducted to illustrate the effectiveness of the proposed estimator.The proposed method is applied to analyze patient distribution in hospitals to further demonstrate its utility. 展开更多
关键词 Network data single-indexing network weights nonparametric GMM estimation pairwise penalty
原文传递
Inference for High-Dimensional Streamed Longitudinal Data
3
作者 Senyuan Zheng Ling Zhou 《Acta Mathematica Sinica,English Series》 2025年第2期757-779,共23页
With the advent of modern devices,such as smartphones and wearable devices,high-dimensional data are collected on many participants for a period of time or even in perpetuity.For this type of data,dependencies between... With the advent of modern devices,such as smartphones and wearable devices,high-dimensional data are collected on many participants for a period of time or even in perpetuity.For this type of data,dependencies between and within data batches exist because data are collected from the same individual over time.Under the framework of streamed data,individual historical data are not available due to the storage and computation burden.It is urgent to develop computationally efficient methods with statistical guarantees to analyze high-dimensional streamed data and make reliable inferences in practice.In addition,the homogeneity assumption on the model parameters may not be valid in practice over time.To address the above issues,in this paper,we develop a new renewable debiased-lasso inference method for high-dimensional streamed data allowing dependences between and within data batches to exist and model parameters to gradually change.We establish the large sample properties of the proposed estimators,including consistency and asymptotic normality.The numerical results,including simulations and real data analysis,show the superior performance of the proposed method. 展开更多
关键词 Debiased lasso high-dimensional inference streamed longitudinal data renewable inference
原文传递
Tensor Decomposition-assisted Multiview Subgroup Analysis
4
作者 Xun Zhao Ling Zhou +1 位作者 Weijia Zhang Huazhen Lin 《Acta Mathematica Sinica,English Series》 2025年第2期588-618,共31页
To learn the subgroup structure generated by multidimensional interaction, we propose a novel multiview subgroup integration technique based on tensor decomposition. Compared to the traditional subgroup analysis that ... To learn the subgroup structure generated by multidimensional interaction, we propose a novel multiview subgroup integration technique based on tensor decomposition. Compared to the traditional subgroup analysis that can only handle single-view heterogeneity, our proposed method achieves a greater level of homogeneity within the subgroups, leading to enhanced interpretability and predictive power. For computational readiness of the proposed method, we build an algorithm that incorporates pairwise shrinkage-encouraging penalties and ADMM techniques. Theoretically, we establish the asymptotic consistency and normality of the proposed estimators. Extensive simulation studies and real data analysis demonstrate that our proposal outperforms other methods in terms of prediction accuracy and grouping consistency. In addition, the analysis based on the proposed method indicates that intergenerational care significantly increases the risk of chronic diseases associated with diet and fatigue in all provinces while only reducing the risk of emotion-related chronic diseases in the eastern coastal and central regions of China. 展开更多
关键词 Multiview subgroup analysis tensor decomposition data integration ADMM algorithm
原文传递
High-dimensional large-scale mixed-type data imputation under missing at random
5
作者 Wei Liu Guizhen Li +1 位作者 Ling Zhou Lan Luo 《Science China Mathematics》 2025年第4期969-1000,共32页
Missingness in mixed-type variables is commonly encountered in a variety of areas.The requirement of complete observations necessitates data imputation when a moderate or large proportion of data is missing.However,in... Missingness in mixed-type variables is commonly encountered in a variety of areas.The requirement of complete observations necessitates data imputation when a moderate or large proportion of data is missing.However,inappropriate imputation would downgrade the performance of machine learning algorithms,leading to bad predictions and unreliable statistical inference.For high-dimensional large-scale mixed-type missing data,we develop a computationally efficient imputation method,missing value imputation via generalized factor models(MIG),under missing at random.The proposed MIG method allows missing variables to be of different types,including continuous,binary,and count variables,and are scalable to both data size n and variable dimension p while existing imputation methods rely on restrictive assumptions such as the same type of missing variables,the low dimensionality of variables,and a limited sample size.We explicitly show that the imputation error of the proposed MIG method diminishes to zero with the rate Op(max{n^(-1/2),p^(-1/2)})as both n and p tend to infinity.Five real datasets demonstrate the superior empirical performance of the proposed MIG method over existing methods that the average normalized absolute imputation error is reduced by 5.3%–34.1%. 展开更多
关键词 IMPUTATION high-dimensional mixed-type data missing at random generalized factor model
原文传递
A case study on the shareholder network effect of stock market data:An SARMA approach
6
作者 Rong Zhang Jing Zhou +1 位作者 Wei Lan Hansheng Wang 《Science China Mathematics》 SCIE CSCD 2022年第11期2219-2242,共24页
One of the key research problems in financial markets is the investigation of inter-stock dependence.A good understanding in this regard is crucial for portfolio optimization.To this end,various econometric models hav... One of the key research problems in financial markets is the investigation of inter-stock dependence.A good understanding in this regard is crucial for portfolio optimization.To this end,various econometric models have been proposed.Most of them assume that the random noise associated with each subject is independent.However,dependence might still exist within this random noise.Ignoring this valuable information might lead to biased estimations and inaccurate predictions.In this article,we study a spatial autoregressive moving average model with exogenous covariates.Spatial dependence from both response and random noise is considered simultaneously.A quasi-maximum likelihood estimator is developed,and the estimated parameters are shown to be consistent and asymptotically normal.We then conduct an extensive analysis of the proposed method by applying it to the Chinese stock market data. 展开更多
关键词 spatial autoregressive moving average model shareholder network effect quasi-maximum likelihood estimator stock market data
原文传递
High dimensional cross-sectional dependence test under arbitrary serial correlation 被引量:1
7
作者 LAN Wei PAN Rui +1 位作者 LUO RongHua CHENG YongWei 《Science China Mathematics》 SCIE CSCD 2017年第2期345-360,共16页
In panel data analysis,the cross-sectional dependence(CD)test has been extensively used to test the cross-sectional dependence.However,this traditional CD test does not take serial correlation into consideration,which... In panel data analysis,the cross-sectional dependence(CD)test has been extensively used to test the cross-sectional dependence.However,this traditional CD test does not take serial correlation into consideration,which commonly occurs in many fields.To solve this problem,we propose an adjusted CD test which is able to effectively handle serial correlation.More specifically,the serial correlation can be of arbitrary form in our work.Furthermore,we establish the theoretical properties of the proposed adjusted CD test.Our extensive Monte Carlo experiments show that the traditional CD test cannot work well under serial correlation,while the proposed adjusted CD test does provide rather satisfactory performance. 展开更多
关键词 CD test cross-sectional dependence panel data serial correlation
原文传递
Dynamically integrated regression model for online auction data
8
作者 Mengying You Huazhen Lin Hua Liang 《Science China Mathematics》 SCIE CSCD 2022年第7期1531-1552,共22页
We propose a dynamically integrated regression model to predict the price of online auctions,including the final price.Different from existing models,the proposed method uses not only the historical price but also the... We propose a dynamically integrated regression model to predict the price of online auctions,including the final price.Different from existing models,the proposed method uses not only the historical price but also the information from bidding time.Consequently,the prediction accuracy is improved compared with the existing methods.An estimation method based on B-spline approximation is proposed for the estimation and the inference of parameters and nonparametric functions in this model.The minimax rate of convergence for the prediction risk and large-sample results including the consistency and the asymptotic normality are established.Simulation studies verify the finite sample performance and the appealing prediction accuracy and robustness.Finally,when we apply our method to a 7-day auction of iPhone 6s during December 2015 and March 2016,the proposed method predicts the ending price with a much smaller error than the existing models. 展开更多
关键词 B-SPLINE dynamic forecasting model functional linear regression model minimax rate online auction
原文传递
Polynomial network autoregressive models with divergent orders
9
作者 Bo Lei Wei Lan +1 位作者 Nengsheng Fang Jing Zhou 《Science China Mathematics》 SCIE CSCD 2023年第5期1073-1086,共14页
We propose a novel polynomial network autoregressive model by incorporating higher-order connected relationships to simultaneously model the effects of both direct and indirect connections. A quasimaximum likelihood e... We propose a novel polynomial network autoregressive model by incorporating higher-order connected relationships to simultaneously model the effects of both direct and indirect connections. A quasimaximum likelihood estimation method is proposed to estimate the unknown influence parameters, and we demonstrate its consistency and asymptotic normality without imposing any distribution assumption. Moreover,an extended Bayesian information criterion is set for order selection with a divergent upper order. The application of the proposed polynomial network autoregressive model is demonstrated through both the simulation and the real data analysis. 展开更多
关键词 diverging order extended Bayesian information criterion polynomial network autoregressive model quasi-maximum likelihood estimation
原文传递
Truncated L1 Regularized Linear Regression:Theory and Algorithm
10
作者 Mingwei Dai Shuyang Dai +2 位作者 Junjun Huang Lican Kang Xiliang Lu 《Communications in Computational Physics》 SCIE 2021年第6期190-209,共20页
Truncated L1 regularization proposed by Fan in[5],is an approximation to the L0 regularization in high-dimensional sparse models.In this work,we prove the non-asymptotic error bound for the global optimal solution to ... Truncated L1 regularization proposed by Fan in[5],is an approximation to the L0 regularization in high-dimensional sparse models.In this work,we prove the non-asymptotic error bound for the global optimal solution to the truncated L1 regularized linear regression problem and study the support recovery property.Moreover,a primal dual active set algorithm(PDAS)for variable estimation and selection is proposed.Coupled with continuation by a warm-start strategy leads to a primal dual active set with continuation algorithm(PDASC).Data-driven parameter selection rules such as cross validation,BIC or voting method can be applied to select a proper regularization parameter.The application of the proposed method is demonstrated by applying it to simulation data and a breast cancer gene expression data set(bcTCGA). 展开更多
关键词 High-dimensional linear regression SPARSITY truncated L1 regularization primal dual active set algorithm
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部