It is quite common that both categorical and continuous covariates appear in the data. But, most feature screening methods for ultrahigh-dimensional classification assume the covariates are continuous. And applicable ...It is quite common that both categorical and continuous covariates appear in the data. But, most feature screening methods for ultrahigh-dimensional classification assume the covariates are continuous. And applicable feature screening method is very limited;to handle this non-trivial situation, we propose a model-free feature screening for ultrahigh-dimensional multi-classification with both categorical and continuous covariates. The proposed feature screening method will be based on Gini impurity to evaluate the prediction power of covariates. Under certain regularity conditions, it is proved that the proposed screening procedure possesses the sure screening property and ranking consistency properties. We demonstrate the finite sample performance of the proposed procedure by simulation studies and illustrate using real data analysis.展开更多
It is common for datasets to contain both categorical and continuous variables. However, many feature screening methods designed for high-dimensional classification assume that the variables are continuous. This limit...It is common for datasets to contain both categorical and continuous variables. However, many feature screening methods designed for high-dimensional classification assume that the variables are continuous. This limits the applicability of existing methods in handling this complex scenario. To address this issue, we propose a model-free feature screening approach for ultra-high-dimensional multi-classification that can handle both categorical and continuous variables. Our proposed feature screening method utilizes the Maximal Information Coefficient to assess the predictive power of the variables. By satisfying certain regularity conditions, we have proven that our screening procedure possesses the sure screening property and ranking consistency properties. To validate the effectiveness of our approach, we conduct simulation studies and provide real data analysis examples to demonstrate its performance in finite samples. In summary, our proposed method offers a solution for effectively screening features in ultra-high-dimensional datasets with a mixture of categorical and continuous covariates.展开更多
With the rapid-growth-in-size scientific data in various disciplines, feature screening plays an important role to reduce the high-dimensionality to a moderate scale in many scientific fields. In this paper, we introd...With the rapid-growth-in-size scientific data in various disciplines, feature screening plays an important role to reduce the high-dimensionality to a moderate scale in many scientific fields. In this paper, we introduce a unified and robust model-free feature screening approach for high-dimensional survival data with censoring, which has several advantages: it is a model-free approach under a general model framework, and hence avoids the complication to specify an actual model form with huge number of candidate variables; under mild conditions without requiring the existence of any moment of the response, it enjoys the ranking consistency and sure screening properties in ultra-high dimension. In particular, we impose a conditional independence assumption of the response and the censoring variable given each covariate, instead of assuming the censoring variable is independent of the response and the covariates. Moreover, we also propose a more robust variant to the new procedure, which possesses desirable theoretical properties without any finite moment condition of the predictors and the response. The computation of the newly proposed methods does not require any complicated numerical optimization and it is fast and easy to implement. Extensive numerical studies demonstrate that the proposed methods perform competitively for various configurations. Application is illustrated with an analysis of a genetic data set.展开更多
In ultra-high-dimensional data, it is common for the response variable to be multi-classified. Therefore, this paper proposes a model-free screening method for variables whose response variable is multi-classified fro...In ultra-high-dimensional data, it is common for the response variable to be multi-classified. Therefore, this paper proposes a model-free screening method for variables whose response variable is multi-classified from the point of view of introducing Jensen-Shannon divergence to measure the importance of covariates. The idea of the method is to calculate the Jensen-Shannon divergence between the conditional probability distribution of the covariates on a given response variable and the unconditional probability distribution of the covariates, and then use the probabilities of the response variables as weights to calculate the weighted Jensen-Shannon divergence, where a larger weighted Jensen-Shannon divergence means that the covariates are more important. Additionally, we also investigated an adapted version of the method, which is to measure the relationship between the covariates and the response variable using the weighted Jensen-Shannon divergence adjusted by the logarithmic factor of the number of categories when the number of categories in each covariate varies. Then, through both theoretical and simulation experiments, it was demonstrated that the proposed methods have sure screening and ranking consistency properties. Finally, the results from simulation and real-dataset experiments show that in feature screening, the proposed methods investigated are robust in performance and faster in computational speed compared with an existing method.展开更多
Three-dimensional(3D)reconstruction based on aerial images has broad prospects,and feature matching is an important step of it.However,for high-resolution aerial images,there are usually problems such as long time,mis...Three-dimensional(3D)reconstruction based on aerial images has broad prospects,and feature matching is an important step of it.However,for high-resolution aerial images,there are usually problems such as long time,mismatching and sparse feature pairs using traditional algorithms.Therefore,an algorithm is proposed to realize fast,accurate and dense feature matching.The algorithm consists of four steps.Firstly,we achieve a balance between the feature matching time and the number of matching pairs by appropriately reducing the image resolution.Secondly,to realize further screening of the mismatches,a feature screening algorithm based on similarity judgment or local optimization is proposed.Thirdly,to make the algorithm more widely applicable,we combine the results of different algorithms to get dense results.Finally,all matching feature pairs in the low-resolution images are restored to the original images.Comparisons between the original algorithms and our algorithm show that the proposed algorithm can effectively reduce the matching time,screen out the mismatches,and improve the number of matches.展开更多
Understanding the correlation between the fundamental descriptors and catalytic performance is meaningful to guide the design of high-performance electrochemical catalysts.However,exploring key factors that affect cat...Understanding the correlation between the fundamental descriptors and catalytic performance is meaningful to guide the design of high-performance electrochemical catalysts.However,exploring key factors that affect catalytic performance in the vast catalyst space remains challenging for people.Herein,to accurately identify the factors that affect the performance of N2 reduction,we apply interpretable machine learning(ML)to analyze high-throughput screening results,which is also suited to other surface reactions in catalysis.To expound on the paradigm,33 promising catalysts are screened from 168 carbon-supported candidates,specifically single-atom catalysts(SACs)supported by a BC_(3)monolayer(TM@V_(B/C)-N_(n)=_(0-3)-BC_(3))via high-throughput screening.Subsequently,the hybrid sampling method and XGBoost model are selected to classify eligible and non-eligible catalysts.Through feature interpretation using Shapley Additive Explanations(SHAP)analysis,two crucial features,that is,the number of valence electrons(N_(v))and nitrogen substitution(N_(n)),are screened out.Combining SHAP analysis and electronic structure calculations,the synergistic effect between an active center with low valence electron numbers and reasonable C-N coordination(a medium fraction of nitrogen substitution)can exhibit high catalytic performance.Finally,six superior catalysts with a limiting potential lower than-0.4 V are predicted.Our workflow offers a rational approach to obtaining key information on catalytic performance from high-throughput screening results to design efficient catalysts that can be applied to other materials and reactions.展开更多
In this paper,we mainly study the feature screening and error variance estimation in ultrahigh-dimensional linear model with errors-in-variables(EV).Given that sure independence screening(SIS)method by marginal Pearso...In this paper,we mainly study the feature screening and error variance estimation in ultrahigh-dimensional linear model with errors-in-variables(EV).Given that sure independence screening(SIS)method by marginal Pearson’s correlation learning may omit some important observation variables due to measurement errors,a corrected SIS called EVSIS is proposed to rank the importance of features according to their corrected marginal correlation with the response variable.Also,a corrected error variance procedure is proposed to accurately estimate the error variance,which could greatly attenuate the influence of measurement errors and spurious correlations,simultaneously.Under some regularization conditions,the proposed EVSIS possesses sure screening property and consistency in ranking and the corrected error variance estimator is also proved to be asymptotically normal.The two methodologies are illustrated by some simulations and a real data example,which suggests that the proposed methods perform well.展开更多
Current high-dimensional feature screening methods still face significant challenges in handling mixed linear and nonlinear relationships,controlling redundant information,and improving model robustness.In this study,...Current high-dimensional feature screening methods still face significant challenges in handling mixed linear and nonlinear relationships,controlling redundant information,and improving model robustness.In this study,we propose a Dynamic Conditional Feature Screening(DCFS)method tailored for high-dimensional economic forecasting tasks.Our goal is to accurately identify key variables,enhance predictive performance,and provide both theoretical foundations and practical tools for macroeconomic modeling.The DCFS method constructs a comprehensive test statistic by integrating conditional mutual information with conditional regression error differences.By introducing a dynamic weighting mechanism,DCFS adaptively balances the linear and nonlinear contributions of features during the screening process.In addition,a dynamic thresholding mechanism is designed to effectively control the false discovery rate(FDR),thereby improving the stability and reliability of the screening results.On the theoretical front,we rigorously prove that the proposed method satisfies the sure screening property and rank consistency,ensuring accurate identification of the truly important feature set in high-dimensional settings.Simulation results demonstrate that under purely linear,purely nonlinear,and mixed dependency structures,DCFS consistently outperforms classical screening methods such as SIS,CSIS,and IG-SIS in terms of true positive rate(TPR),false discovery rate(FDR),and rank correlation.These results highlight the superior accuracy,robustness,and stability of our method.Furthermore,an empirical analysis based on the U.S.FRED-MD macroeconomic dataset confirms the practical value of DCFS in real-world forecasting tasks.The experimental results show that DCFS achieves lower prediction errors(RMSE and MAE)and higher R2 values in forecasting GDP growth.The selected key variables-including the Industrial Production Index(IP),Federal Funds Rate,Consumer Price Index(CPI),and Money Supply(M2)-possess clear economic interpretability,offering reliable support for economic forecasting and policy formulation.展开更多
SLAM(Simultaneous Localization and Mapping,即时定位与地图构建)在机器人、无人机等行业起着重要作用,ORB-SLAM是其中的一个热门应用。其中的特征提取与匹配是SLAM最终能够获得准确定位与精确地图构建的基础,也是耗时、耗能较多的环...SLAM(Simultaneous Localization and Mapping,即时定位与地图构建)在机器人、无人机等行业起着重要作用,ORB-SLAM是其中的一个热门应用。其中的特征提取与匹配是SLAM最终能够获得准确定位与精确地图构建的基础,也是耗时、耗能较多的环节。为了进一步提高特征匹配效果,在ORB-SLAM2算法的基础上,在特征提取与特征匹配后,通过引入对特征点的匹配筛选环节,同时为了保证其实时性,采用具有强大并行能力的FPGA技术进行设计。最终在增加适当硬件资源的代价上,实现了较小的位姿相对误差。展开更多
A large database is desired for machine learning(ML) technology to make accurate predictions of materials physicochemical properties based on their molecular structure.When a large database is not available,the develo...A large database is desired for machine learning(ML) technology to make accurate predictions of materials physicochemical properties based on their molecular structure.When a large database is not available,the development of proper featurization method based on physicochemical nature of target proprieties can improve the predictive power of ML models with a smaller database.In this work,we show that two new featurization methods,volume occupation spatial matrix and heat contribution spatial matrix,can improve the accuracy in predicting energetic materials' crystal density(ρ_(crystal)) and solid phase enthalpy of formation(H_(f,solid)) using a database containing 451 energetic molecules.Their mean absolute errors are reduced from 0.048 g/cm~3 and 24.67 kcal/mol to 0.035 g/cm~3 and 9.66 kcal/mol,respectively.By leave-one-out-cross-validation,the newly developed ML models can be used to determine the performance of most kinds of energetic materials except cubanes.Our ML models are applied to predict ρ_(crystal) and H_(f,solid) of CHON-based molecules of the 150 million sized PubChem database,and screened out 56 candidates with competitive detonation performance and reasonable chemical structures.With further improvement in future,spatial matrices have the potential of becoming multifunctional ML simulation tools that could provide even better predictions in wider fields of materials science.展开更多
In this paper,we propose a new correlation,called stable correlation,to measure the dependence between two random vectors.The new correlation is well defined without the moment condition and is zero if and only if the...In this paper,we propose a new correlation,called stable correlation,to measure the dependence between two random vectors.The new correlation is well defined without the moment condition and is zero if and only if the two random vectors are independent.We also study its other theoretical properties.Based on the new correlation,we further propose a robust model-free feature screening procedure for ultrahigh dimensional data and establish its sure screening property and rank consistency property without imposing the subexponential or sub-Gaussian tail condition,which is commonly required in the literature of feature screening.We also examine the finite sample performance of the proposed robust feature screening procedure via Monte Carlo simulation studies and illustrate the proposed procedure by a real data example.展开更多
In this paper, we study the sure independence screening of ultrahigh-dimensional censored data with varying coefficient single-index model. This general model framework covers a large number of commonly used survival ...In this paper, we study the sure independence screening of ultrahigh-dimensional censored data with varying coefficient single-index model. This general model framework covers a large number of commonly used survival models. The property that the proposed method is not derived for a specific model is appealing in ultrahigh dimensional regressions, as it is difficult to specify a correct model for ultrahigh dimensional predictors.Once the assuming data generating process does not meet the actual one, the screening method based on the model will be problematic. We establish the sure screening property and consistency in ranking property of the proposed method. Simulations are conducted to study the finite sample performances, and the results demonstrate that the proposed method is competitive compared with the existing methods. We also illustrate the results via the analysis of data from The National Alzheimers Coordinating Center(NACC).展开更多
文摘It is quite common that both categorical and continuous covariates appear in the data. But, most feature screening methods for ultrahigh-dimensional classification assume the covariates are continuous. And applicable feature screening method is very limited;to handle this non-trivial situation, we propose a model-free feature screening for ultrahigh-dimensional multi-classification with both categorical and continuous covariates. The proposed feature screening method will be based on Gini impurity to evaluate the prediction power of covariates. Under certain regularity conditions, it is proved that the proposed screening procedure possesses the sure screening property and ranking consistency properties. We demonstrate the finite sample performance of the proposed procedure by simulation studies and illustrate using real data analysis.
文摘It is common for datasets to contain both categorical and continuous variables. However, many feature screening methods designed for high-dimensional classification assume that the variables are continuous. This limits the applicability of existing methods in handling this complex scenario. To address this issue, we propose a model-free feature screening approach for ultra-high-dimensional multi-classification that can handle both categorical and continuous variables. Our proposed feature screening method utilizes the Maximal Information Coefficient to assess the predictive power of the variables. By satisfying certain regularity conditions, we have proven that our screening procedure possesses the sure screening property and ranking consistency properties. To validate the effectiveness of our approach, we conduct simulation studies and provide real data analysis examples to demonstrate its performance in finite samples. In summary, our proposed method offers a solution for effectively screening features in ultra-high-dimensional datasets with a mixture of categorical and continuous covariates.
基金supported by the Research Grant Council of Hong Kong (Grant Nos. 509413 and 14311916)Direct Grants for Research of The Chinese University of Hong Kong (Grant Nos. 3132754 and 4053235)+3 种基金the Natural Science Foundation of Jiangxi Province (Grant No. 20161BAB201024)the Key Science Fund Project of Jiangxi Province Eduction Department (Grant No. GJJ150439)National Natural Science Foundation of China (Grant Nos. 11461029, 11601197 and 61562030)the Canadian Institutes of Health Research (Grant No. 145546)
文摘With the rapid-growth-in-size scientific data in various disciplines, feature screening plays an important role to reduce the high-dimensionality to a moderate scale in many scientific fields. In this paper, we introduce a unified and robust model-free feature screening approach for high-dimensional survival data with censoring, which has several advantages: it is a model-free approach under a general model framework, and hence avoids the complication to specify an actual model form with huge number of candidate variables; under mild conditions without requiring the existence of any moment of the response, it enjoys the ranking consistency and sure screening properties in ultra-high dimension. In particular, we impose a conditional independence assumption of the response and the censoring variable given each covariate, instead of assuming the censoring variable is independent of the response and the covariates. Moreover, we also propose a more robust variant to the new procedure, which possesses desirable theoretical properties without any finite moment condition of the predictors and the response. The computation of the newly proposed methods does not require any complicated numerical optimization and it is fast and easy to implement. Extensive numerical studies demonstrate that the proposed methods perform competitively for various configurations. Application is illustrated with an analysis of a genetic data set.
文摘In ultra-high-dimensional data, it is common for the response variable to be multi-classified. Therefore, this paper proposes a model-free screening method for variables whose response variable is multi-classified from the point of view of introducing Jensen-Shannon divergence to measure the importance of covariates. The idea of the method is to calculate the Jensen-Shannon divergence between the conditional probability distribution of the covariates on a given response variable and the unconditional probability distribution of the covariates, and then use the probabilities of the response variables as weights to calculate the weighted Jensen-Shannon divergence, where a larger weighted Jensen-Shannon divergence means that the covariates are more important. Additionally, we also investigated an adapted version of the method, which is to measure the relationship between the covariates and the response variable using the weighted Jensen-Shannon divergence adjusted by the logarithmic factor of the number of categories when the number of categories in each covariate varies. Then, through both theoretical and simulation experiments, it was demonstrated that the proposed methods have sure screening and ranking consistency properties. Finally, the results from simulation and real-dataset experiments show that in feature screening, the proposed methods investigated are robust in performance and faster in computational speed compared with an existing method.
基金This work was supported by the Equipment Pre-Research Foundation of China(6140001020310).
文摘Three-dimensional(3D)reconstruction based on aerial images has broad prospects,and feature matching is an important step of it.However,for high-resolution aerial images,there are usually problems such as long time,mismatching and sparse feature pairs using traditional algorithms.Therefore,an algorithm is proposed to realize fast,accurate and dense feature matching.The algorithm consists of four steps.Firstly,we achieve a balance between the feature matching time and the number of matching pairs by appropriately reducing the image resolution.Secondly,to realize further screening of the mismatches,a feature screening algorithm based on similarity judgment or local optimization is proposed.Thirdly,to make the algorithm more widely applicable,we combine the results of different algorithms to get dense results.Finally,all matching feature pairs in the low-resolution images are restored to the original images.Comparisons between the original algorithms and our algorithm show that the proposed algorithm can effectively reduce the matching time,screen out the mismatches,and improve the number of matches.
基金supported by the National Key R&D Program of China(2022YFA1503103)the National Natural Science Foundation of China(22033002,92261112,22203046)+2 种基金the Natural Science Research Start-up Foundation of Recruiting Talents of Nanjing University of Posts and Telecommunications(Grant No.NY221128)the Six Talent Peaks Project in Jiangsu Province(XCL-104)the open research fund of Key Laboratory of Quantum Materials and Devices(Southeast University)
文摘Understanding the correlation between the fundamental descriptors and catalytic performance is meaningful to guide the design of high-performance electrochemical catalysts.However,exploring key factors that affect catalytic performance in the vast catalyst space remains challenging for people.Herein,to accurately identify the factors that affect the performance of N2 reduction,we apply interpretable machine learning(ML)to analyze high-throughput screening results,which is also suited to other surface reactions in catalysis.To expound on the paradigm,33 promising catalysts are screened from 168 carbon-supported candidates,specifically single-atom catalysts(SACs)supported by a BC_(3)monolayer(TM@V_(B/C)-N_(n)=_(0-3)-BC_(3))via high-throughput screening.Subsequently,the hybrid sampling method and XGBoost model are selected to classify eligible and non-eligible catalysts.Through feature interpretation using Shapley Additive Explanations(SHAP)analysis,two crucial features,that is,the number of valence electrons(N_(v))and nitrogen substitution(N_(n)),are screened out.Combining SHAP analysis and electronic structure calculations,the synergistic effect between an active center with low valence electron numbers and reasonable C-N coordination(a medium fraction of nitrogen substitution)can exhibit high catalytic performance.Finally,six superior catalysts with a limiting potential lower than-0.4 V are predicted.Our workflow offers a rational approach to obtaining key information on catalytic performance from high-throughput screening results to design efficient catalysts that can be applied to other materials and reactions.
基金supported partly by the National Natural Science Foundation of China(Grant No.11971324)supported by the State Key Program of National Natural Science Foundation of China(Grant No.12031016).
文摘In this paper,we mainly study the feature screening and error variance estimation in ultrahigh-dimensional linear model with errors-in-variables(EV).Given that sure independence screening(SIS)method by marginal Pearson’s correlation learning may omit some important observation variables due to measurement errors,a corrected SIS called EVSIS is proposed to rank the importance of features according to their corrected marginal correlation with the response variable.Also,a corrected error variance procedure is proposed to accurately estimate the error variance,which could greatly attenuate the influence of measurement errors and spurious correlations,simultaneously.Under some regularization conditions,the proposed EVSIS possesses sure screening property and consistency in ranking and the corrected error variance estimator is also proved to be asymptotically normal.The two methodologies are illustrated by some simulations and a real data example,which suggests that the proposed methods perform well.
文摘Current high-dimensional feature screening methods still face significant challenges in handling mixed linear and nonlinear relationships,controlling redundant information,and improving model robustness.In this study,we propose a Dynamic Conditional Feature Screening(DCFS)method tailored for high-dimensional economic forecasting tasks.Our goal is to accurately identify key variables,enhance predictive performance,and provide both theoretical foundations and practical tools for macroeconomic modeling.The DCFS method constructs a comprehensive test statistic by integrating conditional mutual information with conditional regression error differences.By introducing a dynamic weighting mechanism,DCFS adaptively balances the linear and nonlinear contributions of features during the screening process.In addition,a dynamic thresholding mechanism is designed to effectively control the false discovery rate(FDR),thereby improving the stability and reliability of the screening results.On the theoretical front,we rigorously prove that the proposed method satisfies the sure screening property and rank consistency,ensuring accurate identification of the truly important feature set in high-dimensional settings.Simulation results demonstrate that under purely linear,purely nonlinear,and mixed dependency structures,DCFS consistently outperforms classical screening methods such as SIS,CSIS,and IG-SIS in terms of true positive rate(TPR),false discovery rate(FDR),and rank correlation.These results highlight the superior accuracy,robustness,and stability of our method.Furthermore,an empirical analysis based on the U.S.FRED-MD macroeconomic dataset confirms the practical value of DCFS in real-world forecasting tasks.The experimental results show that DCFS achieves lower prediction errors(RMSE and MAE)and higher R2 values in forecasting GDP growth.The selected key variables-including the Industrial Production Index(IP),Federal Funds Rate,Consumer Price Index(CPI),and Money Supply(M2)-possess clear economic interpretability,offering reliable support for economic forecasting and policy formulation.
文摘SLAM(Simultaneous Localization and Mapping,即时定位与地图构建)在机器人、无人机等行业起着重要作用,ORB-SLAM是其中的一个热门应用。其中的特征提取与匹配是SLAM最终能够获得准确定位与精确地图构建的基础,也是耗时、耗能较多的环节。为了进一步提高特征匹配效果,在ORB-SLAM2算法的基础上,在特征提取与特征匹配后,通过引入对特征点的匹配筛选环节,同时为了保证其实时性,采用具有强大并行能力的FPGA技术进行设计。最终在增加适当硬件资源的代价上,实现了较小的位姿相对误差。
基金support from the Ministry of Education(MOE) Singapore Tier 1 (RG8/20)。
文摘A large database is desired for machine learning(ML) technology to make accurate predictions of materials physicochemical properties based on their molecular structure.When a large database is not available,the development of proper featurization method based on physicochemical nature of target proprieties can improve the predictive power of ML models with a smaller database.In this work,we show that two new featurization methods,volume occupation spatial matrix and heat contribution spatial matrix,can improve the accuracy in predicting energetic materials' crystal density(ρ_(crystal)) and solid phase enthalpy of formation(H_(f,solid)) using a database containing 451 energetic molecules.Their mean absolute errors are reduced from 0.048 g/cm~3 and 24.67 kcal/mol to 0.035 g/cm~3 and 9.66 kcal/mol,respectively.By leave-one-out-cross-validation,the newly developed ML models can be used to determine the performance of most kinds of energetic materials except cubanes.Our ML models are applied to predict ρ_(crystal) and H_(f,solid) of CHON-based molecules of the 150 million sized PubChem database,and screened out 56 candidates with competitive detonation performance and reasonable chemical structures.With further improvement in future,spatial matrices have the potential of becoming multifunctional ML simulation tools that could provide even better predictions in wider fields of materials science.
基金supported by National Natural Science Foundation of China(Grant No.11701034)supported by National Science Foundation of USA(Grant No.DMS1820702)。
文摘In this paper,we propose a new correlation,called stable correlation,to measure the dependence between two random vectors.The new correlation is well defined without the moment condition and is zero if and only if the two random vectors are independent.We also study its other theoretical properties.Based on the new correlation,we further propose a robust model-free feature screening procedure for ultrahigh dimensional data and establish its sure screening property and rank consistency property without imposing the subexponential or sub-Gaussian tail condition,which is commonly required in the literature of feature screening.We also examine the finite sample performance of the proposed robust feature screening procedure via Monte Carlo simulation studies and illustrate the proposed procedure by a real data example.
基金Supported by the National Natural Science Foundation of China(No.11801567)
文摘In this paper, we study the sure independence screening of ultrahigh-dimensional censored data with varying coefficient single-index model. This general model framework covers a large number of commonly used survival models. The property that the proposed method is not derived for a specific model is appealing in ultrahigh dimensional regressions, as it is difficult to specify a correct model for ultrahigh dimensional predictors.Once the assuming data generating process does not meet the actual one, the screening method based on the model will be problematic. We establish the sure screening property and consistency in ranking property of the proposed method. Simulations are conducted to study the finite sample performances, and the results demonstrate that the proposed method is competitive compared with the existing methods. We also illustrate the results via the analysis of data from The National Alzheimers Coordinating Center(NACC).