期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
Health Analytics, Economics and Medicine toward a 21st Century Health Care System
1
作者 Anna L. Choi David A. Lai Tze L. Lai 《Health》 CAS 2016年第5期428-443,共16页
After a review of recent developments in precision medicine, population health sciences and innovative clinical trial designs, and in health economics and policy, we show how innovations in health analytics can capita... After a review of recent developments in precision medicine, population health sciences and innovative clinical trial designs, and in health economics and policy, we show how innovations in health analytics can capitalize on the advances in biomedicine and health economics towards developing a data-driven and cost-effective 21<sup>st</sup> century health care system. In particular, we propose a mutually beneficial public-private partnership that combines individual responsibility with community solidarity in building this health care system. 展开更多
关键词 ANALYTICS Big Data Comparative Effectiveness Research Health Insurance Moral Hazards Population Health Sciences
在线阅读 下载PDF
Self-normalized moderate deviations for independent random variables 被引量:2
2
作者 JING BingYi LIANG HanYing ZHOU Wang 《Science China Mathematics》 SCIE 2012年第11期2297-2315,共19页
Let X1,X2,... be a sequence of independent random variables (r.v.s) belonging to the domain of attraction of a normal or stable law. In this paper, we study moderate deviations for the self-normalized sum n X ∑^n_i... Let X1,X2,... be a sequence of independent random variables (r.v.s) belonging to the domain of attraction of a normal or stable law. In this paper, we study moderate deviations for the self-normalized sum n X ∑^n_i=1Xi/Vm,p ,where Vn,p (∑^n_i=1|Xi|p)^1/p (P 〉 1).Applications to the self-normalized law of the iteratedlogarithm, Studentized increments of partial sums, t-statistic, and weighted sum of independent and identically distributed (i.i.d.) r.v.s are considered. 展开更多
关键词 self-normalized sum moderate deviation t-statistic LIL INCREMENT
原文传递
Discussion on “Gaining Effciency via Weight Estimators for Multivariate Failure Time Data” by Fan, Zhou and Chen
3
作者 KUK Anthony 《Science China Mathematics》 SCIE 2009年第6期1129-1130,共2页
The survival analysis literature has always lagged behind the categorical data literature in developing methods to analyze clustered or multivariate data. While estimators based on
关键词 Discussion on Gaining Effciency via Weight Estimators for Multivariate Failure Time Data Zhou and Chen by Fan
原文传递
KTI-RNN:Recognition of Heart Failure from Clinical Notes
4
作者 Dengao Li Huiting Ma +4 位作者 Wenjing Li Baofeng Zhao Jumin Zhao Yi Liu Jian Fu 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2023年第1期117-130,共14页
Although deep learning methods have recently attracted considerable attention in the medical field,analyzing large-scale electronic health record data is still a difficult task.In particular,the accurate recognition o... Although deep learning methods have recently attracted considerable attention in the medical field,analyzing large-scale electronic health record data is still a difficult task.In particular,the accurate recognition of heart failure is a key technology for doctors to make reasonable treatment decisions.This study uses data from the Medical Information Mart for Intensive Care database.Compared with structured data,unstructured data contain abundant patient information.However,this type of data has unsatisfactory characteristics,e.g.,many colloquial vocabularies and sparse content.To solve these problems,we propose the KTI-RNN model for unstructured data recognition.The proposed model overcomes sparse content and obtains good classification results.The term frequency-inverse word frequency(TF-IWF)model is used to extract the keyword set.The latent dirichlet allocation(LDA)model is adopted to extract the topic word set.These models enable the expansion of the medical record text content.Finally,we embed the global attention mechanism and gating mechanism between the bidirectional recurrent neural network(BiRNN)model and the output layer.We call it gated-attention-BiRNN(GA-BiRNN)and use it to identify heart failure from extensive medical texts.Results show that the F 1 score of the proposed KTI-RNN model is 85.57%,and the accuracy rate of the proposed KTI-RNN model is 85.59%. 展开更多
关键词 heart failure diagnosis text classification deep learning
原文传递
Balanced augmented jackknife empirical likelihood for two sample U-statistics
5
作者 Conghua Cheng Yiming Liu +1 位作者 Zhi Liu Wang Zhou 《Science China Mathematics》 SCIE CSCD 2018年第6期1129-1138,共10页
In this paper, we investigate the two sample U-statistics by jackknife empirical likelihood(JEL),a versatile nonparametric approach. More precisely, we propose the method of balanced augmented jackknife empirical like... In this paper, we investigate the two sample U-statistics by jackknife empirical likelihood(JEL),a versatile nonparametric approach. More precisely, we propose the method of balanced augmented jackknife empirical likelihood(BAJEL) by adding two artificial points to the original pseudo-value dataset, and we prove that the log likelihood ratio based on the expanded dataset tends to the χ~2 distribution. 展开更多
关键词 jackknife empirical likelihood U-STATISTICS ROC curves balanced augmented empirical likelihood
原文传递
On the k-sample Behrens-Fisher problem for high-dimensional data 被引量:3
6
作者 ZHANG JinTing XU JinFeng 《Science China Mathematics》 SCIE 2009年第6期1285-1304,共20页
For several decades, much attention has been paid to the two-sample Behrens-Fisher (BF) problem which tests the equality of the means or mean vectors of two normal populations with unequal variance/covariance structur... For several decades, much attention has been paid to the two-sample Behrens-Fisher (BF) problem which tests the equality of the means or mean vectors of two normal populations with unequal variance/covariance structures. Little work, however, has been done for the k-sample BF problem for high dimensional data which tests the equality of the mean vectors of several high-dimensional normal populations with unequal covariance structures. In this paper we study this challenging problem via extending the famous Scheffe’s transformation method, which reduces the k-sample BF problem to a one-sample problem. The induced one-sample problem can be easily tested by the classical Hotelling’s T 2 test when the size of the resulting sample is very large relative to its dimensionality. For high dimensional data, however, the dimensionality of the resulting sample is often very large, and even much larger than its sample size, which makes the classical Hotelling’s T 2 test not powerful or not even well defined. To overcome this difficulty, we propose and study an L 2-norm based test. The asymptotic powers of the proposed L 2-norm based test and Hotelling’s T 2 test are derived and theoretically compared. Methods for implementing the L 2-norm based test are described. Simulation studies are conducted to compare the L 2-norm based test and Hotelling’s T 2 test when the latter can be well defined, and to compare the proposed implementation methods for the L 2-norm based test otherwise. The methodologies are motivated and illustrated by a real data example. 展开更多
关键词 χ 2-approximation χ 2-type mixtures high-dimensional data analysis Hotelling’s T 2 test k-sample test L 2-norm based test Primary 62H15 Secondary 62E17 62E20
原文传递
A regression approach to ROC surface,with applications to Alzheimer's disease
7
作者 LI JiaLiang ZHOU XiaoHua FINE Jason P 《Science China Mathematics》 SCIE 2012年第8期1583-1595,共13页
We consider the estimation of three-dimensional ROC surfaces for continuous tests given covariates.Three way ROC analysis is important in our motivating example where patients with Alzheimer's disease are usually ... We consider the estimation of three-dimensional ROC surfaces for continuous tests given covariates.Three way ROC analysis is important in our motivating example where patients with Alzheimer's disease are usually classified into three categories and should receive different category-specific medical treatment.There has been no discussion on how covariates affect the three way ROC analysis.We propose a regression framework induced from the relationship between test results and covariates.We consider several practical cases and the corresponding inference procedures.Simulations are conducted to validate our methodology.The application on the motivating example illustrates clearly the age and sex effects on the accuracy for Mini-Mental State Examination of Alzheimer's disease. 展开更多
关键词 receiver operating characteristic surface volume under ROC surface rank regression transfor-mation model maximum likelihood estimation BOOTSTRAP
原文传递
Multi-category diagnostic accuracy based on logistic regression
8
作者 Jialiang Li Jason P.Fine Michael JPencina 《Statistical Theory and Related Fields》 2017年第2期143-158,共16页
We provide a detailed review for the statistical analysis of diagnostic accuracy in a multi-category classification task.For qualitative response variables with more than two categories,many traditional accuracy measu... We provide a detailed review for the statistical analysis of diagnostic accuracy in a multi-category classification task.For qualitative response variables with more than two categories,many traditional accuracy measures such as sensitivity,specificity and area under the ROC curve are no longer applicable.In recent literature,new diagnostic accuracy measures are introduced in medical research studies.In this paper,important statistical concepts for multi-category classification accuracy are reviewed and their utilities are demonstrated with real medical examples.We offer problem-based R code to illustrate how to perform these statistical computations step by step.We expect such analysis tools will become more familiar to practitioners and receive broader applications in biostatistics.Our program can be adapted to many classifiers among which logistic regression may be the most popular approach.We thus base our discussion and illustration completely on the logistic regression in this paper. 展开更多
关键词 Hypervolume under the ROC manifold multi-category classification correct classification probability net reclassification improvement integrated discrimination improvement marker evaluation R software
原文传递
On-chain analytics for sentiment-driven statistical causality in cryptocurrencies
9
作者 Ioannis Chalkiadakis Anna Zaremba +1 位作者 Gareth W.Peters Michael J.Chantler 《Blockchain(Research and Applications)》 2022年第2期39-63,共25页
This paper establishes a new framework for assessing multimodal statistical causality between cryptocurrency market(cryptomarket)sentiment and cryptocurrency price processes.In order to achieve this,we present an effi... This paper establishes a new framework for assessing multimodal statistical causality between cryptocurrency market(cryptomarket)sentiment and cryptocurrency price processes.In order to achieve this,we present an efficient algorithm for multimodal statistical causality analysis based on Multiple-Output Gaussian Processes.Signals from different information sources(modalities)are jointly modelled as a Multiple-Output Gaussian Process,and then using a novel approach to statistical causality based on Gaussian Processes(GPs),we study linear and non-linear causal effects between the different modalities.We demonstrate the effectiveness of our approach in a machine learning application by studying the relationship between cryptocurrency spot price dynamics and sentiment time-series data specific to the crypto sector,which we conjecture influences retail investor behaviour.The investor sentiment is extracted from cryptomarket news data via methods developed in the area of statistical machine learning known as Natural Language Processing(NLP).To capture sentiment,we present a novel framework for text to time-series embedding,which we then use to construct a sentiment index from publicly available news articles.We conduct a statistical analysis of our sentiment statistical index model and compare it to alternative state-of-the-art sentiment models popular in the NLP literature.In regard to the multimodal causality,the investor sentiment is our primary modality of exploration,in addition to price and a blockchain technologyrelated indicator(hash rate).Analysis shows that our approach is effective in modelling causal structures of variable degree of complexity between heterogeneous data sources and illustrates the impact that certain modelling choices for the different modalities can have on detecting causality.A solid understanding of these factors is necessary to gauge cryptocurrency adoption by retail investors and provide sentiment-and technologybased insights about the cryptocurrency market dynamics. 展开更多
关键词 Cryptocurrencies Statistical causality Blockchain regression Multiple-output Gaussian process Natural language processing Cryptonews sentiment
原文传递
On high-dimensional change point problem
10
作者 JIN BaiSuo PAN GuangMing +1 位作者 YANG Qing ZHOU Wang 《Science China Mathematics》 SCIE CSCD 2016年第12期2355-2378,共24页
New statistics are proposed to estimate and test the structural change when the data dimension is comparable to or larger than the sample size. Consistency of the new statistic in estimating the change point position ... New statistics are proposed to estimate and test the structural change when the data dimension is comparable to or larger than the sample size. Consistency of the new statistic in estimating the change point position is established under the alternative hypothesis. The asymptotic distribution of the new statistic in testing the existence of a change point is obtained under the null hypothesis. Some simulation results are presented which show that the numerical performance of our method is satisfactory. The method is illustrated via the analysis of the house price index of US. 展开更多
关键词 change point high-dimensional statistics inference Hotelling's T^2 statistic
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部