| 2,324 | 9 | 137 |
| 下载次数 | 被引频次 | 阅读次数 |
生物统计学是以解决生物学、医学、公共卫生学、农学等领域科学问题为目标的应用型学科,近年来在精准医疗的背景下得以快速发展。另一方面,生物统计研究面对的数据存在海量化、复杂化和异质化的大数据特征,对理论与应用研究者都提出了新的挑战。本文围绕生物统计研究中的流行病学研究、临床试验设计、生存数据分析和基因数据分析展开讨论,在介绍基本思路的基础上对最新挑战及前沿发展方向进行展望。
Abstract:Biostatistics is the application of statistics to the field of biological science,medical science,public health,and agriculture. The beginning of the era of precision medicine has witnessed the rapid advancement in the field of biostatistics. Yet the growing dimension,complexity,and heterogeneity in "big data"have brought about a lot of new challenges to biostatistics researchers in theory and application alike. The article focuses on the recent development in four different areas including epidemiology, clinical trials, survival analysis, and genomic data analysis, discusses the challenges with further development based on introducing the basic concepts.
[1]Wacholder S,Chanock S,Garcia-Closas M,et al.Assessing the probability that a positive report is false:an approach for molecular epidemiology studies[J].Journal of the National Cancer Institute,2004(6):434-442.
[2]March D,Susser E.The eco-in eco-epidemiology[J].International Journal of Epidemiology,2006(6):1379-1383.
[3]Restif O.Evolutionary epidemiology 20 years on:challenges and prospects[J].Infection,Genetics and Evolution,2009(1):108-123.
[4]Ostfeld R S,Glass G E,Keesing F.Spatial epidemiology:an emerging(or re-emerging)discipline[J].Trends in Ecology&Evolution,2005(6):328-336.
[5]Austin P C.The performance of different propensity score methods for estimating marginal hazard ratios[J].Statistics in Medicine,2013(16):2837-2849.
[6]Imai K,Ratkovic M.Covariate balancing propensity score[J].Journal of the Royal Statistical Society:Series B(Statistical Methodology),2014(1):243-263.
[7]Maeda I,Morita T,Yamaguchi T,et al.Effect of continuous deep sedation on survival in patients with advanced cancer(J-Proval):a propensity score-weighted analysis of a prospective cohort study[J].The Lancet Oncology,2016(1):115-122.
[8]Szpiro A A,Paciorek C J.Measurement error in two-stage analyses,with application to air pollution epidemiology[J].Environmetrics,2013(8):501-517.
[9]Szpiro A A,Sheppard L,Adar S D,et al.Estimating acute air pollution health effects from cohort study data[J].Biometrics,2014(1):164-174.
[10]Alexeeff S E,Carroll R J,Coull B.Spatial measurement error and correction by spatial SIMEX in linear regression models when using predicted air pollution exposures[J].Biostatistics,2015.
[11]Ehrhardt S,Appel L J,Meinert C L.Trends in National Institutes of Health Funding for Clinical Trials Registered in Clinical Trials.gov[J].Journal of the American Medical Association,2015(23):2566-2567.
[12]Machin D,Fayers P.Randomized clinical trials:design,practice and reporting[M].John Wiley&Sons,Oxford.2010.
[13]Chow S C,Chang M.Adaptive design methods in clinical trials-a review[J].Orphanet Journal of Rare Diseases,2008(11):169-90.
[14]Rosenberger W F,Lachin J L.Randomization in Clinical Trials:Theory and Practice[M].New York:Wiley.2002.
[15]Efron B.Forcing a sequential experiment to be balanced[J].Biometrika,1971(58):403-417.
[16]Markaryan T,Rosenberger W F.Exact properties of Efron's biased coin randomization procedure[J].Annals of Statistics,2010(38):1546-1567.
[17]Baldi Antognini A.A theoretical analysis of the power of biased coin designs[J].Journal of Statistical Planning and Inference,2008(138):1792-1798.
[18]Hu F,Zhang L X.Asymptotic properties of doubly-adaptive biased coin designs for multitreatment clinical trials[J].Annals of Statistics,2004(32):268-301.
[19]Brown S,Thorpe H,Hawkins K,et al.Minimization-reducing predictability for multi-centre trials whilst retaining balance within centre[J].Statistics in Medicine,2005(25):3715-3727.
[20]Shao J,Yu X,Zhong B.A theory for testing hypotheses under covariate-adaptive randomization[J].Biometrika,2010(97):347-360.
[21]Baldi Antognini A,Zagoraiou M.The covariate-adaptive biased coin design for balancing clinical trials in the presence of prognostic factors[J].Biometrika,2011(98):519-535.
[22]Yuan Y,Huang X,Liu S.A Bayesian response-adaptive covariatebalanced randomization design with application to a leukemia clinical trial[J].Statistics in Medicine,2011(30):1218-1229.
[23]Ma W,Hu F,Zhang L.Testing hypotheses of covariate-adaptive randomized clinical trials[J].Journal of the American Statistical Association,2015(110):669-680.
[24]Hu J,Zhu H,Hu F.A unified family of covariate-adjusted response-adaptive designs based on efficiency and ethics[J].Journal of the American Statistical Association,2015(509):357-367.
[25]Huang T,Zhu H.Longitudinal Covariate-Adjusted ResponseAdaptive Randomization:Impact of Missing Data[M].Boca Raton:CRC Press,2016.
[26]Hartley A M.A Bayesian adaptive blinded sample size adjustment method for risk differences[J].Pharmaceutical Statistics,2015(6):488-514.
[27]Fisher L D,Lin D Y.Time-dependent covariates in the Cox Proportional-hazardsregressionmodel[J].Annual Review of Public Health,1999(1):145-157.
[28]Tsiatis A A,Davidian M.Joint modeling of longitudinal and timeto-event data:an overview[J].Statistica Sinica,2004.809-834.
[29]Ibrahim J G,Chen M H,Sinha D.Bayesian methods for joint modeling of longitudinal and survival data with applications to cancer vaccine trialsp[J].Statistica Sinica,2004.863-883.
[30]Robertson T,Robertson T.Order restricted statistical inference[J].1988.
[31]Huang J.Efficient estimation for the proportional hazards model with interval censoring[J].The Annals of Statistics,1996(2):540-568.
[32]Ma S,Kosorok M R.Robust semiparametric M-estimation and the weighted bootstrap[J].Journal of Multivariate Analysis,2005(1):190-217.
[33]Ma S,Kosorok M R.Adaptive penalized M-estimation with current status data[J].Annals of the Institute of Statistical Mathematics2006,58(3):511-526.
[34]Genest C,Rivest L P.Statistical inference procedures for bivariate Archimedean copulas[J].Journal of the American statistical Association,1993(423):1034-1043.
[35]Hougaard P.Analysis of multivariate survival data[M].New York:Springer-Verlag,2000.
[36]Fine J P,Gray R J.A proportional hazards model for the subdistribution of a competing risk[J].Journal of the American statistical association,1999(446):496-509.
[37]Maller R A,Zhou X.Survival analysis with long-term survivors[M].Wiley New York,1996.
[38]Fang H B,Li G,Sun J.Maximum Likelihood Estimation in a Semiparametric Logistic/Proportional-Hazards Mixture Model[J].Scandinavian Journal of Statistics,2005(1):59-75.
[39]Ma S.Additive risk model for current status data with a cured subgroup[J].Annals of the Institute of Statistical Mathematics,2011(1):117-134.
[40]Ibrahim J G,Chen M H,Mac Eachern S N.Bayesian variable selection for proportional hazards models[J].Canadian Journal of Statistics,1999(4):701-717.
[41]Heagerty P J,Zheng Y.Survival model predictive accuracy and ROC curves[J].Biometrics,2005(1):92-105.
[42]Harrell F E,Califf R M,Pryor D B,et al.Evaluating the yield of medical tests[J].Jama,1982(18):2543-2546.
[43]Pencina M J,D'Agostino R B.Overall C as a measure of discrimination in survival analysis:model specific population value and confidence interval estimation[J].Statistics in medicine,2004(13):2109-2123.
[44]Ma S,Song X.Ranking prognosis markers in cancer genomic studies[J].Briefings in bioinformatics,2010.
[45]Lu Y,Fang J,Tian L,et al.Advanced medical statistics[J].World Scientific,2015.5.
[46]Donoho D L.High-dimensional data analysis:The curses and blessings of dimensionality.[J].AMS Math Challenges Lecture,2000.1-32.
[47]Tibshirani R.Regression shrinkage and selection via the lasso[J].Journal of the Royal Statistical Society.Series B(Methodological),1996.267-288.
[48]Fan J,Li R.Variable selection via nonconcave penalized likelihood and its oracle properties[J].Journal of the American statistical Association,2001(456):1348-1360.
[49]Zhang C H.Nearly unbiased variable selection under minimax concave penalty[J].The Annals of statistics,2010.894-942.
[50]Zou H.The adaptive lasso and its oracle properties[J].Journal of the American statistical association,2006(476):1418-1429.
[51]Zou H,Hastie T.Regularization and variable selection via the elastic net[J].Journal of the Royal Statistical Society:Series B(Statistical Methodology),2005(2):301-320.
[52]Yuan M,Lin Y.Model selection and estimation in regression with grouped variables[J].Journal of the Royal Statistical Society:Series B(Statistical Methodology),2006(1):49-67.
[53]Ma S,Song X,Huang J.Supervised group Lasso with applications to microarray data analysis[J].BMC bioinformatics,2007(1):1.
[54]Huang J,Ma S,Xie H,et al.A group bridge approach for variable selection[J].Biometrika,2009(2):339-355.
[55]Friedman J,Hastie T,Tibshirani,et al.A note on the group lasso and a sparse group lasso[J].ar Xiv preprint ar Xiv,2010.
[56]Huang J,Breheny P,Ma S,et al.A selective review of group selection in high-dimensional models[J].Statistical science:a review journal of the Institute of Mathematical Statistics,2012,27(4).
[57]Li C,Li H,et al.Network-constrained regularization and variable selection for analysis of genomic data[J].Bioinformatics,2008(9):1175-1182.
[58]Huang J,Ma S,Li H,et al.The sparse Laplacian shrinkage estimator for high-dimensional regression[J].Annals of statistics,2011(4):2021.
[59]Shi X,Zhao Q,Huang J,et al.Deciphering the associations between gene expression and copy number alteration using a sparse double Laplacian shrinkage approach[J].Bioinformatics,2015(24):3977-3983.
[60]Friedman J,Hastie T,Tibshirani R,et al.Sparse inverse covariance estimation with the graphical lasso[J].Biostatistics,2008(3):432-441.
[61]Peng J,Wang P,Zhou N,et al.Partial correlation estimation by joint sparse regression models[J].Journal of the American Statistical Association,2012.
[62]Witten D M,Tibshirani R.Covariance-regularized regression and classification for high dimensional problems[J].Journal of the Royal Statistical Society:Series B(Statistical Methodology),2009(3):615-636.
[63]Wang W,Baladandayuthapani V,Morris J S,et al.i BAG:integrative Bayesian analysis of high-dimensional multiplatform genomics data[J].Bioinformatics,2013(2):149-159.
[64]Benjamini Y,Hochberg Y.Controlling the false discovery rate:a practical and powerful approach to multiple testing[J].Journal of the Royal Statistical Society.Series B(Methodological),1995.289-300.
[65]Gilbert P B.A modified false discovery rate multiple-comparisons procedure for discrete data,applied to human immunodeficiency virus genetics[J].Journal of the Royal Statistical Society:Series C(Applied Statistics),2005(1):143-158.
[66]Storey J D.A direct approach to false discovery rates[J].Journal of the Royal Statistical Society:Series B(Statistical Methodology),2002(3):479-498.
[67]Efron B.Local false discovery rates[M].Division of Biostatistics,Stanford University.2005.
[68]Tsai C A,Hsueh H M,Chen J J.Estimation of false discovery rates in multiple testing:application to gene microarray data[J].Biometrics,2003(4):1071-1081.
[69]Benjamini Y,Yekutieli D.The control of the false discovery rate in multiple testing under dependency[J].Annals of statistics,2001.1165-1188.
[70]Wu W B.On false discovery control under dependence[J].The Annals of statistics,2008.364-380.
[71]Sun W,Tony Cai T.Large-scale multiple testing under dependence[J].Journal of the Royal Statistical Society:Series B(Statistical Methodology),2009(2):393-424.
[72]Liu J,Peissig P,Zhang C,et al.Graphical-model based multiple testing under dependence,with applications to genome-wide association studies[A].In Uncertainty in artificial intelligence:proceedings of the...Conference.Conference on Uncertainty in Artificial Intelligence[C].NIH Public Access,2012.511.
基本信息:
DOI:10.19343/j.cnki.11-1302/c.2016.06.001
中图分类号:Q-332
引用信息:
[1]李扬,赵青,马双鸽.生物统计的研究进展与挑战[J].统计研究,2016,33(06):3-12.DOI:10.19343/j.cnki.11-1302/c.2016.06.001.
基金信息:
中国人民大学科学研究基金(中央高校基本科研业务费专项资金资助)项目“生物医学大数据的统计方法基础研究”(15XNI011)的阶段性成果
2016-06-15
2016-06-15