Volume 52, Issue 1
Research Article

L1 Penalized Estimation in the Cox Proportional Hazards Model

Jelle J. Goeman

Corresponding Author

E-mail address: j.j.goeman@lumc.nl

Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, The Netherlands

Phone: +31‐71‐5269708Search for more papers by this author
First published: 17 February 2010
Citations: 111

Abstract

This article presents a novel algorithm that efficiently computes L1 penalized (lasso) estimates of parameters in high‐dimensional models. The lasso has the property that it simultaneously performs variable selection and shrinkage, which makes it very useful for finding interpretable prediction rules in high‐dimensional data. The new algorithm is based on a combination of gradient ascent optimization with the Newton–Raphson algorithm. It is described for a general likelihood function and can be applied in generalized linear models and other models with an L1 penalty. The algorithm is demonstrated in the Cox proportional hazards model, predicting survival of breast cancer patients using gene expression data, and its performance is compared with competing approaches. An R package, penalized, that implements the method, is available on CRAN.

Number of times cited according to CrossRef: 111

  • Cancer Biomarker Discovery for Precision Medicine: New Progress, Current Medicinal Chemistry, 10.2174/0929867325666180718164712, 26, 42, (7655-7671), (2020).
  • A 16‐mRNA signature optimizes recurrence‐free survival prediction of Stages II and III gastric cancer, Journal of Cellular Physiology, 10.1002/jcp.29511, 235, 7-8, (5777-5786), (2020).
  • Prenatal diagnosis and long‐term follow‐up of a Chinese patient with mosaic variegated aneuploidy and its molecular analysis, Clinical Case Reports, 10.1002/ccr3.2802, 8, 8, (1369-1375), (2020).
  • Identification and Validation of an Energy Metabolism-Related lncRNA-mRNA Signature for Lower-Grade Glioma, BioMed Research International, 10.1155/2020/3708231, 2020, (1-14), (2020).
  • Protein disulfide isomerases are promising targets for predicting the survival and tumor progression in glioma patients, Aging, 10.18632/aging.102748, 12, 3, (2347-2372), (2020).
  • A 12-immune cell signature to predict relapse and guide chemotherapy for stage II colorectal cancer, Aging, 10.18632/aging.103707, 12, 18, (18363-18383), (2020).
  • Catecholate‐Based Helicates, European Journal of Inorganic Chemistry, 10.1002/ejic.202000255, 2020, 23, (2227-2237), (2020).
  • Butterfly–plant interaction ‐ A dicey ménage à trois?, Plant, Cell & Environment, 10.1111/pce.13782, 43, 8, (1811-1814), (2020).
  • Synthesis of 2‐Aminoquinazoline‐ and 2‐Aminopyrimidine‐Fused Hybrid Scaffolds by Copper‐Catalyzed C(sp2)–N Coupling and Cyclization Followed by Oxidation, European Journal of Organic Chemistry, 10.1002/ejoc.202000427, 2020, 23, (3468-3474), (2020).
  • Value of robotic surgery in endometrial cancer by body mass index, International Journal of Gynecology & Obstetrics, 10.1002/ijgo.13258, 150, 3, (398-405), (2020).
  • Utilization rate of uncontrolled donors after circulatory death‐a 3‐year single‐center investigation, Clinical Transplantation, 10.1111/ctr.13896, 34, 8, (2020).
  • Anode Materials for Zinc‐Air Batteries, Zinc Batteries, 10.1002/9781119662433, (103-130), (2020).
  • Self‐constraining of low‐energy rivers explains low channel mobility and tortuous planforms, The Depositional Record, 10.1002/dep2.112, 6, 3, (648-669), (2020).
  • Impella RP as a bridge to cardiac transplant for refractory late right ventricular failure in setting of left ventricular assist device, ESC Heart Failure, 10.1002/ehf2.12685, 7, 4, (1972-1975), (2020).
  • Abolishing spontaneous epileptiform activity in human brain tissue through AMPA receptor inhibition, Annals of Clinical and Translational Neurology, 10.1002/acn3.51030, 7, 6, (883-890), (2020).
  • The relationship between computed tomography‐derived body composition and survival in colorectal cancer: the effect of image software, JCSM Rapid Communications, 10.1002/rco2.15, 3, 2, (81-90), (2020).
  • Locally discrete expanding groups of analytic diffeomorphisms of the circle, Journal of Topology, 10.1112/topo.12146, 13, 3, (1216-1229), (2020).
  • Modifications of medical treatment and outcome after percutaneous correction of secondary mitral regurgitation, ESC Heart Failure, 10.1002/ehf2.12737, 7, 4, (1753-1763), (2020).
  • FF‐QuantSC: accurate quantification of fetal fraction by a neural network model, Molecular Genetics & Genomic Medicine, 10.1002/mgg3.1232, 8, 6, (2020).
  • Recurrence-Associated Multi-RNA Signature to Predict Disease-Free Survival for Ovarian Cancer Patients, BioMed Research International, 10.1155/2020/1618527, 2020, (1-19), (2020).
  • Identification of key molecular targets that correlate with breast cancer through bioinformatic methods, The Journal of Gene Medicine, 10.1002/jgm.3141, 22, 3, (2020).
  • sn-spMF: matrix factorization informs tissue-specific genetic regulation of gene expression, Genome Biology, 10.1186/s13059-020-02129-6, 21, 1, (2020).
  • Development and Validation of a Novel DNA Methylation-Driven Gene Based Molecular Classification and Predictive Model for Overall Survival and Immunotherapy Response in Patients With Glioblastoma: A Multiomic Analysis, Frontiers in Cell and Developmental Biology, 10.3389/fcell.2020.576996, 8, (2020).
  • A 1p/19q Codeletion-Associated Immune Signature for Predicting Lower Grade Glioma Prognosis, Cellular and Molecular Neurobiology, 10.1007/s10571-020-00959-3, (2020).
  • Risk score based on expression of five novel genes predicts survival in soft tissue sarcoma, Aging, 10.18632/aging.102847, 12, 4, (3807-3827), (2020).
  • Systematic identification, development, and validation of prognostic biomarkers involving the tumor‐immune microenvironment for glioblastoma, Journal of Cellular Physiology, 10.1002/jcp.29878, 0, 0, (2020).
  • Development of a Nomogram With Alternative Splicing Signatures for Predicting the Prognosis of Glioblastoma: A Study Based on Large-Scale Sequencing Data, Frontiers in Oncology, 10.3389/fonc.2020.01257, 10, (2020).
  • The Potential of Five Immune-Related Prognostic Genes to Predict Survival and Response to Immune Checkpoint Inhibitors for Soft Tissue Sarcomas Based on Multi-Omic Study, Frontiers in Oncology, 10.3389/fonc.2020.01317, 10, (2020).
  • Identification of a Five-Gene Signature for Predicting Survival in Malignant Pleural Mesothelioma Patients, Frontiers in Genetics, 10.3389/fgene.2020.00899, 11, (2020).
  • A dose-finding approach for genomic patterns in phase I trials, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2020.1744619, (1-20), (2020).
  • Stress and Self-Efficacy as Long-Term Predictors for Chronic Low Back Pain: A Prospective Longitudinal Study

    ,
    Journal of Pain Research, 10.2147/JPR.S223893, Volume 13, (613-621), (2020).
  • Malignant Evaluation and Clinical Prognostic Values of m6A RNA Methylation Regulators in Glioblastoma, Frontiers in Oncology, 10.3389/fonc.2020.00208, 10, (2020).
  • A Scalable Empirical Bayes Approach to Variable Selection in Generalized Linear Models, Journal of Computational and Graphical Statistics, 10.1080/10618600.2019.1706542, (1-12), (2020).
  • Development and Validation of a Tumor Mutation Burden–Related Immune Prognostic Model for Lower-Grade Glioma, Frontiers in Oncology, 10.3389/fonc.2020.01409, 10, (2020).
  • Pulse Wave Velocity and Machine Learning to Predict Cardiovascular Outcomes in Prediabetic and Diabetic Populations, Journal of Medical Systems, 10.1007/s10916-019-1479-y, 44, 1, (2019).
  • Uncertain times and early predictions of bank failure, Financial Review, 10.1111/fire.12213, 55, 4, (583-601), (2019).
  • Sunk Costs and Screening: Two‐Part Tariffs in Life Insurance, Journal of Risk and Insurance, 10.1111/jori.12283, 87, 3, (689-718), (2019).
  • A Potential Prognostic Gene Signature for Predicting Survival for Glioblastoma Patients, BioMed Research International, 10.1155/2019/9506461, 2019, (1-9), (2019).
  • Heterogeneity of Treatment Effects From an Intensive Lifestyle Weight Loss Intervention on Cardiovascular Events in Patients With Type 2 Diabetes: Data From the Look AHEAD Trial, Diabetes Care, 10.2337/dc19-0776, 42, 10, (1988-1994), (2019).
  • A Brief Neuropsychological Battery for Measuring Cognitive Functions Associated with Obesity, Obesity, 10.1002/oby.22644, 27, 12, (1988-1996), (2019).
  • Each p[ɚ]son does it th[εː] way: Rhoticity variation and the community grammar, Language Variation and Change, 10.1017/S0954394519000048, 31, 1, (91-117), (2019).
  • An immune infiltration signature to predict the overall survival of patients with colon cancer, IUBMB Life, 10.1002/iub.2124, 71, 11, (1760-1770), (2019).
  • Identification of a three‐miRNA signature as a novel potential prognostic biomarker in patients with clear cell renal cell carcinoma, Journal of Cellular Biochemistry, 10.1002/jcb.28648, 120, 8, (13751-13764), (2019).
  • CpG methylation signature predicts prognosis in breast cancer, Breast Cancer Research and Treatment, 10.1007/s10549-019-05417-3, (2019).
  • A Long Non-coding RNA Signature to Improve Prognostic Prediction of Pancreatic Ductal Adenocarcinoma, Frontiers in Oncology, 10.3389/fonc.2019.01160, 9, (2019).
  • Development and Validation of an IDH1-Associated Immune Prognostic Signature for Diffuse Lower-Grade Glioma, Frontiers in Oncology, 10.3389/fonc.2019.01310, 9, (2019).
  • A genomic-clinical nomogram predicting recurrence-free survival for patients diagnosed with hepatocellular carcinoma, PeerJ, 10.7717/peerj.7942, 7, (e7942), (2019).
  • Clinical-Pharmacogenetic Predictive Models for Time to Occurrence of Levodopa Related Motor Complications in Parkinson’s Disease, Frontiers in Genetics, 10.3389/fgene.2019.00461, 10, (2019).
  • Investigating Gene–Gene and Gene–Environment Interactions in the Association Between Overnutrition and Obesity-Related Phenotypes, Frontiers in Genetics, 10.3389/fgene.2019.00151, 10, (2019).
  • Estimation of variance components, heritability and the ridge penalty in high-dimensional generalized linear models, Communications in Statistics - Simulation and Computation, 10.1080/03610918.2019.1646760, (1-19), (2019).
  • Prognostic value of a gene signature in clear cell renal cell carcinoma, Journal of Cellular Physiology, 10.1002/jcp.27700, 234, 7, (10324-10335), (2018).
  • A four-microRNA classifier as a novel prognostic marker for tumor recurrence in stage II colon cancer, Scientific Reports, 10.1038/s41598-018-24519-4, 8, 1, (2018).
  • Development of the Behaviorally Anchored Rating Scales for the Skills Demonstration and Progression Guide, ETS Research Report Series, 10.1002/ets2.12210, 2018, 1, (1-36), (2018).
  • Examining the Applications and Opinions of the TOEFL ITP® Assessment Series Test Scores in Three Countries, ETS Research Report Series, 10.1002/ets2.12231, 2018, 1, (1-30), (2018).
  • Understanding the economic determinants of the severity of operational losses: A regularized generalized Pareto regression approach, Journal of Applied Econometrics, 10.1002/jae.2638, 33, 6, (898-935), (2018).
  • Automated Scoring of Nonnative Speech Using the SpeechRaterSM v. 5.0 Engine, ETS Research Report Series, 10.1002/ets2.12198, 2018, 1, (1-31), (2018).
  • Forecasting Airport Transfer Passenger Flow Using Real-Time Data and Machine Learning, SSRN Electronic Journal, 10.2139/ssrn.3245609, (2018).
  • A Prognostic Signature for Lower Grade Gliomas Based on Expression of Long Non-Coding RNAs, Molecular Neurobiology, 10.1007/s12035-018-1416-y, (2018).
  • Immune cell infiltration as a biomarker for the diagnosis and prognosis of stage I–III colon cancer, Cancer Immunology, Immunotherapy, 10.1007/s00262-018-2289-7, (2018).
  • Clinical Pharmacogenetic Models of Treatment Response to Methotrexate Monotherapy in Slovenian and Serbian Rheumatoid Arthritis Patients: Differences in Patient's Management May Preclude Generalization of the Models, Frontiers in Pharmacology, 10.3389/fphar.2018.00020, 9, (2018).
  • Restoration of Cognitive Performance in Mice Carrying a Deficient Allele of 8-Oxoguanine DNA Glycosylase by X-ray Irradiation, Neurotoxicity Research, 10.1007/s12640-017-9833-7, 33, 4, (824-836), (2017).
  • Assessment of listing and categorisation of animal diseases within the framework of the Animal Health Law (Regulation (EU) No 2016/429): ovine epididymitis (Brucella ovis), EFSA Journal, 10.2903/j.efsa.2017.4994, 15, 10, (2017).
  • Assessment of listing and categorisation of animal diseases within the framework of the Animal Health Law (Regulation (EU) No 2016/429): Trichomonosis, EFSA Journal, 10.2903/j.efsa.2017.4992, 15, 10, (2017).
  • Discriminating between positional and non-positional obstructive sleep apnea using some clinical characteristics, Sleep and Breathing, 10.1007/s11325-017-1499-0, 21, 4, (877-884), (2017).
  • Relation between postoperative ileus and anastomotic leakage after colorectal resection: a post hoc analysis of a prospective randomized controlled trial, Colorectal Disease, 10.1111/codi.13582, 19, 7, (667-674), (2017).
  • Selection of effects in Cox frailty models by regularization methods, Biometrics, 10.1111/biom.12637, 73, 3, (846-856), (2017).
  • Modelling imperfect presence data obtained by citizen science, Environmetrics, 10.1002/env.2446, 28, 5, (2017).
  • Quantitative EEG (QEEG) Measures Differentiate Parkinson's Disease (PD) Patients from Healthy Controls (HC), Frontiers in Aging Neuroscience, 10.3389/fnagi.2017.00003, 9, (2017).
  • A validated microRNA profile with predictive potential in glioblastoma patients treated with bevacizumab, Molecular Oncology, 10.1016/j.molonc.2016.06.004, 10, 8, (1296-1304), (2016).
  • Signaling and Two-Part Tariffs in Long-Term Insurance Contracts, SSRN Electronic Journal, 10.2139/ssrn.2863171, (2016).
  • The Myopia of Crowds: A Study of Collective Evaluation on Stack Exchange, SSRN Electronic Journal, 10.2139/ssrn.2736568, (2016).
  • Too many covariates and too few cases? – a comparative study, Statistics in Medicine, 10.1002/sim.7021, 35, 25, (4546-4558), (2016).
  • Structured fusion lasso penalized multi‐state models, Statistics in Medicine, 10.1002/sim.7017, 35, 25, (4637-4659), (2016).
  • Objective Bayesian model selection for Cox regression, Statistics in Medicine, 10.1002/sim.7089, 35, 29, (5376-5390), (2016).
  • Analysis of Red Sea fish species' introductions into the Mediterranean reveals shifts in introduction patterns, Journal of Biogeography, 10.1111/jbi.12793, 43, 9, (1797-1807), (2016).
  • An asymmetric logistic regression model for ecological data, Methods in Ecology and Evolution, 10.1111/2041-210X.12473, 7, 2, (249-260), (2015).
  • Better prediction by use of co‐data: adaptive group‐regularized ridge regression, Statistics in Medicine, 10.1002/sim.6732, 35, 3, (368-381), (2015).
  • Covariate dimension reduction for survival data via the Gaussian process latent variable model, Statistics in Medicine, 10.1002/sim.6784, 35, 8, (1340-1353), (2015).
  • Precinct or Prejudice? Understanding Racial Disparities in New York City's Stop-and-Frisk Policy, SSRN Electronic Journal, 10.2139/ssrn.2572718, (2015).
  • Nonidentical twins: Comparison of frequentist and Bayesian lasso for Cox models, Biometrical Journal, 10.1002/bimj.201400160, 57, 6, (959-981), (2015).
  • Joint Effect of Unlinked Genotypes: Application to Type 2 Diabetes in the EPIC‐Potsdam Case‐Cohort Study, Annals of Human Genetics, 10.1111/ahg.12115, 79, 4, (253-263), (2015).
  • Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data, Computational and Mathematical Methods in Medicine, 10.1155/2015/259474, 2015, (1-7), (2015).
  • undefined, Proceedings of the 26th ACM Conference on Hypertext & Social Media - HT '15, 10.1145/2700171.2791247, (139-148), (2015).
  • Generalized t‐statistic for two‐group classification, Biometrics, 10.1111/biom.12265, 71, 2, (404-416), (2014).
  • Application of LASSO to the Eigenvector Selection Problem in Eigenvector‐based Spatial Filtering, Geographical Analysis, 10.1111/gean.12054, 47, 3, (284-299), (2014).
  • Fused lasso algorithm for Cox′ proportional hazards and binomial logit models with application to copy number profiles, Biometrical Journal, 10.1002/bimj.201200241, 56, 3, (477-492), (2014).
  • Least angle regression for model selection, WIREs Computational Statistics , 10.1002/wics.1288, 6, 2, (116-123), (2014).
  • High-Dimensional Additive Hazards Regression for Oral Squamous Cell Carcinoma Using Microarray Data: A Comparative Study, BioMed Research International, 10.1155/2014/393280, 2014, (1-7), (2014).
  • Including network knowledge into Cox regression models for biomarker signature discovery, Biometrical Journal, 10.1002/bimj.201300035, 56, 2, (287-306), (2014).
  • Internet-Based Motivation Program for Women With Eating Disorders: Eating Disorder Pathology and Depressive Mood Predict Dropout, Journal of Medical Internet Research, 10.2196/jmir.3104, 16, 3, (e92), (2014).
  • Large‐scale parametric survival analysis, Statistics in Medicine, 10.1002/sim.5817, 32, 23, (3955-3971), (2013).
  • Gain of 1q is a marker of poor prognosis in Wilms' tumors, Genes, Chromosomes and Cancer, 10.1002/gcc.22101, 52, 11, (1065-1074), (2013).
  • A Simple Tool to Predict End‐Stage Renal Disease within 1 Year in Elderly Adults with Advanced Chronic Kidney Disease, Journal of the American Geriatrics Society, 10.1111/jgs.12223, 61, 5, (762-768), (2013).
  • Increased Metallothionein Expression Reflects Steroid Resistance in Renal Allograft Recipients, American Journal of Transplantation, 10.1111/ajt.12314, 13, 8, (2106-2118), (2013).
  • Network‐based regularization for matched case‐control analysis of high‐dimensional DNA methylation data, Statistics in Medicine, 10.1002/sim.5694, 32, 12, (2127-2139), (2012).
  • Analysis of multiple exposures in the case‐crossover design via sparse conditional likelihood, Statistics in Medicine, 10.1002/sim.5344, 31, 21, (2290-2302), (2012).
  • An evaluation of penalised survival methods for developing prognostic models with rare events, Statistics in Medicine, 10.1002/sim.4371, 31, 11-12, (1150-1161), (2011).
  • Pseudosibship methods in the case‐parents design, Statistics in Medicine, 10.1002/sim.4397, 30, 27, (3236-3251), (2011).
  • An overview of techniques for linking high‐dimensional molecular data to time‐to‐event endpoints by risk prediction models, Biometrical Journal, 10.1002/bimj.201000152, 53, 2, (170-189), (2011).
  • Use of pretransformation to cope with extreme values in important candidate features, Biometrical Journal, 10.1002/bimj.201000189, 53, 4, (673-688), (2011).
  • See more

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.