Volume 7, Issue 5 pp. 385-403
Original Article

Survival analysis with electronic health record data: Experiments with chronic kidney disease

Yolanda Hagar

Yolanda Hagar

Department of Applied Mathematics, University of Colorado at Boulder, Boulder, CO, USA

Search for more papers by this author
David Albers

David Albers

Department of Biomedical Informatics, Columbia University, New York, NY, USA

Search for more papers by this author
Rimma Pivovarov

Rimma Pivovarov

Department of Biomedical Informatics, Columbia University, New York, NY, USA

Search for more papers by this author
Herbert Chase

Herbert Chase

Department of Biomedical Informatics, Columbia University, New York, NY, USA

Search for more papers by this author
Vanja Dukic

Corresponding Author

Vanja Dukic

Department of Applied Mathematics, University of Colorado at Boulder, Boulder, CO, USA

Joint senior authors for this work.Vanja Dukic ([email protected]) and Noémie Elhadad ([email protected])Search for more papers by this author
Noémie Elhadad

Corresponding Author

Noémie Elhadad

Department of Biomedical Informatics, Columbia University, New York, NY, USA

Joint senior authors for this work.Vanja Dukic ([email protected]) and Noémie Elhadad ([email protected])Search for more papers by this author
First published: 19 August 2014
Citations: 35

Abstract

This article presents a detailed survival analysis for chronic kidney disease (CKD). The analysis is based on the electronic health record (EHR) data comprising almost two decades of clinical observations collected at New York-Presbyterian, a large hospital in New York City with one of the oldest electronic health records in the United States. Our survival analysis approach centers around Bayesian multiresolution hazard modeling, with an objective to capture the changing hazard of CKD over time, adjusted for patient clinical covariates and kidney-related laboratory tests. Special attention is paid to statistical issues common to all EHR data, such as cohort definition, missing data and censoring, variable selection, and potential for joint survival and longitudinal modeling, all of which are discussed alone and within the EHR CKD context.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.