Volume 7, Issue 5 pp. 337-351
Review

Integrating database knowledge and epidemiological design to improve the implementation of data mining methods that evaluate vaccine safety in large healthcare databases

Jennifer C. Nelson

Corresponding Author

Jennifer C. Nelson

Biostatistics Unit, Group Health Research Institute, Seattle, WA 98101, USA

Department of Biostatistics, University of Washington, Seattle, WA 98195, USA

Jennifer Clark Nelson ([email protected])Search for more papers by this author
Susan M. Shortreed

Susan M. Shortreed

Biostatistics Unit, Group Health Research Institute, Seattle, WA 98101, USA

Department of Biostatistics, University of Washington, Seattle, WA 98195, USA

Search for more papers by this author
Onchee Yu

Onchee Yu

Biostatistics Unit, Group Health Research Institute, Seattle, WA 98101, USA

Search for more papers by this author
Do Peterson

Do Peterson

Biostatistics Unit, Group Health Research Institute, Seattle, WA 98101, USA

Search for more papers by this author
Roger Baxter

Roger Baxter

Vaccine Study Center and Division of Research, Northern California Kaiser Permanente, Oakland, CA 94612, USA

Search for more papers by this author
Bruce Fireman

Bruce Fireman

Vaccine Study Center and Division of Research, Northern California Kaiser Permanente, Oakland, CA 94612, USA

Search for more papers by this author
Ned Lewis

Ned Lewis

Vaccine Study Center and Division of Research, Northern California Kaiser Permanente, Oakland, CA 94612, USA

Search for more papers by this author
Dave McClure

Dave McClure

Epidemiology Research Center, Marshfield Clinic Research Foundation, Marshfield, WI 54449, USA

Search for more papers by this author
Eric Weintraub

Eric Weintraub

Centers for Disease Control and Prevention, Atlanta, GA 30333, USA

Search for more papers by this author
Stan Xu

Stan Xu

Kaiser Permanente Institute for Health Research, Denver, CO 80231, USA

Search for more papers by this author
Lisa A. Jackson

Lisa A. Jackson

Biostatistics Unit, Group Health Research Institute, Seattle, WA 98101, USA

Department of Epidemiology, University of Washington, Seattle, WA 98195, USA

Search for more papers by this author
on behalf of the Vaccine Safety Datalink project

on behalf of the Vaccine Safety Datalink project

Search for more papers by this author
First published: 19 August 2014
Citations: 5

Abstract

Large healthcare databases maintained by health plans have been widely used to conduct customized protocol-based epidemiological safety studies as well as targeted routine sequential monitoring of suspected adverse events for newly licensed vaccines. These databases also offer a rich data source to discover vaccine-related adverse events not known prior to licensure using data mining methods, but they remain relatively under-utilized for this purpose. Initial safety applications of data mining methods using ‘big healthcare data’ are promising, but stronger integration of database expertize, epidemiological design, and statistical analysis strategies are needed to better leverage the available information, reduce bias, and improve reporting transparency. We enumerate major methodological challenges in mining large healthcare databases for vaccine safety research, describe existing strategies that have been used to address these issues, and identify opportunities for methodological advancements that emphasize the importance of adapting techniques used in customized protocol-based vaccine safety assessments. Investment in such research methods and in the development of deeper collaborations between database safety experts and data mining methodologists has great potential to improve existing safety surveillance programs and further increase public confidence in the safety of newly licensed vaccines.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.