Volume 20, Issue 20 pp. 3123-3157
Tutorial in Biostatistics

The applications of capture-recapture models to epidemiological data

Anne Chao

Corresponding Author

Anne Chao

Institute of Statistics, National Tsing Hua University, Hsin-Chu, Taiwan

Institute of Statistics, National Tsing Hua University, Hsin-Chu 30043, TaiwanSearch for more papers by this author
P. K. Tsay

P. K. Tsay

Institute of Statistics, National Tsing Hua University, Hsin-Chu, Taiwan

Search for more papers by this author
Sheng-Hsiang Lin

Sheng-Hsiang Lin

Institute of Statistics, National Tsing Hua University, Hsin-Chu, Taiwan

Search for more papers by this author
Wen-Yi Shau

Wen-Yi Shau

Graduate Institute of Clinical Medicine, National Taiwan University, Taipei, Taiwan

Search for more papers by this author
Day-Yu Chao

Day-Yu Chao

Graduate Institute of Epidemiology, National Taiwan University, Taipei, Taiwan

Search for more papers by this author
First published: 03 October 2001
Citations: 240

Abstract

Capture-recapture methodology, originally developed for estimating demographic parameters of animal populations, has been applied to human populations. This tutorial reviews various closed capture-recapture models which are applicable to ascertainment data for estimating the size of a target population based on several incomplete lists of individuals. Most epidemiological approaches merging different lists and eliminating duplicate cases are likely to be biased downwards. That is, the final merged list misses those who are in the population but were not ascertained in any of the lists. If there are no matching errors, then the duplicate information collected from a capture-recapture experiment can be used to estimate the number of missed under proper assumptions. Three approaches and their associated estimation procedures are introduced: ecological models; log-linear models, and the sample coverage approach. Each approach has its unique way of incorporating two types of source dependencies: local (list) dependence and dependence due to heterogeneity. An interactive program, CARE (for capture-recapture) developed by the authors is demonstrated using four real data sets. One set of data deals with infection by the acute hepatitis A virus in an outbreak in Taiwan; the other three sets are ascertainment data on diabetes, spina bifida and infants' congenital anomaly discussed in the literature. These data sets provide examples to show the usefulness of the capture-recapture method in correcting for under-ascertainment. The limitations of the methodology and some cautionary remarks are also discussed. Copyright © 2001 John Wiley & Sons, Ltd.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.