Volume 86, Issue 2 pp. 111-115
Outstanding Observation

Many human immunoglobulin heavy-chain IGHV gene polymorphisms have been reported in error

Yan Wang

Yan Wang

School of Biotechnology and Biomolecular Sciences, University of New South Wales, Kensington, New South Wales, Australia

Search for more papers by this author
Katherine J L Jackson

Katherine J L Jackson

School of Biotechnology and Biomolecular Sciences, University of New South Wales, Kensington, New South Wales, Australia

Search for more papers by this author
William A Sewell

William A Sewell

Institute of Laboratory Medicine, St Vincent's Hospital, Sydney, New South Wales, Australia

St Vincent's Clinical School, University of New South Wales, Darlinghurst, New South Wales, Australia

Search for more papers by this author
Andrew M Collins

Corresponding Author

Andrew M Collins

School of Biotechnology and Biomolecular Sciences, University of New South Wales, Kensington, New South Wales, Australia

Dr AM Collins, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia. E-mail: [email protected]Search for more papers by this author
First published: 27 November 2007
Citations: 46

Abstract

The identification of the genes that make up rearranged immunoglobulin genes is critical to many studies. For example, the enumeration of mutations in immunoglobulin genes is important for the prognosis of chronic lymphocytic leukemia, and this requires the accurate identification of the germline genes from which a particular sequence is derived. The immunoglobulin heavy-chain variable (IGHV) gene repertoire is generally considered to be highly polymorphic. In this report, we describe a bioinformatic analysis of germline and rearranged immunoglobulin gene sequences which casts doubt on the existence of a substantial proportion of reported germline polymorphisms. We report a five-level classification system for IGHV genes, which indicates the likelihood that the genes have been reported accurately. The classification scheme also reflects the likelihood that germline genes could be incorrectly identified in mutated VDJ rearrangements, because of similarities to other alleles. Of the 226 IGHV alleles that have previously been reported, our analysis suggests that 104 of these alleles almost certainly include sequence errors, and should be removed from the available repertoire. The analysis also highlights the presence of common mismatches, with respect to the germline, in many rearranged heavy-chain sequences, suggesting the existence of twelve previously unreported alleles. Sequencing of IGHV genes from six individuals in this study confirmed the existence of three of these alleles, which we designate IGHV3-49*04, IGHV3-49*05 and IGHV4-39*07. We therefore present a revised repertoire of expressed IGHV genes, which should substantially improve the accuracy of immunoglobulin gene analysis.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.