|
PMID |
|
|
TITLE |
|
A simple and fast two-locus quality control test to detect false positives due to batch effects in genome-wide association studies. |
|
ABSTRACT |
|
|
|
The impact of erroneous genotypes having passed standard quality control (QC) can be severe in genome-wide association studies, genotype imputation, and estimation of heritability and prediction of genetic risk based on single nucleotide polymorphisms (SNP). To detect such genotyping errors, a simple two-locus QC method, based on the difference in test statistic of association between single SNPs and pairs of SNPs, was developed and applied. The proposed approach could detect many problematic SNPs with statistical significance even when standard single SNP QC analyses fail to detect them in real data. Depending on the data set used, the number of erroneous SNPs that were not filtered out by standard single SNP QC but detected by the proposed approach varied from a few hundred to thousands. Using simulated data, it was shown that the proposed method was powerful and performed better than other tested existing methods. The power of the proposed approach to detect erroneous genotypes was ∼80% for a 3% error rate per SNP. This novel QC approach is easy to implement and computationally efficient, and can lead to a better quality of genotypes for subsequent genotype-phenotype investigations. |
© 2010 Wiley-Liss, Inc. |
|
DATE PUBLISHED |
|
|
HISTORY |
|
PUBSTATUS |
PUBSTATUSDATE |
entrez |
2010/11/25 06:00 |
pubmed |
2010/11/26 06:00 |
medline |
2011/03/05 06:00 |
|
AUTHORS |
|
NAME |
COLLECTIVENAME |
LASTNAME |
FORENAME |
INITIALS |
AFFILIATION |
AFFILIATIONINFO |
Lee SH |
|
Lee |
Sang Hong |
SH |
|
Queensland Institute of Medical Research, Herston, Queensland, Australia. hong.lee@qimr.edu.au |
Nyholt DR |
|
Nyholt |
Dale R |
DR |
|
|
Macgregor S |
|
Macgregor |
Stuart |
S |
|
|
Henders AK |
|
Henders |
Anjali K |
AK |
|
|
Zondervan KT |
|
Zondervan |
Krina T |
KT |
|
|
Montgomery GW |
|
Montgomery |
Grant W |
GW |
|
|
Visscher PM |
|
Visscher |
Peter M |
PM |
|
|
|
INVESTIGATORS |
|
|
JOURNAL |
|
VOLUME: 34 |
ISSUE: 8 |
TITLE: Genetic epidemiology |
ISOABBREVIATION: Genet. Epidemiol. |
YEAR: 2010 |
MONTH: Dec |
DAY: |
MEDLINEDATE: |
SEASON: |
CITEDMEDIUM: Internet |
ISSN: 1098-2272 |
ISSNTYPE: Electronic |
|
MEDLINE JOURNAL |
|
MEDLINETA: Genet Epidemiol |
COUNTRY: United States |
ISSNLINKING: 0741-0395 |
NLMUNIQUEID: 8411723 |
|
PUBLICATION TYPE |
|
PUBLICATIONTYPE TEXT |
Journal Article |
Research Support, Non-U.S. Gov't |
|
COMMENTS AND CORRECTIONS |
|
REFTYPE |
REFSOURCE |
REFPMID |
NOTE |
Cites |
Fertil Steril. 2002 Oct;78(4):679-85 |
12372440 |
|
Cites |
PLoS Genet. 2008 Oct;4(10):e1000231 |
18949033 |
|
Cites |
Am J Hum Genet. 2005 Sep;77(3):365-76 |
16080113 |
|
Cites |
Eur J Hum Genet. 2006 Apr;14(4):450-8 |
16435001 |
|
Cites |
Am J Hum Genet. 2006 May;78(5):737-46 |
16642430 |
|
Cites |
Hum Hered. 2006;61(1):31-44 |
16557026 |
|
Cites |
PLoS Genet. 2006 Mar;2(3):e41 |
16565746 |
|
Cites |
Bioinformatics. 2007 Jan 15;23(2):255-6 |
17118959 |
|
Cites |
Nature. 2007 Jun 7;447(7145):661-78 |
17554300 |
|
Cites |
Am J Hum Genet. 2007 Sep;81(3):559-75 |
17701901 |
|
Cites |
Genome Res. 2007 Oct;17(10):1520-8 |
17785532 |
|
Cites |
Bioinformatics. 2002 Feb;18(2):337-8 |
11847089 |
|
Cites |
JAMA. 2008 Mar 19;299(11):1335-44 |
18349094 |
|
Cites |
Nat Genet. 2008 May;40(5):489-90 |
18443579 |
|
Cites |
PLoS Genet. 2008;4(7):e1000130 |
18654633 |
|
Cites |
BMC Genet. 2009;10:3 |
19178712 |
|
Cites |
BMC Genomics. 2009;10:106 |
19284636 |
|
Cites |
PLoS Genet. 2009 Jul;5(7):e1000572 |
19629167 |
|
Cites |
Nature. 2009 Oct 8;461(7265):747-53 |
19812666 |
|
Cites |
Am J Hum Genet. 2010 Jan;86(1):88-92 |
20045101 |
|
Cites |
Am J Hum Genet. 2010 Apr 9;86(4):519-25 |
20303062 |
|
Cites |
Am J Hum Genet. 2003 Mar;72(3):598-610 |
12587097 |
|
|
GRANTS |
|
GRANTID |
AGENCY |
COUNTRY |
076113 |
Wellcome Trust |
United Kingdom |
084766 |
Wellcome Trust |
United Kingdom |
085235 |
Wellcome Trust |
United Kingdom |
|
GENERAL NOTE |
|
|
KEYWORDS |
|
|
MESH HEADINGS |
|
DESCRIPTORNAME |
QUALIFIERNAME |
Genetic Loci |
|
Genome, Human |
|
Genome-Wide Association Study |
standards |
Genotype |
standards |
Humans |
standards |
Models, Genetic |
standards |
Polymorphism, Single Nucleotide |
genetics |
Quality Control |
genetics |
|
SUPPLEMENTARY MESH |
|
|
GENE SYMBOLS |
|
|
CHEMICALS |
|
|
OTHER ID's |
|
OTHERID |
SOURCE |
PMC3674525 |
NLM |
|
|