Skip Navigation


Biostatistics Advance Access originally published online on December 22, 2006
Biostatistics 2007 8(2):485-499; doi:10.1093/biostatistics/kxl042
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplementary Material
Right arrow All Versions of this Article:
8/2/485    most recent
kxl042v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Carvalho, B.
Right arrow Articles by Irizarry, R. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Carvalho, B.
Right arrow Articles by Irizarry, R. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org.

Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data

Benilton Carvalho

Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205, USA

Henrik Bengtsson

Department of Statistics, University of California, Berkeley, CA, USA

Terence P. Speed

Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute, Melbourne, Australia and Department of Statistics, University of California, Berkeley, CA, USA

Rafael A. Irizarry*

Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205, USA ririzarr{at}jhsph.edu

* To whom correspondence should be addressed.

In most microarray technologies, a number of critical steps are required to convert raw intensity measurements into the data relied upon by data analysts, biologists, and clinicians. These data manipulations, referred to as preprocessing, can influence the quality of the ultimate measurements. In the last few years, the high-throughput measurement of gene expression is the most popular application of microarray technology. For this application, various groups have demonstrated that the use of modern statistical methodology can substantially improve accuracy and precision of the gene expression measurements, relative to ad hoc procedures introduced by designers and manufacturers of the technology. Currently, other applications of microarrays are becoming more and more popular. In this paper, we describe a preprocessing methodology for a technology designed for the identification of DNA sequence variants in specific genes or regions of the human genome that are associated with phenotypes of interest such as disease. In particular, we describe a methodology useful for preprocessing Affymetrix single-nucleotide polymorphism chips and obtaining genotype calls with the preprocessed data. We demonstrate how our procedure improves existing approaches using data from 3 relatively large studies including the one in which large numbers of independent calls are available. The proposed methods are implemented in the package oligo available from Bioconductor.

Keywords: Affymetrix; Genotyping; High-throughput; Microarrays

Received June 27, 2006; revised September 18, 2006; accepted for publication October 12, 2006.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
M. Xie, J. Wang, and J. Chen
A model of higher accuracy for the individual haplotyping problem based on weighted SNP fragments and genotype with errors
Bioinformatics, July 1, 2008; 24(13): i105 - i113.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. F. Thompson, M. Reimers, B. Khulan, M. Gissot, T. A. Richmond, Q. Chen, X. Zheng, K. Kim, and J. M. Greally
An analytical pipeline for genomic representations used for cytosine methylation studies
Bioinformatics, May 1, 2008; 24(9): 1161 - 1167.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Bengtsson, R. Irizarry, B. Carvalho, and T. P. Speed
Estimation and assessment of raw copy numbers at the single locus level
Bioinformatics, March 15, 2008; 24(6): 759 - 767.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Y. Teo, M. Inouye, K. S. Small, R. Gwilliam, P. Deloukas, D. P. Kwiatkowski, and T. G. Clark
A genotype calling algorithm for the Illumina BeadArray platform
Bioinformatics, October 15, 2007; 23(20): 2741 - 2746.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.