Skip Navigation



Biostatistics Advance Access published online on December 22, 2006

Biostatistics, doi:10.1093/biostatistics/kxl042
This Article
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
8/2/485    most recent
kxl042v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Carvalho, B.
Right arrow Articles by Irizarry, R. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Carvalho, B.
Right arrow Articles by Irizarry, R. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Exploration, Normalization, and Genotype Calls of High Density Oligonucleotide SNP Array Data

Benilton Carvalho

Department of Biostatistics, Johns Hopkins University, Baltimore MD 21205

Henrik Bengtsson

Department of Statistics, UC Berkeley, CA

Terence P. Speed

Division of Genetics and Bioinformatics, WEHI, Melbourne, Australia
Department of Statistics, UC Berkeley, CA

Rafael A. Irizarry*

Department of Biostatistics, Johns Hopkins University, Baltimore MD 21205, rafa{at}jhu.edu

* To whom correspondence should be addressed

In most microarray technologies, a number of critical steps are required to convert raw intensity measurements into the data relied upon by data analysts, biologists and clinicians. These data manipulations, referred to as preprocessing, can influence the quality of the ultimate measurements. In the last few years, the high-throughput measurement of gene expression is the most popular application of microarray technology. For this application, various groups have demonstrated that the use of modern statistical methodology can substantially improve accuracy and precision of gene expression measurements, relative to ad-hoc procedures introduced by designers and manufacturers of the technology. Currently, other applications of microarrays are becoming more and more popular. In this paper we describe a preprocessing methodology for a technology designed for the identification of DNA sequence variants in specific genes or regions of the human genome that are associated with phenotypes of interest such as disease. In particular we describe methodology useful for preprocessing Affymetrix SNP chips and obtaining genotype calls with the preprocessed data. We demonstrate how our procedure improves existing approaches using data from three relatively large studies including one in which large numbers of independent calls are available. The proposed methods are implemented in the package oligo available from Bioconductor.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BiostatisticsHome page
C. D. Greenman, G. Bignell, A. Butler, S. Edkins, J. Hinton, D. Beare, S. Swamy, T. Santarius, L. Chen, S. Widaa, et al.
PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data
Biostat., October 15, 2009; (2009) kxp045v1.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
T. J. Jorgensen, I. Ruczinski, B. Kessing, M. W. Smith, Y. Y. Shugart, and A. J. Alberg
Hypothesis-Driven Candidate Gene Association Studies: Practical Design and Analytical Considerations
Am. J. Epidemiol., October 15, 2009; 170(8): 986 - 993.
[Abstract] [Full Text] [PDF]


Home page
BiostatisticsHome page
Y. Y. Teo
Exploratory data analysis in large-scale genetic studies
Biostat., October 14, 2009; (2009) kxp038v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. E. Ritchie, B. S. Carvalho, K. N. Hetrick, S. Tavare, and R. A. Irizarry
R/Bioconductor software for Illumina's Infinium whole-genome genotyping BeadChips
Bioinformatics, October 1, 2009; 25(19): 2621 - 2623.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Wan, K. Sun, Q. Ding, Y. Cui, M. Li, Y. Wen, R. C. Elston, M. Qian, and W. J Fu
Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation
Nucleic Acids Res., September 1, 2009; 37(17): e117 - e117.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Bengtsson, P. Wirapati, and T. P. Speed
A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6
Bioinformatics, September 1, 2009; 25(17): 2149 - 2156.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Bahlo and C. J. Bromhead
Generating linkage mapping files from Affymetrix SNP chip data
Bioinformatics, August 1, 2009; 25(15): 1961 - 1962.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Lin, G. C. Tseng, S. Y. Cheong, L. J. H. Bean, S. L. Sherman, and E. Feingold
Smarter clustering methods for SNP genotype calling
Bioinformatics, December 1, 2008; 24(23): 2665 - 2671.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Xie, J. Wang, and J. Chen
A model of higher accuracy for the individual haplotyping problem based on weighted SNP fragments and genotype with errors
Bioinformatics, July 1, 2008; 24(13): i105 - i113.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. F. Thompson, M. Reimers, B. Khulan, M. Gissot, T. A. Richmond, Q. Chen, X. Zheng, K. Kim, and J. M. Greally
An analytical pipeline for genomic representations used for cytosine methylation studies
Bioinformatics, May 1, 2008; 24(9): 1161 - 1167.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Bengtsson, R. Irizarry, B. Carvalho, and T. P. Speed
Estimation and assessment of raw copy numbers at the single locus level
Bioinformatics, March 15, 2008; 24(6): 759 - 767.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Y. Teo, M. Inouye, K. S. Small, R. Gwilliam, P. Deloukas, D. P. Kwiatkowski, and T. G. Clark
A genotype calling algorithm for the Illumina BeadArray platform
Bioinformatics, October 15, 2007; 23(20): 2741 - 2746.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.