Skip Navigation


Biostatistics Advance Access originally published online on April 21, 2006
Biostatistics 2007 8(1):118-127; doi:10.1093/biostatistics/kxj037
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Data Supplement
Right arrow All Versions of this Article:
8/1/118    most recent
kxj037v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Johnson, W. E.
Right arrow Articles by Rabinovic, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Johnson, W. E.
Right arrow Articles by Rabinovic, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org.

Adjusting batch effects in microarray expression data using empirical Bayes methods

W. Evan Johnson and Cheng Li*

Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA and Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA cli{at}hsph.harvard.edu

Ariel Rabinovic

Department of Genetics and Complex Diseases, Harvard School of Public Health, Boston, MA, USA

* To whom correspondence should be addressed.

Non-biological experimental variation or "batch effects" are commonly observed across multiple batches of microarray experiments, often rendering the task of combining data from these batches difficult. The ability to combine microarray data sets is advantageous to researchers to increase statistical power to detect biological phenomena from studies where logistical considerations restrict sample size or in studies that require the sequential hybridization of arrays. In general, it is inappropriate to combine data sets without adjusting for batch effects. Methods have been proposed to filter batch effects from data, but these are often complicated and require large batch sizes (Formula) to implement. Because the majority of microarray studies are conducted using much smaller sample sizes, existing methods are not sufficient. We propose parametric and non-parametric empirical Bayes frameworks for adjusting data for batch effects that is robust to outliers in small sample sizes and performs comparable to existing methods for large samples. We illustrate our methods using two example data sets and show that our methods are justifiable, easy to apply, and useful in practice. Software for our method is freely available at: http://biosun1.harvard.edu/complab/batch/.

Keywords: Batch effects; Empirical Bayes; Microarrays; Monte Carlo


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
JCOHome page
A. V. Rao, P. J.M. Valk, K. H. Metzeler, C. R. Acharya, S. A. Tuchman, M. M. Stevenson, D. A. Rizzieri, R. Delwel, C. Buske, S. K. Bohlander, et al.
Age-Specific Differences in Oncogenic Pathway Dysregulation and Anthracycline Sensitivity in Patients With Acute Myeloid Leukemia
J. Clin. Oncol., November 20, 2009; 27(33): 5580 - 5586.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
D. R. Friedman, J. B. Weinberg, W. T. Barry, B. K. Goodman, A. D. Volkheimer, K. M. Bond, Y. Chen, N. Jiang, J. O. Moore, J. P. Gockerman, et al.
A Genomic Approach to Improve Prognosis and Predict Therapeutic Response in Chronic Lymphocytic Leukemia
Clin. Cancer Res., November 15, 2009; 15(22): 6947 - 6955.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. L. Vidal-Cardenas and C. W. Greider
Comparing effects of mTR and mTERT deletion on gene expression and DNA damage response: a critical examination of telomere length maintenance-independent roles of telomerase
Nucleic Acids Res., October 22, 2009; (2009) gkp855v1.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Pathol.Home page
A H Sims
Bioinformatics and breast cancer: what can high-throughput genomic approaches actually tell us?
J. Clin. Pathol., October 1, 2009; 62(10): 879 - 885.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
A. Anguiano, S. A. Tuchman, C. Acharya, K. Salter, C. Gasparetto, F. Zhan, M. Dhodapkar, J. Nevins, B. Barlogie, J. D. Shaughnessy Jr, et al.
Gene Expression Profiles of Tumor Biology Provide a Novel Approach to Prognosis and May Guide the Selection of Therapeutic Targets in Multiple Myeloma
J. Clin. Oncol., September 1, 2009; 27(25): 4197 - 4203.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
T. Zhang, J. G. Berrocal, K. M. Frizzell, M. J. Gamble, M. E. DuMond, R. Krishnakumar, T. Yang, A. A. Sauve, and W. L. Kraus
Enzymes in the NAD+ Salvage Pathway Regulate SIRT1 Activity at Target Gene Promoters
J. Biol. Chem., July 24, 2009; 284(30): 20408 - 20417.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
O. Dakhova, M. Ozen, C. J. Creighton, R. Li, G. Ayala, D. Rowley, and M. Ittmann
Global Gene Expression Analysis of Reactive Stroma in Prostate Cancer
Clin. Cancer Res., June 15, 2009; 15(12): 3979 - 3989.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
A. Berchuck, E. S. Iversen, J. Luo, J. P. Clarke, H. Horne, D. A. Levine, J. Boyd, M. A. Alonso, A. A. Secord, M. Q. Bernardini, et al.
Microarray Analysis of Early Stage Serous Ovarian Cancers Shows Profiles Predictive of Favorable Outcome
Clin. Cancer Res., April 1, 2009; 15(7): 2448 - 2455.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
K. S. Garman, C. R. Acharya, E. Edelman, M. Grade, J. Gaedcke, S. Sud, W. Barry, A. M. Diehl, D. Provenzale, G. S. Ginsburg, et al.
A genomic approach to colon cancer risk stratification yields biologic insights into therapeutic opportunities
PNAS, December 9, 2008; 105(49): 19432 - 19437.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H. M. Kang, C. Ye, and E. Eskin
Accurate Discovery of Expression Quantitative Trait Loci Under Confounding From Spurious and Genuine Regulatory Hotspots
Genetics, December 1, 2008; 180(4): 1909 - 1925.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
K. Owzar, W. T. Barry, S.-H. Jung, I. Sohn, and S. L. George
Statistical Challenges in Preprocessing in Microarray Experiments in Cancer
Clin. Cancer Res., October 1, 2008; 14(19): 5959 - 5966.
[Abstract] [Full Text] [PDF]


Home page
J Exp BotHome page
J. C. Cushman, R. L. Tillett, J. A. Wood, J. M. Branco, and K. A. Schlauch
Large-scale mRNA expression profiling in the common ice plant, Mesembryanthemum crystallinum, performing C3 photosynthesis and Crassulacean acid metabolism (CAM)
J. Exp. Bot., May 1, 2008; 59(7): 1875 - 1894.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. A. Shabalin, H. Tjelmeland, C. Fan, C. M. Perou, and A. B. Nobel
Merging two gene-expression studies via cross-platform normalization
Bioinformatics, May 1, 2008; 24(9): 1154 - 1160.
[Abstract] [Full Text] [PDF]


Home page
JAMAHome page
C. R. Acharya, D. S. Hsu, C. K. Anders, A. Anguiano, K. H. Salter, K. S. Walters, R. C. Redman, S. A. Tuchman, C. A. Moylan, S. Mukherjee, et al.
Gene Expression Signatures, Clinicopathological Features, and Individualized Therapy in Breast Cancer
JAMA, April 2, 2008; 299(13): 1574 - 1587.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
B. S. Abrahams, D. Tentler, J. V. Perederiy, M. C. Oldham, G. Coppola, and D. H. Geschwind
Genome-wide analyses of human perisylvian cerebral cortical patterning
PNAS, November 6, 2007; 104(45): 17849 - 17854.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.