Skip Navigation


Biostatistics Advance Access originally published online on August 23, 2006
Biostatistics 2007 8(2):414-432; doi:10.1093/biostatistics/kxl019
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
8/2/414    most recent
kxl019v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Storey, J. D.
Right arrow Articles by Leek, J. T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Storey, J. D.
Right arrow Articles by Leek, J. T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org.

The optimal discovery procedure for large-scale significance testing, with applications to comparative microarray experiments

John D. Storey*, James Y. Dai and Jeffrey T. Leek

Department of Biostatistics, University of Washington, Seattle, Washington, 98195, USA jstorey{at}u.washington.edu

* To whom correspondence should be addressed.

As much of the focus of genetics and molecular biology has shifted toward the systems level, it has become increasingly important to accurately extract biologically relevant signal from thousands of related measurements. The common property among these high-dimensional biological studies is that the measured features have a rich and largely unknown underlying structure. One example of much recent interest is identifying differentially expressed genes in comparative microarray experiments. We propose a new approach aimed at optimally performing many hypothesis tests in a high-dimensional study. This approach estimates the optimal discovery procedure (ODP), which has recently been introduced and theoretically shown to optimally perform multiple significance tests. Whereas existing procedures essentially use data from only one feature at a time, the ODP approach uses the relevant information from the entire data set when testing each feature. In particular, we propose a generally applicable estimate of the ODP for identifying differentially expressed genes in microarray experiments. This microarray method consistently shows favorable performance over five highly used existing methods. For example, in testing for differential expression between two breast cancer tumor types, the ODP provides increases from 72% to 185% in the number of genes called significant at a false discovery rate of 3%. Our proposed microarray method is freely available to academic users in the open-source, point-and-click EDGE software package.

Keywords: Differential expression; Multiple hypothesis testing; q-value; Systems biology

Received December 20, 2005; revised April 16, 2006; revised August 4, 2006; accepted for publication August 22, 2006.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Stat Methods Med ResHome page
D. M Witten and R. Tibshirani
Survival analysis with high-dimensional covariates
Statistical Methods in Medical Research, February 1, 2010; 19(1): 29 - 51.
[Abstract] [PDF]


Home page
J. Bacteriol.Home page
A. E. Cozen, M. T. Weirauch, K. S. Pollard, D. L. Bernick, J. M. Stuart, and T. M. Lowe
Transcriptional Map of Respiratory Versatility in the Hyperthermophilic Crenarchaeon Pyrobaculum aerophilum
J. Bacteriol., February 1, 2009; 191(3): 782 - 794.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
L. E. Bauman, J. S. Sinsheimer, E. M. Sobel, and K. Lange
Mixed Effects Models for Quantitative Trait Loci Mapping With Inbred Strains
Genetics, November 1, 2008; 180(3): 1743 - 1761.
[Abstract] [Full Text] [PDF]


Home page
JDRHome page
M. Handfield, H.V. Baker, and R.J. Lamont
Beyond Good and Evil in the Oral Cavity: Insights into Host-Microbe Relationships Derived from Transcriptional Profiling of Gingival Cells
Journal of Dental Research, March 1, 2008; 87(3): 203 - 223.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Lai
Genome-wide co-expression based prediction of differential expressions
Bioinformatics, March 1, 2008; 24(5): 666 - 673.
[Abstract] [Full Text] [PDF]


Home page
DiabetesHome page
P. White, C. Lee May, R. N. Lamounier, J. E. Brestelli, and K. H. Kaestner
Defining Pancreatic Endocrine Precursors and Their Descendants
Diabetes, March 1, 2008; 57(3): 654 - 668.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. Le Gac, M. D. Brazas, M. Bertrand, J. G. Tyerman, C. C. Spencer, R. E. W. Hancock, and M. Doebeli
Metabolic Changes Associated With Adaptive Diversification in Escherichia coli
Genetics, February 1, 2008; 178(2): 1049 - 1060.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.