Biostatistics Advance Access published online on August 23, 2006
Biostatistics, doi:10.1093/biostatistics/kxl019
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Biostatistics, University of Washington
* To whom correspondence should be addressed. As much of the focus of genetics and molecular biology has shifted toward the systems level, it has become increasingly important to accurately extract biologically relevant signal from thousands of related measurements. The common property among these high-dimensional biological studies is that the measured features have a rich and largely unknown underlying structure. One example of much recent interest is identifying differentially expressed genes in comparative microarray experiments. We propose a new approach aimed at optimally performing many hypothesis tests in a high-dimensional study. This approach estimates the Optimal Discovery Procedure (ODP), which has recently been introduced and theoretically shown to optimally perform multiple significance tests. Whereas existing procedures essentially use data from only one feature at a time, the ODP approach uses the relevant information from the entire data set when testing each feature. In particular, we propose a generally applicable estimate of the ODP for identifying differentially expressed genes in microarray experiments. This microarray method consistently shows favorable performance over five highly-used existing methods. For example, in testing for differential expression between two breast cancer tumor types, the ODP provides increases from 72% to 185% in the number of genes called significant at a false discovery rate of 3%. Our proposed microarray method is freely available to academic users in the open-source, point-and-click EDGE software package.
Received December 20, 2005
Revised August 4, 2006
Accepted August 22, 2006
Article
The optimal discovery procedure for large-scale significance testing, with applications to comparative microarray experiments
John D. Storey 1 *, James Y. Dai 1, and Jeffrey T. Leek 1
John D. Storey, E-mail: jstorey{at}u.washington.edu
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. E. Cozen, M. T. Weirauch, K. S. Pollard, D. L. Bernick, J. M. Stuart, and T. M. Lowe Transcriptional Map of Respiratory Versatility in the Hyperthermophilic Crenarchaeon Pyrobaculum aerophilum J. Bacteriol., February 1, 2009; 191(3): 782 - 794. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. E. Bauman, J. S. Sinsheimer, E. M. Sobel, and K. Lange Mixed Effects Models for Quantitative Trait Loci Mapping With Inbred Strains Genetics, November 1, 2008; 180(3): 1743 - 1761. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Handfield, H.V. Baker, and R.J. Lamont Beyond Good and Evil in the Oral Cavity: Insights into Host-Microbe Relationships Derived from Transcriptional Profiling of Gingival Cells Journal of Dental Research, March 1, 2008; 87(3): 203 - 223. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Lai Genome-wide co-expression based prediction of differential expressions Bioinformatics, March 1, 2008; 24(5): 666 - 673. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. White, C. Lee May, R. N. Lamounier, J. E. Brestelli, and K. H. Kaestner Defining Pancreatic Endocrine Precursors and Their Descendants Diabetes, March 1, 2008; 57(3): 654 - 668. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Le Gac, M. D. Brazas, M. Bertrand, J. G. Tyerman, C. C. Spencer, R. E. W. Hancock, and M. Doebeli Metabolic Changes Associated With Adaptive Diversification in Escherichia coli Genetics, February 1, 2008; 178(2): 1049 - 1060. [Abstract] [Full Text] [PDF] |
||||




