Biostatistics Advance Access originally published online on December 15, 2008
Biostatistics 2009 10(2):297-309; doi:10.1093/biostatistics/kxn036
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Optimal multistage designs—a general framework for efficient genome-wide association studies
Institute of Medical Biometry and Epidemiology, Philipps University, Marburg, Germany muellerh{at}med.uni-marburg.de
Genome-wide association studies (GWAS) have become increasingly affordable but they are still costly. Therefore, cost saving 2-stage designs were proposed in the literature. The restriction to 2 stages, however, seems artificial and does not exploit the full potential of the underlying methods. We extend the 2-stage approach to the general framework of any number of stages. Based on the theory of group sequential methods, we derive optimal multistage designs. With current genotyping cost structures, our results suggest that up to 4 stages are sufficient in order to get feasible and efficient designs. Furthermore, we consider the problem of choosing the optimal number of stages depending on the costs of the statistical interim analysis at each stage and provide guidelines for planning the number of stages in practice. In particular, we found that in the majority of cases both 3-stage designs and 4-stage designs are more efficient than 2-stage designs. Although prices for marker panels are showing a continuing downward trend, we still recommend implementing and using optimal multistage designs in practice. In addition to the immediate benefit, it will be necessary to acquire know-how regarding the application of multistage designs in order to be able to adapt the general framework of multistage designs to upcoming technologies in the area of GWAS.
Keywords: Group sequential methods; Maximal power; Minimal study costs; Optimal 2-stage design
* To whom correspondence should be addressed.
Received November 30, 2007; revised August 22, 2008; accepted for publication September 16, 2008.