Skip Navigation



Biostatistics Advance Access published online on January 20, 2006

Biostatistics, doi:10.1093/biostatistics/kxj018
This Article
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
7/3/456    most recent
kxj018v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Janes, H.
Right arrow Articles by Pepe, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Janes, H.
Right arrow Articles by Pepe, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org.
Received March 2, 2005
Revised January 17, 2006
Accepted January 17, 2006

Article

The optimal ratio of cases to controls for estimating the classification accuracy of a biomarker

Holly Janes 1 * and Margaret Pepe 2

1 Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, U.S.A.
2 Department of Biostatistics, University of Washington, Seattle, WA 98195, U.S.A.; Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98195, U.S.A.

* To whom correspondence should be addressed.
Holly Janes, E-mail: hjanes{at}jhsph.edu


   Abstract

The case-control design is frequently used to study the discriminatory accuracy of a screening or diagnostic biomarker. Yet, the appropriate ratio in which to sample cases and controls has never been determined. It is common for researchers to sample equal numbers of cases and controls, a strategy that can be optimal for studies of association. However, considerations are quite different when the biomarker is to be used for classification. In this paper, we provide an expression for the optimal case-control ratio, when the accuracy of the biomarker is quantified by the ROC curve. We show how it can be integrated with choosing the overall sample size to yield an efficient study design with specified power and type-I error. We also derive the optimal case-control ratios for estimating the area under the ROC curve and the area under part of the ROC curve. Our methods are applied to a study of a new marker for adenocarcinoma in patients with Barrett's esophagus.

Keywords: sensitivity; specificity; ROC curve; efficiency; sample size; case-control design; power.
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Clin. Pathol.Home page
K. Soreide
Receiver-operating characteristic curve analysis in diagnostic, prognostic and predictive biomarker research
J. Clin. Pathol., January 1, 2009; 62(1): 1 - 5.
[Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.