Biostatistics Advance Access originally published online on November 3, 2006
Biostatistics 2007 8(2):474-484; doi:10.1093/biostatistics/kxl038
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Insights into latent class analysis of diagnostic test performance
Department of Biostatistics, University of Washington and Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, M2-B500, Seattle, WA 98109, USA mspepe{at}u.washington.edu
Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA
* To whom correspondence should be addressed.
Latent class analysis is used to assess diagnostic test accuracy when a gold standard assessment of disease is not available but results of multiple imperfect tests are. We consider the simplest setting, where 3 tests are observed and conditional independence (CI) is assumed. Closed-form expressions for maximum likelihood parameter estimates are derived. They show explicitly how observed 2- and 3-way associations between test results are used to infer disease prevalence and test true- and false-positive rates. Although interesting and reasonable under CI, the estimators clearly have no basis when it fails. Intuition for bias induced by conditional dependence follows from the analytic expressions. Further intuition derives from an Expectation Maximization (EM) approach to calculating the estimates. We discuss implications of our results and related work for settings where more than 3 tests are available. We conclude that careful justification of assumptions about the dependence between tests in diseased and nondiseased subjects is necessary in order to ensure unbiased estimates of prevalence and test operating characteristics and to provide these estimates clinical interpretations. Such justification must be based in part on a clear clinical definition of disease and biological knowledge about mechanisms giving rise to test results.
Keywords: Errors in variables; Factor analysis; Imperfect reference test; Item response theory; Latent variables; Sensitivity; Specificity
Received June 29, 2006; revised October 18, 2006; accepted for publication October 27, 2006.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
H. Chu, S. R. Cole, Y. Wei, and J. G. Ibrahim Estimation and inference for case-control studies with multiple non-gold standard exposure assessments: with an occupational health application Biostat., October 1, 2009; 10(4): 591 - 602. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Soreide Receiver-operating characteristic curve analysis in diagnostic, prognostic and predictive biomarker research J. Clin. Pathol., January 1, 2009; 62(1): 1 - 5. [Full Text] [PDF] |
||||
![]() |
A. K. Formann and D. Bohning Letter to the editor Biostat., October 1, 2008; 9(4): 777 - 778. [Full Text] [PDF] |
||||

