Denoising array-based comparative genomic hybridization data using wavelets
Biostatistics Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N, M2-B500, Seattle, WA 98109, USA
Biostatistics Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N, PO Box 19024, Seattle, WA 98109, USA
Department of Biostatistics, University of Washington, Box 357232, Seattle, WA 98195, USA
Department of Biomedical Informatics, Columbia University, 622 West 168th Street, New York, NY 10032, USA
Genomics, Shared Resources, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N, DE-740, Seattle, WA 98109, USA
Human Biology Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N, C1-015, Seattle, WA 98109, USA
* To whom correspondence should be addressed. lih{at}fhcrc.org
Array-based comparative genomic hybridization (array-CGH) provides a high-throughput, high-resolution method to measure relative changes in DNA copy number simultaneously at thousands of genomic loci. Typically, these measurements are reported and displayed linearly on chromosome maps, and gains and losses are detected as deviations from normal diploid cells. We propose that one may consider denoising the data to uncover the true copy number changes before drawing inferences on the patterns of aberrations in the samples. Nonparametric techniques are particularly suitable for data denoising as they do not impose a parametric model in finding structures in the data. In this paper, we employ wavelets to denoise the data as wavelets have sound theoretical properties and a fast computational algorithm, and are particularly well suited for handling the abrupt changes seen in array-CGH data. A simulation study shows that denoising data prior to testing can achieve greater power in detecting the aberrant spot than using the raw data without denoising. Finally, we illustrate the method on two array-CGH data sets.
Keywords: Array-CGH; Denoising; Hidden Markov models; Nonparametric modeling; Threshold; Wavelet analysis
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Liu, N. Bandyopadhyay, S. Ranka, M. Baudis, and T. Kahveci Inferring progression models for CGH data Bioinformatics, September 1, 2009; 25(17): 2208 - 2215. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. G. Paulson, C. C. Maley, X. Li, H. Li, C. A. Sanchez, D. L. Chao, R. D. Odze, T. L. Vaughan, P. L. Blount, and B. J. Reid Chromosomal Instability and Copy Number Alterations in Barrett's Esophagus and Esophageal Adenocarcinoma Clin. Cancer Res., May 15, 2009; 15(10): 3305 - 3314. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Budinska, E. Gelnarova, and M. G. Schimek MSMAD: a computationally efficient method for the analysis of noisy array CGH data Bioinformatics, March 15, 2009; 25(6): 703 - 713. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-y. Wang, A. Abyzov, J. O. Korbel, M. Snyder, and M. Gerstein MSB: A mean-shift-based approach for the analysis of structural variation in the genome Genome Res., January 1, 2009; 19(1): 106 - 117. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Li, P. C. Galipeau, C. A. Sanchez, P. L. Blount, C. C. Maley, J. Arnaudo, D. A. Peiffer, D. Pokholok, K. L. Gunderson, and B. J. Reid Single Nucleotide Polymorphism-Based Genome-Wide Chromosome Copy Change, Loss of Heterozygosity, and Aneuploidy in Barrett's Esophagus Neoplastic Progression Cancer Prevention Research, November 1, 2008; 1(6): 413 - 423. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Erdman and J. W. Emerson A fast Bayesian change point analysis for the segmentation of microarray data Bioinformatics, October 1, 2008; 24(19): 2143 - 2148. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Ben-Yaacov and Y. C. Eldar A fast and flexible method for the segmentation of aCGH data Bioinformatics, August 15, 2008; 24(16): i139 - i145. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. L. Lai, H. Xing, and N. Zhang Stochastic segmentation models for array-based comparative genomic hybridization data analysis Biostat., April 1, 2008; 9(2): 290 - 307. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Andersson, C. E. G. Bruder, A. Piotrowski, U. Menzel, H. Nord, J. Sandgren, T. R. Hvidsten, T. Diaz de Stahl, J. P. Dumanski, and J. Komorowski A segmental maximum a posteriori approach to genome-wide copy number profiling Bioinformatics, March 15, 2008; 24(6): 751 - 758. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Pique-Regi, J. Monso-Varona, A. Ortega, R. C. Seeger, T. J. Triche, and S. Asgharzadeh Sparse representation and Bayesian detection of genome copy number alterations from microarray data Bioinformatics, February 1, 2008; 24(3): 309 - 318. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Li and J. Zhu Analysis of array CGH data for cancer studies using fused quantile regression Bioinformatics, September 15, 2007; 23(18): 2470 - 2476. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Huang, A. Gusnanto, K. O'Sullivan, J. Staaf, A. Borg, and Y. Pawitan Robust smooth segmentation approach for array CGH data analysis Bioinformatics, September 15, 2007; 23(18): 2463 - 2469. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Diaz-Uriarte, A. Alibes, E. R. Morrissey, A. Canada, O. M. Rueda, and M. L. Neves Asterias: integrated analysis of expression and aCGH data using an open-source, web-based, parallelized software suite Nucleic Acids Res., July 13, 2007; 35(suppl_2): W75 - W80. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Xing, C. M. T. Greenwood, and S. B. Bull A hierarchical clustering method for estimating copy number variation Biostat., July 1, 2007; 8(3): 632 - 653. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Stjernqvist, T. Ryden, M. Skold, and J. Staaf Continuous-index hidden Markov modelling of array CGH copy number data Bioinformatics, April 15, 2007; 23(8): 1006 - 1014. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Y. Hehir-Kwa, M. Egmont-Petersen, I. M. Janssen, D. Smeets, A. G. van Kessel, and J. A. Veltman Genome-wide Copy Number Profiling on High-density Bacterial Artificial Chromosomes, Single-nucleotide Polymorphisms, and Oligonucleotide Microarrays: A Platform Comparison based on Statistical Power Analysis DNA Res, March 15, 2007; (2007) dsm002v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hu, J.-B. Gao, Y. Cao, E. Bottinger, and W. Zhang Exploiting noise in array CGH data to improve detection of DNA copy number change Nucleic Acids Res., March 12, 2007; 35(5): e35 - e35. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Willenbrock and J. Fridlyand A comparison study: applying segmentation to array CGH data for downstream analyses Bioinformatics, November 15, 2005; 21(22): 4084 - 4091. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Lai, M. D. Johnson, R. Kucherlapati, and P. J. Park Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data Bioinformatics, October 1, 2005; 21(19): 3763 - 3770. [Abstract] [Full Text] [PDF] |
||||






