Biostatistics 4:249-264 (2003)
© 2003 Oxford University Press
Exploration, normalization, and summaries of high density oligonucleotide array probe level data
Department of Biostatistics, Johns Hopkins University, Baltimore MD 21205, USA rafa{at}jhu.edu
Division of Genetics and Bioinformatics, WEHI, Melbourne, Australia
Gene Logic Inc., Berkeley, CA, USA
Gene Logic Inc., Gaithersburg, MD, USA
Gene Logic Inc., Gaithersburg, MD, USA
Gene Logic Inc., Gaithersburg, MD, USA
Division of Genetics and Bioinformatics, WEHI, Melbourne, Australia. Department of Statistics, University of California at Berkeley
*To whom correspondence should be addressed
In this paper we report exploratory analyses of high-density oligonucleotide array data from the Affymetrix GeneChip® system with the objective of improving upon currently used measures of gene expression. Our analyses make use of three data sets: a small experimental study consisting of five MGU74A mouse GeneChip® arrays, part of the data from an extensive spike-in study conducted by Gene Logic and Wyeth's Genetics Institute involving 95 HG-U95A human GeneChip® arrays; and part of a dilution study conducted by Gene Logic involving 75 HG-U95A GeneChip® arrays. We display some familiar features of the perfect match and mismatch probe (PM and MM) values of these data, and examine the variancemean relationship with probe-level data from probes believed to be defective, and so delivering noise only. We explain why we need to normalize the arrays to one another using probe level intensities. We then examine the behavior of the PM and MM using spike-in data and assess three commonly used summary measures: Affymetrix's (i) average difference (AvDiff) and (ii) MAS 5.0 signal, and (iii) the Li and Wong multiplicative model-based expression index (MBEI). The exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values. We evaluate the four expression summary measures using the dilution study data, assessing their behavior in terms of bias, variance and (for MBEI and RMA) model fit. Finally, we evaluate the algorithms in terms of their ability to detect known levels of differential expression using the spike-in data. We conclude that there is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
H. Frank, N. Groger, M. Diener, C. Becker, T. Braun, and T. Boettger Lactaturia and Loss of Sodium-dependent Lactate Uptake in the Colon of SLC5A8-deficient Mice J. Biol. Chem., September 5, 2008; 283(36): 24729 - 24737. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Alice, H. Naka, and J. H. Crosa Global Gene Expression as a Function of the Iron Status of the Bacterial Cell: Influence of Differentially Expressed Genes in the Virulence of the Human Pathogen Vibrio vulnificus Infect. Immun., September 1, 2008; 76(9): 4019 - 4037. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Lamarche, S.-H. Kim, S. Crepin, M. Mourez, N. Bertrand, R. E. Bishop, J. D. Dubreuil, and J. Harel Modulation of Hexa-Acyl Pyrophosphate Lipid A Population under Escherichia coli Phosphate (Pho) Regulon Activation J. Bacteriol., August 1, 2008; 190(15): 5256 - 5264. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Diez, C. Grijota-Martinez, P. Agretti, G. De Marco, M. Tonacchera, A. Pinchera, G. Morreale de Escobar, J. Bernal, and B. Morte Thyroid Hormone Action in the Adult Brain: Gene Expression Profiling of the Effects of Single and Multiple Doses of Triiodo-L-Thyronine in the Rat Striatum Endocrinology, August 1, 2008; 149(8): 3989 - 4000. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Purdom, K. M. Simpson, M. D. Robinson, J. G. Conboy, A. V. Lapuk, and T.P. Speed FIRMA: a method for detection of alternative splicing from exon array data Bioinformatics, August 1, 2008; 24(15): 1707 - 1714. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Thimon, E. Calvo, O. Koukoui, C. Legare, and R. Sullivan Effects of Vasectomy on Gene Expression Profiling along the Human Epididymis Biol Reprod, August 1, 2008; 79(2): 262 - 273. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Schrenk-Siemens, S. Perez-Alcala, J. Richter, E. Lacroix, J. Rahuel, M. Korte, U. Muller, Y.-A. Barde, and M. Bibel Embryonic Stem Cell-Derived Neurons as a Cellular System to Study Gene Function: Lack of Amyloid Precursor Proteins APP and APLP2 Leads to Defective Synaptic Transmission Stem Cells, August 1, 2008; 26(8): 2153 - 2163. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Wan, F. Luo, S. E. Wert, L. Zhang, Y. Xu, M. Ikegami, Y. Maeda, S. M. Bell, and J. A. Whitsett Kruppel-like factor 5 is required for perinatal lung morphogenesis and function Development, August 1, 2008; 135(15): 2563 - 2572. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. P. Haberman, H. J. Lee, C. Colantuoni, M. T. Koh, and M. Gallagher Rapid encoding of new information alters the profile of plasticity-related mRNA transcripts in the hippocampal CA3 region PNAS, July 29, 2008; 105(30): 10601 - 10606. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. Russell, P. L. Bhalla, and M. B. Singh Transcriptome-Based Examination of Putative Pollen Allergens of Rice (Oryza sativa ssp. japonica) Mol Plant, July 21, 2008; (2008) ssn036v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Gene expression patterns in visual cortex during the critical period: Synaptic stabilization and reversal by visual deprivation PNAS, July 8, 2008; 105(27): 9409 - 9414. |
||||
![]() |
G. Huang, R. Eisenberg, M. Yan, S. Monti, E. Lawrence, P. Fu, J. Walbroehl, E. Lowenberg, T. Golub, J. Merchan, et al. 15-Hydroxyprostaglandin Dehydrogenase is a Target of Hepatocyte Nuclear Factor 3{beta} and a Tumor Suppressor in Lung Cancer Cancer Res., July 1, 2008; 68(13): 5040 - 5048. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Kanies, J. J. Smith, C. Kis, C. Schmidt, S. Levy, K. S.A. Khabar, J. Morrow, N. Deane, D. A. Dixon, and R. D. Beauchamp Oncogenic Ras and Transforming Growth Factor-{beta} Synergistically Regulate AU-Rich Element-Containing mRNAs during Epithelial to Mesenchymal Transition Mol. Cancer Res., July 1, 2008; 6(7): 1124 - 1136. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Chapman, G. Kelly, and M. A. Knowles Genes Involved in Differentiation, Stem Cell Renewal, and Tumorigenesis Are Modulated in Telomerase-Immortalized Human Urothelial Cells Mol. Cancer Res., July 1, 2008; 6(7): 1154 - 1168. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.C. Koo, B.S. Bushman, and I.W. Mott Transcripts Associated with Non-Acclimated Freezing Response in Two Barley Cultivars The Plant Genome, July 1, 2008; 1(1): 21 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. L. Bayne, T. Forster, S. T. G. Burgess, M. Craigon, M. J. Walton, D. T. Baird, P. Ghazal, and R. A. Anderson Molecular Profiling of the Human Testis Reveals Stringent Pathway-Specific Regulation of RNA Expression Following Gonadotropin Suppression and Progestogen Treatment J Androl, July 1, 2008; 29(4): 389 - 403. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.A. Knijnenburg, L.F.A. Wessels, and M.J.T. Reinders Combinatorial influence of environmental parameters on transcription factor activity Bioinformatics, July 1, 2008; 24(13): i172 - i181. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Srinivasasainagendra, G. P. Page, T. Mehta, I. Coulibaly, and A. E. Loraine CressExpress: A Tool For Large-Scale Mining of Expression Data from Arabidopsis Plant Physiology, July 1, 2008; 147(3): 1004 - 1016. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Hassane, M. L. Guzman, C. Corbett, X. Li, R. Abboud, F. Young, J. L. Liesveld, M. Carroll, and C. T. Jordan Discovery of agents that eradicate leukemia stem cells using an in silico screen of public gene expression data Blood, June 15, 2008; 111(12): 5654 - 5662. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. D. Hoopfer, A. Penton, R. J. Watts, and L. Luo Genomic Analysis of Drosophila Neuronal Remodeling: A Role for the RNA-Binding Protein Boule as a Negative Regulator of Axon Pruning J. Neurosci., June 11, 2008; 28(24): 6092 - 6103. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Cvijanovich, T. P. Shanley, R. Lin, G. L. Allen, N. J. Thomas, P. Checchia, N. Anas, R. J. Freishtat, M. Monaco, K. Odoms, et al. Validating the genomic signature of pediatric septic shock Physiol Genomics, June 10, 2008; 34(1): 127 - 134. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Calderon-Vazquez, E. Ibarra-Laclette, J. Caballero-Perez, and L. Herrera-Estrella Transcript profiling of Zea mays roots reveals gene responses to phosphate deficiency at the plant- and species-specific levels J. Exp. Bot., June 6, 2008; (2008) ern115v2. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Rajasingh, E. Lambers, H. Hamada, E. Bord, T. Thorne, I. Goukassian, P. Krishnamurthy, K. M. Rosen, D. Ahluwalia, Y. Zhu, et al. Cell-Free Embryonic Stem Cell Extract-Mediated Derivation of Multipotent Stem Cells From NIH3T3 Fibroblasts for Functional and Anatomical Ischemic Tissue Repair Circ. Res., June 6, 2008; 102(11): e107 - e117. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-H. Ding, Y. Xie, S. Park, G. Xiao, and M. D. Story Enhanced identification and biological validation of differential gene expression via Illumina whole-genome expression arrays through the use of the model-based background correction methodology Nucleic Acids Res., June 1, 2008; 36(10): e58 - e58. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Palmer, N. L. Barbosa-Morais, E. L. Gooding, B. Muralidhar, C. M. Thornton, M. R. Pett, I. Roberts, D. T. Schneider, N. Thorne, S. Tavare, et al. Pediatric Malignant Germ Cell Tumors Show Characteristic Transcriptome Profiles Cancer Res., June 1, 2008; 68(11): 4239 - 4247. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. W. Horne, S. van den Driesche, A. E. King, S. Burgess, M. Myers, H. Ludlow, P. Lourenco, P. Ghazal, A. R. Williams, H. O. D. Critchley, et al. Endometrial Inhibin/Activin {beta}-B Subunit Expression Is Related to Decidualization and Is Reduced in Tubal Ectopic Pregnancy J. Clin. Endocrinol. Metab., June 1, 2008; 93(6): 2375 - 2382. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Lopez-Martin, M. Becana, L. C. Romero, and C. Gotor Knocking Out Cytosolic Cysteine Synthesis Compromises the Antioxidant Capacity of the Cytosol to Maintain Discrete Concentrations of Hydrogen Peroxide in Arabidopsis Plant Physiology, June 1, 2008; 147(2): 562 - 572. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Belmont, A. Tadimalla, W. J. Chen, J. J. Martindale, D. J. Thuerauf, M. Marcinko, N. Gude, M. A. Sussman, and C. C. Glembotski Coordination of Growth and Endoplasmic Reticulum Stress Signaling by Regulator of Calcineurin 1 (RCAN1), a Novel ATF6-inducible Gene J. Biol. Chem., May 16, 2008; 283(20): 14012 - 14021. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Ono, S. Suzuki, C. Furusawa, T. Agata, A. Kashiwagi, H. Shimizu, and T. Yomo An improved physico-chemical model of hybridization on high-density oligonucleotide microarrays Bioinformatics, May 15, 2008; 24(10): 1278 - 1285. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Sperandio, B. Regnault, J. Guo, Z. Zhang, S. L. Stanley Jr., P. J. Sansonetti, and T. Pedron Virulent Shigella flexneri subverts the host innate immune response through manipulation of antimicrobial peptide gene expression J. Exp. Med., May 12, 2008; 205(5): 1121 - 1132. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Grundberg, H. Brandstrom, K. C. L. Lam, S. Gurd, B. Ge, E. Harmsen, A. Kindmark, O. Ljunggren, H. Mallmin, O. Nilsson, et al. Systematic assessment of the human osteoblast transcriptome in resting and induced primary cells Physiol Genomics, May 9, 2008; 33(3): 301 - 311. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Flowers, M. P. Keller, Y. Choi, H. Lan, C. Kendziorski, J. M. Ntambi, and A. D. Attie Liver gene expression analysis reveals endoplasmic reticulum stress and metabolic dysfunction in SCD1-deficient mice fed a very low-fat diet Physiol Genomics, May 9, 2008; 33(3): 361 - 372. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. J. Norton, D. E. Lou-Hing, A. A. Meharg, and A. H. Price Rice-arsenate interactions in hydroponics: whole genome transcriptional analysis J. Exp. Bot., May 2, 2008; (2008) ern097v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Wei, P. F. Kuan, S. Tian, C. Yang, J. Nie, S. Sengupta, V. Ruotti, G. A. Jonsdottir, S. Keles, J. A. Thomson, et al. A study of the relationships between oligonucleotide properties and hybridization signal intensities from NimbleGen microarray datasets Nucleic Acids Res., May 1, 2008; 36(9): 2926 - 2938. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Bogan, M. J. Murphy, R. L. Stouffer, and J. D. Hennebold Systematic Determination of Differential Gene Expression in the Primate Corpus Luteum during the Luteal Phase of the Menstrual Cycle Mol. Endocrinol., May 1, 2008; 22(5): 1260 - 1273. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Pew, M. Zou, D. R. Brickley, and S. D. Conzen Glucocorticoid (GC)-Mediated Down-Regulation of Urokinase Plasminogen Activator Expression via the Serum and GC Regulated Kinase-1/Forkhead Box O3a Pathway Endocrinology, May 1, 2008; 149(5): 2637 - 2645. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Pachot, M.-A. Cazalis, F. Venet, F. Turrel, C. Faudot, N. Voirin, J. Diasparra, N. Bourgoin, F. Poitevin, B. Mougin, et al. Decreased Expression of the Fractalkine Receptor CX3CR1 on Circulating Monocytes as New Feature of Sepsis-Induced Immunosuppression J. Immunol., May 1, 2008; 180(9): 6421 - 6429. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Cushman, R. L. Tillett, J. A. Wood, J. M. Branco, and K. A. Schlauch Large-scale mRNA expression profiling in the common ice plant, Mesembryanthemum crystallinum, performing C3 photosynthesis and Crassulacean acid metabolism (CAM) J. Exp. Bot., May 1, 2008; 59(7): 1875 - 1894. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Li and H. Li Network-constrained regularization and variable selection for analysis of genomic data Bioinformatics, May 1, 2008; 24(9): 1175 - 1182. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. A. Shabalin, H. Tjelmeland, C. Fan, C. M. Perou, and A. B. Nobel Merging two gene-expression studies via cross-platform normalization Bioinformatics, May 1, 2008; 24(9): 1154 - 1160. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. F. Thompson, M. Reimers, B. Khulan, M. Gissot, T. A. Richmond, Q. Chen, X. Zheng, K. Kim, and J. M. Greally An analytical pipeline for genomic representations used for cytosine methylation studies Bioinformatics, May 1, 2008; 24(9): 1161 - 1167. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Dierlamm, E. M. Murga Penas, S. Bentink, S. Wessendorf, H. Berger, M. Hummel, W. Klapper, D. Lenze, A. Rosenwald, E. Haralambieva, et al. Gain of chromosome region 18q21 including the MALT1 gene is associated with the activated B-cell-like gene expression subtype and increased BCL2 gene dosage and protein expression in diffuse large B-cell lymphoma Haematologica, May 1, 2008; 93(5): 688 - 696. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Greco, A. Kotronen, J. Westerbacka, O. Puig, P. Arkkila, T. Kiviluoto, S. Laitinen, M. Kolak, R. M. Fisher, A. Hamsten, et al. Gene expression in human NAFLD Am J Physiol Gastrointest Liver Physiol, May 1, 2008; 294(5): G1281 - G1287. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Stoecklin, S. A. Tenenbaum, T. Mayo, S. V. Chittur, A. D. George, T. E. Baroni, P. J. Blackshear, and P. Anderson Genome-wide Analysis Identifies Interleukin-10 mRNA as Target of Tristetraprolin J. Biol. Chem., April 25, 2008; 283(17): 11689 - 11699. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Xu and X. Cui Robustified MANOVA with applications in detecting differentially expressed genes from oligonucleotide arrays Bioinformatics, April 15, 2008; 24(8): 1056 - 1062. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Vicent, D. Luis-Ravelo, I. Anton, I. Garcia-Tunon, F. Borras-Cuesta, J. Dotor, J. De Las Rivas, and F. Lecanda A Novel Lung Cancer Signature Mediates Metastatic Bone Colonization by a Dual Mechanism Cancer Res., April 1, 2008; 68(7): 2275 - 2285. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Ambrosini, S. L. Seelman, L.-X. Qin, and G. K. Schwartz The Cyclin-Dependent Kinase Inhibitor Flavopiridol Potentiates the Effects of Topoisomerase I Poisons by Suppressing Rad51 Expression in a p53-Dependent Manner Cancer Res., April 1, 2008; 68(7): 2312 - 2320. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. G. Anderson, S. Moreau-Marquis, B. A. Stanton, and G. A. O'Toole In Vitro Analysis of Tobramycin-Treated Pseudomonas aeruginosa Biofilms on Cystic Fibrosis-Derived Airway Epithelial Cells Infect. Immun., April 1, 2008; 76(4): 1423 - 1433. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Koltai and C. Weingarten-Baror Specificity of DNA microarray hybridization: characterization, effectors and approaches for data correction Nucleic Acids Res., April 1, 2008; 36(7): 2395 - 2405. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-I. Fernandez, B. Regnault, C. Mulet, M. Tanguy, P. Jay, P. J. Sansonetti, and T. Pedron Maturation of Paneth Cells Induces the Refractory State of Newborn Mice to Shigella Infection J. Immunol., April 1, 2008; 180(7): 4924 - 4930. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. Collins, Z. Hu, P. N. Ranganathan, D. Feng, L. M. Garrick, M. D. Garrick, and R. W. Browne Induction of arachidonate 12-lipoxygenase (Alox15) in intestine of iron-deficient rats correlates with the production of biologically active lipid mediators Am J Physiol Gastrointest Liver Physiol, April 1, 2008; 294(4): G948 - G962. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Inada, A. Follenzi, K. Cheng, M. Surana, B. Joseph, D. Benten, S. Bandi, H. Qian, and S. Gupta Phenotype reversion in fetal human liver epithelial cells identifies the role of an intermediate meso-endodermal stage before hepatic maturation J. Cell Sci., April 1, 2008; 121(7): 1002 - 1013. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. G. Woo, E. S. Park, J. H. Cheon, J. H. Kim, J.-S. Lee, B. J. Park, W. Kim, S. C. Park, Y. J. Chung, B. G. Kim, et al. Gene Expression-Based Recurrence Prediction of Hepatitis B Virus-Related Human Hepatocellular Carcinoma Clin. Cancer Res., April 1, 2008; 14(7): 2056 - 2064. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Wohlbach, B. F. Quirino, and M. R. Sussman Analysis of the Arabidopsis Histidine Kinase ATHK1 Reveals a Connection between Vegetative Osmotic Stress Sensing and Seed Maturation PLANT CELL, April 1, 2008; 20(4): 1101 - 1117. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Chang Milbauer, P. Wei, J. Enenstein, A. Jiang, C. A. Hillery, J. P. Scott, S. C. Nelson, V. Bodempudi, J. N. Topper, R.-B. Yang, et al. Genetic endothelial systems biology of sickle stroke risk Blood, April 1, 2008; 111(7): 3872 - 3879. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Cossegal, P. Chambrier, S. Mbelo, S. Balzergue, M.-L. Martin-Magniette, A. Moing, C. Deborde, V. Guyon, P. Perez, and P. Rogowsky Transcriptional and Metabolic Adjustments in ADP-Glucose Pyrophosphorylase-Deficient bt2 Maize Kernels Plant Physiology, April 1, 2008; 146(4): 1553 - 1570. [Abstract] [Full Text] [PDF] |
||||





























