HomeLarge Type Edition
HOME ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Services
Right arrow Download to citation manager
PubMed
Right arrow PubMed Citation
The Journals of Gerontology Series A: Biological Sciences and Medical Sciences 59:B306-B315 (2004)
© 2004 The Gerontological Society of America

Reproducibility, Sources of Variability, Pooling, and Sample Size: Important Considerations for the Design of High-Density Oligonucleotide Array Experiments

Eun-Soo Han1,2,, Yimin Wu2, Roger McCarter2, James F. Nelson2, Arlan Richardson2,3 and Susan G. Hilsenbeck4

1 Department of Biological Science, The University of Tulsa, Oklahoma.
2 Department of Physiology, The University of Texas Health Science Center, San Antonio.
3 South Texas Veterans Health Care System at San Antonio.
4 Breast Center at Baylor College of Medicine, Houston, Texas.

Address correspondence to Eun-Soo Han, Department of Biological Science, University of Tulsa, 600 S. College Ave., Tulsa, OK 74104. E-mail: eun-han{at}utulsa.edu

We have undertaken a series of experiments to examine several issues that directly affect design of gene expression studies using Affymetrix GeneChip arrays: probe-level analysis, need for technical replication, relative contribution of various sources of variability, and utility of pooling RNA from different samples. Probe-level data were analyzed by Affymetrix MAS 5.0, and three model-based methods, PM-MM and PM-only models by dChip, and the RMA model by Bioconductor, with the latter two providing the best performance. We found that replicate chips of the same RNA have limited value in reducing total variability, and for relatively highly expressed genes in this biologically homogeneous animal model of aging, about 11% of total variation is due to day effects and the remainder is approximately equally split between sample and residual sources. We also found that pooling samples is neither advantageous nor detrimental. Finally we suggest a strategy for sample size calculations using formulas appropriate when coefficients of variation are known, target effects are expressed as fold changes, and data can be assumed to be approximately lognormally distributed.







HOME ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2004 by The Gerontological Society of America.