Abstract
Correspondence analysis (CA) is a statistical method that is widely used in multiple disciplines to reveal relationships amongst variables. Among others, CA has been successfully applied for microarray data analysis. One of CA’s strengths is its ability to help visualize the complex relationships that may be present in the data. In this sense, CA is a powerful exploratory tool that takes advantage of human pattern analysis abilities. The power of CA can, however, be diluted, if the patterns are embedded in data clutter. This is because CA is a dimensionality reduction approach and not a data reduction method; thus, is powerless to remove clutter. Unfortunately, our visual analysis abilities can be overwhelmed in such conditions causing failures in identifying relationships. In this paper, we propose a solution to this problem by combining CA with one-way analysis of variance (ANOVA) and subsequently by clustering in the low-dimensional space obtained from CA. We investigate the proposed approach using microarray data from 6200 S. cerevisiae genes and demonstrate how visual analysis is facilitated by removal of unnecessary clutter as well as facilitating the discernment of complex relationships that may be missed through application of CA alone.
This research was funded in part by the NSF grant IIS-064418 (CAREER).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cui, X.Q., Churchill, G.A.: Statistical tests for differential expression in cDNA microarray experiments. Genome Biol. 4, article 210 (2003)
Fellenberg, K., Hauser, N., Brors, B., Neutzner, A., Hoheisel, J., Vingron, M.: Correspondence Analysis Applied to Microarray Data. Proc. Nat. Acad. Sci. 98, 10781–10786 (2001)
Greenacre, M.: Correspondence Analysis in Practice. Chapman & Hall/CRC, Taylor & Francis Group, Boca Raton (2007)
Oba, S., Sato, M., Takemasa, I., Monden, M., Matsubara, K., Ishii, S.: A Bayesian Missing Value Estimation Method For Gene Expression Profile Data. Bioinformatics 19(16), 2088–2096 (2003)
Raychaudhuri, S., Stuart, J.M., Altman, R.B.: Principal Components Analysis to Summarize Microarray Experiments: Application to Sporulation Time Series. In: Pacific Symposium on Biocomputing, pp. 455–466 (2000)
Spellman, P., Sherlock, G., Zhang, M., Iyer, V., Anders, K., Eisen, M., Brown, P., Botstein, D., Futcher, B.: Comprehensive Identification of Cell Cycle–regulated Genes of the Yeast Saccharomyces cerevisiae by Microarray Hybridization. Molecular Biology of the Cell 9, 3273–3297 (1998)
Tai, C., Speed, C.: Statistical Analysis of Microarray Time Course Data, Walter and Eliza Hall Institute of Medical Research. In: DNA Microarrays, vol. ch. 20, Taylor and Francis, New York (2005)
Tzeng, J., Lu, H-S., Li, W-H.: Multidimensional Scaling For Large Genomic Data Sets. BMC Bioinformatics 9 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sasho, A., Zhu, S., Singh, R. (2011). Identification and Analysis of Cell Cycle Phase Genes by Clustering in Correspondence Subspaces. In: Abraham, A., Lloret Mauri, J., Buford, J.F., Suzuki, J., Thampi, S.M. (eds) Advances in Computing and Communications. ACC 2011. Communications in Computer and Information Science, vol 190. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22709-7_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-22709-7_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22708-0
Online ISBN: 978-3-642-22709-7
eBook Packages: Computer ScienceComputer Science (R0)