Conference proceeding
Identification and Analysis of Cell Cycle Phase Genes by Clustering in Correspondence Subspaces
ADVANCES IN COMPUTING AND COMMUNICATIONS, PT I, Vol.190(1), pp.340-350
Communications in Computer and Information Science
01/01/2011
DOI: 10.1007/978-3-642-22709-7_35
Abstract
Correspondence analysis (CA) is a statistical method that is widely used in multiple disciplines to reveal relationships amongst variables. Among others, CA has been successfully applied for microarray data analysis. One of CA's strengths is its ability to help visualize the complex relationships that may be present in the data. In this sense, CA is a powerful exploratory tool that takes advantage of human pattern analysis abilities. The power of CA can, however. be diluted, if the patterns are embedded in data clutter. This is because CA is a dimensionality reduction approach and not a data reduction method; thus. is powerless to remove clutter. Unfortunately, our visual analysis abilities can be overwhelmed in such conditions causing failures in identifying relationships. In this paper, we propose a solution to this problem by combining CA with one-way analysis of variance (ANOVA) and subsequently by clustering in the low-dimensional space obtained from CA. We investigate the proposed approach using microarray data from 6200 S. cerevisiae genes and demonstrate how visual analysis is facilitated by removal of unnecessary clutter as well as facilitating the discernment of complex relationships that may be missed through application of CA alone.
Details
- Title: Subtitle
- Identification and Analysis of Cell Cycle Phase Genes by Clustering in Correspondence Subspaces
- Creators
- Ai Sasho - San Francisco State UniversityShenhaochen Zhu - San Francisco State UniversityRahul Singh - San Francisco State University
- Contributors
- A Abraham (Editor)J L Mauri (Editor)J F Buford (Editor)J Suzuki (Editor)S M Thampi (Editor)
- Resource Type
- Conference proceeding
- Publication Details
- ADVANCES IN COMPUTING AND COMMUNICATIONS, PT I, Vol.190(1), pp.340-350
- Publisher
- Springer Nature
- Series
- Communications in Computer and Information Science
- DOI
- 10.1007/978-3-642-22709-7_35
- ISSN
- 1865-0929
- eISSN
- 1865-0937
- Number of pages
- 2
- Grant note
- IIS-064418 / NSF; National Science Foundation (NSF)
- Language
- English
- Date published
- 01/01/2011
- Academic Unit
- Computer Science
- Record Identifier
- 9984446067602771
Metrics
5 Record Views