Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
12742.pdf (4.56 MB)
ETD Abstract Container
Abstract Header
An Efficient Algorithm for Clustering Genomic Data
Author Info
Zhou, Xuan
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=ucin1418910389
Abstract Details
Year and Degree
2014, MS, University of Cincinnati, Engineering and Applied Science: Computer Science.
Abstract
In this thesis, we investigated an efficient framework for clustering analysis of gene expression profiles by discretizing continuous genomic data and adopting the 1D-jury approach for fast clustering that was previously used for protein model quality assessment. We demonstrated, through an empirical analysis of multiple data sets from independent studies, that the loss of information due to discretization of genomic data is limited. Patterns observed using the original data can largely be recovered from discretized expression profiles, while enabling efficient identification of genomic signatures and clustering of expression profiles. We further studied the application of 1D-Jury approach in reducing the dimensionality of genomic data. We demonstrated that discretization and 1D-Jury score projection efficiently reduced the dimensionality of feature space. More importantly, the proposed discretization-projection heuristic enhanced the discovery of cluster structure and patterns in the data. Therefore, the proposed discretization-projection method can be a valuable tool for the analysis of gene expression data.
Committee
Jaroslaw Meller, Ph.D. (Committee Chair)
Raj Bhatnagar, Ph.D. (Committee Member)
Yizong Cheng, Ph.D. (Committee Member)
Pages
70 p.
Subject Headings
Computer Science
Keywords
genomic data
;
clustering
;
discretization
;
1D-Jury
;
dimension reduction
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Zhou, X. (2014).
An Efficient Algorithm for Clustering Genomic Data
[Master's thesis, University of Cincinnati]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1418910389
APA Style (7th edition)
Zhou, Xuan.
An Efficient Algorithm for Clustering Genomic Data.
2014. University of Cincinnati, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=ucin1418910389.
MLA Style (8th edition)
Zhou, Xuan. "An Efficient Algorithm for Clustering Genomic Data." Master's thesis, University of Cincinnati, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1418910389
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
ucin1418910389
Download Count:
666
Copyright Info
© 2014, all rights reserved.
This open access ETD is published by University of Cincinnati and OhioLINK.