Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
akron1227547457.pdf (1.32 MB)
ETD Abstract Container
Abstract Header
Application of Committee k-NN Classifiers for Gene Expression Profile Classification
Author Info
Dhawan, Manik
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=akron1227547457
Abstract Details
Year and Degree
2008, Master of Science, University of Akron, Computer Science.
Abstract
The study of this thesis was an effort to design a stable classification system to categorize microarray gene expression profiles. Currently, high-throughput microarray technology has been widely used to simultaneously probe the expression values of thousands genes in a biological sample. However, due to the nature of DNA hybridization, the expression profiles are highly noisy and demand specialized data mining methods for analysis. This study focuses on developing an effective and stable sample classification system using gene expression data. The system includes a sequence of data preprocessing steps and a committee of k-nearest neighbor (k-NN) classifiers that are of different architectures and use different sets of features. A case study of the system was performed to illustrate the effectiveness of the committee approach. A real microarray dataset, the MIT leukemia cancer dataset, was used in the study. The expression profiles were first subjected to the sequence of preprocessing steps. About 38% of the genes were removed. The remaining informative genes were then ranked and used for constructing k-NN classifiers. The k-NN classifiers that gave the best results were further recruited to form a decision-making committee. The performance of the committee of k-NN classifiers were later evaluated using a new dataset. The results of the case study indicate that the system developed consistently outperforms individual k-NN classifiers in terms of both accuracy and stability.
Committee
Zhong-Hui Duan, PhD (Advisor)
Pages
62 p.
Subject Headings
Bioinformatics
;
Computer Science
;
Mining
Keywords
committee k-NN classification based on gene expression data
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Dhawan, M. (2008).
Application of Committee k-NN Classifiers for Gene Expression Profile Classification
[Master's thesis, University of Akron]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=akron1227547457
APA Style (7th edition)
Dhawan, Manik.
Application of Committee k-NN Classifiers for Gene Expression Profile Classification.
2008. University of Akron, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=akron1227547457.
MLA Style (8th edition)
Dhawan, Manik. "Application of Committee k-NN Classifiers for Gene Expression Profile Classification." Master's thesis, University of Akron, 2008. http://rave.ohiolink.edu/etdc/view?acc_num=akron1227547457
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
akron1227547457
Download Count:
1,498
Copyright Info
© 2008, all rights reserved.
This open access ETD is published by University of Akron and OhioLINK.