Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Application of Committee k-NN Classifiers for Gene Expression Profile Classification

Dhawan, Manik

Abstract Details

2008, Master of Science, University of Akron, Computer Science.
The study of this thesis was an effort to design a stable classification system to categorize microarray gene expression profiles. Currently, high-throughput microarray technology has been widely used to simultaneously probe the expression values of thousands genes in a biological sample. However, due to the nature of DNA hybridization, the expression profiles are highly noisy and demand specialized data mining methods for analysis. This study focuses on developing an effective and stable sample classification system using gene expression data. The system includes a sequence of data preprocessing steps and a committee of k-nearest neighbor (k-NN) classifiers that are of different architectures and use different sets of features. A case study of the system was performed to illustrate the effectiveness of the committee approach. A real microarray dataset, the MIT leukemia cancer dataset, was used in the study. The expression profiles were first subjected to the sequence of preprocessing steps. About 38% of the genes were removed. The remaining informative genes were then ranked and used for constructing k-NN classifiers. The k-NN classifiers that gave the best results were further recruited to form a decision-making committee. The performance of the committee of k-NN classifiers were later evaluated using a new dataset. The results of the case study indicate that the system developed consistently outperforms individual k-NN classifiers in terms of both accuracy and stability.
Zhong-Hui Duan, PhD (Advisor)
62 p.

Recommended Citations

Citations

  • Dhawan, M. (2008). Application of Committee k-NN Classifiers for Gene Expression Profile Classification [Master's thesis, University of Akron]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=akron1227547457

    APA Style (7th edition)

  • Dhawan, Manik. Application of Committee k-NN Classifiers for Gene Expression Profile Classification. 2008. University of Akron, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=akron1227547457.

    MLA Style (8th edition)

  • Dhawan, Manik. "Application of Committee k-NN Classifiers for Gene Expression Profile Classification." Master's thesis, University of Akron, 2008. http://rave.ohiolink.edu/etdc/view?acc_num=akron1227547457

    Chicago Manual of Style (17th edition)