Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
ucin1353342433.pdf (621.28 KB)
ETD Abstract Container
Abstract Header
Exploratory Study of Fuzzy Clustering and Set-Distance Based Validation Indexes
Author Info
Pangaonkar, Manali
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=ucin1353342433
Abstract Details
Year and Degree
2012, MS, University of Cincinnati, Engineering and Applied Science: Computer Science.
Abstract
This thesis is concerned with issues related to clustering. In particular, it addresses the con- vergence speed of fuzzy c-means family of algorithms and cluster validation. The fuzzy c-means clustering algorithm and its objective function is studied along with a literature review of the speed of clustering algorithms. After careful examination, several objective functions are derived by modifying the fuzzy c-means’ objective function. In addition, cluster validation is examined and new set distance based cluster validation indexes (CVI) are proposed which are the ratio of separation between clusters to compactness within a cluster. To this end, a new measure of compactness, compactness of a fuzzy partition is presented and fuzzy derivative of Pompeiu-Hausdorff distance is used as separation. The convergence of fuzzy c-means clustering algorithm is tested on real classification and clus- tering datasets. Under classification datasets, Iris, Breast Cancer Wisconsin and Wine Recognition datasets are used. Water Treatment Plant and Libras Movement datasets are used as clustering datasets. In classification datasets, the class labels in the data set are used to measure the per- formance. For clustering datasets, Rand index and Jaccard index are used to evaluate clustering results. The new set distance based validation indexes are tested on both synthetic and real datasets. Datasets with three, four, five and six clusters are generated by using Gaussian distributions. The above mentioned real datasets, Iris, Breast Cancer Wisconsin and Wine Recognition are also used to evaluate the performance of set distance based validation indexes. The result (number of clusters) obtained from the set distance based validation indexes are compared with those obtained from [50] to demonstrate efficiency of set distance based validation indexes and how it considers the structure of underlying data unlike others, [50] in particular.
Committee
Anca Ralescu, Ph.D. (Committee Chair)
Kenneth Berman, Ph.D. (Committee Member)
Dan Ralescu, Ph.D. (Committee Member)
Pages
73 p.
Subject Headings
Computer Science
Keywords
Fuzzy Clustering
;
Cluster Validation
;
Compactness
;
Separation
;
Set Distance
;
Cluster Comparison
;
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Pangaonkar, M. (2012).
Exploratory Study of Fuzzy Clustering and Set-Distance Based Validation Indexes
[Master's thesis, University of Cincinnati]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1353342433
APA Style (7th edition)
Pangaonkar, Manali.
Exploratory Study of Fuzzy Clustering and Set-Distance Based Validation Indexes.
2012. University of Cincinnati, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=ucin1353342433.
MLA Style (8th edition)
Pangaonkar, Manali. "Exploratory Study of Fuzzy Clustering and Set-Distance Based Validation Indexes." Master's thesis, University of Cincinnati, 2012. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1353342433
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
ucin1353342433
Download Count:
559
Copyright Info
© 2012, all rights reserved.
This open access ETD is published by University of Cincinnati and OhioLINK.