Skip to Main Content
 

Global Search Box

 
 
 
 

Files

ETD Abstract Container

Abstract Header

Classification of Patterns in Streaming Data Using Clustering Signatures

Abstract Details

2017, MS, University of Cincinnati, Engineering and Applied Science: Electrical Engineering.
Streaming datasets often pose a myriad of challenges for machine learning algorithms, some of which include insufficient storage and changes in the underlying distributions of the data during different time intervals. This thesis proposes a hierarchical clustering based method (unsupervised learning) for determining signatures of data in a time window and thus building a classifier based on the match between the observed clusters and known patterns of clustering. When new clusters are observed, they are added to the collection of possible global list of clusters, used to generate a signature for data in a time window. Dendrograms are created from each time window, and their clusters were compared to a global list of clusters. The global clusters list is only updated if none of the existing global clusters that can model data points in any later time window. The global clusters were then used in the testing phase to classify novel data chunks according to their Tanimoto similarities. Although the training samples were only taken from 20% of the entire KDD Cup 99 dataset, we validated our approach by using test data from different regions of the datasets at multiple intervals and the classifier performance achieved was comparable to other methods that had used the entire datasets for training.
Raj Bhatnagar, Ph.D. (Committee Chair)
Gowtham Atluri (Committee Member)
Nan Niu, Ph.D. (Committee Member)
70 p.

Recommended Citations

Citations

  • Awodokun, O. (2017). Classification of Patterns in Streaming Data Using Clustering Signatures [Master's thesis, University of Cincinnati]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1504880155623189

    APA Style (7th edition)

  • Awodokun, Olugbenga. Classification of Patterns in Streaming Data Using Clustering Signatures. 2017. University of Cincinnati, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=ucin1504880155623189.

    MLA Style (8th edition)

  • Awodokun, Olugbenga. "Classification of Patterns in Streaming Data Using Clustering Signatures." Master's thesis, University of Cincinnati, 2017. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1504880155623189

    Chicago Manual of Style (17th edition)