Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

A Tree-based Summarization Framework For Differences Between Two Data Sets

Abstract Details

2009, MS, Kent State University, College of Arts and Sciences / Department of Computer Science.
This work addresses the issue of describing the difference between two data sets. A framework is developed to quantify the difference between two data sets, given that the difference is induced by the different statistical distributions of the two data sets. Besides the quantification, this framework also provides an intuitive explanation of difference: a decision tree like structure is built to interpret the interesting point(s) of the difference. A dynamic programming algorithm is developed to give the global optimal solution. However, it has high computational complexity. To improve the efficiency, a greedy algorithm is proposed. Both algorithms are tested against the synthetic data sets and the real data sets.
Ruoming Jin (Advisor)
Yuri Breitbart (Committee Member)
Feodor Dragan (Committee Member)
51 p.

Recommended Citations

Citations

  • Wang, D. (2009). A Tree-based Summarization Framework For Differences Between Two Data Sets [Master's thesis, Kent State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=kent1232503910

    APA Style (7th edition)

  • Wang, Dong. A Tree-based Summarization Framework For Differences Between Two Data Sets. 2009. Kent State University, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=kent1232503910.

    MLA Style (8th edition)

  • Wang, Dong. "A Tree-based Summarization Framework For Differences Between Two Data Sets." Master's thesis, Kent State University, 2009. http://rave.ohiolink.edu/etdc/view?acc_num=kent1232503910

    Chicago Manual of Style (17th edition)