Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
kent1232503910.pdf (1.07 MB)
ETD Abstract Container
Abstract Header
A Tree-based Summarization Framework For Differences Between Two Data Sets
Author Info
Wang, Dong
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=kent1232503910
Abstract Details
Year and Degree
2009, MS, Kent State University, College of Arts and Sciences / Department of Computer Science.
Abstract
This work addresses the issue of describing the difference between two data sets. A framework is developed to quantify the difference between two data sets, given that the difference is induced by the different statistical distributions of the two data sets. Besides the quantification, this framework also provides an intuitive explanation of difference: a decision tree like structure is built to interpret the interesting point(s) of the difference. A dynamic programming algorithm is developed to give the global optimal solution. However, it has high computational complexity. To improve the efficiency, a greedy algorithm is proposed. Both algorithms are tested against the synthetic data sets and the real data sets.
Committee
Ruoming Jin (Advisor)
Yuri Breitbart (Committee Member)
Feodor Dragan (Committee Member)
Pages
51 p.
Subject Headings
Computer Science
Keywords
describing difference data sets
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Wang, D. (2009).
A Tree-based Summarization Framework For Differences Between Two Data Sets
[Master's thesis, Kent State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=kent1232503910
APA Style (7th edition)
Wang, Dong.
A Tree-based Summarization Framework For Differences Between Two Data Sets.
2009. Kent State University, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=kent1232503910.
MLA Style (8th edition)
Wang, Dong. "A Tree-based Summarization Framework For Differences Between Two Data Sets." Master's thesis, Kent State University, 2009. http://rave.ohiolink.edu/etdc/view?acc_num=kent1232503910
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
kent1232503910
Download Count:
819
Copyright Info
© 2009, all rights reserved.
This open access ETD is published by Kent State University and OhioLINK.