Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Mining Shared Decision Trees between Datasets

Abstract Details

2010, Master of Science in Computer Engineering (MSCE), Wright State University, Computer Engineering.
This thesis studies the problem of mining models, patterns andstructures (MPS) shared by two datasets (applications), a well understood dataset, denoted as WD, and a poorly understood one, denoted as PD. Combined with users' familiarity with WD, the shared MPS can help users better understand PD, since they capture similarities between WD and PD. Moreover, the knowledge on such similarities can enable the users to focus attention on analyzing the unique behavior of PD. Technically, this thesis focuses on the shared decision tree mining problem. In order to provide a view on the similarities between WD and PD, this thesis proposes to mine a high quality shared decision tree satisfying the properties: the tree has (1) highly similar data distribution and (2) high classification accuracy in the datasets. This thesis proposes an algorithm, namely SDT-Miner, for mining such shared decision tree. This algorithm is significantly different from traditional decision tree mining, since it addresses the challenges caused by the presence of two datasets, by the data distribution similarity requirement and by the tree accuracy requirement. The effectiveness of the algorithm is verified by experiments.
Guozhu Dong, PhD (Advisor)
Keke Chen, PhD (Committee Member)
Pascal Hitzler, PhD (Committee Member)
52 p.

Recommended Citations

Citations

  • Han, Q. (2010). Mining Shared Decision Trees between Datasets [Master's thesis, Wright State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=wright1274807201

    APA Style (7th edition)

  • Han, Qian. Mining Shared Decision Trees between Datasets. 2010. Wright State University, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=wright1274807201.

    MLA Style (8th edition)

  • Han, Qian. "Mining Shared Decision Trees between Datasets." Master's thesis, Wright State University, 2010. http://rave.ohiolink.edu/etdc/view?acc_num=wright1274807201

    Chicago Manual of Style (17th edition)