Skip to Main Content
 

Global Search Box

 
 
 
 

Files

ETD Abstract Container

Abstract Header

Integration of Heterogeneous Web-based Information into a Uniform Web-based Presentation

Janga, Prudhvi

Abstract Details

2014, PhD, University of Cincinnati, Engineering and Applied Science: Computer Science and Engineering.
With the continuing explosive growth of the world wide web, a wealth of information has become available online. The web has become one of the major sources of information for both individual users and large organizations. To find the information, individual users can either use search engines or navigate to a particular website following links. The former method returns links to vast amounts of data in seconds while the latter one could be tedious and time consuming. The presentation of results using the former method is usually a web page with links to actual web data sources (or websites). The latter method takes the user to the actual web data source itself. Using the two most popular forms of web data presentation/retrieval, web data can hardly be queried, manipulated and analyzed easily even though it is publicly and readily available. Many companies also use web for information whose challenge is to build web-based analytical and decision support systems, often referred to as web data warehouses. However, the information present on the web is extremely complex and heterogeneous which brings along with it a challenge in integrating and presenting retrieved web data in a uniform format. Hence, there is a need for different web data integration frameworks that can integrate and present web data in a uniform format. To achieve a homogeneous representation of web data we need a framework that extracts relevant structured and semi-structured web data from different web data sources, generates schemas from structured as well as semi-structured web data, and integrates schemas generated from different structured and semi-structured web data sources into a merged schema, populates it with data and presents it to the end user in a uniform format. We propose a modular framework for homogeneous presentation of web data. This framework consists of different standalone modules that can also be used to create independent systems that solve other schema unification problems. To extract, transform and integrate web data we propose new techniques and also improve on existing techniques for tabular web data integration, XML web data integration, schema mapping, data mapping and uniform presentation of web data.
Karen Davis, Ph.D. (Committee Chair)
Raj Bhatnagar, Ph.D. (Committee Member)
Hsiang-Li Chiang, Ph.D. (Committee Member)
Ali Minai, Ph.D. (Committee Member)
Carla Purdy, Ph.D. (Committee Member)
404 p.

Recommended Citations

Citations

  • Janga, P. (2014). Integration of Heterogeneous Web-based Information into a Uniform Web-based Presentation [Doctoral dissertation, University of Cincinnati]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1397467105

    APA Style (7th edition)

  • Janga, Prudhvi. Integration of Heterogeneous Web-based Information into a Uniform Web-based Presentation. 2014. University of Cincinnati, Doctoral dissertation. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=ucin1397467105.

    MLA Style (8th edition)

  • Janga, Prudhvi. "Integration of Heterogeneous Web-based Information into a Uniform Web-based Presentation." Doctoral dissertation, University of Cincinnati, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1397467105

    Chicago Manual of Style (17th edition)