Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
GruenbergThesis.pdf (3.46 MB)
ETD Abstract Container
Abstract Header
Multi-Model Snowflake Schema Creation
Author Info
Gruenberg, Rebecca
ORCID® Identifier
http://orcid.org/0000-0002-1895-2830
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=miami1650650300150613
Abstract Details
Year and Degree
2022, Master of Computer Science, Miami University, Computer Science and Software Engineering.
Abstract
Big Data's three V's--volume, velocity, and variety--have continually presented a problem for storing and querying large, diverse data efficiently. Data lakes represent a growing field of study to store large volumes of data in a variety of formats. Multi-model star schemas support analytical processing of data stored in native formats and are an emerging area in data warehousing. Using multi-model snowflake schemas in place of star schemas gives the user a bigger picture of the data lake and the relationships within. In this work, we extend and implement a meta-model for data lakes and provide an algorithm to semi-automatically perform mappings between the data lake and multi-model snowflake schema for structured and semi-structured data. Our algorithm recommends candidate multi-model snowflake schemas derived from a meta-model of a data lake. The algorithm is the basis for a tool to assist analysts in understanding the contents of a data lake and in creating views that support analytical processing to better make business decisions when querying a large data repository. We implement this basis for a tool and demonstrate its functionality using a variety of case studies.
Committee
Karen Davis (Advisor)
Dhananjai Rao (Committee Member)
Daniela Inclezan (Committee Member)
Pages
129 p.
Subject Headings
Computer Science
Keywords
data lakes
;
multi-model database
;
snowflake schema
;
star schema
;
multi-model snowflake schema
;
meta-model
;
data lake meta-model, computer science, database, graph database
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Gruenberg, R. (2022).
Multi-Model Snowflake Schema Creation
[Master's thesis, Miami University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=miami1650650300150613
APA Style (7th edition)
Gruenberg, Rebecca.
Multi-Model Snowflake Schema Creation.
2022. Miami University, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=miami1650650300150613.
MLA Style (8th edition)
Gruenberg, Rebecca. "Multi-Model Snowflake Schema Creation." Master's thesis, Miami University, 2022. http://rave.ohiolink.edu/etdc/view?acc_num=miami1650650300150613
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
miami1650650300150613
Download Count:
410
Copyright Info
© 2022, all rights reserved.
This open access ETD is published by Miami University and OhioLINK.