Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
Final.pdf (2.01 MB)
ETD Abstract Container
Abstract Header
Advanced Concurrency Control Algorithm Design and GPU System Support for High Performance In-Memory Data Management
Author Info
Yuan, Yuan
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=osu1480123978759034
Abstract Details
Year and Degree
2016, Doctor of Philosophy, Ohio State University, Computer Science and Engineering.
Abstract
The design and implementation of data management systems have been significantly affected by application demands and hardware advancements. On one hand, with the emerging of various new applications, the traditional one-size-fits-all data management system has evolved into domain specific systems optimized for each application (e.g., OLTP, OLAP, streaming, etc.). On the other hand, with increasing memory capacity, and advancements of multi-core CPUs and massive parallel co-processors (e.g., GPUs), the performance bottleneck of data management systems have shifted from I/O to memory accesses, which has led a constructive re-design of data management systems for memory resident data. Although many in-memory systems have been developed to deliver much better performance than that of disk-based systems, they all face the challenge of how to maximize the system’s performance by massive parallelism. In this Ph.D. dissertation, we explore how to design high performance in-memory data management systems for massive parallel processors. We have identified three critical issues of in-memory data processing. First, Optimistic Concurrency Control (OCC) method has been commonly used for in-memory databases to ensure transaction serializability. Although OCC can achieve high performance at low contention, it causes large number of unnecessary transaction aborts at high contention, which wastes system resources and significantly degrades database throughput. To solve the problem, we propose a new concurrency control method named Balanced Concurrency Control (BCC) that can more accurately abort transactions while maintaining OCC’s merits at low contention. Second, we study how to use the massive parallel co-processor GPUs to improve the performance of in-memory analytical systems. Existing works have demonstrated GPU’s performance advantage over CPU on simple analytical operations (e.g., join), but it is unclear how to optimize complex queries with various optimizations. To address the issue, we comprehensively examine analytical query behaviors on GPUs and design a new GPU in-memory analytical system to efficiently execute complex analytical workloads. Third, we investigate how to use GPUs to accelerate the performance of various analytical applications on production-level distributed in-memory data processing systems. Most of existing GPU works adopt a GPU-centric design, which completely redesigns a system for GPUs without considering the performance of CPU operations. It is unclear how much a CPU-optimized, distributed in-memory data processing system can benefit from GPUs. To answer the question, we use Apache Spark as a platform and design Spark-GPU that has addressed a set of real-world challenges incurred by the mismatches between Spark and GPU. Our research includes both algorithm design and system design and implementation in the form of open source software.
Committee
Xiaodong Zhang (Advisor)
Pages
156 p.
Subject Headings
Computer Engineering
;
Computer Science
Keywords
Advanced Concurrency Control Algorithm Design, GPU System Support, High Performance In-Memory Data Management
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Yuan, Y. (2016).
Advanced Concurrency Control Algorithm Design and GPU System Support for High Performance In-Memory Data Management
[Doctoral dissertation, Ohio State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=osu1480123978759034
APA Style (7th edition)
Yuan, Yuan.
Advanced Concurrency Control Algorithm Design and GPU System Support for High Performance In-Memory Data Management.
2016. Ohio State University, Doctoral dissertation.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=osu1480123978759034.
MLA Style (8th edition)
Yuan, Yuan. "Advanced Concurrency Control Algorithm Design and GPU System Support for High Performance In-Memory Data Management." Doctoral dissertation, Ohio State University, 2016. http://rave.ohiolink.edu/etdc/view?acc_num=osu1480123978759034
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
osu1480123978759034
Download Count:
1,015
Copyright Info
© 2016, all rights reserved.
This open access ETD is published by The Ohio State University and OhioLINK.