Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
6779.pdf (1.45 MB)
ETD Abstract Container
Abstract Header
The Integration of LlamaOS for Fine-Grained Parallel Simulation
Author Info
Gideon, John
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=ucin1377872370
Abstract Details
Year and Degree
2013, MS, University of Cincinnati, Engineering and Applied Science: Computer Engineering.
Abstract
LlamaOS is a custom operating system that provides much of the basic functionality needed for low latency applications. It is designed to run in a Xen-based virtual machine on a Beowulf cluster of multi/many-core processors. The software architecture of llamaOS is decomposed into two main components, namely: the llamaNET driver and llamaApps. The llamaNET driver contains Ethernet drivers and manages all node-to-node communications between user application programs that are contained within a llamaApp instance. Typically, each node of the Beowulf cluster will run one instance of the llamaNET driver with one or more llamaApps bound to parallel applicaitons. These capabilities provide a solid foundation for the deployment of MPI applications as evidenced by our initial benchmarks and case studies. However, a message passing standard still needed to be either ported or implemented in llamaOS. To minimize latency, llamaMPI was developed as a new implementation of the Message Passing Interface (MPI), which is compliant with the core MPI functionality. This provides a standardized and easy way to develop for this new system. Performance assessment of llamaMPI was achieved using both standard parallel computing benchmarks and a locally (but independently) developed program that executes parallel discrete event-driven simulations. In particular, the NAS Parallel Benchmarks are used to show the performance characteristics of llamaMPI. In the experiments, most of the NAS Parallel Benchmarks ran faster than, or equal to their native performance. The benefit of llamaMPI was also shown with the fine-grained parallel application WARPED. The order of magnitude lower communication latency in llamaMPI greatly reduced the amount of time that the simulation spent in rollbacks. This resulted in an overall faster and more efficient computation, because less time was spent off the critical path due to causality errors.
Committee
Philip Wilsey, Ph.D. (Committee Chair)
Fred Beyette, Ph.D. (Committee Member)
Carla Purdy, Ph.D. (Committee Member)
Pages
87 p.
Subject Headings
Computer Engineering
Keywords
Parallel Computing
;
Time Warp Simulation
;
MPI
;
Operating Systems
;
Beowulf Cluster
;
Parallel Discrete Event Simulation
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Gideon, J. (2013).
The Integration of LlamaOS for Fine-Grained Parallel Simulation
[Master's thesis, University of Cincinnati]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1377872370
APA Style (7th edition)
Gideon, John.
The Integration of LlamaOS for Fine-Grained Parallel Simulation.
2013. University of Cincinnati, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=ucin1377872370.
MLA Style (8th edition)
Gideon, John. "The Integration of LlamaOS for Fine-Grained Parallel Simulation." Master's thesis, University of Cincinnati, 2013. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1377872370
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
ucin1377872370
Download Count:
488
Copyright Info
© 2013, some rights reserved.
The Integration of LlamaOS for Fine-Grained Parallel Simulation by John Gideon is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. Based on a work at etd.ohiolink.edu.
This open access ETD is published by University of Cincinnati and OhioLINK.