Skip to Main Content
 

Global Search Box

 
 
 

ETD Abstract Container

Abstract Header

Techniques for Storing and Processing Next-Generation DNA Sequencing Data

Abstract Details

2014, Master of Science, Ohio State University, Biophysics.
Genomics is undergoing unprecedented transformation due to rapid improvements in genetic sequencing technology, which has lowered costs for genetic sequencing experiments while increasing the amount of data generated in a typical experiment (McKinsey Global Institute, May 2013, pp. 86-94). The increase in data has shifted the burden from analysis and research to expertise in IT hardware and network support for distributed and efficient processing. Bioinformaticians, in response to a data-rich environment, are challenged to develop better and faster algorithms to solve problems in genomics and molecular biology research. This thesis examines the storage and data processing issues inherent in next- generation DNA sequencing (NGS). This work details the design and implementation of a software prototype that exemplifies the current approaches as it relates to the efficient storage of NGS data. The software library is utilized within the context of a previous software project which accompanies the publication related to the HT_SOSA assay. The software for the HT_SOSA, called NGSPositionCounter, demonstrates a workflow that is common in a molecular biology research lab. In an effort to scale beyond the research institute, the software library’s architecture takes into account scalability considerations for data storage and processing demands that are more likely to be encountered in a clinical or commercial enterprise.
Kun Huang, Ph.D (Advisor)
Alvarez Carlos, Ph.D (Committee Member)
Machiraju Raghu, Ph.D (Committee Member)
98 p.

Recommended Citations

Citations

  • Camerlengo, T. L. (2014). Techniques for Storing and Processing Next-Generation DNA Sequencing Data [Master's thesis, Ohio State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=osu1388502159

    APA Style (7th edition)

  • Camerlengo, Terry. Techniques for Storing and Processing Next-Generation DNA Sequencing Data. 2014. Ohio State University, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=osu1388502159.

    MLA Style (8th edition)

  • Camerlengo, Terry. "Techniques for Storing and Processing Next-Generation DNA Sequencing Data." Master's thesis, Ohio State University, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=osu1388502159

    Chicago Manual of Style (17th edition)