Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Bioinformatics Tools for Finding the Vocabularies of Genomes

Petri, Eric D.C.

Abstract Details

2008, Master of Science (MS), Ohio University, Computer Science (Engineering and Technology).

More organisms are having their genomes sequenced recently than in the past, thus creating a greater demand from the biological community to better understand the exact biological mechanisms which are encoded within the genomic blueprint of each organism. While biologists continue to analyze genomes and to identify new functional elements within organisms, there remain several regions of the genomes which are often overlooked, such as non-protein encoding regions, introns, and intergenic regions. Several bioinformatics algorithms exist to discover functional elements (which are also referenced within as words) in these regions.

In this thesis, a functional genomics toolkit for finding functional words of genomes (vocabularies) is presented and described. With currently available vocabulary based tools, limitations arise when analyzing large input sequences. To overcome this limitation, a scalable word searching approach is presented and tested with genomic sequences with file sizes up to 2 Gigabytes (GB). In addition, the toolkit is utilized to provide a genome-wide characterization of the Arabidopsis thaliana genome in terms of over- and under-represented repeats within specific genome regions and to search for similarities between putative functional elements in the human genome and Arabidopsis thaliana thereby producing a putative vocabulary. The difficulties encountered during the research process and suggestions for future work are also further discussed.

Lonnie R. Welch, PhD (Advisor)
Frank Drews, PhD (Committee Member)
Klaus Ecker, PhD (Committee Member)
Sarah Wyatt, PhD (Committee Member)
96 p.

Recommended Citations

Citations

  • Petri, E. D.C. (2008). Bioinformatics Tools for Finding the Vocabularies of Genomes [Master's thesis, Ohio University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1213730223

    APA Style (7th edition)

  • Petri, Eric. Bioinformatics Tools for Finding the Vocabularies of Genomes. 2008. Ohio University, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1213730223.

    MLA Style (8th edition)

  • Petri, Eric. "Bioinformatics Tools for Finding the Vocabularies of Genomes." Master's thesis, Ohio University, 2008. http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1213730223

    Chicago Manual of Style (17th edition)