Bioinformatics Tools for Finding the Vocabularies of Genomes

Petri, Eric D.C.

Keyword Search

School Logo

ohiou1213730223.pdf (470.5 KB)

Bioinformatics Tools for Finding the Vocabularies of Genomes

Author Info

Petri, Eric D.C.

Permalink:

http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1213730223

Year and Degree

2008, Master of Science (MS), Ohio University, Computer Science (Engineering and Technology).

Abstract

More organisms are having their genomes sequenced recently than in the past, thus creating a greater demand from the biological community to better understand the exact biological mechanisms which are encoded within the genomic blueprint of each organism. While biologists continue to analyze genomes and to identify new functional elements within organisms, there remain several regions of the genomes which are often overlooked, such as non-protein encoding regions, introns, and intergenic regions. Several bioinformatics algorithms exist to discover functional elements (which are also referenced within as words) in these regions.

In this thesis, a functional genomics toolkit for finding functional words of genomes (vocabularies) is presented and described. With currently available vocabulary based tools, limitations arise when analyzing large input sequences. To overcome this limitation, a scalable word searching approach is presented and tested with genomic sequences with file sizes up to 2 Gigabytes (GB). In addition, the toolkit is utilized to provide a genome-wide characterization of the Arabidopsis thaliana genome in terms of over- and under-represented repeats within specific genome regions and to search for similarities between putative functional elements in the human genome and Arabidopsis thaliana thereby producing a putative vocabulary. The difficulties encountered during the research process and suggestions for future work are also further discussed.

Committee

Lonnie R. Welch, PhD (Advisor)
Frank Drews, PhD (Committee Member)
Klaus Ecker, PhD (Committee Member)
Sarah Wyatt, PhD (Committee Member)

Pages

96 p.

Subject Headings

Bioinformatics; Computer Science

Keywords

Bioinformatics; Functional Genomics; Vocabulary Generation; Scalable Word Searching

Petri, E. D.C. (2008). Bioinformatics Tools for Finding the Vocabularies of Genomes [Master's thesis, Ohio University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1213730223
APA Style (7th edition)
Petri, Eric. Bioinformatics Tools for Finding the Vocabularies of Genomes. 2008. Ohio University, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1213730223.
MLA Style (8th edition)
Petri, Eric. "Bioinformatics Tools for Finding the Vocabularies of Genomes." Master's thesis, Ohio University, 2008. http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1213730223
Chicago Manual of Style (17th edition)

Document number:

ohiou1213730223

Download Count:

951

Copyright Info

Global Search Box

Files

File List

ETD Abstract Container

Abstract Header

Bioinformatics Tools for Finding the Vocabularies of Genomes

Abstract Details

Recommended Citations

Citations

Abstract Footer

Global Footer

Ohio Department of Higher Education

State Government Links

Education Links

Global Search Box

Files

File List

ETD Abstract Container

Abstract Header

Bioinformatics Tools for Finding the Vocabularies of Genomes

Abstract Details

Recommended CitationsRefworksEndNoteRISMendeley

Citations

Abstract Footer

Global Footer

Ohio Department of Higher Education

State Government Links

Education Links

Recommended Citations