Publication Date

5-2016

Date of Final Oral Examination (Defense)

3-4-2016

Type of Culminating Activity

Thesis

Degree Title

Master of Science in Computer Science

Department

Computer Science

Major Advisor

Timothy Andersen, Ph.D.

Advisor

Amit Jain, Ph.D.

Advisor

Gregory Hampikian, Ph.D.

Abstract

A parallel string matching software architecture has been developed (incorporating several algorithms) to identify small genetic sequences in large genomes. Endogenous viral elements (EVEs) are sequences originating in the genomes of viruses that have become integrated into the chromosomes of sperm or egg cells of infected hosts, and passed to subsequent generations. EVEs have been identified in all seven classes of viruses and in the species of all kingdoms of life. Viruses from groups V and VI are considered in this thesis, including HIV and Ebola, within host genomes ranging from bacteria to humans. This database of small endogenous viral elements (SEVEs) contains homology between the viruses and every chromosome of the ten multicellular organisms in this study, including human, chimpanzee, gorilla, mouse, fruitbat, nematode, and thale cress.

Share

COinS