A Data Mining Approach for Detecting Evolutionary Divergence in Transcriptomic Data

dc.contributor.advisorMcConkey, Brendan
dc.contributor.authorWoody, Owen
dc.date.accessioned2019-11-19T15:18:35Z
dc.date.available2019-11-19T15:18:35Z
dc.date.issued2019-11-19
dc.date.submitted2019-11-15
dc.description.abstractIt has become common to produce genome sequences for organisms of scientific or popular interest. Although these genome projects provide insight into the gene and protein complements of a species including their evolutionary relationships, it remains challenging to determine gene regulatory behavior from genome sequence alone. It has also become common to produce “expression atlas” transcriptomic data sets. These atlases employ high-throughput transcript assays to survey an assortment of tissues, developmental states, and responses to stimuli that each may individually elicit or inhibit the transcription of genes. Although genomic and transcriptomic data sets are both routinely collected, they are seldom analyzed in tandem. Here I present a novel approach to combining these complementary data with a software package called BranchOut. BranchOut uses genomic information to construct gene family phylogenies, and then attempts to map gene expression activity onto this phylogeny to allow estimation of ancestral expression states. This allows the identification of specific innovations due to gene duplications that resulted in fundamental diversification in the roles of otherwise closely related genes. As a proof of concept, the BranchOut technique is first applied to a tangible small-scale example in Apis mellifera. Subsequently, the power of BranchOut to analyze complete genomes is shown for two mammalian genomes, Sus scrofa and Bos taurus. The transcriptomic data sets for these two mammals employ microarray and RNAseq platforms, respectively, for expression analysis, demonstrating BranchOut’s applicability to both future and historic expression atlases. Potential refinements to the approach are also discussed.en
dc.identifier.urihttp://hdl.handle.net/10012/15257
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.relation.urihttps://github.com/owoody/BranchOuten
dc.subjectevolutionen
dc.subjectgene expressionen
dc.subjectbioinformaticsen
dc.subjectdata miningen
dc.subjectphylogeneticsen
dc.titleA Data Mining Approach for Detecting Evolutionary Divergence in Transcriptomic Dataen
dc.typeDoctoral Thesisen
uws-etd.degreeDoctor of Philosophyen
uws-etd.degree.departmentBiologyen
uws-etd.degree.disciplineBiologyen
uws-etd.degree.grantorUniversity of Waterlooen
uws.comment.hiddenGitHub is not accessible yet, but will become public once associated publication is accepted.en
uws.contributor.advisorMcConkey, Brendan
uws.contributor.affiliation1Faculty of Scienceen
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Woody_Owen.pdf
Size:
2.33 MB
Format:
Adobe Portable Document Format
Description:
Ph.D. dissertation

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.08 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections