RNA Homology Searches Using Pair Seeding

dc.contributor.authorDarbha, Sriramen
dc.date.accessioned2006-08-22T14:25:22Z
dc.date.available2006-08-22T14:25:22Z
dc.date.issued2005en
dc.date.submitted2005en
dc.description.abstractDue to increasing numbers of non-coding RNA (ncRNA) being discovered recently, there is interest in identifying homologs of a given structured RNA sequence. Exhaustive homology searching for structured RNA molecules using covariance models is infeasible on genome-length sequences. Hence, heuristic methods are employed, but they largely ignore structural information in the query. We present a novel method, which uses secondary structure information, to perform homology searches for a structured RNA molecule. We define the concept of a <em>pair seed</em> and theoretically model alignments of random and related paired regions to compute expected sensitivity and specificity. We show that our method gives theoretical gains in sensitivity and specificity compared to a BLAST-based heuristic approach. We provide experimental verification of this gain. <br /><br /> We also show that pair seeds can be effectively combined with the spaced seeds approach to nucleotide homology search. The hybrid search method has theoretical specificity superior to that of the BLAST seed. We provide experimental evaluation of our hypotheses. Finally, we note that our method is easily modified to process pseudo-knotted regions in the query, something outside the scope of covariance model based methods.en
dc.formatapplication/pdfen
dc.format.extent885315 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10012/1172
dc.language.isoenen
dc.pendingfalseen
dc.publisherUniversity of Waterlooen
dc.rightsCopyright: 2005, Darbha, Sriram. All rights reserved.en
dc.subjectComputer Scienceen
dc.subjectbioinformaticsen
dc.subjecthomologyen
dc.subjectRNAen
dc.subjectseedingen
dc.subjectpairwise sequence alignmenten
dc.subjectalgorithmsen
dc.titleRNA Homology Searches Using Pair Seedingen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentSchool of Computer Scienceen
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s2darbha2005.pdf
Size:
864.57 KB
Format:
Adobe Portable Document Format