Loop Modeling in Proteins Using a Database Approach with Multi-Dimensional Scaling

dc.comment.hiddenThis work was previously published in the Journal of Computational Biology. They only require permission to publish the work in the same form. I have used two public domain figures and a figure from a paper licensed under the creative commons attribution license. I have made notes of this in the Copyright Permissions section.en
dc.contributor.authorHoltby, Daniel James
dc.date.accessioned2013-09-26T20:08:16Z
dc.date.available2013-09-26T20:08:16Z
dc.date.issued2013-09-26T20:08:16Z
dc.date.submitted2013-09
dc.description.abstractModeling loops is an often necessary step in protein structure and function determination, even with experimental X-ray and NMR data. It is well known to be difficult. Database techniques have the advantage of producing a higher proportion of predictions with sub-angstrom accuracy when compared with ab initio techniques, but the disadvantage of often being able to produce usable results as they depend entirely on the loop already being represented within the database. My contribution is the LoopWeaver protocol, a database method that uses multidimensional scaling to rapidly achieve better clash-free, low energy placement of loops obtained from a database of protein structures. This maintains the above- mentioned advantage while avoiding the disadvantage by permitting the use of lower quality matches that would not otherwise fit. Test results show that this method achieves significantly better results than all other methods, including Modeler, Loopy, SuperLooper, and Rapper before refinement. With refinement, the results (LoopWeaver and Loopy combined) are better than ROSETTA's, with 0.53Å RMSD on average for 206 loops of length 6, 0.75Å local RMSD for 168 loops of length 7, 0.93Å RMSD for 117 loops of length 8, and 1.13Å RMSD loops of length 9, while ROSETTA scores 0.66Å , 0.93Å , 1.23Å , 1.56Å , respectively, at the same average time limit (3 hours on a 2.2 GHz Opteron). When ROSETTA is allowed to run for over a week against LoopWeaver's and Loopy's combined 3 hours, it approaches, but does not surpass, this accuracy.en
dc.identifier.urihttp://hdl.handle.net/10012/7935
dc.language.isoenen
dc.pendingfalseen
dc.publisherUniversity of Waterlooen
dc.subjectloop modelingen
dc.subjectprotein structureen
dc.subject.programComputer Scienceen
dc.titleLoop Modeling in Proteins Using a Database Approach with Multi-Dimensional Scalingen
dc.typeDoctoral Thesisen
uws-etd.degreeDoctor of Philosophyen
uws-etd.degree.departmentSchool of Computer Scienceen
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Holtby_Daniel.pdf
Size:
8.09 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
247 B
Format:
Item-specific license agreed upon to submission
Description: