Attaching Social Interactions Surrounding Software Changes to the Release History of an Evolving Software System

dc.comment.hiddenI would appreciate if you could review the thesis for any required formatting changes at your earliest convenience. Thank you!en
dc.contributor.authorBaysal, Olga
dc.date.accessioned2006-12-21T19:44:19Z
dc.date.available2006-12-21T19:44:19Z
dc.date.issued2006-12-21T19:44:19Z
dc.date.submitted2006
dc.description.abstractOpen source software is designed, developed and maintained by means of electronic media. These media include discussions on a variety of issues reflecting the evolution of a software system, such as reports on bugs and their fixes, new feature requests, design change, refactoring tasks, test plans, etc. Often this valuable information is simply buried as plain text in the mailing archives. We believe that email interactions collected prior to a product release are related to its source code modifications, or if they do not immediately correlate to change events of the current release, they might affect changes happening in future revisions. In this work, we propose a method to reason about the nature of software changes by mining and correlating electronic mailing list archives. Our approach is based on the assumption that developers use meaningful names and their domain knowledge in defining source code identifiers, such as classes and methods. We employ natural language processing techniques to find similarity between source code change history and history of public interactions surrounding these changes. Exact string matching is applied to find a set of common concepts between discussion vocabulary and changed code vocabulary. We apply our correlation method on two software systems, LSEdit and Apache Ant. The results of these exploratory case studies demonstrate the evidence of similarity between the content of free-form text emails among developers and the actual modifications in the code. We identify a set of correlation patterns between discussion and changed code vocabularies and discover that some releases referred to as minor should instead fall under the major category. These patterns can be used to give estimations about the type of a change and time needed to implement it.en
dc.format.extent636939 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10012/2628
dc.language.isoenen
dc.pendingfalseen
dc.publisherUniversity of Waterlooen
dc.subjectSoftware architectureen
dc.subjectSoftware changesen
dc.subjectCorrelation patternsen
dc.subjectEmail interactionsen
dc.subject.programComputer Scienceen
dc.titleAttaching Social Interactions Surrounding Software Changes to the Release History of an Evolving Software Systemen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentSchool of Computer Scienceen
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
main.pdf
Size:
622.01 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
248 B
Format:
Item-specific license agreed upon to submission
Description: