An Experimental Study of Selected Methods towards Achieving 100% Recall of Synonyms in Software Requirements Documents

dc.contributor.authorLan, Xiaoye
dc.date.accessioned2015-12-01T17:36:36Z
dc.date.available2015-12-01T17:36:36Z
dc.date.issued2015
dc.date.submitted2015
dc.description.abstractSoftware requirements documents written in natural language need to avoid the use of synonyms to reduce unnecessary confusion and ambiguity. In practice, synonyms are still common and are widely used in requirements documents. Lots of tools to identify synonyms have been developed. To evaluate these tools, two metrics are often used: recall and precision. Recall is the ratio of the number of relevant records retrieved to the total number of relevant records in the document. Precision is the fraction of retrieved records that are relevant. Industry practice leads us to believe that 100% recall is preferred over 100% precision for such tools. Available tools never actually achieve 100% recall. The goal of this thesis is to explore computational methods that could reach 100% recall in extracting synonyms from software requirements documents. This thesis compares six WordNet-based methods and two context-based algorithmic approaches to extract synonyms from two different types of requirement documents. The eight methods were compared by their recall. The experiments results showed that the word co-occurrence-based method achieved the best recall in identifying synonyms of the software requirements documents. Further experiments showed that setting the parameters of the word co-occurrence-based method impacts the results of the experiments as well. The thesis also discusses potential issues of the word co-occurrence-based method in the design of the experiments. The document author's personal factors could influence the experiment results, but this influence can be avoided with careful design.en
dc.identifier.urihttp://hdl.handle.net/10012/10019
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.subject.lcshNatural language processing (Computer science)en
dc.subject.lcshsemantic computingen
dc.subject.lcshSynonymsen
dc.subject.lcshRequirements engineeringen
dc.subject.programComputer Scienceen
dc.titleAn Experimental Study of Selected Methods towards Achieving 100% Recall of Synonyms in Software Requirements Documentsen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentSchool of Accounting and Financeen
uws-etd.degree.disciplineAccountingen
uws.contributor.advisorBerry, Daniel
uws.contributor.affiliation1Faculty of Applied Health Sciencesen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Lan_Xiaoye.pdf
Size:
342.02 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.17 KB
Format:
Item-specific license agreed upon to submission
Description: