The Library will be performing maintenance on UWSpace on October 2nd, 2024. UWSpace will be offline for all UW community members during this time.
 

Determining the Utility of Key-term Highlighting for High Recall Information Retrieval Systems

dc.contributor.authorWang, Xue Jun
dc.date.accessioned2021-09-28T20:20:39Z
dc.date.available2021-09-28T20:20:39Z
dc.date.issued2021-09-28
dc.date.submitted2021-09-14
dc.description.abstractHigh-recall information retrieval (HRIR) is an important tool used in tasks such as electronic discovery ("eDiscovery") and systematic review of medical research. Applications of HRIR often uses a human as its oracle to determine the relevance of immense numbers of documents, which is expensive in both time and money. Various methods for reducing the amount of time spent per assessment and improving the quality of assessors have been proposed to improve these systems. For this thesis, we examine the method of presenting documents where key-terms are highlighted in place of plain-text document. This is commonly accepted as a positive feature which achieves both of the previously mentioned improvements, but there is currently a lack of empirical evidence to support its effectiveness. We describe an user study in which participants are assigned to one of two variations of a HRIR system (key-term highlighting vs plain-text) with a post task questionnaire. Our results failed to show statistically significant improvement for labelling documents with key-term highlighting over plain-text for any of the measures recall, precision, and F1, but may negatively affect retention of concepts. Our study provides empirical evidence for how the use of key-term highlighting affects an assessor's abilities to label documents and provides insight into when including this feature may be harmful rather than helpful.en
dc.identifier.urihttp://hdl.handle.net/10012/17573
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.relation.urihttps://www.semanticscholar.org/cord19en
dc.subjectinformation retrievalen
dc.subjecthigh recallen
dc.subjecthighlightingen
dc.titleDetermining the Utility of Key-term Highlighting for High Recall Information Retrieval Systemsen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentDavid R. Cheriton School of Computer Scienceen
uws-etd.degree.disciplineComputer Scienceen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.embargo.terms0en
uws.contributor.advisorGrossman, Maura
uws.contributor.affiliation1Faculty of Mathematicsen
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Wang_XueJun.pdf.pdf
Size:
5.01 MB
Format:
Adobe Portable Document Format
Description:
MMath Thesis
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.4 KB
Format:
Item-specific license agreed upon to submission
Description: