Measuring the Stability of Query Term Collocations and Using it in Document Ranking

dc.contributor.authorAlshaar, Rana
dc.date.accessioned2009-01-26T19:34:34Z
dc.date.available2009-01-26T19:34:34Z
dc.date.issued2009-01-26T19:34:34Z
dc.date.submitted2008
dc.description.abstractDelivering the right information to the user is fundamental in information retrieval system. Many traditional information retrieval models assume word independence and view a document as bag-of-words, however getting the right information requires a deep understanding of the content of the document and the relationships that exist between words in the text. This study focuses on developing two new document ranking techniques, which are based on a lexical cohesive relationship of collocation. Collocation relationship is a semantic relationship that exists between words that co-occur in the same lexical environment. Two types of collocation relationship have been considered; collocation in the same grammatical structure (such as a sentence), and collocation in the same semantic structure where query terms occur in different sentences but they co-occur with the same words. In the first technique, we only considered the first type of collocation to calculate the document score; where the positional frequency of query terms co-occurrence have been used to identify collocation relationship between query terms and calculating query term’s weight. In the second technique, both types of collocation have been considered; where the co-occurrence frequency distribution within a predefined window has been used to determine query terms collocations and computing query term’s weight. Evaluation of the proposed techniques show performance gain in some of the collocations over the chosen baseline runs.en
dc.identifier.urihttp://hdl.handle.net/10012/4256
dc.language.isoenen
dc.pendingfalseen
dc.publisherUniversity of Waterlooen
dc.subjectCollocationen
dc.subjectTerm proximityen
dc.subjectDocument rankingen
dc.subject.programManagement Sciencesen
dc.titleMeasuring the Stability of Query Term Collocations and Using it in Document Rankingen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Applied Scienceen
uws-etd.degree.departmentManagement Sciencesen
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Alshaar_Rana.pdf
Size:
292.28 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
250 B
Format:
Item-specific license agreed upon to submission
Description: