Show simple item record

dc.contributor.authorCen, Kun
dc.date.accessioned2008-10-01 16:26:20 (GMT)
dc.date.available2008-10-01 16:26:20 (GMT)
dc.date.issued2008-10-01T16:26:20Z
dc.date.submitted2008-09-24
dc.identifier.urihttp://hdl.handle.net/10012/4081
dc.description.abstractWith the huge amount of subjective contents in on-line documents, there is a clear need for an information retrieval system that supports retrieval of documents containing opinions about the topic expressed in a user’s query. In recent years, blogs, a new publishing medium, have attracted a large number of people to express personal opinions covering all kinds of topics in response to the real-world events. The opinionated nature of blogs makes them a new interesting research area for opinion retrieval. Identification and extraction of subjective contents from blogs has become the subject of several research projects. In this thesis, four novel methods are proposed to retrieve blog posts that express opinions about the given topics. The first method utilizes the Kullback-Leibler divergence (KLD) to weight the lexicon of subjective adjectives around query terms. Considering the distances between the query terms and subjective adjectives, the second method uses KLD scores of subjective adjectives based on distances from the query terms for document re-ranking. The third method calculates KLD scores of subjective adjectives for predefined query categories. In the fourth method, collocates, words co-occurring with query terms in the corpus, are used to construct the subjective lexicon automatically. The KLD scores of collocates are then calculated and used for document ranking. Four groups of experiments are conducted to evaluate the proposed methods on the TREC test collections. The results of the experiments are compared with the baseline systems to determine the effectiveness of using KLD in opinion retrieval. Further studies are recommended to explore more sophisticated approaches to identify subjectivity and promising techniques to extract opinions.en
dc.language.isoenen
dc.publisherUniversity of Waterlooen
dc.subjectOpinion Retrievalen
dc.subjectKullback-Leibler Divergenceen
dc.titleThe Use Of Kullback-Leibler Divergence In Opinion Retrievalen
dc.typeMaster Thesisen
dc.pendingfalseen
dc.subject.programManagement Sciencesen
uws-etd.degree.departmentManagement Sciencesen
uws-etd.degreeMaster of Applied Scienceen
uws.typeOfResourceTexten
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record


UWSpace

University of Waterloo Library
200 University Avenue West
Waterloo, Ontario, Canada N2L 3G1
519 888 4883

All items in UWSpace are protected by copyright, with all rights reserved.

DSpace software

Service outages