Show simple item record

dc.contributor.authorFu, Chengyao
dc.date.accessioned2020-09-28 13:58:20 (GMT)
dc.date.available2020-09-28 13:58:20 (GMT)
dc.date.issued2020-09-28
dc.date.submitted2020-09-24
dc.identifier.urihttp://hdl.handle.net/10012/16382
dc.description.abstractSentiment analysis has been widely used in the domain of finance. There are two most common textual sentiment analysis methods in finance: \textit{dictionary-based approach} and \textit{machine learning approach}. The dictionary-based method is the most convenient and efficient method to extract sentiments from the text, but the words in the dictionary are limited and cannot capture the full scope of a particular domain. Additionally, it is expensive and unsustainable to manually create and maintain domain-specific dictionary using expert opinions. Deep learning models become mainstream methods in sentiment analysis because of their better performance by utilizing extra information on a larger corpus and more complex model structures. However, deep learning models often suffer from the interpretability problem. This thesis is an attempt to address the issues of both methods. It proposes a machine learning method to do a corpus-based sentiment lexicon induction, which extends the sentiment dictionary that is customized to analyze corporate conference calls. The new extended dictionary is shown to have a better performance than the original dictionary in terms of the three-day returns of the companies in the MSCI universe. It also proposes a highly interpretable attention-based multiple-instance learning model to perform sentiment classification. It also shows that the newly proposed model has comparable accuracy performance to the state-of-the-art sequential models with better interpretability. A keyword ranking is also generated by the model as a by-product. A new sentiment dictionary is also generated by the deep learning method and shows even better performance than both the extended dictionary and the original dictionary.en
dc.language.isoenen
dc.publisherUniversity of Waterlooen
dc.subjectsentiment analysisen
dc.subjectnatural language processingen
dc.subjectfinanceen
dc.subjectsentiment dictionaryen
dc.subjectsentiment lexicon inductionen
dc.subjectmultiple-instance learningen
dc.subjectstocksen
dc.titleSentiment Lexicon Induction and Interpretable Multiple-instance Learning in Financial Marketsen
dc.typeMaster Thesisen
dc.pendingfalse
uws-etd.degree.departmentDavid R. Cheriton School of Computer Scienceen
uws-etd.degree.disciplineComputer Scienceen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.degreeMaster of Mathematicsen
uws.contributor.advisorHuang, Alan
uws.contributor.advisorLi, Yuying
uws.contributor.affiliation1Faculty of Mathematicsen
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.typeOfResourceTexten
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record


UWSpace

University of Waterloo Library
200 University Avenue West
Waterloo, Ontario, Canada N2L 3G1
519 888 4883

All items in UWSpace are protected by copyright, with all rights reserved.

DSpace software

Service outages