Query Similarity for Community Question Answering System Based on Recurrent Encoder Decoder

dc.contributor.advisorLi, Ming
dc.contributor.authorYe, Borui
dc.date.accessioned2017-01-18T20:11:35Z
dc.date.available2017-01-18T20:11:35Z
dc.date.issued2017-01-18
dc.date.submitted2016-12-12
dc.description.abstractThe measurement of sentence similarity is a fundamental task in natural language processing. Traditionally, it is measured either from word-level or sentence-level (such as paraphrasing), which requires many lexical and syntactic resources. In order to solve the problem of lacking labelled data and Chinese language resources, we propose a novel sentence similarity framework based on a recurrent neural network (RNN) Encoder-Decoder architecture. This RNN is pre-trained with a large set of question-question pairs, which is weakly labelled automatically and heuristically. Though less accurate, the pre-training greatly improve the performance of the model, also better than other traditional methods. Our proposed model is capable of both classification and candidate ranking. In addition, we release our evaluation dataset -- a finely annotated question similarity dataset, which will be the first public dataset under this purpose in Chinese to the best of our knowledge.en
dc.identifier.urihttp://hdl.handle.net/10012/11201
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.subjectquestion answeringen
dc.subjectsentence similarityen
dc.subjectneural networken
dc.subjectchinese corpusen
dc.titleQuery Similarity for Community Question Answering System Based on Recurrent Encoder Decoderen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentDavid R. Cheriton School of Computer Scienceen
uws-etd.degree.disciplineComputer Scienceen
uws-etd.degree.grantorUniversity of Waterlooen
uws.contributor.advisorLi, Ming
uws.contributor.affiliation1Faculty of Mathematicsen
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ye_Borui.pdf
Size:
1.77 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.17 KB
Format:
Item-specific license agreed upon to submission
Description: