UWSpace is currently experiencing technical difficulties resulting from its recent migration to a new version of its software. These technical issues are not affecting the submission and browse features of the site. UWaterloo community members may continue submitting items to UWSpace. We apologize for the inconvenience, and are actively working to resolve these technical issues.
 

A Semantic Distance of Natural Language Queries Based on Question-Answer Pairs

dc.contributor.authorXiong, Kun
dc.date.accessioned2014-08-21T12:30:34Z
dc.date.available2014-08-21T12:30:34Z
dc.date.issued2014-08-21
dc.date.submitted2014
dc.description.abstractMany Natural Language Processing (NLP) techniques have been applied in the field of Question Answering (QA) for understanding natural language queries. Practical QA systems classify a natural language query into vertical domains, and determine whether it is similar to a question with known or latent answers. Current mobile personal assistant applications process queries, recognized from voice input or translated from cross-lingual queries. Theoretically speaking, all these problems rely on an intuitive notion of semantic distance. However, it is neither definable nor computable. Many studies attempt to approximate such a semantic distance in heuristic ways, for instance, distances based on synonym dictionaries. In this paper, we propose a unified algorithm to approximate the semantic distance by a well-defined information distance theory. The algorithm depends on a pre-constructed data structure - semantic clusters, which is built from 35 million question-answer pairs automatically. From the semantic measurement of questions, we implement two practical NLP systems, including a question classifier and a translation corrector. Then a series of comparison experiments have been conducted on both implementations. Experimental results demonstrate that our distance based approach produces fewer errors in classification, compared with other academic works. Also, our translation correction system achieves significant improvements on the Google translation results.en
dc.identifier.urihttp://hdl.handle.net/10012/8664
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.subjectsemantic distanceen
dc.subjectquestion classificationen
dc.subjectquestion translationen
dc.subjectquestion-answer pairsen
dc.subject.programComputer Scienceen
dc.titleA Semantic Distance of Natural Language Queries Based on Question-Answer Pairsen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentSchool of Computer Scienceen
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Xiong_Kun.pdf
Size:
1.57 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.67 KB
Format:
Item-specific license agreed upon to submission
Description: