Dowsing for Math Answers: Exploring MathCQA with a Math-aware Search Engine

dc.contributor.authorNg, Yin Ki
dc.date.accessioned2021-11-09T20:28:30Z
dc.date.available2021-11-09T20:28:30Z
dc.date.issued2021-11-09
dc.date.submitted2021-11-01
dc.description.abstractSolving math problems can be challenging. It is so challenging that one might wish to seek insights from the internet, looking for related references to understand more about the problems. Even more, one might wish to actually search for the answer, believing that some wise people have already solved the problem and shared their intelligence selflessly. However, searching for relevant answers for a math problem effectively from those sites is itself not trivial. This thesis details how a math-aware search engine Tangent-L---which adopts a traditional text retrieval model (Bag-of-Words scored by BM25+ using formulas' symbol pairs and other features as "words''---tackles the challenge of finding answers to math questions. Various adaptations for Tangent-L to this challenge are explored, including query conversion, weighting scheme of math features, and result re-ranking. In a recent workshop series named Answer Retrieval for Questions on Math (ARQMath), and with math problems from Math StackExchange, the submissions based on these adaptations of Tangent-L achieved the best participant run for two consecutive years, performing better than many participating models designed with machine learning and deep learning models. The major contributions of this thesis are the design and implementation of the three-stage approach to adapting Tangent-L to the challenge, and the detailed analyses of many variants to understand which aspects are most beneficial. The code repository is available, as is a data exploration too built for interested participants to view the math questions in this ARQMath challenge and check the performance of their answer rankings.en
dc.identifier.urihttp://hdl.handle.net/10012/17696
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.relation.urihttps://github.com/kiking0501/MathDowsers-ARQMathen
dc.subjectCommunity Question Answering (CQA)en
dc.subjectMathematical Information Retrieval (MathIR)en
dc.subjectMathematics Stack Exchange (MSE)en
dc.subjectARQMath Laben
dc.subjectTangent-Len
dc.subjectformula matchingen
dc.subjectproximityen
dc.titleDowsing for Math Answers: Exploring MathCQA with a Math-aware Search Engineen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentDavid R. Cheriton School of Computer Scienceen
uws-etd.degree.disciplineComputer Scienceen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.embargo.terms0en
uws.contributor.advisorTompa, Frank
uws.contributor.affiliation1Faculty of Mathematicsen
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ng_YinKi.pdf
Size:
10.15 MB
Format:
Adobe Portable Document Format
Description:
updated 20211109

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.4 KB
Format:
Item-specific license agreed upon to submission
Description: