UWSpace is currently experiencing technical difficulties resulting from its recent migration to a new version of its software. These technical issues are not affecting the submission and browse features of the site. UWaterloo community members may continue submitting items to UWSpace. We apologize for the inconvenience, and are actively working to resolve these technical issues.
 

The Limited Effectiveness of Neural Networks for Simple Question Answering on Knowledge Graphs

Loading...
Thumbnail Image

Date

2017-12-08

Authors

Mohammed, Salman

Journal Title

Journal ISSN

Volume Title

Publisher

University of Waterloo

Abstract

Simple factoid question answering (QA) is a task, where the questions can be answered by looking up a single fact in the knowledge base (KB). However, this QA task is difficult, since retrieving a single supporting fact involves searching many alternatives given a query expressed in natural language. We use a retrieval-based approach to QA. We decompose the problem into four sub-problems: entity detection, entity linking, relation prediction, and evidence integration. Entity detection and linking rely on detecting the entities in a question and linking them to the candidate entities in the KB. Relation prediction classifies a question as one of the relation types in the KB. Finally, evidence integration combines scores from entity linking and relation prediction to predict an (entity, relation) pair that answers the question. Most of the research community has explored complex neural network architectures for this task without establishing baselines to compare the results with `non-neural-network' approaches. We explore several different models for entity detection and relation prediction; a few different scoring functions for entity linking and evidence integration. Our findings show that deep learning does help for the QA task, but not as much as the research community has portrayed it to be. We also present two simple yet very competitive baselines: one based on a simple neural network architecture and one that does not use any neural networks.

Description

Keywords

LC Keywords

Citation