Multilingual Grammatical Error Detection And Its Applications to Prompt-Based Correction

Sutter Pessurno de Carvalho, Gustavo

Multilingual Grammatical Error Detection And Its Applications to Prompt-Based Correction

Files

Sutter_Pessurno_de_Carvalho_Gustavo.pdf (678.42 KB)

Date

2024-01-05

Authors

Sutter Pessurno de Carvalho, Gustavo

Advisor

Poupart, Pascal

Publisher

University of Waterloo

Abstract

Grammatical Error Correction (GEC) and Grammatical Error Correction (GED) are two important tasks in the study of writing assistant technologies. Given an input sentence, the former aims to output a corrected version of the sentence, while the latter's goal is to indicate in which words of the sentence errors occur. Both tasks are relevant for real-world applications that help native speakers and language learners to write better. Naturally, these two areas have attracted the attention of the research community and have been studied in the context of modern neural networks. This work focuses on the study of multilingual GED models and how they can be used to improve GEC performed by large language models (LLMs). We study the difference in performance between GED models trained in a single language and models that undergo multilingual training. We expand the list of datasets used for multilingual GED to further experiment with cross-dataset and cross-lingual generalization of detection models. Our results go against previous findings and indicate that multilingual GED models are as good as monolingual ones when evaluated in the in-domain languages. Furthermore, multilingual models show better generalization to novel languages seen only at test time. Making use of the GED models we study, we propose two methods to improve corrections of prompt-based GEC using LLMs. The first method aims to mitigate overcorrection by using a detection model to determine if a sentence has any mistakes before feeding it to the LLM. The second method uses the sequence of GED tags to select the in-context examples provided in the prompt. We perform experiments in English, Czech, German and Russian, using Llama2 and GPT3.5. The results show that both methods increase the performance of prompt-based GEC and point to a promising direction of using GED models as part of the correction pipeline performed by LLMs.

Keywords

grammatical error correction, grammatical error detection, natural language processing, machine learning, deep learning

URI

http://hdl.handle.net/10012/20216

Collections

Theses
Computer Science

Full item page

Multilingual Grammatical Error Detection And Its Applications to Prompt-Based Correction

Files

Date

Authors

Advisor

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

LC Subject Headings

Citation

URI

Collections