Prompt-tuning in Controlled Dialogue Generation

Liu, Runcheng

Prompt-tuning in Controlled Dialogue Generation

Files

Liu_Runcheng.pdf (2.44 MB)

Date

2022-12-22

Authors

Liu, Runcheng

Advisor

Poupart, Pascal

Publisher

University of Waterloo

Abstract

Recent years have witnessed a prosperous development of dialogue response generation since the advent of Transformer. Fine-tuning pretrained language models for different downstream tasks has become the dominant paradigm in Natural Language Processing (NLP). However, fine-tuning requires storing a full copy of parameter states for every task, which is memory-consuming and expensive to serve when working with large-scale models with billions of parameters like GPT-3. Meanwhile, prompt-tuning has become an increasingly popular parameter-efficient method for steering large pretrained language models to various tasks. Most of the prompting techniques are applied in language understanding and assuming fixed prompts for all data samples within a task. Therefore, there arises an urgent need to exploit the ability of prompt-tuning in open-domain dialogue generation where data samples may vary greatly within a task. In this thesis, we present a novel, instance-specific prompt-tuning algorithm for dialogue generation. Specifically, we generate prompts based on instance-level control code, rather than the conversation context, to explore their impact on controlled dialogue generation. Experiments on popular open-domain dialogue datasets, evaluated with both automated metrics and human evaluation, demonstrate that our method is superior to prompting baselines as well as other lightweight controlled generation methods, and comparable to fine-tuning with less than 10% of total parameters.

Keywords

machine learning, natural language processing, controlled dialogue generation, parameter efficient fine-tuning

URI

http://hdl.handle.net/10012/18993

Collections

Theses
Computer Science

Full item page

Prompt-tuning in Controlled Dialogue Generation

Files

Date

Authors

Advisor

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

LC Subject Headings

Citation

URI

Collections