Harnessing Generalist LLMs for Diverse Objective and Subjective NLP Tasks

Sahu, Gaurav

Harnessing Generalist LLMs for Diverse Objective and Subjective NLP Tasks

dc.contributor.advisor	Vechtomova, Olga
dc.contributor.author	Sahu, Gaurav
dc.date.accessioned	2024-12-17T14:42:11Z
dc.date.available	2024-12-17T14:42:11Z
dc.date.issued	2024-12-17
dc.date.submitted	2024-12-13
dc.description.abstract	Recent advances in natural language processing (NLP), particularly in the subspace of large language modeling, have led to a major paradigm shift. Large language models (LLMs), like the GPT and LLaMA family of models, are trained on a massive Internet corpus covering data from a gamut of diverse domains. In addition, the billions of parameters in these models also invoke emergent capabilities in them, leading to strong improvements across diverse NLP tasks without much task-specific tuning; however, effectively harnessing the knowledge of these generalist models for real-world data still remains a major challenge as the LLMs can produce inconsistent, biased, and unsatisfactory outputs. In this thesis, we propose task-specific strategies for effectively leveraging LLMs for a number of challenging NLP tasks, such as (low-resource) text classification, text summarization, modeling artistic preferences of creative individuals, and automated data analysis. Our results suggest that LLMs can serve as excellent data generators and data labelers for well-defined single-step tasks like classification and summarization, crucially in data-scarce settings, where models trained on LLM-generated data achieved competitive performance to oracle models trained on a much larger labeled training data. On the other hand, for more subjective tasks like modeling artistic preferences among creative individuals, we demonstrate that while LLMs might not be able to discern between the likes and dislikes of artists, they can be effective in extracting key linguistic and poetic properties from text that can later be employed to infer artistic preferences among different individuals. Lastly, we also evaluate the effectiveness of LLMs in multi-step tasks that require the LLM to perform multiple tasks in tandem without compromising performance for individual tasks. Overall, our work draws critical insights into the strengths and shortcomings of LLMs for a wide range of subjective and objective NLP tasks and includes meaningful suggestions for the research community to harness LLMs for those tasks effectively.
dc.identifier.uri	https://hdl.handle.net/10012/21255
dc.language.iso	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	large language models (LLMs)
dc.subject	natural language processing (NLP)
dc.subject	text classification
dc.subject	intent classification
dc.subject	few-shot text classification
dc.subject	text summarization
dc.subject	extractive text summarization
dc.subject	semi-supervised text summarization
dc.subject	data augmentation
dc.subject	zero-shot text classification
dc.subject	artistic preference modeling
dc.subject	LLM-based exploratory data analysis
dc.subject	abstractive text summarization
dc.subject	GPT
dc.subject	LLaMA-3
dc.subject	LLaMA-2
dc.subject	BERT
dc.subject	DistilBERT
dc.subject	DistilBART
dc.subject	PreSumm
dc.subject	PromptMix
dc.subject	MixSumm
dc.title	Harnessing Generalist LLMs for Diverse Objective and Subjective NLP Tasks
dc.type	Doctoral Thesis
uws-etd.degree	Doctor of Philosophy
uws-etd.degree.department	David R. Cheriton School of Computer Science
uws-etd.degree.discipline	Computer Science
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0
uws.contributor.advisor	Vechtomova, Olga
uws.contributor.affiliation1	Faculty of Mathematics
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Sahu_Gaurav.pdf
Size:: 8.38 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Computer Science