DSCode Comparator: An Interactive Interface for Comparing Models and Evaluating Code for Data Science Tasks

Yu, Xinxin

DSCode Comparator: An Interactive Interface for Comparing Models and Evaluating Code for Data Science Tasks

dc.contributor.author	Yu, Xinxin
dc.date.accessioned	2026-04-17T15:24:00Z
dc.date.available	2026-04-17T15:24:00Z
dc.date.issued	2026-04-17
dc.date.submitted	2026-03-20
dc.description.abstract	Code-generating models are increasingly used to support data science tasks. However, reviewing and validating their outputs remains largely manual and time-consuming, requiring users to understand how generated code works and to assess its quality and correctness. Rather than eliminating effort, these models often shift user work from writing code to verifying it. This challenge is further compounded by the fact that different models frequently produce diverse solutions with varying levels of effectiveness, making systematic comparison and evaluation difficult. To address these challenges, this thesis presents DSCode Comparator, an interactive system designed to support code understanding, evaluation, refinement, and comparison in data science workflows. The system enables users to examine code at multiple levels of granularity, ranging from individual lines of code to complete solutions across different prompts and tasks. DSCode Comparator incorporates an automated annotation pipeline that analyzes generated code and provides structured, line-level explanations to facilitate rapid comprehension. In addition, the system evaluates code quality along multiple functional and pragmatic dimensions, including efficiency, readability, usability, and resource usage. Beyond individual code inspection, DSCode Comparator supports comparative analysis across models by aggregating annotations and evaluation results into compact summaries that highlight key differences in behavior and performance. Through a combination of empirical evaluation and user studies with data science practitioners, this thesis demonstrates that the proposed approach improves users’ ability to understand, compare, and refine code generated by large language models, reducing verification effort while supporting more informed decision-making in model-assisted programming.
dc.identifier.uri	https://hdl.handle.net/10012/23013
dc.language.iso	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	large language models
dc.subject	code generation
dc.subject	human-ai interaction
dc.subject	code evaluation
dc.subject	data science workflows
dc.title	DSCode Comparator: An Interactive Interface for Comparing Models and Evaluating Code for Data Science Tasks
dc.type	Master Thesis
uws-etd.degree	Master of Mathematics
uws-etd.degree.department	David R. Cheriton School of Computer Science
uws-etd.degree.discipline	Computer Science
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0
uws.contributor.advisor	Crisan, Anamaria
uws.contributor.affiliation1	Faculty of Mathematics
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Yu_Xinxin.pdf
Size:: 18.18 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses