Evaluating Deep Learning-based Vulnerability Detection Models on Realistic Datasets

Arumugam, Krishna Kanth

Evaluating Deep Learning-based Vulnerability Detection Models on Realistic Datasets

dc.contributor.advisor	Nagappan, Meiyappan
dc.contributor.author	Arumugam, Krishna Kanth
dc.date.accessioned	2023-05-23T20:00:28Z
dc.date.available	2023-05-23T20:00:28Z
dc.date.issued	2023-05-23
dc.date.submitted	2023-05-16
dc.description.abstract	The impact of software vulnerabilities on daily-used software systems is alarming. Despite numerous proposed deep learning-based models to automate vulnerability detection, the detection of software vulnerabilities remains a significant issue. While some techniques report high precision/recall scores of up to 99%, our experience leads us to believe that these models may underperform in realistic settings, specifically when evaluating vulnerability detection models on the entire source code repository of a project. Therefore, in this thesis, we create a more comprehensive vulnerability detection dataset (i.e., Comp-Vul), which aims to accurately represent the realistic settings where vulnerability detection models are deployed. Then, we evaluate the performance of two state-of-the-art deep learning-based models, LineVul and DeepWukong, on the Comp-Vul dataset. Our results show that the performance of both models drops drastically, with precision dropping by 86% - 95% and F1 score dropping by 88% - 91%. Our further investigation shows that the ratio of vulnerable to non-vulnerable samples in the evaluation dataset significantly impacts the performance metrics of these models. When we visualize the embeddings produced by the models, we find that there is a substantial overlap between vulnerable and non-vulnerable samples. This shows that these models have difficulty distinguishing between vulnerable and non-vulnerable samples in the Comp-Vul dataset, resulting in a high number of false positives. We introduce a new program slice-level vulnerability detection technique named SliceVul, which leverages the powerful capabilities of Transformers and incorporates the semantic properties of source code programs such as data and control flow information. Our approach outperforms the existing state-of-the-art program slice-level vulnerability detection model, DeepWukong when evaluated on the Comp-Vul dataset. Our study argues that accurately identifying vulnerabilities using deep learning remains a challenging task that requires improved approaches to model evaluation and design. Further research and development, complemented by realistic evaluation datasets, is required to enhance the performance of these methods.	en
dc.identifier.uri	http://hdl.handle.net/10012/19471
dc.language.iso	en	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	vulnerability detection	en
dc.subject	vulnerability datasets	en
dc.subject	deep learning	en
dc.title	Evaluating Deep Learning-based Vulnerability Detection Models on Realistic Datasets	en
dc.type	Master Thesis	en
uws-etd.degree	Master of Mathematics	en
uws-etd.degree.department	David R. Cheriton School of Computer Science	en
uws-etd.degree.discipline	Computer Science	en
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0	en
uws.contributor.advisor	Nagappan, Meiyappan
uws.contributor.affiliation1	Faculty of Mathematics	en
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Arumugam_Krishna-Kanth.pdf
Size:: 534.62 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Computer Science