Analyzing Threats of Large-Scale Machine Learning Systems

Lukas, Nils

Analyzing Threats of Large-Scale Machine Learning Systems

dc.contributor.advisor	Kerschbaum, Florian
dc.contributor.author	Lukas, Nils
dc.date.accessioned	2024-02-22T15:03:31Z
dc.date.available	2024-02-22T15:03:31Z
dc.date.issued	2024-02-22
dc.date.submitted	2024-01-29
dc.description.abstract	Large-scale machine learning systems such as ChatGPT rapidly transform how we interact with and trust digital media. However, the emergence of such a powerful technology faces a dual-use dilemma. While it can have many positive societal impacts in providing equitable access to information, ML systems can also be misused by untrustworthy entities to cause intentional harm. For example, a system could unintentionally disclose private information about its training data and jeopardize the privacy of individuals in the training data. The system's generated content could also be misused for unethical purposes, such as eroding trust in digital media by misrepresenting generating content as authentic. Providing untrustworthy users with these new capabilities could amplify potential negative consequences emerging through this technology, such as a proliferation of deep fakes or disinformation. I analyze these threats from two perspectives: (i) Data leakage, when the model cannot be trusted because it has memorized private information during training, and (ii) Misuse when users cannot be trusted to use the system for its intended purposes. This thesis presents five projects to assess these risks to the privacy and security of ML systems and evaluates the reliability of known countermeasures. To do so, I assess the privacy risks of extracting Personally Identifiable Information from language models trained with differential privacy. As a method of controlling unintended use, I study the effectiveness and robustness of fingerprinting and watermarking methods to detect the provenance of models and their generated content.	en
dc.identifier.uri	http://hdl.handle.net/10012/20355
dc.language.iso	en	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.relation.uri	https://github.com/dnn-security/Watermark-Robustness-Toolbox	en
dc.relation.uri	https://github.com/microsoft/analysing_pii_leakage	en
dc.relation.uri	https://github.com/nilslukas/gan-watermark	en
dc.subject	machine learning	en
dc.subject	security	en
dc.subject	watermark	en
dc.subject	differential privacy	en
dc.subject	data poisoning	en
dc.subject	differential privacy	en
dc.subject	generative AI	en
dc.title	Analyzing Threats of Large-Scale Machine Learning Systems	en
dc.type	Doctoral Thesis	en
uws-etd.degree	Doctor of Philosophy	en
uws-etd.degree.department	David R. Cheriton School of Computer Science	en
uws-etd.degree.discipline	Computer Science	en
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0	en
uws.contributor.advisor	Kerschbaum, Florian
uws.contributor.affiliation1	Faculty of Mathematics	en
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Lukas_Nils.pdf
Size:: 9.82 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Computer Science