AdaptPrompt with Diffusion Set: A Unified Framework for Generalizable Deepfake Detection

Jiang, Yichen

AdaptPrompt with Diffusion Set: A Unified Framework for Generalizable Deepfake Detection

dc.contributor.author	Jiang, Yichen
dc.date.accessioned	2025-04-30T16:56:44Z
dc.date.available	2025-04-30T16:56:44Z
dc.date.issued	2025-04-30
dc.date.submitted	2025-04-29
dc.description.abstract	Deepfake detection focuses on identifying synthetic media. It has critical applications in cybersecurity, misinformation mitigation, digital forensics, and media authentication. Re- cent developments in deepfake detection have achieved impressive performance, leveraging deep learning models to distinguish between real and synthetic content. However, recent developments in diffusion-based generative models and publicly available tools such as Stable Diffusion and DALL-E pose immediate challenges to existing detection techniques. Diffusion models generate photorealistic and high-resolution content with less observable or detectable artifacts and are, therefore, more difficult to detect using traditional deepfake detection techniques. In this thesis, we present a comprehensive study of existing deepfake detection tech- niques and adapting large vision-language models, i.e., CLIP, to generalizable deepfake detection. We introduce the Diffusion Set, a new dataset of 100k diffusion-generated fake images and 100k real images. Our experiments reveal that detectors trained on Diffusion Set outperform detectors trained on GAN-based datasets. To further enhance deepfake detection, we introduce a new transfer learning strategy that learns randomly initialized prompts and a lightweight adapter network while having CLIP frozen. Extensive experi- ments confirm its efficiency, and we also investigate the impact of dropping specific CLIP layers on detection accuracy. Our study utilizes the Diffusion Set for training and evaluates models on 25 unseen test sets, covering images synthesized by GAN-based models, diffusion-based models, and commercially available tools. Beyond large-scale training, we assess model performance in few-shot settings, where models are trained with only a small fraction of the dataset (e.g., 320 real and 320 fake images), providing insights into their adaptability under data constraints. Additionally, we extend our analysis beyond classification by exploring image attribu- tion and training models in a few-shot setting to attribute images to specific generators such as BigGAN, StarGAN, and Stable Diffusion. Our findings showcase the robustness of CLIP-based models in deepfake detection and their ability to generalize across unseen gen- erative techniques. We also investigated the feasibility of using the same transfer learning strategy for attribution, with experimental results that demonstrate its high effectiveness in closed set attribution.
dc.identifier.uri	https://hdl.handle.net/10012/21687
dc.language.iso	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	deepfake detection
dc.subject	transfer learning
dc.subject	vision-language model
dc.title	AdaptPrompt with Diffusion Set: A Unified Framework for Generalizable Deepfake Detection
dc.type	Master Thesis
uws-etd.degree	Master of Applied Science
uws-etd.degree.department	Electrical and Computer Engineering
uws-etd.degree.discipline	Electrical and Computer Engineering
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	4 months
uws.contributor.advisor	Karray, Fakhri
uws.contributor.affiliation1	Faculty of Engineering
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Jiang_Yichen.pdf
Size:: 1.46 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Electrical and Computer Engineering