AdvEx: Interactive Visual Explorations of Adversarial Evasion Attacks

You, Yuzhe

AdvEx: Interactive Visual Explorations of Adversarial Evasion Attacks

Files

You_Yuzhe.pdf (17.01 MB)

Date

2023-06-28

Authors

You, Yuzhe

Advisor

Zhao, Jian

Publisher

University of Waterloo

Abstract

Adversarial machine learning (AML) focuses on studying attacks that can fool machine learning algorithms into generating incorrect outcomes as well as the defenses against worst-case attacks to strengthen the adversarial robustness of machine learning models. Specifically for image classification tasks, it is difficult to comprehend the underlying logic behind adversarial attacks due to two key challenges: 1) the attacks exploiting “non-robust” features that are not human-interpretable and 2) the perturbations applied being almost imperceptible to human eyes. We propose an interactive visualization system, AdvEx, that presents the properties and consequences of evasion attacks as well as provides data and model performance analytics on both instance and population levels. We quantitatively and qualitatively assessed AdvEx in a two-part evaluation including user studies and expert interviews. Our results show that AdvEx is effective both as an educational tool for understanding AML mechanisms and a visual analytics tool for inspecting machine learning models, which can benefit both AML learners and experienced practitioners.