People | MIT CSAILpeople.csail.mit.edu/tsipras/bugs_poster.pdf · 2019-10-28 · Title: Adversarial...

Adversarial Examples Are Not Bugs, They Are FeaturesAndrew Ilyas*, Shibani Santurkar*, Dimitris Tsipras*, Logan Engstrom*, Brandon Tran and Aleksander Madry

Massachusetts Institute of Technology madry-lab.ml

Adversarial Examples: A Challenge for ML Systems

Why are ML models so sensitive to small perturbations?

Prevailing theme: They stem from bugs/aberrations

A Simple Experiment

Adversarial perturbation towards “cat”

1. Make adversarial example towards the other class 2. Relabel the image as the target class 3. Train with new dataset but test on the original test set

Training set

A Simple Experiment

catdogcat

New training setTest set

dog cat

car ship

So: We train on a totally “mislabeled” dataset butexpect performance on a “correct” dataset

Training DataDataset

CIFAR-10 ImageNetRStandard Dataset 95.3% 96.6%“Mislabeled” Dataset 43.7% 64.4%

Result: nontrivial accuracy on the original task

The Robust Features Model

From “max accuracy” view: All features are goodIf NRFs are (often) good: Models want to use them

Thus: Models use NRFs → adversarial examples

Adversarial example towards “cat” dog

Training set

dogcat

Robust features: dog Non-robust features: dog

Robust features: dog Non-robust features: cat

The Simple Experiment: A Second Look

New training set

RFs misleading but NRFs suffice for generalization

Directly Manipulating Features“Robust” Data: Standard training → robust models

Robust Optimization: Makes NRFs useless for learning→ Need more data to learn from only RFs (cf. [Schmidt et al., 2018])→ Trade-off between robustness/accuracy (cf. [Tsipras et al., 2019])

Implications

ML models do not work the way we expect them to

Adversarial examples: A "human-based" phenomenon?

Transfer Attacks: Models rely on similar NRFs

25 30 35 40 45 50Test accuracy (%; trained on Dy + 1)

VGG-16

Inception-v3

ResNet-18 DenseNet

ResNet-50

Interpretability: May need to be enforced at training time

A Theoretical Framework→ We consider (robust) MLE classification between Gaussians→ Vulnerability is misalignment between the data geometry andadversary’s (`2) geometry→ Shows that robust optimization better aligns these geometries

20 15 10 5 0 5 10 15 20Feature x1

Maximum likelihood estimate2 unit ball

1-induced metric unit ballSamples from (0, )

20 15 10 5 0 5 10 15 20Feature x1

True Parameters ( = 0)Samples from ( , )Samples from ( , )

20 15 10 5 0 5 10 15 20Feature x1

Robust parameters, = 1.0

20 15 10 5 0 5 10 15 20Feature x1

Robust parameters, = 10.0

Moving Forward→ Do we want our models to rely on NRFs?→ How should we think of interpretability?

Robustness as a goal beyond security/reliability?

arxiv:1905.02175

gradsci.org/adv

Python Library

MadryLab/robustness

People | MIT CSAILpeople.csail.mit.edu/tsipras/bugs_poster.pdf · 2019-10-28 · Title: Adversarial...

Documents

Transcript of People | MIT CSAILpeople.csail.mit.edu/tsipras/bugs_poster.pdf · 2019-10-28 · Title: Adversarial...

In Metal we are United

lecture11 - People | MIT CSAILpeople.csail.mit.edu/dsontag/courses/ml13/slides/lecture... · 2013-10-08 · [Slides&from Mehyrar&Mohri] Mehryar Mohri - Introduction to Machine Learning

Are I 1 Evaluation im Bereich Nachhaltige Entwicklung Anne DuPasquier, ARE .

Who are you, livia

Handout 1 What Are Clusters

NIS-18-0061 mobile Telematik Traffic Manage Ordner-RZ · 2020. 7. 6. · Ilyas Sharif Nissen UK T +44 207 237 75 74 isharif@nissen-germany.com Kontakt-3-0519 KONTAKT ... ein klar

Management Angels - We are Interim

LudwigSchmidt - People | MIT CSAILpeople.csail.mit.edu/ludwigs/papers/thesis.pdf · 2020. 1. 4. · Talking to Ben, Kunal, Martin, and ... For convex ERM problems (and some non-convex

People | MIT CSAILpeople.csail.mit.edu/jrennie/trg/papers/rubin-missing-76.pdfPeople | MIT CSAIL

Dr. Oliver Vicent, Dresden Dresden · Dr. Tim Mäcken, Bochum Dr. Ilyas Tugtekin, Ulm Dr. Oliver Vicent, Dresden Technische Leitung Daniel Lohr, Bochum Dr. Axel Rand, Bochum Dr. Oliver

What Language Are You Speaking?

Who are you

38 SPORT · Lorenz Rau, Peter Schaffrath, Angelina Kirch-ner, Ilyas Iman, Manuel Werner, Daniel Lat-zel, Manuel Wehner, Manfred Weinkath. Groß, Davia Popp, Davia Kreß, Constantin

We Are Family BACH

Reguläre Ausdr ücke - Theoretical Computer Sciencetheo.cs.ovgu.de/lehre05s/ti_iif/folien/folien06b.pdf · Reguläre Ausdr ücke Definition (Reguläre Ausdr ücke) Sei Σ

Hacker X-Pro Motorsteller / ESC · PDF fileHacker X-Pro are sensorless ESCs which are specially developed for Hacker Brushless motors. Due to different operation modes these ESCs are

People are ourfuture

People | MIT CSAILpeople.csail.mit.edu/whit/reprints/saccadic-sup.pdf · 2007. 8. 30. · 6700085, with supplementary funding to Professor H.-L. Teuber under NASA and NIMH grants

Biotechnology: Regulatory Requirements on Continuous ... · 3 (Glycosylated) monoclonal antibodies 3.1 Why are monoclonal antibodies of interest? “Monoclonal antibodies (mABs) are

ALL HIGHER ADMINISTRATIVE COURTS ARE EQUAL?