Project 18 · Advanced Deep Learning

Saliency-based Analysis of
Shortcut Learning in CNNs

We train a CNN on the deliberately biased Waterbirds dataset and use Grad-CAM, a foreground/background attention-bias score, and four inference-time interventions to test whether the model is actually looking at the bird — or just at the background.

See the methodology →Jump to results Try the live demo

The shortcut, in one screen

The model looks excellent on average — but the gap to worst-group accuracy and the dramatic effect of intervening on the image make the shortcut undeniable.

Overall test accuracy

83.9%

ResNet18, balanced test split

Worst-group accuracy

59.5%

Waterbird on land · the conflict case

Accuracy after foreground mask

53.4%

31.8% of predictions flip

Accuracy after background mask

86.0%

Removing the background helps — bias confirmed

The pipeline

Following the project brief: train, evaluate by subgroup, run Grad-CAM, score foreground vs. background attention, intervene at inference time, then compare classification and saliency-based metrics.

Load Waterbirds

HF grodino/waterbirds · 4 subgroups

→

Train ResNet18

Save best by worst-group acc

→

Subgroup eval

Acc · P · R · F1 · CM · WG

→

Grad-CAM

Class-conditional saliency on layer4[-1]

→