Volkan Cevher, Grigorios Chrysos, Fanghui Liu, Elias Abad Rocamora
Catastrophic overfitting (CO) in single-step adversarial training (AT) results in abrupt drops in the adversarial test accuracy (even down to 0%). For models trained with multi-step AT, it has been observed that the loss function behaves locally linearly w ...
2024