ExplorerArtificial IntelligenceAI
Research PaperResearchia:202604.22019

Benign Overfitting in Adversarial Training for Vision Transformers

Jiaming Zhang

Abstract

Despite the remarkable success of Vision Transformers (ViTs) across a wide range of vision tasks, recent studies have revealed that they remain vulnerable to adversarial examples, much like Convolutional Neural Networks (CNNs). A common empirical defense strategy is adversarial training, yet the theoretical underpinnings of its robustness in ViTs remain largely unexplored. In this work, we present the first theoretical analysis of adversarial training under simplified ViT architectures. We show ...

Submitted: April 22, 2026Subjects: AI; Artificial Intelligence

Description / Details

Despite the remarkable success of Vision Transformers (ViTs) across a wide range of vision tasks, recent studies have revealed that they remain vulnerable to adversarial examples, much like Convolutional Neural Networks (CNNs). A common empirical defense strategy is adversarial training, yet the theoretical underpinnings of its robustness in ViTs remain largely unexplored. In this work, we present the first theoretical analysis of adversarial training under simplified ViT architectures. We show that, when trained under a signal-to-noise ratio that satisfies a certain condition and within a moderate perturbation budget, adversarial training enables ViTs to achieve nearly zero robust training loss and robust generalization error under certain regimes. Remarkably, this leads to strong generalization even in the presence of overfitting, a phenomenon known as \emph{benign overfitting}, previously only observed in CNNs (with adversarial training). Experiments on both synthetic and real-world datasets further validate our theoretical findings.


Source: arXiv:2604.19724v1 - http://arxiv.org/abs/2604.19724v1 PDF: https://arxiv.org/pdf/2604.19724v1 Original Link: http://arxiv.org/abs/2604.19724v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Apr 22, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark
Benign Overfitting in Adversarial Training for Vision Transformers | Researchia