Document Type
Article
Publication Title
arXiv
Abstract
Transferable adversarial attacks optimize adversaries from a pretrained surrogate model and known label space to fool the unknown black-box models. Therefore, these attacks are restricted by the availability of an effective surrogate model. In this work, we relax this assumption and propose Adversarial Pixel Restoration as a self-supervised alternative to train an effective surrogate model from scratch under the condition of no labels and few data samples. Our training approach is based on a min-max objective which reduces overfitting via an adversarial objective and thus optimizes for a more generalizable surrogate model. Our proposed attack is complimentary to our adversarial pixel restoration and is independent of any task specific objective as it can be launched in a self-supervised manner. We successfully demonstrate the adversarial transferability of our approach to Vision Transformers as well as Convolutional Neural Networks for the tasks of classification, object detection, and video segmentation. Our codes & pre-trained surrogate models are available at: https://github.com/HashmatShadab/APR. © 2022, CC BY.
DOI
10.48550/arXiv.2207.08803
Publication Date
7-18-2022
Keywords
Convolutional neural networks, Image segmentation, Object detection, Pixels
Recommended Citation
H.S. Malik, S.K. Kunhimon, M. Naseer, S. Khan and F.S. Khan, "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations", 2022, arXiv:2207.08803
Comments
Preprint: arXiv
Archived with thanks to arXiv
Preprint License: CC by 4.0
Uploaded 30 August 2022