Los puntos clave no están disponibles para este artículo en este momento.
The goal of a decision-based adversarial attack on a trained model is to generate adversarial examples based solely on observing output labels returned by the targeted model. We develop HopSkipJumpAttack, a family of algorithms based on a novel estimate of the gradient direction using binary information at the decision boundary. The proposed family includes both untargeted and targeted attacks optimized for ℓ and ℓ ∞ similarity metrics respectively. Theoretical analysis is provided for the proposed algorithms and the gradient direction estimate. Experiments show HopSkipJumpAttack requires significantly fewer model queries than several state-of-the-art decision-based adversarial attacks. It also achieves competitive performance in attacking several widely-used defense mechanisms.
Building similarity graph...
Analyzing shared references across papers
Loading...
Jianbo Chen
Michael I. Jordan
Martin J. Wainwright
University of California, Berkeley
Building similarity graph...
Analyzing shared references across papers
Loading...
Chen et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69dbcd6b498b35d3e6a3d307 — DOI: https://doi.org/10.1109/sp40000.2020.00045