Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Combating Adversaries with Anti-adversaries

  • Motasem Alfarra
  • , Juan C. Pérez
  • , Ali Thabet
  • , Adel Bibi
  • , Philip H.S. Torr
  • , Bernard Ghanem
  • King Abdullah University of Science and Technology
  • Meta

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

21 Citas (Scopus)

Resumen

Deep neural networks are vulnerable to small input perturbations known as adversarial attacks. Inspired by the fact that these adversaries are constructed by iteratively minimizing the confidence of a network for the true class label, we propose the anti-adversary layer, aimed at countering this effect. In particular, our layer generates an input perturbation in the opposite direction of the adversarial one and feeds the classifier a perturbed version of the input. Our approach is training-free and theoretically supported. We verify the effectiveness of our approach by combining our layer with both nominally and robustly trained models and conduct large-scale experiments from black-box to adaptive attacks on CIFAR10, CIFAR100, and ImageNet. Our layer significantly enhances model robustness while coming at no cost on clean accuracy.

Idioma originalInglés
Título de la publicación alojadaAAAI-22 Technical Tracks 6
EditorialAssociation for the Advancement of Artificial Intelligence
Páginas5992-6000
Número de páginas9
ISBN (versión digital)1577358767, 9781577358763
DOI
EstadoPublicada - 30 jun 2022
Publicado de forma externa
Evento36th AAAI Conference on Artificial Intelligence, AAAI 2022 - Virtual, Online
Duración: 22 feb 20221 mar 2022

Serie de la publicación

NombreProceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022
Volumen36

Conferencia

Conferencia36th AAAI Conference on Artificial Intelligence, AAAI 2022
CiudadVirtual, Online
Período22/02/221/03/22

Huella

Profundice en los temas de investigación de 'Combating Adversaries with Anti-adversaries'. En conjunto forman una huella única.

Citar esto