CA-Stream: Attention-based pooling for interpretable image recognition - Laboratoire d'Informatique et Systèmes Accéder directement au contenu
Communication Dans Un Congrès Année : 2024

CA-Stream: Attention-based pooling for interpretable image recognition

Felipe Torres
  • Fonction : Auteur
  • PersonId : 1376426
Hanwei Zhang
  • Fonction : Auteur
  • PersonId : 1376427
Ronan Sicre
  • Fonction : Auteur
  • PersonId : 1067988
Stéphane Ayache

Résumé

Explanations obtained from transformer-based architectures in the form of raw attention, can be seen as a class-agnostic saliency map. Additionally, attention-based pooling serves as a form of masking the in feature space. Motivated by this observation, we design an attention-based pooling mechanism intended to replace Global Average Pooling (GAP) at inference. This mechanism, called Cross-Attention Stream (CA-Stream), comprises a stream of cross attention blocks interacting with features at different network depths. CA-Stream enhances interpretability in models, while preserving recognition performance.
Fichier principal
Vignette du fichier
WACV_ICCVround2-4.pdf (2.29 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04551613 , version 1 (18-04-2024)

Identifiants

  • HAL Id : hal-04551613 , version 1

Citer

Felipe Torres, Hanwei Zhang, Ronan Sicre, Stéphane Ayache, Yannis Avrithis. CA-Stream: Attention-based pooling for interpretable image recognition. XAI4CV workshop (CVPR), Jun 2024, Seatle, WA, United States. ⟨hal-04551613⟩
0 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More