Paper published in a journal (Scientific congresses and symposiums)
Explaining through Transformer Input Sampling
Englebert, Alexandre; Stassin, Sédrick; Nanfack, Géraldin et al.
2023In ICCV Workshop on New Ideas in Vision Transformers
Peer reviewed
 

Files


Full Text
_Englebert_Explaining_Through_Transformer_Input_Sampling_ICCVW_2023_paper.pdf
Publisher postprint (1.86 MB)
Download
Annexes
TiS__Transformer_Input_Sampling (3).pdf
(18.87 MB)
Download

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Keywords :
Vision Transformers; Explainable Artificial Intelligence; XAI; Deep Learning; Post-hoc methods
Abstract :
[en] Vision Transformers are becoming more and more the preferred solution to many computer vision problems, which has motivated the development of dedicated explainability methods. Among them, perturbation-based methods offer an elegant way to build saliency maps by analyzing how perturbations of the input image affect the network prediction. However, those methods suffer from the drawback of introducing outlier image features that might mislead the explainability process, e.g. by affecting the output classes independently of the initial image content. To overcome this issue, this paper introduces Transformer Input Sampling (TIS), a perturbation-based explainability method for Vision Transformers, which computes a saliency map based on perturbations induced by a sampling of the input tokens. TIS utilizes the natural property of Transformers which permits a variable input number of tokens, thereby preventing the use of replacement values to generate perturbations. Using standard models such as ViT and DeiT for benchmarking, TIS demonstrates superior performance on several metrics including Insertion, Deletion, and Pointing Game compared to state-ofthe-art explainability methods for Transformers. The code for TIS is publicly available at https://github.com/ aenglebert/Transformer_Input_Sampling.
Disciplines :
Computer science
Author, co-author :
Englebert, Alexandre;  UCL - Université Catholique de Louvain [BE] > ELEN/ ICTEAM
Stassin, Sédrick  ;  Université de Mons - UMONS > Faculté Polytechniqu > Service Informatique, Logiciel et Intelligence artificielle
Nanfack, Géraldin;  Concordia - Concordia University [CA]
Mahmoudi, Sidi  ;  Université de Mons - UMONS > Faculté Polytechniqu > Service Informatique, Logiciel et Intelligence artificielle
Siebert, Xavier  ;  Université de Mons - UMONS > Faculté Polytechniqu > Service de Mathématique et Recherche opérationnelle
Cornu, Olivier;  UCL - Université Catholique de Louvain [BE] > NMSK/ IREC
De Vleeschouwer, Christophe;  UCL - Université Catholique de Louvain [BE] > ELEN/ ICTEAM
Language :
English
Title :
Explaining through Transformer Input Sampling
Publication date :
2023
Event name :
ICCV Workshop on New Ideas in Vision Transformers
Event place :
Paris, France
Event date :
2-6 October 2023
Audience :
International
Journal title :
ICCV Workshop on New Ideas in Vision Transformers
Peer review/Selection committee :
Peer reviewed
Research unit :
F114 - Informatique, Logiciel et Intelligence artificielle
Research institute :
Infortech
Funding text :
The Research Foundation for Industry and Agriculture, National Scientific Research Foundation (FRIA-FNRS) funded this research as grants attributed to Alexandre Englebert, consisting in Ph.D. financing. Sedrick Stassin thanks the support of the E-origin project funded by the Walloon Region within the pole of logistics in Wallonia. Christophe De Vleeschouwer is funded by the FNRS (National Scientific Research Foundation).
Available on ORBi UMONS :
since 13 January 2024

Statistics


Number of views
84 (8 by UMONS)
Number of downloads
133 (2 by UMONS)

Scopus citations®
 
19
Scopus citations®
without self-citations
17
OpenCitations
 
4
OpenAlex citations
 
13

Bibliography


Similar publications



Contact ORBi UMONS