![](images/LeGrad_main_figure.jpg) |
LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity
Walid Bousselham,
Angie Boggust,
Sofian Chaybouti
Hendrik Strobelt
Hilde Kuehne
arXiv, 2024
Project Page
/
Code
/
arXiv
/
Demo
|
![](images/GEM_main_figure.jpg) |
Grounding Everything: Emerging Localization Properties in Vision-Language Transformers
Walid Bousselham,
Felix Petersen,
Vittorio Ferrari,
Hilde Kuehne
CVPR, 2024
Code
/
arXiv
/
Demo
|
![](images/HGQA_architecture.png) |
Learning Situation Hyper-Graphs for Video Question Answering
Aisha Urooj,
Hilde Kuehne,
Bo Wu,
Kim Chheu,
Walid Bousselham,
Chuang Gan,
Niels Lobo,
Mubarak Shah
CVPR, 2023
Code
/
arXiv
|
![](images/SenFormer_main_figure.png) |
Efficient Self-Ensemble for Semantic Segmentation
Walid Bousselham,
Guillaume Thibault,
Lucas Pagano,
Archana Machireddy,
Joe Gray,
Young Hwan Chang,
Xubo Song
BMVC, 2022
Code
/
arXiv
/
video
|
Design and source code borrowed from Jon Barron's website.
|
|