All Things Vits Code Samples Save

Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.

Project README

All Things ViTs (CVPR'23 Tutorial)

By: Hila Chefer (Tel-Aviv University and Google) and Sayak Paul (Hugging Face) (with Ron Mokady as a guest speaker)

Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision. We leverage ๐Ÿค— transformers, ๐Ÿงจ diffusers, timm, and PyTorch for the code samples.

Using the code samples

We provide all the code samples as Colab Notebooks so that no setup is needed locally to execute them.

We divide our tutorial into the following logical sections:

  • explainability: has the notebooks that show how to generate explanations from attention-based models (such as Vision Transformers) on the basis of their predictions.
    • CLIP_explainability.ipynb
    • Comparative_Transformer_explainability.ipynb
    • Transformer_explainability.ipynb
  • probing: has notebooks the probe into the representations learned by the attention-based models (such as Vision Transformers).
    • dino_attention_maps.ipynb
    • mean_attention_distance.ipynb

Below we provide links to all the Colab Notebooks:

Section Notebook Name Colab Notebook
explainability CLIP_explainability.ipynb Open In Colab
Comparative_Transformer_explainability.ipynb Open In Colab
Transformer_explainability.ipynb Open In Colab
probing dino_attention_maps.ipynb Open In Colab
mean_attention_distance.ipynb Open In Colab

Disclaimer

The following notebooks were taken from their original repositories with the authors being aware of this:

  • CLIP_explainability.ipynb (original)
  • Transformer_explainability.ipynb (original)
Open Source Agenda is not affiliated with "All Things Vits Code Samples" Project. README Source: all-things-vits/code-samples

Open Source Agenda Badge

Open Source Agenda Rating