A PyTorch implementation of Speech Transformer, an End-to-End ASR with T...
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Tr...
Universal Graph Transformer Self-Attention Networks (TheWebConf WWW 2022...
A Pytorch Implementation of "Attention is All You Need" and "Weighted Tr...
A Structured Self-attentive Sentence Embedding
Official PyTorch implementation of Fully Attentional Networks
Implementing Stand-Alone Self-Attention in Vision Models using Pytorch
Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures...
[ICML 2023] Official PyTorch implementation of Global Context Vision Tra...
DSMIL: Dual-stream multiple instance learning networks for tumor detecti...
[NeurIPS'22] Tokenized Graph Transformer (TokenGT), in PyTorch
[NeurIPS 2021 Spotlight] & [IJCV 2024] SOFT: Softmax-free Transformer wi...
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Tex...
Important paper implementations for Question Answering using PyTorch
Representation learning on dynamic graphs using self-attention networks