Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
Collect some Transformer with Computer-Vision (CV) papers.
If you find some overlooked papers, please open issues or pull requests (recommended).
TPAMI
ECCV
CVPR
WACV
ICLR
[RelViT] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning [paper] [code]
[CrossFormer] CrossFormer: A Versatile Vision Transformer Based on Cross-scale Attention [paper] [code]
Uniformer: Unified Transformer for Efficient Spatiotemporal Representation Learning [paper] [code]
[DAB-DETR] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR [paper] [code]
NeurIPS
ICCV
CVPR
ICML
ICRA
ICLR
ACM MM
MICCAI
BMVC
ISIE
CORL
IJCAI
IROS
WACV
ICDAR
Thanks the template from Awesome-Crowd-Counting