An ultimately comprehensive paper list of Vision Transformer/Attention, ...
Towhee is a framework that is dedicated to making neural data processing...
[CVPR 2021] Official PyTorch implementation for Transformer Interpretabi...
solo-learn: a library of self-supervised methods for visual representati...
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models fo...
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch...
A fast, easy-to-use, production-ready inference server for computer visi...
A paper list of some recent Transformer-based CV works.
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Mo...
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, a...
Extract video features from raw videos using multiple GPUs. We support R...
Open-source evaluation toolkit of large vision-language models (LVLMs), ...
FFCS course registration made hassle free for VITians. Search courses an...
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,M...
Official Code of Paper "Reversible Column Networks" "RevColv2"