Shufangxun MAC Save

An end-to-end masked contrastive video-and-language pre-training framework

Project README

MAC

Masked Contrastive Pre-Training for Efficient Video-Text Retrieval, arxiv 2022,

We present a simple yet effective Masked Contrastive Video-and-Language Pre-training framework for efficient video-text retrieval. Instead of blindly applying the mask-then-prediction paradigm from MAE, we propose a masked-then-alignment paradigm for efficient video-text alignment with random masking on both video and text. Our MAC enables efficient end-to-end pre-training: reduce FLOPs (60% off), accelerate pre-training (by 3x), and improve performance.

Pre-Training

Download WebVid2M (see https://github.com/m-bain/webvid)
Download CC3M (see https://ai.google.com/research/ConceptualCaptions/download)

Finetune

Download MSRVTT (see https://www.robots.ox.ac.uk/~maxbain/frozen-in-time/data/MSRVTT.zip)
Download DiDeMo (see https://github.com/LisaAnne/TemporalLanguageRelease)
Download ActivityNet (see https://github.com/activitynet/ActivityNet)

Results

We achieve SOTA results on various video-text retrieval datasets including MSR-VTT, DiDeMo, and ActivityNet. Below is the result on MSRVTT, more details can be found in our paper.

Citation

If you find our paper helpful in your research, please cite:

@article{shu2022masked,
  title={Masked Contrastive Pre-Training for Efficient Video-Text Retrieval},
  author={Shu, Fangxun and Chen, Biaolong and Liao, Yue and Xiao, Shuwen and Sun, Wenyu and Li, Xiaobo and Zhu, Yousong and Wang, Jinqiao and Liu, Si},
  journal={arXiv preprint arXiv:2212.00986},
  year={2022}
}

LICENSE

This project is licensed under the MIT License. See LICENSE for more details

Acknowledgements

This code is built on Frozen in time and MAE, we thank the authors for their awesome projects

Open Source Agenda is not affiliated with "Shufangxun MAC" Project. README Source: shufangxun/MAC

Stars

Open Issues

Last Commit

1 year ago

Repository

shufangxun/MAC

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/shufangxun-mac"><img src="https://www.opensourceagenda.com/projects/shufangxun-mac/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022