🏄 Scalable embedding, reasoning, ranking for images and sentences with ...
X-modaler is a versatile and high-performance codebase for cross-modal a...
TOMM2020 Dual-Path Convolutional Image-Text Embedding :feet: https://a...
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Te...
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
Code for "Learning the Best Pooling Strategy for Visual Semantic Embeddi...
Deep Supervised Cross-modal Retrieval (CVPR 2019, PyTorch Code)
Official Pytorch implementation of "Probabilistic Cross-Modal Embedding"...
PyTorch code for BagFormer: Better Cross-Modal Retrieval via bag-wise in...
Official implementation of "Contrastive Audio-Language Learning for Musi...
[CVPR 2020, Oral] "Sketch Less for More: On-the-Fly Fine-Grained Sketch...
Learning Cross-Modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)
Source code for paper "Adversary Guided Asymmetric Hashing for Cross-Mod...
Unsupervised Contrastive Cross-modal Hashing (IEEE TPAMI 2023, PyTorch C...
Scalable deep multimodal learning for cross-modal retrieval (SIGIR 2019,...