Best 17 Cross Modal Open Source Projects

🪩 Create Disco Diffusion artworks in one line

Represent, send, store and search multimodal data

A collection of research on knowledge graphs

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA...

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA...

PyTorch source code for "Stacked Cross Attention for Image-Text Matching...

Analyze the unstructured data with Towhee, such as reverse image search,...

[CVPR 2023] Referring Image Matting

[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic r...

Weakly Supervised 3D Object Detection from Point Clouds (VS3D), ACM MM 2020

Unofficial Implementation of Google Deepmind's paper `Objects that Sound`

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal K...

Code for journal paper "Learning Dual Semantic Relations with Graph Atte...

The official implementation of Achieving Cross Modal Generalization with...

This repository provides a comprehensive collection of research papers f...