🪩 Create Disco Diffusion artworks in one line
Represent, send, store and search multimodal data
A collection of research on knowledge graphs
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA...
PyTorch source code for "Stacked Cross Attention for Image-Text Matching...
Analyze the unstructured data with Towhee, such as reverse image search,...
[CVPR 2023] Referring Image Matting
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic r...
Weakly Supervised 3D Object Detection from Point Clouds (VS3D), ACM MM 2020
Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal K...
Code for journal paper "Learning Dual Semantic Relations with Graph Atte...
The official implementation of Achieving Cross Modal Generalization with...
This repository provides a comprehensive collection of research papers f...