A family of lightweight multimodal models.
The Cradle framework is a first attempt at General Computer Control (GCC...
A reading list for large models safety, security, and privacy.
GeoChat, the first grounded Large Vision Language Model for Remote Sensi...
Custom ComfyUI nodes for Vision Language Models, Large Language Models, ...
Ptera Software is a fast, easy-to-use, and open-source software package ...
Matlab implementation to simulate the non-linear dynamics of a fixed-win...
Famous Vision Language Models and Their Architectures
Towards World's Most Comprehensive Curated List of LLM Related Papers & ...
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
M3DBench introduces a comprehensive 3D instruction-following dataset wit...
[ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-La...
PsyDI: A MBTI agent that helps you understand your personality type thro...