InternGPT (iGPT) is an open source demo platform where you can easily sh...
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundati...
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and...
Official implementation of SEED-LLaMA (ICLR 2024).
Visual Med-Alpaca is an open-source, multi-modal foundation model design...
Code for the paper "LLark: A Multimodal Foundation Model for Music" by J...
3D Occupancy Prediction Benchmark in Autonomous Driving
一款文心一言&文心千帆大模型的高性能springboot-starter,支持连续对话(流...
[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scal...
RETFound - A foundation model for retinal image
Pre-training and Lifelong learning for User Embedding and Recommender S...