Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural...
FreeInit: Bridging Initialization Gap in Video Diffusion Models
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with ...
Codes for ID-Specific Video Customized Diffusion
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Papers and resources on Controllable Generation using Diffusion Models, ...
The official implementation for "Gen-L-Video: Multi-Text to Long Video G...
Papers and Book to look at when starting AGI 📚
Official PyTorch implementation of TATS: A Long Video Generation Framewo...
Official implement code of LAMP: Learn a Motion Pattern by Few-Shot Tuni...
Paddle Multimodal Integration and eXploration, supporting mainstream mul...
Implementation of Lumiere, SOTA text-to-video generation from Google Dee...
[CVPR 2022] Show Me What and Tell Me How: Video Synthesis via Multimodal...
The most powerful and modular Sora WebUI, api and backend with OpenAI's ...