LUMIERE Save

Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research

Project README

Multi-Modality

Lumiere

Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research. I will mostly be implementing the modules from the diagram a and b in figure 4

Install

pip install lumiere

Usage

import torch
from lumiere.model import AttentionBasedInflationBlock

# B, T, H, W, D
x = torch.randn(1, 4, 224, 224, 512)

# Model
model = AttentionBasedInflationBlock(dim=512, heads=4, dropout=0.1)

# Forward pass
out = model(x)

# print
print(out.shape)  # Expected shape: [1, 4, 224, 224, 512]

Test

poetry run pytest

# or
pytest tests/

License

MIT

Open Source Agenda is not affiliated with "LUMIERE" Project. README Source: kyegomez/LUMIERE
Stars
44
Open Issues
1
Last Commit
1 month ago
Repository
License
MIT

Open Source Agenda Badge

Open Source Agenda Rating