Frame Predict VAE LSTM Save

Predicting image frames using autoencoder and LSTM

Project README

frame-predict

This project idea is to try predict next n frames, by seeing only first few frames (3 in example) I took UNet and removed skip connections, I used this architecture only to create encoder and decoder model.

Between encoder and decoder I am using LSTM which acts as a time encoder. Time encoder goal is to encoder information about the frames it has seen (like acceleration, position) so it can predict them later. LSTM's here are used simialr to seq2seq models.

Because sequence lenghts are variational, they are all stacked in a batch, im using indicies to later split them apart and pick only the ones I need after using packed LSTM sequences to calculate loss.

Dataset is quite simple, so I wouldn't be suprised if it overfits

Dataset is sequence of falling dot

alt text

Architecure design

alt text

Results:

How to interpret the results: alt text

Vanilla autoencoder

Batch size 32
lr 0.001
z_vector size 16
beta 1

Vanilla autoencoder z_vector size 2

Batch size 32
lr 0.001
z_vector size 2
beta 1

VAE autoencoder

Batch size 32
lr 0.001
z_vector size 16
beta 1

Beta VAE autoencoder

Batch size 32
lr 0.001
z_vector size 3
beta 150

Open Source Agenda is not affiliated with "Frame Predict VAE LSTM" Project. README Source: marisancans/frame-predict-VAE-LSTM

Stars

Open Issues

Last Commit

4 years ago

Repository

marisancans/frame-predict-VAE-LSTM

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/frame-predict-vae-lstm"><img src="https://www.opensourceagenda.com/projects/frame-predict-vae-lstm/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022