A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA...
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Human Emotion Understanding using multimodal dataset.