1D Speech Emotion Recognition Save

Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM

Project README

#1D Speech Emotion Recognition

Speech Emotion Recognition using raw speech signals from the EmoDB database using 1D CNN-LSTM architecture as given in the following paper.

Zhao, Jianfeng, Xia Mao, and Lijiang Chen. "Speech emotion recognition using deep 1D & 2D CNN LSTM networks." Biomedical Signal Processing and Control 47 (2019): 312-323.

EmoDB database can be downloaded from the following website

http://www.emodb.bilderbar.info/download/

There are 7 emotional classes and the validation accuracy obtained is ~61% as mentioned in the paper.

Developed and tested on the following:

Python 2.7 keras 2.2.4 Librosa 0.6.2

Update: 07-05-2021

Modifed the cnn1d.py architecture with attention mechanism (cnn1d_attn.py), shows better performance in terms of accuracy (66%-70% with lr 0.01).

##################################

#For another computational paralinguistic task, verbal conflict intensity estimation, see the repo https://github.com/smartcameras/ConflictNET

Open Source Agenda is not affiliated with "1D Speech Emotion Recognition" Project. README Source: vandana-rajan/1D-Speech-Emotion-Recognition

Open Source Agenda Badge

Open Source Agenda Rating