SpeakerIdentificationNeuralNetworks Save

⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The highlight of the system is that it can identify the Speaker's voice in a Multi-Speaker Environment too. Multi-layer Perceptron (MLP) Neural Network based on error back propagation training algorithm was used to train and test the system. ⇨ The system response time was 74 µs with an average efficiency of 95%.

Project README

Speaker Identification in Multispeaker Environment using Deep Neural Networks

Abstract

Human beings are capable of performing unfathomable tasks. A human being is able to focus on a single person’s voice in an environment of simultaneous conversations. We have tried to emulate this particular skill through an artificial intelligence system. Our system identifies an audio file as a single or multi-speaker file as the first step and then recognizes the speaker(s). Our approach towards the desired solution was to first conduct pre-processing of the audio (input) file where it is subjected to reduction and silence removal, framing, windowing and DCT calculation, all of which is used to extract its features. Mel Frequency Cepstral Coefficients (MFCC) technique was used for feature extraction. The extracted features are then used to train the system via neural networks using the Error Back Propagation Training Algorithm (EBPTA). One of the many applications of our model is in biometric systems such as telephone banking, authentication and surveillance.

Keywords: Speaker identification, neural network, Multi- Speaker, Mel Frequency Cepstral Coefficients (MFCC).

Research Paper published in Springer Journal.

For more details: download file ResearchPaper.pdf, projectreport

Open Source Agenda is not affiliated with "SpeakerIdentificationNeuralNetworks" Project. README Source: manthanthakker/speakerIdentificationNeuralNetworks

Stars

Open Issues

Last Commit

4 years ago

Repository

manthanthakker/speakerIdentificationNeuralNetworks

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/speakeridentificationneuralnetworks"><img src="https://www.opensourceagenda.com/projects/speakeridentificationneuralnetworks/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022