Soheil Mpg Speech Recognition Save

End-to-End Speech Recognition using Neural Networks.

Project README

Automatic Speech Recognition (ASR)

Project Overview

we will build a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline! The completed pipeline will accept raw audio as input and return a predicted transcription of the spoken language. The full pipeline is summarized in the figure below.

STEP 1 is a pre-processing step that converts raw audio to one of two feature representations that are commonly used for ASR.
STEP 2 is an acoustic model which accepts audio features as input and returns a probability distribution over all potential transcriptions. After learning about the basic types of neural networks that are often used for acoustic modeling, we will engage in our own investigations, to design your own acoustic model!
STEP 3 in the pipeline takes the output from the acoustic model and returns a predicted transcription.

Dataset

We begin by investigating the LibriSpeech dataset that will be used to train and evaluate your models. The algorithm will first convert any raw audio to feature representations that are commonly used for ASR. We will then move on to building neural networks that can map these audio features to transcribed text.

Open Source Agenda is not affiliated with "Soheil Mpg Speech Recognition" Project. README Source: soheil-mp/Speech-Recognition

Stars

Open Issues

Last Commit

1 year ago

Repository

soheil-mp/Speech-Recognition

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/soheil-mpg-speech-recognition"><img src="https://www.opensourceagenda.com/projects/soheil-mpg-speech-recognition/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022