Neural building blocks for speaker diarization: speech activity detectio...
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACA...
CNN-based audio segmentation toolkit. Allows to detect speech, music, no...
Repository for our Interspeech2020 general-purpose voice activity detect...
The codebase for Data-driven general-purpose voice activity detection.
Speaker change detection using SincNet and an LSTM/Transformer