Resource Speech Save

语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download

Project README

resource_speech

通用 General

教程 Tutorials

  • Deep Learning for Human Lagnguage Processsing (DLHLP) @ National Taiwan University
    • 李宏毅博士在国立台湾大学开的课程 Lecture by Dr. by Hung-yi Lee at National Taiwan University
    • website@NTU

语料库 Corpus

数据增强 Data Augmentation

  • RIR_NOISES
  • MUSAN
    • 包含音乐、语音、噪声三类录音, 可用于语音识别、说话人识别中的数据增强 MUsic, Speech And Noise recordings for data augmentation in ASR, Speaker Recognition...
    • OpenSLR (官方链接 official link)

活动音检测(VAD) Voice Activity Detection

多用途开发工具箱 General-purpose Development Toolkits

语音工具 Speech Tools

语音前端 Speech Front-end

语音前端 - 数据集 Dataset

语音前端 - 代码 Codebase

说话人识别(SR) Speaker Recognition

说话人识别 - SOTA

说话人识别 - 数据集

说话人识别 - 代码 Codebase

说话人分割(SD) Speaker Diarization

说话人分割 - Awesome

说话人分割 - SOTA

说话人分割 - 数据集 Dataset

说话人分割 - 代码 Codebase

说话人分割 - 评测 Evaluation

音频标注(AT) Audio Tagging

音频标注 - 数据集 Dataset

语音预训练(PTM) Pre-Training for Speech

语音预训练 - 代码 Codebase

  • S3PRL toolkits
    • 语音预训练和SUPERB基准工具包 Toolkits for Pre-Training in Speech and the SUPERB benchmark
    • GitHub (官方代码)

关键词识别(KWS) Keyword Spotting

语音合成(TTS) Text-To-Speech

语音合成 - 教程 Tutorials

  • cnlinxi / book-text-to-speech
    • 一个比较全面的TTS概览性教程 A comprehensive TTS tutorial
    • GitHub

语音合成 - 代码 Codebase

Open Source Agenda is not affiliated with "Resource Speech" Project. README Source: ZhaZhaFon/resource_speech
Stars
42
Open Issues
0
Last Commit
1 year ago
License

Open Source Agenda Badge

Open Source Agenda Rating