Zac2022 Lyric Alignment Save

Solution for Zalo AI Challenge 2022 - Lyrics Alignment

Project README

Solution for Zalo AI Challenge 2022 - Lyrics Alignment

Requirements

pip install -r requirements.txt

Overview

Using Demucs to extract the music and lyrics in the original audio.
Resampling original audio to 16K audio.
Creating new vocab dictionary for Wav2Vec2.
Selecting segments from labels randomly and merge them to create new pair of audio/lyric.
Fine-tuning Wav2Vec2 model with original CTC loss with all training data with the new vocab dictionary.
Using forced-alignment (dynamic programming) to find the best alignment path between audio and lyric.
Merging character durations to obtain words segment index from the audio.

Reproduce

Prepare Dataset

Download data here and prepare a dataset in the following format:

|- data/
|   |- public_test/
|       |- lyrics/
|       |- new_labels_json/
|       |- songs/
|   |- train/
|       |- labels/
|       |- songs/

Training

sh reproduce.sh

you can also download, extract our checkpoints here and will obtain the following format:

|- checkpoints/
|   |- dragonSwing/
|       |- wav2vec2-base-vietnamese/
|           |- checkpoint-5500/
|               |- pytorch_model.bin

Make A Submission

python submission.py submission --saved_path ./result
zip -r submit.zip result/*.json

Open Source Agenda is not affiliated with "Zac2022 Lyric Alignment" Project. README Source: Telegram-Zalo/zac2022-lyric-alignment

Stars

Open Issues

Last Commit

1 year ago

Repository

Telegram-Zalo/zac2022-lyric-alignment

License

Apache-2.0

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/zac2022-lyric-alignment"><img src="https://www.opensourceagenda.com/projects/zac2022-lyric-alignment/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022