RefWhisper

This is a repository for our paper on DLfM 2023, "Aligning Incomplete Lyrics of Korean Folk Song Dataset using Whisper".

The original dataset is available at here

Requirements

We recommend pipenv for installing the requirements. The requirements are given in Pipfile.

Install pipenv and requirements

pip install pipenv
pipenv install
pipenv shell

Model

The pre-trained model is provided in OneDrive

Anthology of Korean Traditional Folksongs

Official website

The metadata we collected from the website is available in Finding Tori metadata.csv, including the url for each song. You can download the audio files from the website.

Lyric Transcribed Result

The transcribed result is provided in transcription_result_csvs.tar.gz. You can untar the file by running tar -xvzf transcription_result_csvs.tar.gz.

The transcription is given in three files:

{song_id}_transcribed.txt
- The transcribed lyric of the song
{song_id}_word_align.csv
- Word-level alignment between the transcribed lyric and the reference lyric
{song_id}_ref_align.csv
- Line-level alignment between the reference lyric and the transcribed lyric
- In this file, there are two alignment.
  - 1. Serial alignment between the transcribed and the reference. In this alignment, the two lyrics are aligned in serial order from the beginning to the end.
  - 1. Open-beginning/open-end alignment between each line of the reference lyric and the entire corresponding transcription. This trys to find the optimal alignment for that specific lyric line, not considering the alignment for entire lyrics.

Citation

If you use this code or the dataset, please cite our paper.

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
ref_whisper		ref_whisper
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
transcription_result_csvs.tar.gz		transcription_result_csvs.tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RefWhisper

Requirements

Install pipenv and requirements

Model

Anthology of Korean Traditional Folksongs

Lyric Transcribed Result

Citation

About

Releases

Packages

Languages

License

daewoung/RefWhisper

Folders and files

Latest commit

History

Repository files navigation

RefWhisper

Requirements

Install pipenv and requirements

Model

Anthology of Korean Traditional Folksongs

Lyric Transcribed Result

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages