An open API service indexing awesome lists of open source software.

https://github.com/eleutherai/aria-amt

Efficient and robust implementation of seq-to-seq automatic piano transcription.
https://github.com/eleutherai/aria-amt

Last synced: 8 months ago
JSON representation

Efficient and robust implementation of seq-to-seq automatic piano transcription.

Awesome Lists containing this project

README

          

# aria-amt

Efficient and robust implementation of seq-to-seq automatic piano transcription.

## Install

Requires Python 3.11

```
git clone https://github.com/EleutherAI/aria-amt.git
cd aria-amt
pip install -e .
```

Download the preliminary model weights:

Piano (v1)

```
wget https://storage.googleapis.com/aria-checkpoints/amt/piano-medium-double-1.0.safetensors
```

## Usage

You can download mp3s from youtube using [yt-dlp](https://github.com/yt-dlp/yt-dlp):

```
yt-dlp --audio-format mp3 --extract-audio --no-playlist --audio-quality 0 -o
```

You can then transcribe using the cli:

```
aria-amt transcribe \
medium-double \
\
-load_path \
-save_dir \
-bs 1 \
-compile
```

If you want to do batch transcription, use the `-load_dir` flag and adjust `-bs` accordingly. Compiling and may take some time, but provides a significant speedup. Quantizing (`-q8` flag) further speeds up inference when the `-compile` flag is also used.

NOTE: Int8 quantization is only supported on GPUs that support BF16.