https://github.com/voidful/speechmix

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
https://github.com/voidful/speechmix

Last synced: 10 months ago
JSON representation

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Host: GitHub
URL: https://github.com/voidful/speechmix
Owner: voidful
Created: 2021-10-29T15:35:41.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2025-07-03T06:31:43.000Z (12 months ago)
Last Synced: 2025-08-17T02:19:32.504Z (10 months ago)
Language: Python
Size: 94.7 KB
Stars: 47
Watchers: 4
Forks: 10
Open Issues: 3
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# SpeechMix

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together.

Implementation of:
[Large-Scale Self- and Semi-Supervised Learning for Speech Translation](https://arxiv.org/abs/2104.06678) - ACL2021
[Multilingual Speech Translation with Efficient Finetuning of Pretrained Models](https://arxiv.org/abs/2010.12829) - ACL2021
[Lightweight Adapter Tuning for Multilingual Speech Translation](https://arxiv.org/abs/2106.01463) - Interspeech 2021
[Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task](https://arxiv.org/abs/2107.05782) - ACL2021
[A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks](https://arxiv.org/abs/2010.11338) - ICASSP 2021

## Installation

### pip install

```bash
pip install speechmix
```

### Build from source

git clone and cd into this project.

```shell
pip install -e .
```

## Name the project(!important)
WANDB_PROJECT=amazing

## base
```shell
python train.py --speech_model_config wav2vec2 \
--nlp_model_config facebook/bart-base \
--SpeechMixEED \
--dataset librispeech_asr \
--field clean \
--train_split train.100 \
--test_split validation \
--batch 3 \
--grad_accum 20 \
--epoch 30 \
--worker 15 \
--share_layer_ratio 0 \
--down_scale 2 \
--lr 4e-5 \
--warmup_steps 500 \
--wandb \
--notes base
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/voidful/speechmix

Awesome Lists containing this project

README