Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/hlasse/multidiagnosis-speech

Last synced: about 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/hlasse/multidiagnosis-speech
Owner: HLasse
License: mit
Created: 2021-09-06T07:10:15.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2023-06-23T10:07:36.000Z (over 1 year ago)
Last Synced: 2024-11-08T20:07:19.729Z (about 2 months ago)
Language: Jupyter Notebook
Size: 14.7 MB
Stars: 11
Watchers: 4
Forks: 3
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Multidiagnosis-speech

Code for finetuning of feature-based baselines and Transformer models for classification of mental disorders from speech.

```
# make virtual env
pip install -r requirements.txt

mkdir data
# download and unzip data to 'data/multi_diagnosis'
wget ...
unzip ..
# make sure metadata file (CleanData4.csv) is in 'data/multidiagnosis.
# if only access to CleanData.csv, run preprocessing/merge_participant_metadata.py

# run preprocessing scripts
bash run_preprocessing.py

# train baseline models
python train_baseline_models.py

# train wav2vec models
bash train_wav2vec.sh

#evaluate models
bash evaluate_baselines.sh
python evaluate_wav2vec_models.py

```

## Audio augmentation
- Using torch-audiomentation
- IR taking from [MIT Acoustical Reverberation Scene Statistics Survey](https://mcdermottlab.mit.edu/Reverb/IR_Survey.html). Placed in 'augmentation_files/ir'. Specificy path in 'augmentation_config.yml'