Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/hlasse/multidiagnosis-speech
https://github.com/hlasse/multidiagnosis-speech
Last synced: about 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/hlasse/multidiagnosis-speech
- Owner: HLasse
- License: mit
- Created: 2021-09-06T07:10:15.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-06-23T10:07:36.000Z (over 1 year ago)
- Last Synced: 2024-11-08T20:07:19.729Z (about 2 months ago)
- Language: Jupyter Notebook
- Size: 14.7 MB
- Stars: 11
- Watchers: 4
- Forks: 3
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Multidiagnosis-speech
Code for finetuning of feature-based baselines and Transformer models for classification of mental disorders from speech.
```
# make virtual env
pip install -r requirements.txtmkdir data
# download and unzip data to 'data/multi_diagnosis'
wget ...
unzip ..
# make sure metadata file (CleanData4.csv) is in 'data/multidiagnosis.
# if only access to CleanData.csv, run preprocessing/merge_participant_metadata.py# run preprocessing scripts
bash run_preprocessing.py# train baseline models
python train_baseline_models.py# train wav2vec models
bash train_wav2vec.sh#evaluate models
bash evaluate_baselines.sh
python evaluate_wav2vec_models.py```
## Audio augmentation
- Using torch-audiomentation
- IR taking from [MIT Acoustical Reverberation Scene Statistics Survey](https://mcdermottlab.mit.edu/Reverb/IR_Survey.html). Placed in 'augmentation_files/ir'. Specificy path in 'augmentation_config.yml'