https://github.com/tabahi/ser_datasets_import
Python module for importing SER datasets
https://github.com/tabahi/ser_datasets_import
Last synced: 3 months ago
JSON representation
Python module for importing SER datasets
- Host: GitHub
- URL: https://github.com/tabahi/ser_datasets_import
- Owner: tabahi
- License: mit
- Created: 2020-05-14T01:07:36.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2020-09-16T09:12:23.000Z (over 4 years ago)
- Last Synced: 2025-01-13T14:52:30.511Z (5 months ago)
- Language: Python
- Size: 20.5 KB
- Stars: 1
- Watchers: 1
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: Readme.md
- License: LICENSE.md
Awesome Lists containing this project
README
# SER corpus import module
This module reads the wav files in DB folders to create clip file class objects
All import functions read the DB folders in their original (downloaded as is from their sources) format.`create_DB_file_objects((string)db_name, (string)db_path)` creates a list of file objected of all the wav files present in the db_path, as long as they are according to the formant of relevant 'db_name's original source and returns a list of `Clip_file_Class` objects.
```python
# in example.py
import SER_DB_Parsing.SER_DB as SER_DBlist_of_clips = SER_DB.create_DB_file_objects("EmoDB", "path\\EMO-DB\\wav\\")
```Each item in the list is created by the class: `Clip_file_Class(db_id, filepath, speaker_id, scenario, sex, emotion_cat=None, intensity_cat=None, valance=None, arousal=None, dominance=None, naturalness=None, statement=None, repetition=None, n_raters=None, n_possible_emotions=None)`
Get clip properties as:
```python
print("First clip path:", list_of_clips[0].filepath)
print("First clip emotion category:", list_of_clips[0].emotion_cat)
```Currently supports these DBs
```python
db_name="EmoDB", db_path="C:\\DB\\EMO-DB\\wav\\"
# First file path: C:\DB\EMO-DB\wav\03a01Fa.wavdb_name="RAVDESS", db_path="C:\\DB\\RAVDESS\\"
# First file: C:\DB\RAVDESS\Speech\Actor_01\03-01-01-01-01-01-01.wavdb_name="IEMOCAP", db_path="C:\\DB\\IEMOCAP_noVideo\\"
# First file: C:\DB\IEMOCAP_noVideo\Session1\sentences\wav\Ses01F_script01_1\Ses01F_script01_1_F001.wav
# Evaluation file: C:\DB\IEMOCAP_noVideo\Session1\dialog\EmoEvaluation\Ses01F_impro01.txtdb_name="ShemoDB", db_path="C:\\DB\\shemo\\"
# First file: C:\DB\shemo\F\F01A01.wavdb_name="DEMoS", db_path="C:\\DB\\wav_DEMoS\\DEMOS\\"
# First file: C:\DB\wav_DEMoS\DEMOS\NP_f_01_col07b.wavdb_name="MSIMPROV", db_path="C:\\DB\\MSP-IMPROV\\"
# First file: C:\DB\MSP-IMPROV\session2\S01A\R\MSP-IMPROV-S01A-F02-R-FF01.wav
# Evaluation file: C:\DB\MSP-IMPROV\Evalution.txt
```
Change the `db_path` according to wherever you have stored the databases.> Standardized emotional category labels (single char) `emotion_cat: {'N':'neutral', 'H':'happy', 'S':'sad', 'A':'anger', 'F':'fear', 'D':'disgust', 'U':'surprise', 'C':'calm', 'R':'frustuated', 'E':'excited', 'Y':'happy-excited', 'G':'guilty', 'X': 'unknown'}`
> Standardized scenario labels (int) : `0=unknown, 1=script, 2=improv, 3=radio/TV, 4=elicited, 5=natural, 6=script-in-improv`
> Standardized sexes (single char) : `'M'=males, 'F'=females`
EmoDB
RAVDESS
IEMOCAP
ShEMO-DB
DEMoS
MSP-IMPROV