https://github.com/mpariente/pystoi

Python implementation of the Short Term Objective Intelligibility measure
https://github.com/mpariente/pystoi

Last synced: about 1 year ago
JSON representation

Python implementation of the Short Term Objective Intelligibility measure

Host: GitHub
URL: https://github.com/mpariente/pystoi
Owner: mpariente
License: mit
Created: 2018-04-18T12:01:22.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2023-12-29T16:47:47.000Z (over 2 years ago)
Last Synced: 2025-05-08T22:34:20.749Z (about 1 year ago)
Language: MATLAB
Homepage:
Size: 796 KB
Stars: 339
Watchers: 12
Forks: 58
Open Issues: 5
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-python-audio-science - pystoi - Short-Time Objective Intelligibility measure. (Loudness, Metering & Perceptual Audio / IO router / deconstructed loops anchor)
awesome-python-scientific-audio - pystoi - Short Term Objective Intelligibility measure (STOI). (Audio Related Packages)

README

# Python implementation of STOI

Implementation of the classical and extended Short Term Objective Intelligibility measures

Intelligibility measure which is highly correlated with the intelligibility of degraded speech signals, e.g., due to additive noise, single/multi-channel noise reduction, binary masking and vocoded speech as in CI simulations. The STOI-measure is intrusive, i.e., a function of the clean and degraded speech signals. STOI may be a good alternative to the speech intelligibility index (SII) or the speech transmission index (STI), when you are interested in the effect of nonlinear processing to noisy speech, e.g., noise reduction, binary masking algorithms, on speech intelligibility.
Description taken from [Cees Taal's website](http://www.ceestaal.nl/code/)

### Install

`pip install pystoi` or
`pip3 install pystoi`

### Usage
```
import soundfile as sf
from pystoi import stoi

clean, fs = sf.read('path/to/clean/audio')
denoised, fs = sf.read('path/to/denoised/audio')

# Clean and den should have the same length, and be 1D
d = stoi(clean, denoised, fs, extended=False)
```

### Running the Octave tests

```bash
sudo apt update
sudo apt install octave octave-signal
pip install oct2py
```

```bash
python -m pytest tests/test_python_octave.py
python -m pytest tests/test_stoi_octave.py
```

### Matlab code & Testing

All the Matlab code in this repo is taken from or adapted from the code available [here](http://www.ceestaal.nl/code/) (STOI – Short-Time Objective Intelligibility Measure – ) written by Cees Taal.

Thanks to Cees Taal who open-sourced his Matlab implementation and enabled thorough testing of this python code.

If you want to run the tests, you will need Matlab, `matlab.engine` (install instructions [here](https://fr.mathworks.com/help/matlab/matlab_external/install-the-matlab-engine-for-python.html)) and `matlab_wrapper` (install with `pip install matlab_wrapper`).
The tests can only be ran under Python 2.7 as `matlab.engine` and `matlab_wrapper` are only compatible with Python2.7
Tests are passing at relative and absolute tolerance of `1e-3`, which is enough for the considered application (all the variability is coming from the resampling method when signals are not natively sampled at 10kHz).

Very big thanks to @gauss256 who translated all the matlab scripts to Octave, and wrote all the tests for it!

### Contribute

Any contribution are welcome~, specially to improve the execution speed of the code~ (thank you Przemek Pobrotyn for a 4x speed-up!) :

* ~Improve the resampling method to match Matlab's resampling in `tests/`.~ This can be considered a solved issue thanks to @gauss256 !
* Write tests for Python 3 (with [`transplant`](https://github.com/bastibe/transplant) for example)

### References
* [1] C.H.Taal, R.C.Hendriks, R.Heusdens, J.Jensen 'A Short-Time
Objective Intelligibility Measure for Time-Frequency Weighted Noisy Speech',
ICASSP 2010, Texas, Dallas.
* [2] C.H.Taal, R.C.Hendriks, R.Heusdens, J.Jensen 'An Algorithm for
Intelligibility Prediction of Time-Frequency Weighted Noisy Speech',
IEEE Transactions on Audio, Speech, and Language Processing, 2011.
* [3] J. Jensen and C. H. Taal, 'An Algorithm for Predicting the
Intelligibility of Speech Masked by Modulated Noise Maskers',
IEEE Transactions on Audio, Speech and Language Processing, 2016.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mpariente/pystoi

Awesome Lists containing this project

README