https://github.com/bagustris/thesis-bss

List of matlab/octave used in master thesis S2 "binaural" by bagustris
https://github.com/bagustris/thesis-bss

binaural-hearing matlab octave source-separation speech-enhancement

Last synced: 7 months ago
JSON representation

List of matlab/octave used in master thesis S2 "binaural" by bagustris

Host: GitHub
URL: https://github.com/bagustris/thesis-bss
Owner: bagustris
Created: 2013-06-26T12:01:09.000Z (over 12 years ago)
Default Branch: master
Last Pushed: 2022-04-27T03:38:46.000Z (over 3 years ago)
Last Synced: 2023-06-01T23:22:51.713Z (over 2 years ago)
Topics: binaural-hearing, matlab, octave, source-separation, speech-enhancement
Language: MATLAB
Homepage: https://bagustris.github.io
Size: 5.15 MB
Stars: 8
Watchers: 2
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

thesis-bss
==========
List of matlab / octave script used on master thesis by bagustris.
The theme of thesis is binaural sound sources separation. Pdf file is available [here](https://www.dropbox.com/s/5wjsrrhxjw5oby3/bta_tesis_en_v16.pdf?dl=0).

In this thesis, I evaluated some common methods in binaural sound separation: ICA (with max likelihood estimation, ICA with Binary Mask (ICABM), binural model using phase difference channel weighting [4], and my-proposed-method FastICA with Binary Mask (FastICABM).

This is the source code for underdetermined separation of instaneous speech mixtures with FastICA and binary mask and the comparison for benchmark.

The algorithm is described in

1. Michael Syskind Pedersen, DeLiang Wang, Jan Larsen and Ulrik Kjems:
Two-microphone Separation of Speech Mixtures, 2006, Submitted for publication.
2. Michael Syskind Pedersen, DeLiang Wang, Jan Larsen and Ulrik Kjems, Overcomplete Blind Source Separation by
Combining ICA and Binary Time-Frequency Masking, IEEE International workshop on Machine
Learning for Signal Processing, pp. 15-20, 2005
3. Hyvärinen, A., Erkki, H. 2000. Independent Component Analysis:
Algorithm and Applications. Neural Networks, 13(4-5):411-430, 2000
4. C. Kim, K. Kumar, B. Raj, , and R. M. Stern, “Signal separation for robust
speech recognition based on phase difference information obtained in the fre-
quency domain,” INTERSPEECH, pp. 2495–2498, 2009.

All files should be in the same directory.
The algorithm is run by calling each icabm.m and fasticabm.m.
For ICA algorithm, it can be run directy from worskpace and for PDCW it can be obtained from the [source](http://www.cs.cmu.edu/~chanwook/MyAlgorithms/PDCW_IS2009/INTERSPEECH2009Package.zip).

A number of parameters can be specified in those files.

- N : Number of sources in mixture
- NFFT : DFT length
- winnumber : Selects window function
- k : Window length is NFFT/k
- noverlapfactor : Overlap between consecutive windows
- th : Mask threshold?
- TC1 : Merge finalstereo signals if correlation is above TC1
- TC2 : Merge finalstereo and enerstereo if correlation is above TC2
- stopthresholdini : One source if condition number is above this value
- thepow : tau_E (see [1])

All codes is copyrighted by its own author. The codes from me are licensed GNU/LGPL v2.
Run `main.m` to get the demo. You can change the input file by your own data.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/bagustris/thesis-bss

Awesome Lists containing this project

README