https://github.com/keunwoochoi/Auralisation

Auralisation of learned features in CNN (for audio)
https://github.com/keunwoochoi/Auralisation

Last synced: 4 days ago
JSON representation

Auralisation of learned features in CNN (for audio)

Host: GitHub
URL: https://github.com/keunwoochoi/Auralisation
Owner: keunwoochoi
Created: 2015-12-04T15:00:53.000Z (almost 10 years ago)
Default Branch: master
Last Pushed: 2017-03-11T14:53:29.000Z (over 8 years ago)
Last Synced: 2024-04-22T12:32:37.672Z (over 1 year ago)
Language: Python
Homepage:
Size: 68.8 MB
Stars: 39
Watchers: 5
Forks: 10
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-python-data-science - Auralisation - Auralisation of learned features in CNN (for audio). (Model Explanation / NLP)
awesome-python-data-science - Auralisation - Auralisation of learned features in CNN (for audio). (Model Explanation / Others)

README

          # AuralisationCNN

This repo is for an example of auralisastion of CNNs that is demonstrated on ISMIR 2015.

## Files

auralise.py: includes all required function for deconvolution.

example.py: includes the whole code - just clone and run it by `python example.py`

You might need to use older version of Keras, e.g. [this](https://github.com/fchollet/keras/commit/06a1545645d974350d13425246eec53a08cb6ab8) (ver 0.3.x)

## Folders

src_songs: includes three songs that I used in my blog posting.

## Usage

Load weights that you want to auralise. I'm using this function

`W = load_weights()`

to load my keras model, it can be anything else.

`W` is a list of weights for the convnet. (TODO: more details)

Then load source files, get STFT of it. I'm using `librosa`.

Then deconve it with `get_deconve_mask`.

## Citation

[This paper](https://scholar.google.co.kr/citations?view_op=view_citation&hl=en&user=ZrqdSu4AAAAJ&sortby=pubdate&citation_for_view=ZrqdSu4AAAAJ:8k81kl-MbHgC), or simply,

```

@inproceedings{choi2015auralisation,

  title={Auralisation of Deep Convolutional Neural Networks: Listening to Learned Features},

  author={Choi, Keunwoo and Kim, Jeonghee and Fazekas, George and Sandler, Mark},

  booktitle={International Society of Music Information Retrieval (ISMIR), Late-Breaking/Demo Session, New York, USA},

  year={2015},

  organization={International Society of Music Information Retrieval}

}

```

## External links

* [The second blog post](https://keunwoochoi.wordpress.com/2016/03/23/what-cnns-see-when-cnns-see-spectrograms/) has more extensive demo. Detailed description will follow after paper submission.

* [The first blog post](http://keunwoochoi.blogspot.co.uk/2015/10/ismir-2015-lbd.html) that explains [my ISMIR 2015 Late-Breaking session paper](http://ismir2015.uma.es/LBD/LBD24.pdf).

#### Credits

* [Keras](https://github.com/fchollet/keras), [librosa](https://bmcfee.github.io/librosa/index.html), [Matt's deconvolution] paper(http://arxiv.org/abs/1311.2901), [Naver Labs](http://labs.naver.com)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/keunwoochoi/Auralisation

Awesome Lists containing this project

README