An open API service indexing awesome lists of open source software.

https://github.com/keunwoochoi/Auralisation

Auralisation of learned features in CNN (for audio)
https://github.com/keunwoochoi/Auralisation

Last synced: 4 days ago
JSON representation

Auralisation of learned features in CNN (for audio)

Awesome Lists containing this project

README

          

# AuralisationCNN
This repo is for an example of auralisastion of CNNs that is demonstrated on ISMIR 2015.

## Files
auralise.py: includes all required function for deconvolution.
example.py: includes the whole code - just clone and run it by `python example.py`
You might need to use older version of Keras, e.g. [this](https://github.com/fchollet/keras/commit/06a1545645d974350d13425246eec53a08cb6ab8) (ver 0.3.x)

## Folders
src_songs: includes three songs that I used in my blog posting.

## Usage
Load weights that you want to auralise. I'm using this function
`W = load_weights()`
to load my keras model, it can be anything else.
`W` is a list of weights for the convnet. (TODO: more details)

Then load source files, get STFT of it. I'm using `librosa`.

Then deconve it with `get_deconve_mask`.

## Citation
[This paper](https://scholar.google.co.kr/citations?view_op=view_citation&hl=en&user=ZrqdSu4AAAAJ&sortby=pubdate&citation_for_view=ZrqdSu4AAAAJ:8k81kl-MbHgC), or simply,

```
@inproceedings{choi2015auralisation,
title={Auralisation of Deep Convolutional Neural Networks: Listening to Learned Features},
author={Choi, Keunwoo and Kim, Jeonghee and Fazekas, George and Sandler, Mark},
booktitle={International Society of Music Information Retrieval (ISMIR), Late-Breaking/Demo Session, New York, USA},
year={2015},
organization={International Society of Music Information Retrieval}
}
```

## External links
* [The second blog post](https://keunwoochoi.wordpress.com/2016/03/23/what-cnns-see-when-cnns-see-spectrograms/) has more extensive demo. Detailed description will follow after paper submission.
* [The first blog post](http://keunwoochoi.blogspot.co.uk/2015/10/ismir-2015-lbd.html) that explains [my ISMIR 2015 Late-Breaking session paper](http://ismir2015.uma.es/LBD/LBD24.pdf).

#### Credits
* [Keras](https://github.com/fchollet/keras), [librosa](https://bmcfee.github.io/librosa/index.html), [Matt's deconvolution] paper(http://arxiv.org/abs/1311.2901), [Naver Labs](http://labs.naver.com)