Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jooohyunpark/Uncanny-Forest

A virtual world synthesized with the latent representation of forest sounds.
https://github.com/jooohyunpark/Uncanny-Forest

Last synced: 3 months ago
JSON representation

A virtual world synthesized with the latent representation of forest sounds.

Host: GitHub
URL: https://github.com/jooohyunpark/Uncanny-Forest
Owner: jooohyunpark
Created: 2018-12-18T05:39:18.000Z (almost 6 years ago)
Default Branch: master
Last Pushed: 2023-03-04T05:56:49.000Z (over 1 year ago)
Last Synced: 2024-07-18T05:40:21.459Z (4 months ago)
Language: JavaScript
Homepage:
Size: 20.2 MB
Stars: 4
Watchers: 1
Forks: 0
Open Issues: 5
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # Uncanny-Forest

Uncanny forest is an alpha version of my thesis project. This project is designed as an immersive web VR experience combined with machine learning. All the sounds here are LATENT REPRESENTATION of real forests encoded and decoded through latent space with deep neural networks. You can see how neural networks interpret and extract meaningful features of forest sounds and reconstruct them. 

* [View Project](https://www.uncannyforest.parkjoohyun.com/) 


* [Demo(old)](https://vimeo.com/307056674)

## Generation

#### WaveNet 
 

The dataset for training WaveNet model consists of 922 30-sec wav files(5.27 GB). It is 8 hour length that has even amount for 4 different categories: birds, creek, wind, bugs&frogs. Training process took 3 days with NVIDIA GPU showing 3.6 sec/step and stopped at 38244 steps with 1.531 loss. Command I used was:

```

python train.py  --data_dir=/ --num_steps=200000 --silence_threshold=0 --batch_size=4 --checkpoint_every=1000 

-- max_checkpoints=100 

```

#### NSynth 


Generation process took 61 hours for reconstructing 35-40 10-sec wav files using pre-trained model with following command. 

```

nsynth_generate --checkpoint_path=//wavenet-ckpt/model.ckpt-200000 --source_path=/ --save_path=/ --batch_size=1 

```

## Tech Stack

* [WaveNet](https://github.com/ibab/tensorflow-wavenet)

* [NSynth(Magenta)](https://github.com/tensorflow/magenta/tree/master/magenta/models/nsynth)

* [A-frame](https://aframe.io/)

## Sound Reference

* [BBC Sound Library](http://bbcsfx.acropolis.org.uk/)

* [NPS Sound Library](https://www.nps.gov/romo/learn/photosmultimedia/soundlibrary.htm)

* [Wildlife Sound Recording Society](https://www.wildlife-sound.org/sounds-of-nature/radio-wsrs)

* [Free To Use Sounds](https://freetousesounds.com/complete-library/)

* [Personal Recordings](https://recordingsofnature.wordpress.com/)