https://github.com/tirthasheshpatel/generative-models

A repository containing code for implementing Deep Learning Generative Models using Python
https://github.com/tirthasheshpatel/generative-models

Last synced: 1 day ago
JSON representation

A repository containing code for implementing Deep Learning Generative Models using Python

Host: GitHub
URL: https://github.com/tirthasheshpatel/generative-models
Owner: tirthasheshpatel
License: mit
Created: 2020-03-31T08:27:28.000Z (over 5 years ago)
Default Branch: master
Last Pushed: 2023-03-24T23:59:53.000Z (over 2 years ago)
Last Synced: 2025-05-29T11:59:15.890Z (about 2 months ago)
Language: Jupyter Notebook
Size: 8.5 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # Generative-Models

A repository containing code for implementing Deep Learning Generative Models using Python

### Restricted Boltmann Machines

The model is persent in [rbm.py](rbm.py) and its coressponding notebook in [RMBs.ipynb](RBMs.ipynb)

The usage is shown below.

Data must of the shape ``(n_samples, n_features)``. For example, the ``n_features`` of images are its number of pixels and ``n_samples`` are the number of samples present in your training dataset. If there are 60000 images of size 28x28, then the input shape will be ``(60000, 784)``. The ``fit(X, lr=1., epochs=10, method="contrastive_divergence", burn_in=1000, tune=2000, verbose=False)`` method is used to train the RBM on given dataset ``X``. Once trained ``encode(training_instance)`` is used to encode a training instance onto the latent space learned by the RBM. You can use ``decode()`` method to generate new random images or ``decode(encoded)`` method to deocde an already encoded image. This is not limited to images! Any type of data can be used!

The following example shows how to train the RBMs on mnist dataset consisting of the digit 5 only.

Example Code

```python

import numpy as np

import matplotlib.pyplot as plt

from rbm import BinaryRestrictedBoltzmannMachine

from keras.datasets import mnist

(X_train, y), (_, _) = mnist.load_data()

# Normalize and reshape

X_train = X_train.reshape(60000, -1)

X_train = 1. * ((X_train[y == 5] / 255.) >= 0.5)

# Plot some training isntances

fig, ax = plt.subplots(nrows=3, ncols=3, figsize=(10, 10))

ax[0, 0].imshow(X_train[10].reshape(28, 28))

ax[0, 1].imshow(X_train[11].reshape(28, 28))

ax[0, 2].imshow(X_train[12].reshape(28, 28))

ax[1, 0].imshow(X_train[13].reshape(28, 28))

ax[1, 1].imshow(X_train[14].reshape(28, 28))

ax[1, 2].imshow(X_train[15].reshape(28, 28))

ax[2, 0].imshow(X_train[16].reshape(28, 28))

ax[2, 1].imshow(X_train[17].reshape(28, 28))

ax[2, 2].imshow(X_train[18].reshape(28, 28))

fig.suptitle("Training instances")

plt.show()

# We will mainly experiment with different latent space

# dimensions. For this instance, i have a 30-D latent space.

hidden_dims = 3

# Define our model

model = BinaryRestrictedBoltzmannMachine(hidden_dims)

# Train the model on our dataset with learning rate 1.0

model.fit(X_train, lr=1.0, burn_in=None, tune=1, epochs=100, verbose=True)

# Use the `decode()` method to generate an image.

images = [model.generate_smooth() for _ in range(9)]

fig, ax = plt.subplots(nrows=3, ncols=3, figsize=(10, 10))

ax[0, 0].imshow(images[0].reshape(28, 28))

ax[0, 1].imshow(images[1].reshape(28, 28))

ax[0, 2].imshow(images[2].reshape(28, 28))

ax[1, 0].imshow(images[3].reshape(28, 28))

ax[1, 1].imshow(images[4].reshape(28, 28))

ax[1, 2].imshow(images[5].reshape(28, 28))

ax[2, 0].imshow(images[6].reshape(28, 28))

ax[2, 1].imshow(images[7].reshape(28, 28))

ax[2, 2].imshow(images[8].reshape(28, 28))

fig.suptitle("Generated instances")

plt.show()

```

Reconstructed images

![Reconstruction](images/rbm_train_5.png)

![Reconstruction](images/rbm_recon_5.png)

Generated Images

![Generated images](images/rbm_gen.png)

### Variational Auto-Encoders

The model is present in [vae.py](vae.py) and the coressponding notebook is present at [VAE.ipynb](VAE.ipynb).

The encoder and decoder are ANNs and the model is trained on 60000 MNIST images for 20 epochs using the RMSprop optimizer. One very important observation is that the model with with low dimensional latent space (`latent_dim=3`) is a better generator and poor reconstructor while the model with high dimensional latent space (`latent_dim=30`) is a good reconstructor but a poor generator. You can play around with the notebook and observe other interesting things. The model is strictly build on TensorFlow 1.x and doesn't work for any 2.x versions. Following are the reconstructed and generated images for a latent space with 3 dimensions.

Reconstructed images

![Reconstruction](images/vae_recon.png)

Generated Images

![Generation](images/vae_gen.png)

### Neural Auto-Regressive Density Estimators

``WIP``

### Masked Auto-Encoder Density Estimators

``WIP``

### Generative Adversarial Networks

``WIP``

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tirthasheshpatel/generative-models

Awesome Lists containing this project

README