https://github.com/parshakova/dl_rl_assignments

Homeworks from Deep learning course
https://github.com/parshakova/dl_rl_assignments

deep-learning dqn mini-alpha-go-zero q-learning tensorflow

Last synced: about 2 months ago
JSON representation

Homeworks from Deep learning course

Host: GitHub
URL: https://github.com/parshakova/dl_rl_assignments
Owner: parshakova
Created: 2017-11-16T01:04:58.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2018-06-24T05:54:11.000Z (about 8 years ago)
Last Synced: 2025-10-19T13:33:47.568Z (9 months ago)
Topics: deep-learning, dqn, mini-alpha-go-zero, q-learning, tensorflow
Language: Jupyter Notebook
Homepage:
Size: 56.2 MB
Stars: 1
Watchers: 0
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Deep Learning assignments
Homeworks from the Deep learning course, implementations done with Tensorflow

Jupyter Notebook files contain code with explanations.

**Hw1:**

Underfitting and overfitting problem for the polynomial regression

**Hw2:**

Back-propagation for 100-layer network, investigation of conditions under which the training fails

**Project1:**

1. Classification on the toy dataset with two classes
2. Transfer learning from MNIST classification of 9 digits to 10 digits
3. PCA on 100 test samples of MNIST classifier

**Project2:**

1. Q learning in the linear environment
2. Q learning for the 4-legged spider
3. Deep Q Network for the Breakout environment

**Project3:**

1. Training a simple neural network playing tic-tac-toe. Proved that the agent can solve any game with all behaviours (while wins agains the random player are not the sufficient conditions, since it gives only the probabilistic conclusion).
2. Mini AlphaGo Zero: train 5 neural networks such that the first is trained based on purely random plays, the second is
trained based on game records played by the first neural network, the third is trained based on
game records played by the second neural network and so on.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/parshakova/dl_rl_assignments

Awesome Lists containing this project

README