https://github.com/escorciav/drlnd-navigation

Deep Q-learning Network for autonomous navigation
https://github.com/escorciav/drlnd-navigation

jupyter-notebook python3 pytorch reinforcement-learning unity

Last synced: 3 months ago
JSON representation

Deep Q-learning Network for autonomous navigation

Host: GitHub
URL: https://github.com/escorciav/drlnd-navigation
Owner: escorciav
License: mit
Created: 2018-10-27T19:25:15.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2019-10-22T21:54:26.000Z (over 6 years ago)
Last Synced: 2025-02-28T07:59:04.985Z (over 1 year ago)
Topics: jupyter-notebook, python3, pytorch, reinforcement-learning, unity
Language: Jupyter Notebook
Size: 85 KB
Stars: 1
Watchers: 3
Forks: 2
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

[//]: # (Image References)

[image1]: https://user-images.githubusercontent.com/10624937/42135619-d90f2f28-7d12-11e8-8823-82b970a54d7e.gif "Trained Agent"

# Project 1: Navigation

## Introduction

This project illustrates how to train an agent to navigate (and collect bananas!) in a large, square world.

![Trained Agent][image1]

A reward of +1 is provided for collecting a yellow banana, and a reward of -1 is provided for collecting a blue banana. Thus, the goal of our agent is to collect as many yellow bananas (healthy) as possible while avoiding blue bananas (rotten).

The state space has 37 dimensions and contains the agent's velocity, along with a ray-based perception of objects around the agent's forward direction. Given this information, the agent has to learn how to best select actions. Four discrete actions are available, corresponding to:

- **`0`** - move forward.
- **`1`** - move backward.
- **`2`** - turn left.
- **`3`** - turn right.

The task is episodic, and to solve the environment, our agent must get an average score of +13 over 100 consecutive episodes.

## Getting started

1. Download the environment from one of the links below. You need only select the environment that matches your operating system:
- Linux: [click here](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P1/Banana/Banana_Linux.zip)
- Mac OSX: [click here](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P1/Banana/Banana.app.zip)
- Windows (32-bit): [click here](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P1/Banana/Banana_Windows_x86.zip)
- Windows (64-bit): [click here](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P1/Banana/Banana_Windows_x86_64.zip)

(_For Windows users_) Check out [this link](https://support.microsoft.com/en-us/help/827218/how-to-determine-whether-a-computer-is-running-a-32-bit-version-or-64) if you need help with determining if your computer is running a 32-bit version or 64-bit version of the Windows operating system.

_Note_ unfortunately, I could only test the project in OSX 10.11.6.

2. Place the file in the root folder of this repository, and unzip (or decompress) the file.

3. Install all the required dependencies:

The main requirements of this project are Python==3.6, numpy, matplotlib, jupyter, pytorch and unity-agents. To ease its installation, I recommend the following procedure:

- [Install miniconda](https://conda.io/docs/user-guide/install/index.html).

> Feel free to skip this step, if you already have anaconda or miniconda installed in your machine.

> For OSX users, I would recommend trying the step outlined [here](#Installation-for-conda-and-OSX-users)

- Creating the environment.

`conda create -n drlnd-navigation python=3.6`

- Activate the environment

`conda activate drlnd-navigation`

- Installing dependencies.

`pip install -r requirements.txt`

### Installation for conda and OSX users

You can use the environment [YAML file](environment_osx.yml) provided with repo as follows:

`conda env create -f environment_osx.yml`

## Instructions

Launch a jupyter notebook and follow the tutorial in [Navigation.ipynb](Navigation.ipynb) to train your own agent!

> In case you close the shell running the jupyter server, don't forget to activate the environment. `conda activate drlnd-navigation`

## Do you like the project?

Please gimme a ⭐️ in the GitHub banner 😉. I am also open for discussions especially accompany with ☕ or 🍺.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/escorciav/drlnd-navigation

Awesome Lists containing this project

README