An open API service indexing awesome lists of open source software.

https://github.com/hyeon9mak/hcp_2020

๐ŸŽฎ ํฌ์ผ“๋ชฌ ๊ธธ์ฐพ๊ธฐ ๊ฒŒ์ž„ (๊ด‘์šด๋Œ€ํ•™๊ต ์ปดํ“จํ„ฐ์ •๋ณด๊ณตํ•™๋ถ€ ๊ณ ๊ธ‰Cํ”„๋กœ๊ทธ๋ž˜๋ฐ ํŒ€ํ”„๋กœ์ ํŠธ)
https://github.com/hyeon9mak/hcp_2020

epsilon-greedy frozen-lake-game q-learning q-learning-algorithm

Last synced: 5 months ago
JSON representation

๐ŸŽฎ ํฌ์ผ“๋ชฌ ๊ธธ์ฐพ๊ธฐ ๊ฒŒ์ž„ (๊ด‘์šด๋Œ€ํ•™๊ต ์ปดํ“จํ„ฐ์ •๋ณด๊ณตํ•™๋ถ€ ๊ณ ๊ธ‰Cํ”„๋กœ๊ทธ๋ž˜๋ฐ ํŒ€ํ”„๋กœ์ ํŠธ)

Awesome Lists containing this project

README

          

# ๊ด‘์šด๋Œ€ํ•™๊ต ์ปดํ“จํ„ฐ์ •๋ณด๊ณตํ•™๋ถ€ 2020ํ•™๋…„๋„ 2ํ•™๊ธฐ ๊ณ ๊ธ‰Cํ”„๋กœ๊ทธ๋ž˜๋ฐ ํŒ€ํ”„๋กœ์ ํŠธ
## ํฌ์ผ“๋ชฌ ๊ธธ์ฐพ๊ธฐ ๊ฒŒ์ž„ ์†Œ๊ฐœ
![image](https://user-images.githubusercontent.com/37354145/95642672-fd189500-0ae4-11eb-960d-35753b44e278.png)

- ๊ฒฉ์ž๋กœ ์ด๋ฃจ์–ด์ง„ ๊ฒŒ์ž„ํŒ์— ์ถœ๋ฐœ์ง€์ ๊ณผ ๋„์ฐฉ์ง€์ ์„ ์ž…๋ ฅํ•˜๋ฉด
์Šค์Šค๋กœ ์ตœ์ ํ™”๋œ ๊ฒฝ๋กœ๋ฅผ ์ฐพ์•„๋‚ด๋Š” ๊ฐ•ํ™”ํ•™์Šต ๊ฒŒ์ž„.
- Frozen lake game ๊ณผ ํฌ์ผ“๋ชฌ์Šคํ„ฐ ๊ฒŒ์ž„์—์„œ
์•„์ด๋””์–ด ์ฐฉ์•ˆ.
### 4 x 4 Map
![image](/๋™์ž‘์˜์ƒ/44_45.gif)

### 7 x 7 Map
![image](/๋™์ž‘์˜์ƒ/77_45.gif)

### 10 x 10 Map #[10 x 10 Map ์‹ค์ œ ๋™์ž‘ ์˜์ƒ ๋งํฌ](https://youtu.be/ZSPgoS3yVrI)
![image](/๋™์ž‘์˜์ƒ/1010_45.gif)

์ด ์™ธ์—๋„, Map.txt ํŒŒ์ผ ํŽธ์ง‘์„ ํ†ตํ•ด ์ž์œ ๋กญ๊ฒŒ ๋งต ๊ตฌ์„ฑ ๊ฐ€๋Šฅ!

## ๊ฒŒ์ž„ ์„ค๋ช…
### ๊ฒŒ์ž„ ์‹œ์ž‘ ์ „
![image](/๋™์ž‘์˜์ƒ/before_game.png)
- Map.txt ํŒŒ์ผ์„ ์ด์šฉํ•œ ๊ฒŒ์ž„ ๋งต ๊ตฌ์„ฑ
- Txy -> xy๋Š” ์ง€๋ฆ„๊ธธ/ํ•จ์ • ์ถœ๊ตฌ์˜ ์ขŒํ‘œ
- ์ •ํ™•ํ•œ ๋งต ๊ตฌ์„ฑ์„ ์ด์šฉํ•˜์ง€ ์•Š์„ ์‹œ ์—๋Ÿฌ ๋ฐœ์ƒ!

### ๊ฒŒ์ž„ ์ค‘
![image](/๋™์ž‘์˜์ƒ/in_game.png)
- ์ˆœ์„œ๋Œ€๋กœ ์šฐ, ์ขŒ, ์ƒ, ํ•˜ ๊ธฐ๋Œ€ ๊ฐ’ ์˜๋ฏธ
- 6/30 ํšŒ๋กœ ํ‘œํ˜„๋˜์ง€๋งŒ ์‹ค์ œ 6,000/30,000 ํšŒ์ž„
- 1,000 ๋‹จ์œ„๋กœ ํ™”๋ฉด์— ์ถœ๋ ฅ๋˜๋Š” ์ƒํƒœ

### ๊ฒŒ์ž„ ์ข…๋ฃŒ ํ›„
![image](/๋™์ž‘์˜์ƒ/after_game.png)
- ์ตœ์ข…์ ์œผ๋กœ ํ”Œ๋ ˆ์ด์–ด๊ฐ€ ์ด๋™ํ•œ ์ตœ์ ์˜ ๊ฒฝ๋กœ ํ‘œํ˜„
- Enter ํ‚ค ์ž…๋ ฅ์„ ํ†ตํ•œ ์ข…๋ฃŒ

## ํ”„๋กœ์ ํŠธ ์„ธ๋ถ€
### ํŒ€์›
- ํŒ€์žฅ ๋ฐ•์ •ํ›ˆ ์ปดํ“จํ„ฐ์ •๋ณด๊ณตํ•™๋ถ€ 2020202074
- ํŒ€์› ๊น€ํ˜„์ค‘ ์ปดํ“จํ„ฐ์ •๋ณด๊ณตํ•™๋ถ€ 2020202041
- ํŒ€์› ์ตœ์„ฑ์šฐ ์ปดํ“จํ„ฐ์ •๋ณด๊ณตํ•™๋ถ€ 2019202081
- ํŒ€์› ์ตœํ˜„๊ตฌ ์ปดํ“จํ„ฐ์ •๋ณด๊ณตํ•™๋ถ€ 2015722010

### ์ฃผ์ œ ์„ ์ • ๊ณผ์ •
![image](https://user-images.githubusercontent.com/37354145/95642716-394bf580-0ae5-11eb-8b0f-3c958580a82c.png)

- ์•ŒํŒŒ๊ณ  ์ดํ›„ ๊ฐ•ํ™”ํ•™์Šต AI ๊ด€์‹ฌ ์ฆ๊ฐ€
- Q-learning, E-greedy ํ•™์Šต์„ ํ†ตํ•œ ๊ฐœ๋ฐœ ๊ฐ€๋Šฅ์„ฑ ํ™•์ธ

### ํ”„๋กœ์ ํŠธ ์Šค์ผ€์ฅด๋ง
![image](https://user-images.githubusercontent.com/37354145/95642792-c8590d80-0ae5-11eb-8a27-977fb1ae7614.png)
![image](https://user-images.githubusercontent.com/37354145/95642797-d0b14880-0ae5-11eb-96f9-0d538946b2fe.png)

### ํ•ต์‹ฌ ์•Œ๊ณ ๋ฆฌ์ฆ˜
- Q-learning
- E-greedy

### ์ฐธ๊ณ 
- [์‚ผ์„ฑ sds saida ํŒ€ ์Šคํƒ€ํฌ๋ž˜ํ”„ํŠธ ์ธ๊ณต์ง€๋Šฅ](http://m.hani.co.kr/arti/economy/it/870696.html#cb)
- [์‹œํ–‰์ฐฉ์˜ค ์—†๋Š” ๊ธธ์ฐพ๊ธฐ ์ธ๊ณต์ง€๋Šฅ](http://m.hani.co.kr/arti/science/future/926150.html)
- [ํ™์ฝฉ ๊ณผ๊ธฐ๋Œ€ ๊น€์„ฑํ›ˆ ๊ต์ˆ˜๋‹˜ ๊ฐ•์˜](https://hunkim.github.io/ml/)
- [ํ…์„œํ”Œ๋กœ์šฐ ํ”„๋ ˆ์ž„์›Œํฌ q-learning ์„ค๋ช…](https://www.tensorflow.org/agents/tutorials/0_intro_rl)
- [Frozen lake game](https://colab.research.google.com/github/simoninithomas/Deep_reinforcement_learning_Course/blob/master/Q_Learning_with_FrozenLakev2.ipynb)