https://github.com/redleader962/t13-deep-learning-project

Last synced: over 1 year ago
JSON representation

Host: GitHub
URL: https://github.com/redleader962/t13-deep-learning-project
Owner: RedLeader962
Created: 2021-01-24T22:45:33.000Z (over 5 years ago)
Default Branch: master
Last Pushed: 2021-05-05T15:28:40.000Z (about 5 years ago)
Last Synced: 2025-04-13T09:14:07.028Z (over 1 year ago)
Language: Jupyter Notebook
Size: 129 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          # Exploration de la redistribution de récompenses en apprentissage par renforcement

par [Luc Coupal](https://redleader962.github.io), [Francois-Alexandre Tremblay](https://www.linkedin.com/in/francois-alexandre-tremblay-m-sc-2b212146/) et [William-Ricardo Bonilla-Villatoro](william-ricardo.bonilla-villatoro.1@ulaval.ca)

Projet d'expérimentation lié à l'article ***"RUDDER: Return Decomposition for Delayed Rewards"*** écrit par *Arjona-Medina, J. A.* et *al.* dans le cadre du cours **GLO-7030 Apprentissage par réseaux de neurones profonds** donné à l'[Université Laval](https://www.fsg.ulaval.ca), Qc, Canada.

[Télécharger le PDF](https://github.com/RedLeader962/T13-Deep-Learning-Project/raw/master/T13_Deep_Learning_Project_Report-v1.pdf) du rapport d'expérimentation.

Pour un survol rapide des concepts clés liés à l'article, visionner notre présentation orale 

[Une intuition sur *RUDDER* (*Return Decomposition for Delayed Rewards*)](https://youtu.be/2xH1TjVt9I8) sur ***YouTube*** (6 min 24 sec). 

Les **diapositives** de la présentation orale sont disponibles [ici](https://github.com/RedLeader962/Une-intuition-sur-RUDDER).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/redleader962/t13-deep-learning-project

Awesome Lists containing this project

README