Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by vwxyzjn
A curated list of projects in awesome lists by vwxyzjn .
https://github.com/vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
a2c actor-critic advantage-actor-critic ale atari deep-learning deep-reinforcement-learning gym machine-learning phasic-policy-gradient ppo proximal-policy-optimization python pytorch reinforcement-learning wandb
Last synced: 29 Oct 2024
https://github.com/vwxyzjn/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Last synced: 26 Oct 2024
https://github.com/vwxyzjn/portwarden
Create Encrypted Backups of Your Bitwarden Vault with Attachments
bitwarden docker encryption k8s
Last synced: 30 Oct 2024
https://github.com/vwxyzjn/lm-human-preference-details
RLHF implementation details of OAI's 2019 codebase
Last synced: 27 Oct 2024
https://github.com/vwxyzjn/invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Last synced: 27 Oct 2024
https://github.com/vwxyzjn/cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
Last synced: 26 Oct 2024
https://github.com/vwxyzjn/gym-microrts-paper
The source code for the gym-microrts paper.
Last synced: 28 Oct 2024
https://github.com/vwxyzjn/a2c_is_a_special_case_of_ppo
A2C is a special case of PPO!
Last synced: 28 Oct 2024
https://github.com/vwxyzjn/jupyter_disqus
Add Disqus to your Jupyter notebook.
disqus ipython-notebook jupyter jupyter-notebook python
Last synced: 27 Sep 2024
https://github.com/vwxyzjn/vectorized-value-methods
[WIP] Vectorized architecture for value-based methods such as DQN and DDPG
Last synced: 11 Oct 2024
https://github.com/vwxyzjn/lp_optimization_python
Linear Programming for Optimal Scheduling by Using Gurobipy
Last synced: 09 Nov 2024
https://github.com/vwxyzjn/sentiment-analysis-lstm
Used neural network to classify movie reviews based on sentiment
Last synced: 09 Nov 2024
https://github.com/vwxyzjn/launcha
Launcha is a simple Docker-based cloud job launcher.
Last synced: 09 Nov 2024
https://github.com/vwxyzjn/penspider
A web crawler that crawls the fountain pens listing
Last synced: 09 Nov 2024
https://github.com/vwxyzjn/plane-shooting-problem-dynamic-programming
Dynamic Programming
Last synced: 09 Nov 2024
https://github.com/vwxyzjn/reproduction_of_newsfeed.fb.com
This is a reproduction of newsfeed.fb.com by using GSAP and Vue.js
Last synced: 09 Nov 2024
https://github.com/vwxyzjn/tensorflowbyexample
Make tensorflow more practical and less magical
machine-learning python tensorflow
Last synced: 09 Nov 2024
https://github.com/vwxyzjn/cocalc_docker_python3
cocalc docker jupyter-notebook sagemath
Last synced: 09 Nov 2024
https://github.com/vwxyzjn/abstract_algebra_finite_group_generator
A brute force program that enumerates all possible permutations of binary operations on a given set.
Last synced: 09 Nov 2024