https://github.com/jianzhnie/scalerl

ScaleRL is a simple and scalable distributed reinforcement learning framework based on Python and PyTorch
https://github.com/jianzhnie/scalerl

a3c distributed-systems dppo impala parallel-computing seed-rl

Last synced: 7 months ago
JSON representation

ScaleRL is a simple and scalable distributed reinforcement learning framework based on Python and PyTorch

Host: GitHub
URL: https://github.com/jianzhnie/scalerl
Owner: jianzhnie
License: apache-2.0
Created: 2024-10-15T06:37:13.000Z (12 months ago)
Default Branch: main
Last Pushed: 2024-10-30T02:28:52.000Z (11 months ago)
Last Synced: 2024-10-30T05:01:45.943Z (11 months ago)
Topics: a3c, distributed-systems, dppo, impala, parallel-computing, seed-rl
Language: Python
Homepage: https://jianzhnie.github.io/llmtech/
Size: 221 KB
Stars: 1
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # ScaleRL

ScaleRL is a simple and scalable distributed reinforcement learning framework based on Python and PyTorch

### Distributed RL Libraries

- https://github.com/ray-project/ray

- https://github.com/pytorch/rl

- https://github.com/facebookresearch/torchbeast

- https://github.com/facebookresearch/rlmeta

- https://github.com/alex-petrenko/sample-factory

- https://github.com/sjtu-marl/malib.git

- https://github.com/Replicable-MARL/MARLlib

- https://github.com/seolhokim/DistributedRL-Pytorch-Ray.git

### Distributed RL Blogs

- https://www.jiqizhixin.com/articles/2024-02-15-6?from=synced&keyword=%E5%88%86%E5%B8%83%E5%BC%8F%E5%BC%BA%E5%8C%96%E5%AD%A6%E4%B9%A0

- https://joseluisc99.github.io/posts/distributed-reinforcement-learning-a-draft/

## Distributed Framework

\[1\] Massively Parallel Methods for Deep Reinforcement Learning (SGD, first distributed architecture, Gorilla DQN).

\[2\] Asynchronous Methods for Deep Reinforcement Learning (SGD, A3C).

\[3\] Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU (A3C on GPU).

\[4\] Efficient Parallel Methods for Deep Reinforcement Learning (Batched A2C, GPU).

\[5\] Evolution Strategies as a Scalable Alternative to Reinforcement Learning (ES).

\[6\] Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for

Reinforcement Learning (ES).

\[7\] RLlib: Abstractions for Distributed Reinforcement Learning (Library)

\[8\] Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes (Batched A3C).

\[9\] Distributed Prioritized Experience Replay (Ape-X, distributed replay buffer).

\[10\] IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures (CPU+GPU).

\[11\] Accelerated Methods for Deep Reinforcement Learning (Simulation Acceleration).

\[12\] GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning (Simulation Acceleration).

\[13\] DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames (DD-PPO)

\[14\] Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning (Sample Factory)

\[15\] SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference (SEED RL)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jianzhnie/scalerl

Awesome Lists containing this project

README