Projects in Awesome Lists tagged with off-policy | Ecosyste.ms: Awesome

Projects in Awesome Lists tagged with off-policy

A curated list of projects in awesome lists tagged with off-policy .

- Recently synced
- Stars

https://github.com/mishalaskin/curl

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac

Last synced: 05 Apr 2025

https://github.com/MishaLaskin/curl

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac

Last synced: 23 Nov 2024

https://github.com/mishalaskin/rad

RAD: Reinforcement Learning with Augmented Data

codebase data- data-augmentations deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deeplearning-ai dm-control model-free mujoc off-policy ppo rad reinforcement-learning rl sac soft-actor-critic

Last synced: 06 Apr 2025

https://github.com/denisyarats/drq

DrQ: Data regularized Q

actor-critic control data-augmentation deep-learning deep-reinforcement-learning dm-control drq gym model-free mujoco off-policy pixel python pytorch reinforcement-learning rl sac soft-actor-crit

Last synced: 05 May 2025

https://github.com/instadeepai/flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

buffers hpc jax machine-learning off-policy reinforcement-learning rl

Last synced: 11 Apr 2025

https://github.com/denisyarats/exorl

ExORL: Exploratory Data for Offline Reinforcement Learning

control datasets deep-learning exporation model-free mujoco off-policy offline-rl python pytorch reinforcement-learning unsupevised

Last synced: 11 May 2025