Projects in Awesome Lists tagged with off-policy
A curated list of projects in awesome lists tagged with off-policy .
https://github.com/mishalaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac
Last synced: 05 Apr 2025
https://github.com/MishaLaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac
Last synced: 23 Nov 2024
https://github.com/mishalaskin/rad
RAD: Reinforcement Learning with Augmented Data
codebase data- data-augmentations deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deeplearning-ai dm-control model-free mujoc off-policy ppo rad reinforcement-learning rl sac soft-actor-critic
Last synced: 06 Apr 2025
https://github.com/denisyarats/drq
DrQ: Data regularized Q
actor-critic control data-augmentation deep-learning deep-reinforcement-learning dm-control drq gym model-free mujoco off-policy pixel python pytorch reinforcement-learning rl sac soft-actor-crit
Last synced: 05 May 2025
https://github.com/instadeepai/flashbax
⚡ Flashbax: Accelerated Replay Buffers in JAX
buffers hpc jax machine-learning off-policy reinforcement-learning rl
Last synced: 11 Apr 2025
https://github.com/denisyarats/exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
control datasets deep-learning exporation model-free mujoco off-policy offline-rl python pytorch reinforcement-learning unsupevised
Last synced: 11 May 2025