Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with safe-reinforcement-learning
A curated list of projects in awesome lists tagged with safe-reinforcement-learning .
https://github.com/PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms reinforcement-learning reinforcement-learning-from-human-feedback rlhf safe-reinforcement-learning safe-reinforcement-learning-from-human-feedback safe-rlhf safety transformer transformers vicuna
Last synced: 03 Aug 2024
https://github.com/pku-alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms reinforcement-learning reinforcement-learning-from-human-feedback rlhf safe-reinforcement-learning safe-reinforcement-learning-from-human-feedback safe-rlhf safety transformer transformers vicuna
Last synced: 27 Sep 2024
https://github.com/EzgiKorkmaz/adversarial-reinforcement-learning
Reading list for adversarial perspective and robustness in deep reinforcement learning.
adversarial-attacks adversarial-machine-learning adversarial-policies adversarial-reinforcement-learning ai-alignment ai-safety deep-reinforcement-learning explainable-machine-learning explainable-rl machine-learning-safety meta-reinforcement-learning multiagent-reinforcement-learning reinforcement-learning-generalization reinforcement-learning-safety responsible-ai robust-adversarial-reinforcement-learning robust-machine-learning robust-reinforcement-learning safe-reinforcement-learning safe-rlhf
Last synced: 31 Jul 2024
https://github.com/chauncygu/Safe-Multi-Agent-Isaac-Gym
Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.
benchmark multi-agent-reinforcement-learning robotics safe-reinforcement-learning
Last synced: 01 Aug 2024
https://github.com/ZhengYinan-AIR/FISOR
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
diffusion-models hamilton-jacobi-reachability imitation-learning jax offline-reinforcement-learning reinforcement-learning safe-reinforcement-learning
Last synced: 02 Aug 2024