Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/vermouth1992/safe_rl_papers

A list of safe reinforcement learning papers
https://github.com/vermouth1992/safe_rl_papers

Last synced: about 2 months ago
JSON representation

A list of safe reinforcement learning papers

Awesome Lists containing this project

README

        

# A List of Safe Reinforcement Learning Papers

## Algorithms
### Safe Exploration
- [Safe Exploration in Continuous Action Spaces](https://arxiv.org/pdf/1801.08757.pdf)
- [Safe Exploration for Interactive Machine Learning](https://arxiv.org/abs/1910.13726)
- [Safely Probabilistically Complete Real-Time Planning and Exploration in Unknown Environments](https://arxiv.org/pdf/1811.07834.pdf)

### Safe Planning
- [Safe Planning via Model Predictive Shielding](https://arxiv.org/pdf/1905.10691.pdf)

### Policy Learning
- [A Lyapunov-based Approach to Safe Reinforcement Learning](https://arxiv.org/pdf/1805.07708.pdf)
- [Constrained Policy Optimization](https://arxiv.org/abs/1705.10528)
- [End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks](https://rcheng805.github.io/files/aaai2019.pdf)
- [Code](https://github.com/rcheng805/RL-CBF)
- [Risk-Constrained Reinforcement Learning with Percentile Risk Criteria](https://arxiv.org/pdf/1512.01629.pdf)
- [Convergent Policy Optimization for Safe Reinforcement Learning](https://arxiv.org/abs/1910.12156)
- [Lyapunov-based Safe Policy Optimization for Continuous Control](https://arxiv.org/pdf/1901.10031.pdf)
- [Neural Lyapunov Control](http://papers.nips.cc/paper/8587-neural-lyapunov-control.pdf)
- [CAQL: CONTINUOUS ACTION Q-LEARNING](https://arxiv.org/pdf/1909.12397.pdf)

### Safe Reinforcement Learning with Stability Guarantees
- [Safe Model-based Reinforcement Learning with Stability Guarantees](https://papers.nips.cc/paper/6692-safe-model-based-reinforcement-learning-with-stability-guarantees.pdf)
- [The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamical Systems](https://arxiv.org/pdf/1808.00924.pdf)
- [Safe Learning of Regions of Attraction for Uncertain, Nonlinear Systems with Gaussian Processes](https://arxiv.org/pdf/1603.04915.pdf)
- [Code](https://github.com/befelix/safe_learning)

### Human in the loop
- [Trial without Error: Towards Safe Reinforcement Learning via Human Intervention](https://arxiv.org/abs/1707.05173)

## Surveys
[A Comprehensive Survey on Safe Reinforcement Learning](http://www.jmlr.org/papers/volume16/garcia15a/garcia15a.pdf)

## Benchmarks
- [Benchmarking Safe Exploration in Deep Reinforcement Learning](https://d4mucfpksywv.cloudfront.net/safexp-short.pdf)
- [The city learn challenge](https://sites.google.com/view/citylearnchallenge)
- [Code](https://github.com/intelligent-environments-lab/CityLearn)

## Thesis
[SAFE REINFORCEMENT LEARNING](https://people.cs.umass.edu/~pthomas/papers/Thomas2015c.pdf)

## Lectures
[Safe Reinforcement Learning](https://web.stanford.edu/class/cs234/slides/2017/cs234_guest_lecture_safe_rl.pdf)