https://github.com/Jingliang-Duan/DSAC-v1

DSAC; Distributional Soft Actor-Critic
https://github.com/Jingliang-Duan/DSAC-v1

pytorch reinforcement-learning

Last synced: 5 months ago
JSON representation

DSAC; Distributional Soft Actor-Critic

Host: GitHub
URL: https://github.com/Jingliang-Duan/DSAC-v1
Owner: Jingliang-Duan
Created: 2020-03-03T03:57:34.000Z (about 5 years ago)
Default Branch: master
Last Pushed: 2024-02-22T16:52:44.000Z (about 1 year ago)
Last Synced: 2024-10-11T02:02:32.777Z (7 months ago)
Topics: pytorch, reinforcement-learning
Language: Python
Homepage:
Size: 9.36 MB
Stars: 109
Watchers: 7
Forks: 31
Open Issues: 3
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

StarryDivineSky - Jingliang-Duan/Distributional-Soft-Actor-Critic

README

## Reference
- [Distributional Soft Actor-Critic (DSAC)](https://ieeexplore.ieee.org/document/9448360)

## Requires
1. Windows 7 or greater or Linux.
2. Python 3.8.
3. The installation path must be in English.

## Installation
```bash
# Please make sure not to include Chinese characters in the installation path, as it may result in a failed execution.
# clone DSAC_v1 repository
git clone [email protected]/Jingliang-Duan/DSAC_v1
cd DSAC_v1
# create conda environment
conda env create -f DSAC1.0_environment.yml
conda activate DSAC1.0
# install DSAC1.0
pip install -e.
```

## Train
These are two examples of running DSAC-v1 on two environments.
Train the policy by running:
```bash
cd example_train
#Train a pendulum task
python main.py
#Train a humanoid task. To execute this file, Mujoco and Mujoco-py need to be installed first.
python dsac_mlp_humanoidconti_offserial.py
```
After training, the results will be stored in the "DSAC_v1/results" folder.

## Simulation
In the "DSAC-v1/results" folder, pick the path to the folder where the policy will be applied to the simulation and select the appropriate PKL file for the simulation.
```bash
python run_policy.py
#you may need to "pip install imageio-ffmpeg" before running this file on Windows.
```
After running, the simulation vedio and state&action curve figures will be stored in the "DSAC_v1/figures" folder.

## Acknowledgment
We would like to thank all members in Intelligent Driving Laboratory (iDLab), School of Vehicle and Mobility, Tsinghua University for making excellent contributions and providing helpful advices for DSAC-v1.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/Jingliang-Duan/DSAC-v1

Awesome Lists containing this project

README