https://github.com/zhaoyingjun/general

Alignment成为GPT类大模型微调的必须环节，深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架，30分钟上手强化学习编程。
https://github.com/zhaoyingjun/general

ddpg deep-reinforcement-learning dqn gui gym ppo tensorflow2

Last synced: over 1 year ago
JSON representation

Alignment成为GPT类大模型微调的必须环节，深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架，30分钟上手强化学习编程。

Host: GitHub
URL: https://github.com/zhaoyingjun/general
Owner: zhaoyingjun
License: mit
Created: 2020-01-31T12:34:38.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2023-03-25T00:18:12.000Z (over 3 years ago)
Last Synced: 2025-03-27T07:35:53.319Z (over 1 year ago)
Topics: ddpg, deep-reinforcement-learning, dqn, gui, gym, ppo, tensorflow2
Language: Python
Homepage:
Size: 106 KB
Stars: 72
Watchers: 9
Forks: 18
Open Issues: 2

Awesome Lists containing this project