https://github.com/daveshap/rlhi
Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition
https://github.com/daveshap/rlhi
Last synced: 12 months ago
JSON representation
Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition
- Host: GitHub
- URL: https://github.com/daveshap/rlhi
- Owner: daveshap
- License: mit
- Created: 2023-04-22T15:08:54.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2023-04-30T22:06:57.000Z (about 3 years ago)
- Last Synced: 2025-04-07T20:44:05.173Z (about 1 year ago)
- Language: Python
- Homepage:
- Size: 20.6 MB
- Stars: 64
- Watchers: 5
- Forks: 19
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE