https://github.com/daveshap/rlhi

Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition
https://github.com/daveshap/rlhi

Last synced: 12 months ago
JSON representation

Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition

Host: GitHub
URL: https://github.com/daveshap/rlhi
Owner: daveshap
License: mit
Created: 2023-04-22T15:08:54.000Z (about 3 years ago)
Default Branch: main
Last Pushed: 2023-04-30T22:06:57.000Z (about 3 years ago)
Last Synced: 2025-04-07T20:44:05.173Z (about 1 year ago)
Language: Python
Homepage:
Size: 20.6 MB
Stars: 64
Watchers: 5
Forks: 19
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project