https://github.com/guyulongcs/Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising/blob/master/06_LLM/02_LLM_Classical/2017%20%28OpenAI%29%20%28NIPS%29%20%5BRLHF%5D%20Deep%20Reinforcement%20Learning%20from%20Human%20Preferences.pdf
Last synced: 4 months ago
JSON representation