https://github.com/guyulongcs/Deep-Learning-for-Search-Recommendation-Advertisements/blob/master/07_LLM/01_LLM_Classical/2022%20%28OpenAI%29%20%28Arxiv%29%20%5BInstructGPT%5D%20%5BRLHF%5D%20Training%20language%20models%20to%20follow%20instructions%20with%20human%20feedback.pdf

https://github.com/guyulongcs/Deep-Learning-for-Search-Recommendation-Advertisements/blob/master/07_LLM/01_LLM_Classical/2022%20%28OpenAI%29%20%28Arxiv%29%20%5BInstructGPT%5D%20%5BRLHF%5D%20Training%20language%20models%20to%20follow%20instructions%20with%20human%20feedback.pdf

Last synced: 8 days ago
JSON representation