An open API service indexing awesome lists of open source software.

https://github.com/ruvenguna94/dialogue-summary-remove-toxic-text-ppo

Fine-tuning FLAN-T5 with PPO and PEFT to generate less toxic text summaries. This notebook leverages Meta AI's hate speech reward model and utilizes RLHF techniques for improved safety.
https://github.com/ruvenguna94/dialogue-summary-remove-toxic-text-ppo

detoxification dialogue-summarization generative-ai hate-speech-detection nlp ppo-pytorch reward-model toxic-comment-classification toxicity-analysis

Last synced: about 1 month ago
JSON representation

Fine-tuning FLAN-T5 with PPO and PEFT to generate less toxic text summaries. This notebook leverages Meta AI's hate speech reward model and utilizes RLHF techniques for improved safety.

Awesome Lists containing this project