https://github.com/rimo02/jigsaw-toxicity
Classifying various toxic comments using DistilBert
https://github.com/rimo02/jigsaw-toxicity
bert nlp toxicity-classification
Last synced: 5 months ago
JSON representation
Classifying various toxic comments using DistilBert
- Host: GitHub
- URL: https://github.com/rimo02/jigsaw-toxicity
- Owner: rimo02
- Created: 2023-03-20T21:19:03.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2024-11-02T12:36:10.000Z (over 1 year ago)
- Last Synced: 2025-04-08T22:22:24.720Z (about 1 year ago)
- Topics: bert, nlp, toxicity-classification
- Language: Jupyter Notebook
- Homepage:
- Size: 26.4 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
# Toxic Comment Classification using DistilBERT
## Overview
This project aims to build a multi-headed model for detecting various types of toxicity in online comments, including:
- Toxic
- Severe Toxic
- Obscene
- Threat
- Insult
- Identity Hate
Utilizing the DistilBERT model, this approach fine-tunes the model on a dataset of comments from Wikipedia's talk page edits. The model is then deployed using FastAPI, allowing for easy interaction and testing of its capabilities.
# Sample

---
### Note
Run the Jupyter notebook to train the model or contact me to get the pretrained model weights