Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dhfbk/WhatsApp-Dataset
https://github.com/dhfbk/WhatsApp-Dataset
Last synced: 7 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/dhfbk/WhatsApp-Dataset
- Owner: dhfbk
- Created: 2018-09-03T12:44:04.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2021-03-18T08:22:39.000Z (over 3 years ago)
- Last Synced: 2024-08-02T04:02:26.052Z (3 months ago)
- Size: 5.43 MB
- Stars: 8
- Watchers: 4
- Forks: 8
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-italian - WhatsApp Dataset - WhatsApp dataset to study cyberbullying among Italian students aged 12-13 in the context of the CREEP EIT project (Corpora / Hate speech recognition)
README
# WhatsApp Dataset
We developed this WhatsApp dataset to study cyberbullying among Italian students aged 12-13 in the context of the [CREEP EIT project](http://creep-project.eu/).
The corpus of Whatsapp chats is made of 14,600 tokens divided in 10 chats. All the chats have been annotated by two annotators using the [CAT web-based tool](https://dh.fbk.eu/resources/cat-content-annotation-tool) following the same guidelines.
Our guidelines are an adaptation to Italian of the “Guidelines for the Fine-Grained Analysis of Cyberbullying” developed for English by the Language and Translation Technology Team of Ghent University. With respect to the original guidelines, we added a new type of insult called "Body Shame" to cover expressions that criticize someone based on the shape, size, or appearance of his/her body. We have also changed the original type "Encouragement to the Harasser" into "Encouragement to the Harassment", so to include all the incitements between the bully and his/her assistants.
**Reference:**
Rachele Sprugnoli, Stefano Menini, Sara Tonelli, Filippo Oncini, Enrico Maria Piras. 2018. "Creating a WhatsApp Dataset to Study Pre-teen Cyberbullying". In Proceedings of the 2nd Workshop on Abusive Language Online (ALW2). (https://www.aclweb.org/anthology/W18-5107/)