Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/damascenorafael/identify-bots-reddit-comment-network
Characterization and classification of bots using only structural characteristics of the network. Python development of network construction, component analysis and Neural Network for classification.
https://github.com/damascenorafael/identify-bots-reddit-comment-network
bot-detection complex-networks machine-learning network-analysis network-science networkx reddit reddit-bot
Last synced: 3 months ago
JSON representation
Characterization and classification of bots using only structural characteristics of the network. Python development of network construction, component analysis and Neural Network for classification.
- Host: GitHub
- URL: https://github.com/damascenorafael/identify-bots-reddit-comment-network
- Owner: DamascenoRafael
- License: gpl-3.0
- Created: 2019-12-03T19:47:28.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2023-05-22T23:55:30.000Z (over 1 year ago)
- Last Synced: 2024-05-02T06:09:22.685Z (9 months ago)
- Topics: bot-detection, complex-networks, machine-learning, network-analysis, network-science, networkx, reddit, reddit-bot
- Language: Jupyter Notebook
- Homepage:
- Size: 26.9 MB
- Stars: 5
- Watchers: 2
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Characterization and Classification of Bots using Reddit’s Comment Network
## About
This repository refers to a study to characterize and identify bots in a social network using only the network structure. This work considers a directed network of users constructed from comments in Reddit. The network characterization highlights the significant structural differences of bots, allowing them to be classified using only network features.
This work was guided by Professor [Daniel R. Figueiredo](http://www.land.ufrj.br/~daniel/) (PESC / COPPE / UFRJ).
## Publications
- [Full Paper on Brazilian Computing Society (pt-BR)](https://sol.sbc.org.br/index.php/wperformance/article/view/6471)
Presented at the XXXIX Congresso da Sociedade Brasileira de Computação- [Undergraduate Project (Poli / COPPE / UFRJ) (pt-BR)](http://monografias.poli.ufrj.br/monografias/monopoli10028221.pdf)
as a partial fulfillment of the requirements for the degree of Computer and Information Engineer## Using this repository
The code elaborated for this work is compatible with [Python 3.7.8](https://www.python.org/).
To install all dependencies and use Jupyter Notebook with a virtual environment:
```bash
python3 -m venv venv
source venv/bin/activate
pip3 install -r requirements.txt
python3 -m ipykernel install --user --name=venv
jupyter notebook
```In the `notebooks` folder are the [Jupyter Notebook](https://jupyter.org/) files that can be used to evaluate other datasets.
Reddit Comments datasets can be found in [pushshift](https://files.pushshift.io/reddit/comments/), the files should be placed uncompressed in `data/raw/`.