An open API service indexing awesome lists of open source software.

https://github.com/philippkant/soccer-matchthread-analysis

Data analysis on Reddit data: r/soccer match thread comments
https://github.com/philippkant/soccer-matchthread-analysis

matplotlib pandas praw praw-reddit reddit reddit-api

Last synced: about 2 months ago
JSON representation

Data analysis on Reddit data: r/soccer match thread comments

Awesome Lists containing this project

README

          

# Data analysis on Reddit data: r/soccer match thread comments

The following graphs show the number of comments in each minute in a specific r/soccer match thread.

## Example graphs

### 2014 FIFA World Cup Final between Germany and Argentina

Match thread link: [``` https://www.reddit.com/r/soccer/comments/2alnzy/match_thread_germany_vs_argentina_2014_fifa_world/ ```](https://www.reddit.com/r/soccer/comments/2alnzy/match_thread_germany_vs_argentina_2014_fifa_world/)

18723 comments

![2014-world-cup-final](./plots/2014_world_cup_final.png)

### Barcelona 6–1 Paris Saint-Germain "La remontada"

Match thread link: [``` https://www.reddit.com/r/soccer/comments/5y9qif/match_thread_barcelona_vs_paris_saintgermain/ ```](https://www.reddit.com/r/soccer/comments/5y9qif/match_thread_barcelona_vs_paris_saintgermain/)

8387 comments

![barcelona-paris](./plots/barcelona_paris.png)

## Tools used

- [PRAW](https://praw.readthedocs.io/en/stable/) (Python Reddit API Wrapper)
- Jupyter Notebook
- Pandas
- Matplotlib
- Seaborn

## About this repository

I created the two graphs with the two Jupyter Notebooks you see in this repository. You can find the same code as a normal Python file in main.py.