Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/anvaka/sayit

Visualization of related subreddits
https://github.com/anvaka/sayit

Last synced: 14 days ago
JSON representation

Visualization of related subreddits

Awesome Lists containing this project

README

        

# visualization of related subreddits

This project builds a graph of related subreddits.

[![demo](https://i.imgur.com/xKlxRkf.gif)](https://anvaka.github.io/sayit/)

Recommendations are constructed based on _Redditors who commented to this subreddit, also commented to..._

Play with it here: https://anvaka.github.io/sayit/

## The data

I used data from two months worth of comments (August and September of 2018) - which contains ~38 millions `user <-> subreddit` records.

You can find original data by [following this discussion](https://www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/)

I computed Jaccard Similarity between subreddits, and then stored results into github pages. Repository [is available here](https://github.com/anvaka/sayit-data). Please let me know if you are curious to learn more about this or anything else - feel free to reach out to me on [twitter](https://twitter.com/anvaka) or via issues in this repository

Note: for very popular subreddits Jaccard Similarity didn't give meaningful results. They all were connected
to each other (e.g. `/r/aww`, `/r/pics`, `/r/funny` and so on). I manually collected references to other subreddits
from subreddit description where it was available. Where description did not include any recommendation -
I looked into actual comments and used most often mentioned subreddits as "related". You can find list of all
overrides in the [sayit-data](https://github.com/anvaka/sayit-data#override) repository

## Local Build Setup

```bash
# install dependencies
npm install

# serve with hot reload at localhost:8080
npm run dev

# build for production with minification
npm run build

# build for production and view the bundle analyzer report
npm run build --report
```

For a detailed explanation on how things work, check out the [guide](http://vuejs-templates.github.io/webpack/) and [docs for vue-loader](http://vuejs.github.io/vue-loader).

# Thanks!

Credit for data goes to [Jason Baumgartner](https://twitter.com/jasonbaumgartne) also known as [u/Stuck_In_the_Matrix](https://www.reddit.com/user/Stuck_In_the_Matrix/).

Huge thanks to [Felipe Hoffa](https://twitter.com/felipehoffa) for putting the data into BigQuery.

If you like this work and would like to support it - I have [a Patreon page](https://www.patreon.com/anvaka),
[paypal.me](https://www.paypal.com/paypalme2/anvakos/3), [github sponsors](https://github.com/sponsors/anvaka).

Thank you!