Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jamiebrynes7/reddit-to-wordcloud
Convert Reddit Threads into Wordclouds
https://github.com/jamiebrynes7/reddit-to-wordcloud
aws-lambda python reddit wordcloud
Last synced: 23 days ago
JSON representation
Convert Reddit Threads into Wordclouds
- Host: GitHub
- URL: https://github.com/jamiebrynes7/reddit-to-wordcloud
- Owner: jamiebrynes7
- Created: 2018-02-28T18:52:37.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2022-12-08T00:54:09.000Z (about 2 years ago)
- Last Synced: 2024-12-06T22:16:47.644Z (about 1 month ago)
- Topics: aws-lambda, python, reddit, wordcloud
- Language: Python
- Homepage:
- Size: 16.6 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Reddit To Wordcloud
This tool will turn a Reddit thread into a wordcloud.
This project relies heavily on the [word cloud generator](https://github.com/amueller/word_cloud) Python module written by Andreas Mueller.
## Getting Started
**Prerequisites**
You must have Python 3.6 installed on your system.1. Clone this repository with `git clone [email protected]:jamiebrynes7/reddit-to-wordcloud.git`
2. Create a virtual environment in `./venv/` (you can name this something else but will need to edit the make file if you so wish)
3. Install the Python requirements with `pip install -r requirements.txt`
4. (Optional) If you are going to be using AWS Lambda you should run the `/src/setup_dynamo.py` script which will create the tables needed for the operation of the Lambda functions.### Getting a Reddit API Key
1. Go to [https://www.reddit.com/prefs/apps](https://www.reddit.com/prefs/apps) to register for an API key.
2. Select script as the type and enter a name and description and a redirect uri (localhost works fine for this).
3. Copy the client id and export it as an environment variable `export REDDIT_CLIENT_ID="..."`
4. Copy the secret and export it as an environment variable `export REDDIT_CLIENT_SECRET="..."`### Running the CLI Tool
The CLI tool can be ran with `python reddit2wordcloud.py --url=reddit_url --output=wordcloud.png`. You can optionally include a `--config` argument with a path to a JSON configuration file for the word cloud. An example of this config file can be found [here](config.json).
## AWS Lambda
Included in this repository is three Python scripts that can be used for AWS Lambda (serverless execution) of the script. The reason there are three is because of the Amazon's API Gateway timeout limits on requests. If you pass in a particularly large thread to the program, it will exceed this time limit, so the process was split into three. There are 2 user facing functions: one for kicking off the execution (`front-end`) and one for polling for success (`polling`). The third is the meat of the system and is not meant to be user facing.
### Building the AWS Lambda Package
To build the AWS lambda packages, run `make package` to build all three AWS Lambda functions, or run `make package-front-end`, `make package-execute-service`, or `make package-polling` to build a specific one.