https://github.com/flaque/stackexchange
A data science project to answer the question: "How do you write a good answer to a question?"
https://github.com/flaque/stackexchange
Last synced: 2 months ago
JSON representation
A data science project to answer the question: "How do you write a good answer to a question?"
- Host: GitHub
- URL: https://github.com/flaque/stackexchange
- Owner: Flaque
- Created: 2016-11-05T20:02:44.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2016-12-15T05:39:22.000Z (over 8 years ago)
- Last Synced: 2025-01-20T10:13:49.802Z (4 months ago)
- Language: Python
- Size: 69.3 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# stackexchange
A data science project to answer the question:
"How do you write a good answer to a question?"
## Setup
### Python Installs
1. Install Spacy:
```pip install -U spacy ```2. Install the Spacy English Language Model:
```python -m spacy.en.download all```### Setting up XML files
1. Extract as many stackexchanges as you want [from this archive](https://archive.org/details/stackexchange). Put the `xyz.stackexchange.com` folders into a single folder and call it
what you want.
2. In `populate_database.py`, change `DATA_FOLDER` to be your filename.### Database
1. Setup a mysql instance with a database called `stackexchange`.
2. Edit `toMySQL.py` variables `host`, `user`, and `database` near the top of
the file to represent your database credentials.
3. Run the `createDatabase.sql` file to create the tables in your database. If you're using something like SequelPro (pancakes), you can press `cmd-shift-i` to import an sql file.
4. If you have all your XML and database stuff setup correctly, you should be able to run `python populate_database.py` and it will fill
your database with the extracted features.