https://github.com/abhishekkrthakur/is_that_a_duplicate_quora_question
https://github.com/abhishekkrthakur/is_that_a_duplicate_quora_question
classification deep-learning machine-learning quora
Last synced: 6 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/abhishekkrthakur/is_that_a_duplicate_quora_question
- Owner: abhishekkrthakur
- Created: 2017-02-27T08:35:53.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2019-03-26T10:29:16.000Z (over 6 years ago)
- Last Synced: 2025-03-29T18:06:36.635Z (6 months ago)
- Topics: classification, deep-learning, machine-learning, quora
- Language: Python
- Size: 4.88 KB
- Stars: 443
- Watchers: 25
- Forks: 175
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# is_that_a_duplicate_quora_question
all the code for the article https://www.linkedin.com/pulse/duplicate-quora-question-abhishek-thakur will be available here..
How To
1. Install Required Libraries
```
pip install pandas
pip install numpy
pip install scikit-learn
pip install nltk
pip install tqdm
pip install keras
pip install tensorflow
pip install pyemd
pip install fuzzywuzzy
pip install python-levenshtein
pip install --upgrade gensim
```
2. Download Required Language libraries
```
mkdir data
cd data
wget http://www-nlp.stanford.edu/data/glove.840B.300d.zip
unzip glove.840B.300d.zip
rm glove.840B.300d.zip
wget http://qim.ec.quoracdn.net/quora_duplicate_questions.tsv
wget https://s3.amazonaws.com/dl4j-distribution/GoogleNews-vectors-negative300.bin.gz
sudo python -m nltk.downloader stopwords
cd ..
```
3. Run
```
python feature_engineering.py
python deepnet.py
```