https://github.com/likejazz/siamese-lstm

Siamese LSTM for evaluating semantic similarity between sentences of the Quora Question Pairs Dataset.
https://github.com/likejazz/siamese-lstm

deep-learning keras lstm nlp

Last synced: about 2 months ago
JSON representation

Siamese LSTM for evaluating semantic similarity between sentences of the Quora Question Pairs Dataset.

Host: GitHub
URL: https://github.com/likejazz/siamese-lstm
Owner: likejazz
Created: 2018-03-16T05:56:58.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2018-03-22T02:23:30.000Z (over 7 years ago)
Last Synced: 2025-03-31T18:21:21.372Z (3 months ago)
Topics: deep-learning, keras, lstm, nlp
Language: Python
Homepage:
Size: 29.3 KB
Stars: 254
Watchers: 11
Forks: 70
Open Issues: 5
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # Siamese-LSTM

Using MaLSTM model(Siamese networks + LSTM with Manhattan distance) to detect semantic similarity between question pairs. Training dataset used is a subset of the original Quora Question Pairs Dataset(~363K pairs used).

It is Keras implementation based on [Original Paper(PDF)](http://www.mit.edu/~jonasm/info/MuellerThyagarajan_AAAI16.pdf) and [Excellent Medium Article](https://medium.com/mlreview/implementing-malstm-on-kaggles-quora-question-pairs-competition-8b31b0b16a07).

![](https://cloud.githubusercontent.com/assets/9861437/20479493/6ea8ad12-b004-11e6-89e4-53d4d354d32e.png)

## Prerequisite

- Paper, Articles

    - [Siamese Recurrent Architectures for Learning Sentence Similarity](http://www.mit.edu/~jonasm/info/MuellerThyagarajan_AAAI16.pdf)

    - [How to predict Quora Question Pairs using Siamese Manhattan LSTM](https://medium.com/mlreview/implementing-malstm-on-kaggles-quora-question-pairs-competition-8b31b0b16a07)

- Data

    - [GoogleNews-vectors-negative300.bin.gz](https://drive.google.com/file/d/0B7XkCwpI5KDYNlNUTTlSS21pQmM/edit?usp=sharing)

    - [Kaggle's Quora Question Pairs Dataset](https://www.kaggle.com/c/quora-question-pairs/data)

- References

    - [aditya1503/Siamese-LSTM](https://github.com/aditya1503/Siamese-LSTM) Original author's GitHub

    - [dhwajraj/deep-siamese-text-similarity](https://github.com/dhwajraj/deep-siamese-text-similarity) TensorFlow based implementation

Kaggle's `test.csv` is too big, so I had extracted only the top 20 questions and created a file called `test-20.csv` and It is used in the `predict.py`.

You should put all data files to `./data` directory.

## How to Run

### Training

```

$ python3 train.py

```

### Predicting

It uses `test-20.csv` file mentioned above.

```

$ python3 predict.py

```

## The Results

I have tried with various parameters such as number of hidden states of LSTM cell, activation function of LSTM cell and repeated count of epochs.

I have used NVIDIA Tesla P40 GPU x 2 for training and 10% data was used as the validation set(batch size=1024*2).

As a result, I have reached about 82.29% accuracy after 50 epochs about 10 mins later.

```

Epoch 50/50

363861/363861 [==============================] - 12s 33us/step - loss: 0.1172 - acc: 0.8486 - val_loss: 0.1315 - val_acc: 0.8229

Training time finished.

50 epochs in       601.24

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/likejazz/siamese-lstm

Awesome Lists containing this project

README