Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/l11x0m7/Question_Answering_Models
This repo collects and re-produces models related to domains of question answering and machine reading comprehension
https://github.com/l11x0m7/Question_Answering_Models
machine-reading-comprehension mrc python qa question-answering tensorflow
Last synced: 3 months ago
JSON representation
This repo collects and re-produces models related to domains of question answering and machine reading comprehension
- Host: GitHub
- URL: https://github.com/l11x0m7/Question_Answering_Models
- Owner: l11x0m7
- License: mit
- Created: 2018-04-27T06:45:05.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2019-06-16T15:33:18.000Z (over 5 years ago)
- Last Synced: 2024-07-10T23:27:14.243Z (7 months ago)
- Topics: machine-reading-comprehension, mrc, python, qa, question-answering, tensorflow
- Language: Python
- Size: 12.1 MB
- Stars: 155
- Watchers: 7
- Forks: 45
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Question_Answering_Models
This repo collects and re-produces models related to domains of question answering and machine reading comprehension.
It's now still in the process of supplement.
## comunity QA
### Dataset
WikiQA, TrecQA, InsuranceQA
#### data preprocess on WikiQA
```
cd cQA
bash download.sh
python preprocess_wiki.py
```### Siamese-NN model
This model is a simple complementation of a Siamese NN QA model with a pointwise way.
[To this repo for details](./cQA/siamese_nn)
#### train model
`python siamese.py --train`
#### test model
`python siamese.py --test`
### Siamese-CNN model
This model is a simple complementation of a Siamese CNN QA model with a pointwise way.
[To this repo for details](./cQA/siamese_cnn)
#### train model
`python siamese.py --train`
#### test model
`python siamese.py --test`
### Siamese-RNN model
This model is a simple complementation of a Siamese RNN/LSTM/GRU QA model with a pointwise way.
[To this repo for details](./cQA/siamese_rnn)
#### train model
`python siamese.py --train`
#### test model
`python siamese.py --test`
### note
All these three models above are based on the vanilla siamese structure. You can easily combine these basic deep learning module cells together and build your own models.
### QACNN
Given a question, a positive answer and a negative answer, this pairwise model can rank two answers with higher ranking in terms of the right answer.
[To this repo for details](./cQA/qacnn)
![](./figures/qacnn.png)
#### train model
`python qacnn.py --train`
#### test model
`python qacnn.py --test`
Refer to:
* [Applying Deep Learning to Answer Selection: A Study and an Open Task](http://arxiv.org/abs/1508.01585v2)
### Decomposable Attention Model
[To this repo for details](./cQA/decomposable_att_model)
![](./figures/decomposable_att.png)
#### train model
`python decomp_att.py --train`
#### test model
`python decomp_att.py --test`
Refer to:
* [A Decomposable Attention Model for Natural Language Inference](http://arxiv.org/abs/1606.01933v2)
### Compare-Aggregate Model with Multi-Compare
[To this repo for details](./cQA/seq_match_seq)
![](./figures/multi-compare.png)
#### train model
`python seq_match_seq.py --train`
#### test model
`python seq_match_seq.py --test`
Refer to:
* [A Compare-Aggregate Model for Matching Text Sequences](http://arxiv.org/abs/1611.01747v1)
### BiMPM
[To this repo for details](./cQA/bimpm)
![](./figures/bimpm1.png)
![](./figures/bimpm2.png)
#### train model
`python bimpm.py --train`
#### test model
`python bimpm.py --test`
Refer to:
* [Bilateral Multi-Perspective Matching for Natural Language Sentences](http://arxiv.org/abs/1702.03814v3)
## Machine Reading Comprehension
### Dataset
CNN/Daily mail, CBT, SQuAD, MS MARCO, RACE
### GA Reader
To be done
![GA](./figures/GA.png)
Refer to:
* [Gated-Attention Readers for Text Comprehension](http://arxiv.org/abs/1606.01549v3)
### SA Reader
To be done
![SAR](./figures/SAR.png)
Refer to:
* [A Thorough Examination of the Cnn/Daily Mail Reading Comprehension Task](http://arxiv.org/abs/1606.02858v2)
### AoA Reader
To be done
![AoA](./figures/AOA.png)
Refer to:
* Attention-over-Attention Neural Networks for Reading Comprehension
### BiDAF
[To this repo for details](./MRC/BiDAF)
![BiDAF](./figures/BiDAF.png)
The result on dev set(single model) under my experimental environment is shown as follows:
| training step | batch size | hidden size | EM (%) | F1 (%) | speed | device |
| :--------: |:------:|:------:|:------:|:------:|:------:|:------:|
| 12W | 32 | 75 | 67.7 | 77.3 | 3.40 it/s | 1 GTX 1080 Ti |![](./figures/bidaf_train.png)
![](./figures/bidaf_dev.png)
Refer to:
* [Bidirectional Attention Flow for Machine Comprehension](https://arxiv.org/abs/1611.01603)
### RNet
[To this repo for details](./MRC/RNet)
![RNet](./figures/RNet.png)
The result on dev set(single model) under my experimental environment is shown as follows:
| training step | batch size | hidden size | EM (%) | F1 (%) | speed | device | RNN type |
| :--------: |:------:|:------:|:------:|:------:| :------:| :------:| :------:|
| 12W | 32 | 75 | 69.1 | 78.2 | 1.35 it/s | 1 GTX 1080 Ti | cuDNNGRU|
| 6W | 64 | 75 | 66.1 | 75.6 | 2.95 s/it | 1 GTX 1080 Ti | SRU |RNet trained with cuDNNGRU:
![](./figures/rnet_train.png)
![](./figures/rnet_dev.png)
RNet trained with SRU(without optimization on operation efficiency):
![](./figures/rnet_sru_train.png)
![](./figures/rnet_sru_dev.png)
Refer to:
* [RNet](https://www.microsoft.com/en-us/research/publication/mrc/)
* [HKUST-KnowComp/R-Net](https://github.com/HKUST-KnowComp/R-Net)### QANet
[To this repo for details](./MRC/QANet)
![QANet](./figures/QANet.png)
The result on dev set(single model) under my experimental environment is shown as follows:
| training step | batch size |attention heads| hidden size | EM (%) | F1 (%) | speed | device |
| :--------: |:------:|:------:|:------:|:------:|:------:|:------:|:------:|
| 6W | 32 | 1 | 96 | 70.2 | 79.7 |2.4 it/s | 1 GTX 1080 Ti |
| 12W | 32 | 1 | 75 | 70.1 | 79.4 | 2.4 it/s | 1 GTX 1080 Ti |Experimental records for the first experiment:
![](./figures/qanet_train.png)
![](./figures/qanet_dev.png)
Experimental records for the second experiment(without smooth):
![](./figures/qanet_train_2.png)
![](./figures/qanet_dev_2.png)
Refer to:
* QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension
* github repo of [NLPLearn/QANet](https://github.com/NLPLearn/QANet)### Hybrid Network
[To this repo for details](./MRC/Hybrid)
This repo contains my experiments and attempt for MRC problems, and I'm still working on it.
| training step | batch size | hidden size | EM (%) | F1 (%) | speed | device | description |
| :--------: |:------:|:------:|:------:|:------:|:------:|:------:|:------:|
| 12W | 32 | 100 | 70.1 | 78.9 |1.6 it/s | 1 GTX 1080 Ti | \ |
| 12W | 32 | 75 | 70.0 | 79.1 | 1.8 it/s | 1 GTX 1080 Ti | \ |
| 12W | 32 | 75 | 69.5 | 78.8 | 1.8 it/s | 1 GTX 1080 Ti |with spatial dropout on embeddings|Experimental records for the first experiment(without smooth):
![](./figures/hybrid_train_1.png)
![](./figures/hybrid_dev_1.png)
Experimental records for the second experiment(without smooth):
![](./figures/hybrid_train_2.png)
![](./figures/hybrid_dev_2.png)
## Information
For more information, please visit http://skyhigh233.com/blog/2018/04/26/cqa-intro/.