https://github.com/yaserkl/BERTvsULMFIT
Comparing Text Classification results using BERT embedding and ULMFIT embedding
https://github.com/yaserkl/BERTvsULMFIT
Last synced: 13 days ago
JSON representation
Comparing Text Classification results using BERT embedding and ULMFIT embedding
- Host: GitHub
- URL: https://github.com/yaserkl/BERTvsULMFIT
- Owner: yaserkl
- Created: 2019-02-11T19:16:17.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2019-02-13T23:34:54.000Z (about 6 years ago)
- Last Synced: 2024-11-02T22:32:55.451Z (5 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 27.9 MB
- Stars: 65
- Watchers: 5
- Forks: 11
- Open Issues: 2
-
Metadata Files:
- Readme: Readme.md
Awesome Lists containing this project
- awesome-bert - yaserkl/BERTvsULMFIT
README
## Instructions
\[THIS REPOSITORY IS UNDER DEVELOPMENT AND MOER DATASETS AND MODELS WILL BE ADDED\]
\[FEEL FREE TO MAKE PULL REQUEST FOR A NEW DATASET OR NEW MODEL\]
## 1. Requirements
* CUDA 9.0
* Python 3.6
* `bash setup.sh`Run [setup.sh](setup.sh) to download the datasets and install all the required packages.
Run [prepare_datasets.py](prepare_datasets.py) notebook to prepare the datasets.
For instruction regarding running each model go the respective model directory.
The [models](models) directory holds the result of these experiments.
Bert 20NG Confusion Matrix | Bert 20NG Sankey Plot
:-------------------------:|:-------------------------:|
## 2. Results
### 2.1 BERT
Bert (MXNet) | F1-score | Precision | Recall | Accuracy | Error Rate
-------------------------- | :------: | :-------: | :----: | :------: | :--------:
20ng | 91.24 | 91.46 | 91.13 | 91.04 | 8.96
IMDB | 88.59 | 88.61 | 88.62 | 88.6 | 11.4
Reuters 21578 (R8) | 94.38 | 93.62 | 95.64 | 98.12 | 1.88
Reuters 21578 (R52) | **73.80**| 73.48 |**76.01**| 96.35 | 3.65
Ohsumed (all docs) | 70.45 | 73.97 | 68.84 | 79.30 | 20.70
Ohsumed (first 20k docs) | **56.52**| **61.49** |**56.04**| **71.04**| **28.96**### 2.2 ULMFit
ULMFit | F1-score | Precision | Recall | Accuracy | Error Rate
-------------------------- | :------: | :-------: | :----: | :------: | :--------:
20ng | **92.87**| **93.02** |**92.82**| **92.82**| **7.18**
IMDB |**91.92** | **91.96** |**91.96**| **91.92**| **8.08**
Reuters 21578 (R8) | **94.79**| **94.07** |**96.12**| **98.18**| **1.82**
Reuters 21578 (R52) | 73.77 | **75.47** | 75.96 | **96.43**| **3.57**
Ohsumed (all docs) |**74.82** | **75.01** |**75.47**| **81.96**| **18.04**
Ohsumed (first 20k docs) | 43.76 | 44.46 | 45.49 | 62.5 | 37.5