https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems

This repository provides state of the art (SoTA) results for all machine learning problems. We do our best to keep this repository up to date. If you do find a problem's SoTA result is out of date or missing, please raise this as an issue or submit Google form (with this information: research paper name, dataset, metric, source code and year). We will fix it immediately.
https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems

Last synced: about 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems
Owner: RedditSota
License: apache-2.0
Created: 2017-11-09T01:21:40.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2019-06-25T14:09:52.000Z (almost 6 years ago)
Last Synced: 2024-10-14T21:21:51.676Z (7 months ago)
Homepage:
Size: 147 KB
Stars: 8,948
Watchers: 872
Forks: 1,314
Open Issues: 14
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesomeai - Machine Learning Problems
awesome-ai-awesomeness - Machine Learning Problems
awesome-ai-awesomeness - Machine Learning Problems
awesome-nlp-note - NLP RedditSota
personal-awesome-list - state-of-the-art-result-for-machine-learning-problems
awesome-ai-list-guide - state-of-the-art-result-for-machine-learning-problems
awesome-list - RedditSota/state-of-the-art-result-for-machine-learning-problems - This repository provides state of the art (SoTA) results for all machine learning problems. (Machine Learning / JavaScript)
100-AI-Machine-learning-Deep-learning-Computer-vision-NLP - 👆
StarryDivineSky - RedditSota/state-of-the-art-result-for-machine-learning-problems

README

# State-of-the-art result for all Machine Learning Problems

### LAST UPDATE: 20th Februray 2019

### NEWS: I am looking for a Collaborator esp who does research in NLP, Computer Vision and Reinforcement learning. If you are not a researcher, but you are willing, contact me. Email me: [email protected]

This repository provides state-of-the-art (SoTA) results for all machine learning problems. We do our best to keep this repository up to date. If you do find a problem's SoTA result is out of date or missing, please raise this as an issue (with this information: research paper name, dataset, metric, source code and year). We will fix it immediately.

You can also submit this [Google Form](https://docs.google.com/forms/d/e/1FAIpQLSe_fFZVCeCVRGGgOQIpoQSXY7mZWynsx7g6WxZEVpO5vJioUA/viewform?embedded=true) if you are new to Github.

This is an attempt to make one stop for all types of machine learning problems state of the art result. I can not do this alone. I need help from everyone. Please submit the Google form/raise an issue if you find SOTA result for a dataset. Please share this on Twitter, Facebook, and other social media.

This summary is categorized into:

- [Supervised Learning](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#supervised-learning)
- [Speech](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#speech)
- [Computer Vision](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#computer-vision)
- [NLP](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#nlp)
- [Semi-supervised Learning](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#semi-supervised-learning)
- Computer Vision
- [Unsupervised Learning](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#unsupervised-learning)
- Speech
- Computer Vision
- [NLP](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems/blob/master/README.md#nlp-1)
- [Transfer Learning](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#transfer-learning)
- [Reinforcement Learning](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#reinforcement-learning)

## Supervised Learning

### NLP
#### 1. Language Modelling

Research Paper
Datasets
Metric
Source Code
Year

Language Models are Unsupervised Multitask Learners

WikiText-2

Perplexity: 35.76

Perplexity: 18.34

Tensorflow
2019

BREAKING THE SOFTMAX BOTTLENECK: A HIGH-RANK RNN LANGUAGE MODEL

WikiText-2

Perplexity: 47.69

Perplexity: 40.68

Pytorch
2017

DYNAMIC EVALUATION OF NEURAL SEQUENCE MODELS

WikiText-2

Perplexity: 51.1

Perplexity: 44.3

Pytorch
2017

Averaged Stochastic Gradient Descent
with Weight Dropped LSTM or QRNN

WikiText-2

Perplexity: 52.8

Perplexity: 52.0

Pytorch
2017

FRATERNAL DROPOUT

WikiText-2

Perplexity: 56.8

Perplexity: 64.1

Pytorch
2017

Factorization tricks for LSTM networks
One Billion Word Benchmark
Perplexity: 23.36
Tensorflow
2017

#### 2. Machine Translation

Research Paper
Datasets
Metric
Source Code
Year

Understanding Back-Translation at Scale

WMT 2014 English-to-French

WMT 2014 English-to-German

BLEU: 45.6

BLEU: 35.0

PyTorch

2018

WEIGHTED TRANSFORMER NETWORK FOR
MACHINE TRANSLATION

WMT 2014 English-to-French

WMT 2014 English-to-German

BLEU: 41.4

BLEU: 28.9

NOT FOUND

2017

Attention Is All You Need

WMT 2014 English-to-French

WMT 2014 English-to-German

BLEU: 41.0

BLEU: 28.4

PyTorch

Tensorflow

2017

NON-AUTOREGRESSIVE
NEURAL MACHINE TRANSLATION

WMT16 Ro→En

BLEU: 31.44

PyTorch

2017

Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets

NIST02

NIST03

NIST04

NIST05

38.74

36.01

37.54

33.76

NMTPY

2017

#### 3. Text Classification

Research Paper
Datasets
Metric
Source Code
Year

Learning Structured Text Representations
Yelp
Accuracy: 68.6

Tensorflow

2017

Attentive Convolution
Yelp
Accuracy: 67.36

Theano

2017

#### 4. Natural Language Inference
Leader board:

[Stanford Natural Language Inference (SNLI)](https://nlp.stanford.edu/projects/snli/)

[MultiNLI](https://www.kaggle.com/c/multinli-matched-open-evaluation/leaderboard)

Research Paper
Datasets
Metric
Source Code
Year

NATURAL LANGUAGE INFERENCE OVER INTERACTION SPACE
Stanford Natural Language Inference (SNLI)
Accuracy: 88.9
Tensorflow
2017

BERT-LARGE (ensemble)
Multi-Genre Natural Language Inference (MNLI)

Matched accuracy: 86.7

Mismatched accuracy: 85.9

Tensorflow

PyTorch

2018

#### 5. Question Answering
Leader Board

[SQuAD](https://rajpurkar.github.io/SQuAD-explorer/)

Research Paper
Datasets
Metric
Source Code
Year

BERT-LARGE (ensemble)
The Stanford Question Answering Dataset

Exact Match: 87.4

F1: 93.2

Tensorflow

PyTorch

2018

#### 6. Named entity recognition

Research Paper
Datasets
Metric
Source Code
Year

Named Entity Recognition in Twitter using Images and Text
Ritter

F-measure: 0.59

NOT FOUND
2017

#### 7. Abstractive Summarization

Research Paper | Datasets | Metric | Source Code | Year
------------ | ------------- | ------------ | ------------- | -------------
[Cutting-off redundant repeating generations for neural abstractive summarization](https://aclanthology.info/pdf/E/E17/E17-2047.pdf) |

DUC-2004

Gigaword

DUC-2004

ROUGE-1: **32.28**

ROUGE-2: 10.54

ROUGE-L: **27.80**

Gigaword

ROUGE-1: **36.30**

ROUGE-2: 17.31

ROUGE-L: **33.88**

| NOT YET AVAILABLE | 2017
[Convolutional Sequence to Sequence](https://arxiv.org/pdf/1705.03122.pdf) |

DUC-2004

Gigaword

DUC-2004

ROUGE-1: 33.44

ROUGE-2: **10.84**

ROUGE-L: 26.90

Gigaword

ROUGE-1: 35.88

ROUGE-2: 27.48

ROUGE-L: 33.29

| [PyTorch](https://github.com/facebookresearch/fairseq-py) | 2017

#### 8. Dependency Parsing

Research Paper | Datasets | Metric | Source Code | Year
------------ | ------------- | ------------ | ------------- | -------------
[Globally Normalized Transition-Based Neural Networks](https://arxiv.org/pdf/1603.06042.pdf) |