Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/viisar/awesome-datasets

A curated list of awesome datasets for papers/experiments/validation.
https://github.com/viisar/awesome-datasets

List: awesome-datasets

Last synced: 7 days ago
JSON representation

A curated list of awesome datasets for papers/experiments/validation.

Awesome Lists containing this project

README

        

awesome-datasets
================

A curated list of awesome datasets for papers/experiments/validation.

- [Awesome Datasets](#awesome-datasets)
- [Classification](#classification)
- [Semi-Supervised](#semi-supervised)
- [Regression](#regression)
- [Time-Series](#time-series)
- [Unsupervised (clustering)](#unsupervised)
- [Face Recognition](#face-recognition)
- [Image Processing](#image-processing)
- [Handwriting Recognition](#handwriting-recognition)
- [Text Classification](#text-classification)

## Classification

*Datasets for classification.*

* [KEEL - General](http://sci2s.ugr.es/keel/category.php?cat=clas) - General classification datasets.
* [KEEL - Missing-values](http://sci2s.ugr.es/keel/missing.php) - Missing values datasets.
* [KEEL - Imbalanced datasets](http://sci2s.ugr.es/keel/imbalanced.php) - Imbalanced datasets for classification.
* [KEEL - Multi-label](http://sci2s.ugr.es/keel/multilabel.php) - Multi-label datasets.
* [KEEL - Class noise](http://sci2s.ugr.es/keel/classNoise.php) - Datasets with class noise.
* [KEEL - Attribute noise](http://sci2s.ugr.es/keel/attributeNoise.php) - Datasets with attribute noise.

## Semi-Supervised

*Datasets for semi-supervised applications.*

* [KEEL - semi-supervised](http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.
* [KEEL - semi-supervised](http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.

## Regression

*Datasets for regression applications.*

* [KEEL - regression](http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for regression experiments.

## Time series

*Datasets for time-series problems.*

* [KEEL - time-series](http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for time-series experiments.

## Face Recognition

*Face Recognition datasets.*

* [JAFFE](http://kasrl.org/jaffe.html) - The Japanese Female Facial Expression (JAFFE) Database.
* [Carnegie Mellon](http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-8/faceimages/) - Datasets from theo-8 projects at Carnegie Mellon University.
* [Yale Face Database](http://vision.ucsd.edu/content/yale-face-database) - Datasets for facial expression (happy, sad, angry...) recognition.
* [Cohn-Kanade](http://www.pitt.edu/~emotion/ck-spread.htm) - The Cohn-Kanade AU-Coded Facial Expression Database is for research in automatic facial image analysis and synthesis and for perceptual studies.
* [AR face Database](http://www2.ece.ohio-state.edu/~aleix/ARdatabase.html) - Different facial expressions, illumination conditions and occlusions.
* [Face Detection CBCL](http://cbcl.mit.edu/software-datasets/FaceData2.html) - Face Detection Data from MIT.
* [Face Recognition LFW](http://vis-www.cs.umass.edu/lfw/) - Face Recognition from UMASS.
* [Face Recognition ORL](http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html) - Face Recognition from AT&T.

## Image Processing

*Image Processing.*

* [Microsoft - Salient Object Database](http://research.microsoft.com/en-us/um/people/jiansun/SalientObject/salient_object.htm) - MSRA Salient Object Database.
* [IVRG - Salient Object Database](http://ivrgwww.epfl.ch/supplementary_material/RK_CVPR09/) - Frequency-tuned Salient Region Detection.
* [ICDAR - Robust Reading](http://dag.cvc.uab.es/icdar2013competition/?com=introduction) - Robust Reading Competition.
* [Brodatz - Texture Recognition](http://www.ux.uis.no/~tranden/brodatz.html) - Texture Recognition.
* [Vistex - Texture Recognition](http://vismod.media.mit.edu/vismod/imagery/VisionTexture/vistex.html) - Texture Recognition.
* [Caltech - Object Categorization](http://www.vision.caltech.edu/Image_Datasets/Caltech101/) - Object Categorization from Caltech101.
* [Marcel - Gesture Recognition](http://www.idiap.ch/resource/gestures/) - Gesture Recognition from Marcel.
* [RPPDI - Gesture Recognition](http://rppdi.ecomp.poli.br/gesture/database/) - Gesture Recognition from RPPDI.

## Handwriting Recognition

*Handwriting Recognition*

* [MNIST - Database of Handwritten Digits](http://yann.lecun.com/exdb/mnist/) - THE MNIST DATABASE of handwritten digits.

## Text Classification

*Text Classification*

* [20 Newsgroups](http://qwone.com/~jason/20Newsgroups/) - The 20 newsgroups text dataset.
* [Reuters-21578](https://archive.ics.uci.edu/ml/datasets/Reuters-21578+Text+Categorization+Collection) - Reuters-21578 Text Categorization Collection Data Set