Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/viisar/awesome-datasets
A curated list of awesome datasets for papers/experiments/validation.
https://github.com/viisar/awesome-datasets
List: awesome-datasets
Last synced: 7 days ago
JSON representation
A curated list of awesome datasets for papers/experiments/validation.
- Host: GitHub
- URL: https://github.com/viisar/awesome-datasets
- Owner: viisar
- Created: 2014-07-31T12:07:30.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2016-10-13T10:46:00.000Z (about 8 years ago)
- Last Synced: 2024-05-19T21:05:00.652Z (6 months ago)
- Homepage:
- Size: 3.91 KB
- Stars: 89
- Watchers: 11
- Forks: 11
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- fucking-lists - awesome-datasets
- awesomelist - awesome-datasets
- more-awesome - Awesome-Datasets - Datasets for papers/experiments/validation. (To Sort)
- collection - awesome-datasets
- lists - awesome-datasets
README
awesome-datasets
================A curated list of awesome datasets for papers/experiments/validation.
- [Awesome Datasets](#awesome-datasets)
- [Classification](#classification)
- [Semi-Supervised](#semi-supervised)
- [Regression](#regression)
- [Time-Series](#time-series)
- [Unsupervised (clustering)](#unsupervised)
- [Face Recognition](#face-recognition)
- [Image Processing](#image-processing)
- [Handwriting Recognition](#handwriting-recognition)
- [Text Classification](#text-classification)## Classification
*Datasets for classification.*
* [KEEL - General](http://sci2s.ugr.es/keel/category.php?cat=clas) - General classification datasets.
* [KEEL - Missing-values](http://sci2s.ugr.es/keel/missing.php) - Missing values datasets.
* [KEEL - Imbalanced datasets](http://sci2s.ugr.es/keel/imbalanced.php) - Imbalanced datasets for classification.
* [KEEL - Multi-label](http://sci2s.ugr.es/keel/multilabel.php) - Multi-label datasets.
* [KEEL - Class noise](http://sci2s.ugr.es/keel/classNoise.php) - Datasets with class noise.
* [KEEL - Attribute noise](http://sci2s.ugr.es/keel/attributeNoise.php) - Datasets with attribute noise.## Semi-Supervised
*Datasets for semi-supervised applications.*
* [KEEL - semi-supervised](http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.
* [KEEL - semi-supervised](http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.## Regression
*Datasets for regression applications.*
* [KEEL - regression](http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for regression experiments.
## Time series
*Datasets for time-series problems.*
* [KEEL - time-series](http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for time-series experiments.
## Face Recognition
*Face Recognition datasets.*
* [JAFFE](http://kasrl.org/jaffe.html) - The Japanese Female Facial Expression (JAFFE) Database.
* [Carnegie Mellon](http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-8/faceimages/) - Datasets from theo-8 projects at Carnegie Mellon University.
* [Yale Face Database](http://vision.ucsd.edu/content/yale-face-database) - Datasets for facial expression (happy, sad, angry...) recognition.
* [Cohn-Kanade](http://www.pitt.edu/~emotion/ck-spread.htm) - The Cohn-Kanade AU-Coded Facial Expression Database is for research in automatic facial image analysis and synthesis and for perceptual studies.
* [AR face Database](http://www2.ece.ohio-state.edu/~aleix/ARdatabase.html) - Different facial expressions, illumination conditions and occlusions.
* [Face Detection CBCL](http://cbcl.mit.edu/software-datasets/FaceData2.html) - Face Detection Data from MIT.
* [Face Recognition LFW](http://vis-www.cs.umass.edu/lfw/) - Face Recognition from UMASS.
* [Face Recognition ORL](http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html) - Face Recognition from AT&T.## Image Processing
*Image Processing.*
* [Microsoft - Salient Object Database](http://research.microsoft.com/en-us/um/people/jiansun/SalientObject/salient_object.htm) - MSRA Salient Object Database.
* [IVRG - Salient Object Database](http://ivrgwww.epfl.ch/supplementary_material/RK_CVPR09/) - Frequency-tuned Salient Region Detection.
* [ICDAR - Robust Reading](http://dag.cvc.uab.es/icdar2013competition/?com=introduction) - Robust Reading Competition.
* [Brodatz - Texture Recognition](http://www.ux.uis.no/~tranden/brodatz.html) - Texture Recognition.
* [Vistex - Texture Recognition](http://vismod.media.mit.edu/vismod/imagery/VisionTexture/vistex.html) - Texture Recognition.
* [Caltech - Object Categorization](http://www.vision.caltech.edu/Image_Datasets/Caltech101/) - Object Categorization from Caltech101.
* [Marcel - Gesture Recognition](http://www.idiap.ch/resource/gestures/) - Gesture Recognition from Marcel.
* [RPPDI - Gesture Recognition](http://rppdi.ecomp.poli.br/gesture/database/) - Gesture Recognition from RPPDI.## Handwriting Recognition
*Handwriting Recognition*
* [MNIST - Database of Handwritten Digits](http://yann.lecun.com/exdb/mnist/) - THE MNIST DATABASE of handwritten digits.
## Text Classification
*Text Classification*
* [20 Newsgroups](http://qwone.com/~jason/20Newsgroups/) - The 20 newsgroups text dataset.
* [Reuters-21578](https://archive.ics.uci.edu/ml/datasets/Reuters-21578+Text+Categorization+Collection) - Reuters-21578 Text Categorization Collection Data Set