{"id":13517261,"url":"https://github.com/viisar/awesome-datasets","last_synced_at":"2025-03-31T07:31:10.007Z","repository":{"id":19233212,"uuid":"22467895","full_name":"viisar/awesome-datasets","owner":"viisar","description":"A curated list of awesome datasets for papers/experiments/validation.","archived":false,"fork":false,"pushed_at":"2016-10-13T10:46:00.000Z","size":4,"stargazers_count":89,"open_issues_count":1,"forks_count":11,"subscribers_count":11,"default_branch":"master","last_synced_at":"2024-05-19T21:05:00.652Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/viisar.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2014-07-31T12:07:30.000Z","updated_at":"2024-03-01T21:06:57.000Z","dependencies_parsed_at":"2022-07-06T22:00:22.872Z","dependency_job_id":null,"html_url":"https://github.com/viisar/awesome-datasets","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viisar%2Fawesome-datasets","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viisar%2Fawesome-datasets/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viisar%2Fawesome-datasets/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viisar%2Fawesome-datasets/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/viisar","download_url":"https://codeload.github.com/viisar/awesome-datasets/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246365641,"owners_count":20765546,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-08-01T05:01:31.894Z","updated_at":"2025-03-31T07:31:09.983Z","avatar_url":"https://github.com/viisar.png","language":null,"funding_links":[],"categories":["Technical","Others","To Sort"],"sub_categories":["awesome-*"],"readme":"awesome-datasets\n================\n\nA curated list of awesome datasets for papers/experiments/validation.\n\n- [Awesome Datasets](#awesome-datasets)\n\t- [Classification](#classification)\n\t- [Semi-Supervised](#semi-supervised)\n\t- [Regression](#regression)\n\t- [Time-Series](#time-series)\n\t- [Unsupervised (clustering)](#unsupervised)\n\t- [Face Recognition](#face-recognition)\n\t- [Image Processing](#image-processing)\n\t- [Handwriting Recognition](#handwriting-recognition)\n\t- [Text Classification](#text-classification)\n\n## Classification\n\n*Datasets for classification.*\n\n* [KEEL - General](http://sci2s.ugr.es/keel/category.php?cat=clas) - General classification datasets.\n* [KEEL - Missing-values](http://sci2s.ugr.es/keel/missing.php) - Missing values datasets.\n* [KEEL - Imbalanced datasets](http://sci2s.ugr.es/keel/imbalanced.php) - Imbalanced datasets for classification.\n* [KEEL - Multi-label](http://sci2s.ugr.es/keel/multilabel.php) - Multi-label datasets.\n* [KEEL - Class noise](http://sci2s.ugr.es/keel/classNoise.php) - Datasets with class noise.\n* [KEEL - Attribute noise](http://sci2s.ugr.es/keel/attributeNoise.php) - Datasets with attribute noise.\n\n## Semi-Supervised\n\n*Datasets for semi-supervised applications.*\n\n* [KEEL - semi-supervised](http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.\n* [KEEL - semi-supervised](http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.\n\n## Regression\n\n*Datasets for regression applications.*\n\n* [KEEL - regression](http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for regression experiments.\n\n\n## Time series\n\n*Datasets for time-series problems.*\n\n* [KEEL - time-series](http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for time-series experiments.\n\n## Face Recognition\n\n*Face Recognition datasets.*\n\n* [JAFFE](http://kasrl.org/jaffe.html) - The Japanese Female Facial Expression (JAFFE) Database.\n* [Carnegie Mellon](http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-8/faceimages/) - Datasets from theo-8 projects at Carnegie Mellon University.\n* [Yale Face Database](http://vision.ucsd.edu/content/yale-face-database) - Datasets for facial expression (happy, sad, angry...) recognition.\n* [Cohn-Kanade](http://www.pitt.edu/~emotion/ck-spread.htm) - The Cohn-Kanade AU-Coded Facial Expression Database is for research in automatic facial image analysis and synthesis and for perceptual studies.\n* [AR face Database](http://www2.ece.ohio-state.edu/~aleix/ARdatabase.html) - Different facial expressions, illumination conditions and occlusions.\n* [Face Detection CBCL](http://cbcl.mit.edu/software-datasets/FaceData2.html) - Face Detection Data from MIT.\n* [Face Recognition LFW](http://vis-www.cs.umass.edu/lfw/) - Face Recognition from UMASS.\n* [Face Recognition ORL](http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html) - Face Recognition from AT\u0026T.\n\n\n## Image Processing\n\n*Image Processing.*\n\n* [Microsoft - Salient Object Database](http://research.microsoft.com/en-us/um/people/jiansun/SalientObject/salient_object.htm) - MSRA Salient Object Database.\n* [IVRG - Salient Object Database](http://ivrgwww.epfl.ch/supplementary_material/RK_CVPR09/) - Frequency-tuned Salient Region Detection.\n* [ICDAR - Robust Reading](http://dag.cvc.uab.es/icdar2013competition/?com=introduction) - Robust Reading Competition.\n* [Brodatz - Texture Recognition](http://www.ux.uis.no/~tranden/brodatz.html) - Texture Recognition.\n* [Vistex - Texture Recognition](http://vismod.media.mit.edu/vismod/imagery/VisionTexture/vistex.html) - Texture Recognition.\n* [Caltech - Object Categorization](http://www.vision.caltech.edu/Image_Datasets/Caltech101/) - Object Categorization from Caltech101.\n* [Marcel - Gesture Recognition](http://www.idiap.ch/resource/gestures/) - Gesture Recognition from Marcel.\n* [RPPDI - Gesture Recognition](http://rppdi.ecomp.poli.br/gesture/database/) - Gesture Recognition from RPPDI.\n\n\n## Handwriting Recognition\n\n*Handwriting Recognition*\n\n* [MNIST - Database of Handwritten Digits](http://yann.lecun.com/exdb/mnist/) - THE MNIST DATABASE of handwritten digits.\n\n## Text Classification\n\n*Text Classification*\n\n* [20 Newsgroups](http://qwone.com/~jason/20Newsgroups/) - The 20 newsgroups text dataset.\n* [Reuters-21578](https://archive.ics.uci.edu/ml/datasets/Reuters-21578+Text+Categorization+Collection) - Reuters-21578 Text Categorization Collection Data Set\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fviisar%2Fawesome-datasets","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fviisar%2Fawesome-datasets","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fviisar%2Fawesome-datasets/lists"}