0 "dataset" Awesome Lists
public-apis
A collective list of free APIs
api apis dataset development free list lists open-source public public-api
390,760 stars
41,786 forks
1,432 projects
Last updated: 12 Jan 2026
awesome-project-ideas
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
awesome awesome-list classification dataset deep-learning forecasting image-classification machine-learning multi-label-classification series-forecasting
8,893 stars
1,284 forks
57 projects
Last updated: 24 Jan 2026
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
bert chinese dataset ernie gpt gpt-2 large-language-models llm multimodel nezha
5,514 stars
512 forks
918 projects
Last updated: 23 Jan 2026
awesome-json-datasets
A curated list of awesome JSON datasets that don't require authentication.
awesome awesome-list data dataset datasets json json-dataset list
3,541 stars
387 forks
300 projects
Last updated: 31 Jan 2026
awesome-industrial-anomaly-detection
Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。
anomaly-detection anomaly-segmentation computer-vision dataset deep-learning defect-detection industrial-image
3,257 stars
287 forks
968 projects
Last updated: 31 Jan 2026
Awesome-Domain-LLM
收集和梳理垂直领域的开源模型、数据集及评测基准。
awesome-list dataset llm nlp paper-list
2,565 stars
201 forks
303 projects
Last updated: 29 Jan 2026
awesome-remote-sensing-change-detection
A comprehensive and up-to-date compilation of datasets, tools, methods, review papers, and competitions for remote sensing change detection.
awesome change-detection dataset deep-learning remote-sensing
2,087 stars
374 forks
239 projects
Last updated: 20 Jan 2026
awesome-radar-perception
A curated list of radar datasets, detection, tracking and fusion
autonomous-driving autonomous-vehicles dataset deep-learning detection fusion radar slam
1,772 stars
327 forks
559 projects
Last updated: 22 Jan 2026
Awesome-Deepfakes-Detection
A list of tools, papers and code related to Deepfake Detection.
awesome code dataset deepfake-detection deepfakes image paper-with-code paperlist tools video
1,645 stars
149 forks
349 projects
Last updated: 22 Jan 2026
game-datasets
:video_game: A curated list of awesome game datasets, and tools to artificial intelligence in games
artificial-intelligence awesome awesome-game awesome-list data-mining database dataset game games
986 stars
71 forks
182 projects
Last updated: 20 Jan 2026
awesome-biomechanics
A curated, public list of resources for biomechanics and human motion analysis: datasets, processing tools, software for simulation, educational videos, lectures, etc.
awesome awesome-list bioengineering biomechanics computer-simulation dataset dynamics gait gait-analysis human-motion
926 stars
144 forks
906 projects
Last updated: 22 Jan 2026
awesome-face
😎 face releated algorithm, dataset and paper
dataset face face-detection face-recognition face-releated-algorithm paper papers
913 stars
182 forks
116 projects
Last updated: 07 Jan 2026
awesome-chatgpt-dataset
Unlock the Power of LLM: Explore These Datasets to Train Your Own ChatGPT!
awesome chatgpt dataset gpt4 instructions
756 stars
62 forks
88 projects
Last updated: 22 Jan 2026
awesome-segmentation-saliency-dataset
A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile:
dataset datasets deep-learning deeplearning machine-learning machinelearning saliency-detection
616 stars
100 forks
338 projects
Last updated: 02 Feb 2026
awesome-llm-eval
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
awsome-list awsome-lists benchmark bert chatglm chatgpt dataset evaluation gpt3 large-language-model
606 stars
51 forks
479 projects
Last updated: 24 Jan 2026
awesome-time-series-segmentation-papers
This repository contains a reading list of papers on Time Series Segmentation. This repository is still being continuously improved.
anomaly-detection awesome awesome-list change-point-detection data-mining data-science dataset deep-learning machine-learning motion-capture
544 stars
13 forks
134 projects
Last updated: 31 Jan 2026
Vehicle_reID-Collection
:red_car: the collection of vehicle re-ID papers, datasets. :red_car:
awesome awesome-list cvpr-workshop dataset deep-learning paper pku-vehicle ve-ri vehicle vehicle-reid
507 stars
55 forks
101 projects
Last updated: 15 Jan 2026
awesome-robotics-datasets
A collection of useful datasets for robotics and computer vision
computer-vision dataset robotics
497 stars
51 forks
79 projects
Last updated: 15 Jan 2026
Awesome-Traffic-Agent-Trajectory-Prediction
This is a list of papers related to traffic agent trajectory prediction.
awesome dataset deep-learning papers source-code traffic-agent trajectory-prediction
497 stars
61 forks
854 projects
Last updated: 15 Jan 2026
awesome-msr
A curated repository of software engineering repository mining data sets
awesome awesome-list dataset ghtorrent mining msr
462 stars
70 forks
73 projects
Last updated: 06 Jan 2026
Awesome-Table-Recognition
A curated list of resources dedicated to table recognition
dataset ocr ocr-recognition papers papers-with-code table-recognition
406 stars
51 forks
40 projects
Last updated: 25 Nov 2025
Awesome_Satellite_Benchmark_Datasets
Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.
awesome-list benchmark benchmark-datasets dataset deep-learning earth-observation machine-learning remote-sensing satellite-imagery
359 stars
30 forks
31 projects
Last updated: 19 Jan 2026
Awesome_Multimodel_LLM
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.
chatgpt dataset gpt llm mllm multimodel nlp pretrained-models
358 stars
22 forks
611 projects
Last updated: 22 Jan 2026
awesome-italian-public-datasets
A selection of interesting Open dataset from the Italian Public Administration and Civic Data use cases
awesome awesome-list civic-hacking civic-tech dataset goverment-data hacktoberfest italy open open-data
311 stars
29 forks
44 projects
Last updated: 28 Jan 2026
awesome-colour
Curated list of awesome colour science resources 😎
awesome awesome-list color color-science color-space color-spaces colorspace colorspaces colour colour-science
299 stars
24 forks
86 projects
Last updated: 02 Jan 2026
Awesome-Foundation-Models-for-Weather-and-Climate
A comprehesive survey about foundation models for weather and cliamte data understanding.
ai4earth ai4science climate climate-change dataset deep-learning foundation-models large-language-models largemodel machine-learning
278 stars
36 forks
184 projects
Last updated: 14 Jan 2026
awesome-hungarian-nlp
A curated list of NLP resources for Hungarian
awesome awesome-list computational-linguistics corpus corpus-linguistics dataset hungarian hungarian-language information-extraction information-retrieval
265 stars
19 forks
181 projects
Last updated: 12 Jan 2026
Awesome-InverseRendering
List of awesome papers on Intrinsic Decomposition & Inverse Rendering
awesome-list awesome-lists dataset intrinsic-decomposition inverse-rendering paper papers
227 stars
11 forks
68 projects
Last updated: 11 Jan 2026
turkish-nlp-resources
🔡 List of Tools, Libraries, Models, Datasets and other resources for Turkish NLP.
awesome-list dataset dogal-dil-isleme natural-language-processing nlp turkish turkish-language turkish-nlp
159 stars
8 forks
86 projects
Last updated: 19 Jan 2026
awesome-object-detection-in-aerial-images
A curated list of awesome resources for generic object detection in aerial images.
aerial-object-detection dataset object-detection oriented-object-detection remote-sensing rotated-object-detection
151 stars
16 forks
217 projects
Last updated: 16 Jan 2026
awesome-object-detection-datasets
A collection of some awesome public object detection and recognition datasets.
aerial-imagery autonomous-driving awesome-list chatgpt coco dataset datasets infrared large-language-models llm
131 stars
10 forks
143 projects
Last updated: 11 Jan 2026
deep-learning-resources
A curated list of deep learning resources books, courses, papers, libraries, conferences, sample code, and many more.
awesome awesome-list conference data-science dataset deep-learning indonesia lecture machine-learning paper
121 stars
38 forks
54 projects
Last updated: 24 Nov 2025
awesome-scene-text-detection
Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized with code and dataset
charmve dataset datasets detection irregular-text-recognition level-annotation ocr recognition scene-text-detection scene-text-recognition
92 stars
18 forks
45 projects
Last updated: 09 Dec 2025
awesome-multivariate-time-series-anomaly-detection-algorithms
This repository contains a reading list of papers on multivariate time series anomaly detection. This repository is still being continuously improved.
anomaly-detection anomaly-detection-algorithm awesome awesome-list data-mining data-science dataset deep-learning fault-diagnosis machine-learning
61 stars
8 forks
143 projects
Last updated: 26 Jan 2026
awesome-iran-opendata
A repo to introduce website that share Data and Dataset about Iran [Useful for Journalist and Researchers]
awesome datajournalism dataset farsi iran opendata persian
43 stars
6 forks
109 projects
Last updated: 31 Dec 2025
publicsectornl
Open Source in the public sector in the Netherlands
awesome-list dataset government netherlands open-data open-government open-science open-source open-state public-sector
38 stars
3 forks
128 projects
Last updated: 20 Jan 2026
awesome-datasets
A comprehensive list of annotated training datasets classified by use case.
annotation awesome-data-science awesome-datasets awesome-public-datasets corpora data dataset datasets document-processing entity-extraction
36 stars
6 forks
87 projects
Last updated: 19 Nov 2025
nepali-devblogs
Collection of blogs/publications by Nepali devs.
awesome-list awesomeness blogs collection dataset developer nepal reads resources
35 stars
10 forks
41 projects
Last updated: 08 Jun 2025
awesome-swedish-nlp
A curated list of resources for natural language processing (NLP) in Swedish
awesome-list corpora corpus dataset datasets natural-language-generation natural-language-processing nlp resource-list swedish
26 stars
2 forks
68 projects
Last updated: 23 Oct 2025
awesome-embodied-ai-datasets
A collection of Embodied AI datasets.
17 stars
3 forks
59 projects
Last updated: 17 Nov 2025
awesome-marine-hacking
Awesome Resources for Ocean Hacking
awesome awesome-list dataset datasets hackathon ocean ocean-hacking oceanography
16 stars
4 forks
35 projects
Last updated: 15 Sep 2025
awesome-multimodal-data-recipe
Curated collection of multimodal data synthesis methods, covering papers, datasets, and best practices for vision-language model training
awesome-list data data-engineering data-generation data-science data-synthesis dataset llms multimodal vision-language-model
5 stars
0 forks
105 projects
Last updated: 20 Nov 2025
awesome-city-data
A curated list of open data sources to analyze and compare cities in a holistic way à la data science and empower citizens.
awesome-list cities data-science dataset
5 stars
0 forks
37 projects
Last updated: 01 Dec 2024
public-apis
A collective list of free APIs
api apis dataset development free list lists open-source public public-api
0 stars
0 forks
1,436 projects
Last updated: 20 Sep 2023
webarchiving-awesome-graph
This graph data is extracted from the Awesome Web Archiving README
0 stars
0 forks
201 projects
Last updated: 25 Nov 2025