An open API service indexing awesome lists of open source software.

0 "datasets" Awesome Lists

awesome-json-datasets

A curated list of awesome JSON datasets that don't require authentication.

awesome awesome-list data dataset datasets json json-dataset list

3,541 stars
387 forks
300 projects

Last updated: 31 Jan 2026

Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

ai awesome datasets llm nlp papers software-engineering survey tmlr

3,205 stars
216 forks
2,594 projects

Last updated: 02 Feb 2026

awesome-ai4med

[Medical_NLP ➟ Awesome-AI4Med] medical-related LLMs, Multimodal systems, Datasets, Benchmarks, and more.

awesome-lists collection datasets medical medical-imaging medical-llms medical-vlms models

2,517 stars
440 forks
62 projects

Last updated: 12 Feb 2026

LLM4Rec-Awesome-Papers

A list of awesome papers and resources of recommender system on large language model (LLM).

awesome datasets large-language-models llm4rec recommender-system survey

2,177 stars
155 forks
124 projects

Last updated: 11 Jan 2026

awesome-public-real-time-datasets

A list of publicly available datasets with real-time data maintained by the team at bytewax.io

awesome-list data data-science data-visualization datasets real-time streaming

2,163 stars
165 forks
70 projects

Last updated: 01 Feb 2026

Awesome-Cybersecurity-Datasets

A curated list of amazingly awesome Cybersecurity datasets

attack cybersecurity data dataframe datasets deep deeplearning events ids ips

1,910 stars
326 forks
54 projects

Last updated: 02 Feb 2026

awesome-transit

Community list of transit APIs, apps, datasets, research, and software :bus::star2::train::star2::steam_locomotive:

awesome awesome-list bus datasets gtfs gtfs-analysis gtfs-converters gtfs-feed gtfs-files gtfs-libraries

1,673 stars
221 forks
437 projects

Last updated: 17 Feb 2026

awesome-yolo-object-detection

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.

cuda datasets deepseek few-shot-object-detection gui llama llm mllm object-detection object-detection-datasets

1,667 stars
227 forks
1,704 projects

Last updated: 09 Jan 2026

awesome-robotics

A curated list of awesome links and software libraries that are useful for robots.

awesome awesome-list datasets deep-learning image-segmentation list lists machine-learning optimization optimization-algorithms

1,319 stars
173 forks
72 projects

Last updated: 27 Jan 2026

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

awsome-lists datasets gpt-3 gpt-4 instruction-following instruction-tuning language-model llama

1,141 stars
57 forks
89 projects

Last updated: 24 Jan 2026

awesome-dataset-tools

🔧 A curated list of awesome dataset tools

annotation-tool annotations awsome awsome-list datasets machine-learning

934 stars
129 forks
87 projects

Last updated: 18 Jan 2026

awesome-autonomous-vehicle

无人驾驶的资源列表中文版

autonomous-vehicles awesome-list car-driving computer-vision datasets intelligent-vehicle self-driving-car tutorial

900 stars
222 forks
181 projects

Last updated: 19 Jan 2026

awesome-llm-and-aigc

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.

ai4s ai4science aigc awesome-list cuda datasets deepseek gpt langchain llama

802 stars
68 forks
1,402 projects

Last updated: 03 Feb 2026

awesome-data-analysis

🚀 500+ curated resources for Data Analysis & Data Science: Python, SQL, Statistics, ML, AI, Visualization, Cheatsheets, Roadmaps, Interview Prep. For beginners and experts.

ai analytics awesome-list big-data business-intelligence dashboard data-analysis data-science data-visualization datasets

753 stars
89 forks
865 projects

Last updated: 03 Feb 2026

awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

chatgpt datasets instruction llama llm prompts self-instruct

724 stars
37 forks
132 projects

Last updated: 12 Feb 2026

awesome-mobile-robotics

Useful links of different content related to AI, Computer Vision, and Robotics.

autonomous-robots autonomous-systems awesome-list books companies datasets jobs lab labs localization

670 stars
105 forks
541 projects

Last updated: 03 Feb 2026

awesome-holistic-3d

A list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision

3d-reconstruction awesome computer-vision datasets deep-learning machine-learning

645 stars
93 forks
129 projects

Last updated: 14 Feb 2026

awesome-segmentation-saliency-dataset

A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile:

dataset datasets deep-learning deeplearning machine-learning machinelearning saliency-detection

616 stars
100 forks
338 projects

Last updated: 02 Feb 2026

Awesome-Domain-Generalization

Awesome things about domain generalization, including papers, code, etc.

awesome awesome-list datasets deep-learning domain-generalization libraries papers

522 stars
51 forks
343 projects

Last updated: 10 Feb 2026

awesome-instruction-learning

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

awesome-list datasets in-context-learning instruction instruction-learning instruction-tuning large-language-models paper-list pretrained-language-model prompt

506 stars
23 forks
177 projects

Last updated: 29 Jan 2026

machine-learning-resources

A curated list of awesome machine learning frameworks, libraries, courses, books and many more.

awesome-list conference data-analysis data-science datasets handbook machine-learning natural-language-processing nlp-machine-learning paper

440 stars
130 forks
56 projects

Last updated: 12 Feb 2026

Awesome-3D-LiDAR-Datasets

This reposiotry is the collection for public 3D LiDAR datasets

awesome-lists datasets lidar

400 stars
37 forks
59 projects

Last updated: 12 Feb 2026

awesome-forests

🌳 A curated list of ground-truth forest datasets for the machine learning and forestry community.

biodiversity carbon climate-change datasets deep-learning ecosystems forestry machine-learning

356 stars
41 forks
64 projects

Last updated: 23 Jan 2026

Text-Summarization-Repo

텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.

awesome curated datasets nlp paper summary text-summarization

348 stars
49 forks
113 projects

Last updated: 22 Jan 2026

awesome-synthetic-datasets

awesome synthetic (text) datasets

ai awesome-list datasets llms synthetic-data synthetic-dataset-generation

322 stars
16 forks
38 projects

Last updated: 07 Feb 2026

awesome-nlp-polish

A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.

datasets nlp nlp-machine-learning polish-language

307 stars
35 forks
44 projects

Last updated: 22 Jan 2026

awesome-colour

Curated list of awesome colour science resources 😎

awesome awesome-list color color-science color-space color-spaces colorspace colorspaces colour colour-science

302 stars
24 forks
86 projects

Last updated: 08 Feb 2026

awesome-rgbd-datasets

This repository contains information for the paper "A Survey on RGB-D Datasets" and is a collaborative initiative to update the datasets list faster.

awesome awesome-list datasets depth depth-estimation lidar rgb-d survey

278 stars
17 forks
232 projects

Last updated: 23 Dec 2025

Graph-Neural-Networks-With-Heterophily

This repository contains the resources on graph neural network (GNN) considering heterophily.

awesome datasets graph-data graph-neural-networks heterophily homophily

270 stars
23 forks
334 projects

Last updated: 23 Jan 2026

Awesome-Earth-Artificial-Intelligence

A curated list of Earth Science's Artificial Intelligence (AI) tutorials, notebooks, software, datasets, courses, books, video lectures and papers. Contributions most welcome.

air-quality awesome-list biosphere datasets deep-learning dust earth-science earthquakes geosphere glacier

233 stars
59 forks
99 projects

Last updated: 06 Jan 2026

awesome-ukrainian-nlp

Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)

awesome-list datasets natural-language-processing nlp ukrainian ukrainian-nlp

227 stars
22 forks
92 projects

Last updated: 21 Jan 2026

awesome-taxonomy

A curated resource for taxonomy research

datasets hypernymy-detection taxonomy-construction taxonomy-learning

217 stars
28 forks
172 projects

Last updated: 27 Jan 2026

awesome-lidar-place-recognition

A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR

awesome awesome-list datasets lidar place-recognition point-cloud robotics slam

203 stars
6 forks
75 projects

Last updated: 29 Jan 2026

Awesome-Deepfakes

A list of datasets, tools, papers and code related to Deepfakes.

awesome datasets deepfakes image paper-with-code paperlist tools video

186 stars
10 forks
72 projects

Last updated: 05 Feb 2026

awesome-legal-data

A collection of datasets and other resources for legal text processing.

datasets legal legal-tech nlp

174 stars
27 forks
133 projects

Last updated: 10 Feb 2026

awesome-Iran-datasets

Iranian/Persian Datasets. دیتاست‌های فارسی و ایرانی

awesome data-science datasets machine-learning persian persiandataset

146 stars
15 forks
82 projects

Last updated: 02 Feb 2026

awesome-dynamic-graphs

A collection of resources on dynamic/streaming/temporal/evolving graph processing systems, databases, data structures, datasets, and related academic and industrial work

awesome awesome-list awesome-lists datasets dynamic-graph-processing dynamic-graphs evolving-graphs graph graph-analytics graph-databases

145 stars
16 forks
71 projects

Last updated: 30 Jan 2026

awesome-object-detection-datasets

A collection of some awesome public object detection and recognition datasets.

aerial-imagery autonomous-driving awesome-list chatgpt coco dataset datasets infrared large-language-models llm

134 stars
10 forks
143 projects

Last updated: 08 Feb 2026

awesome-healthcare-datasets

Healthcare and biomedical datasets, for AI/ML

awesome-list biomedical clinical datasets healthcare imaging omics

102 stars
15 forks
195 projects

Last updated: 25 Jan 2026

awesome-scene-text-detection

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized with code and dataset

charmve dataset datasets detection irregular-text-recognition level-annotation ocr recognition scene-text-detection scene-text-recognition

92 stars
18 forks
45 projects

Last updated: 09 Dec 2025

awesome-data-chile

Lista curada de datasets públicos sobre Chile.

awesome awesome-list chile data datasets opendata

74 stars
4 forks
44 projects

Last updated: 04 Feb 2026

Data-Science-and-Machine-Learning-Resources

List of Data Science and Machine Learning Resource that I frequently use

algorithms awesome awesome-list blog blogs collections datascience datasets deep-learning ebooks

73 stars
21 forks
264 projects

Last updated: 24 Nov 2025

awesome-datasets

A comprehensive list of annotated training datasets classified by use case.

annotation awesome-data-science awesome-datasets awesome-public-datasets corpora data dataset datasets document-processing entity-extraction

37 stars
6 forks
87 projects

Last updated: 31 Jan 2026

awesome-malware-benign-datasets

🪲 A list of malware and benign datasets for malware research

android awesome-list cybersecurity datasets deep-learning documents machine-learning malware-analysis malware-classification malware-dataset

29 stars
4 forks
42 projects

Last updated: 31 Jan 2026

awesome-swedish-nlp

A curated list of resources for natural language processing (NLP) in Swedish

awesome-list corpora corpus dataset datasets natural-language-generation natural-language-processing nlp resource-list swedish

27 stars
2 forks
68 projects

Last updated: 09 Feb 2026

awesome-marine-hacking

Awesome Resources for Ocean Hacking

awesome awesome-list dataset datasets hackathon ocean ocean-hacking oceanography

17 stars
4 forks
35 projects

Last updated: 18 Jan 2026

awesome-nba-data

A curated list of awesome NBA Data and resources.

awesome-list data datasets nba nba-data nba-stats

17 stars
1 forks
45 projects

Last updated: 24 Jan 2026

awesome-italian-public-datasets

A selection of interesting Open dataset from the Italian Public Administration and Civic Data use cases

civic-hacking civic-tech data-science datasets government-data opendata

10 stars
3 forks
43 projects

Last updated: 04 Sep 2025

awesome-ai-for-gui-agents

Awesome resources about AI for GUI Agents.

ai awesome awesome-list datasets gui models papers

10 stars
0 forks
30 projects

Last updated: 25 Nov 2025

awesome-turkish-vlm

A curated list of models, datasets and other useful resources for Turkish Vision-Language Models (VLM).

awesome awesome-list computer-vision datasets deep-learning fine-tuning multimodal nlp pretrained-models turkish

3 stars
0 forks
37 projects

Last updated: 29 Jul 2025

awesome-transit

copy of https://github.com/CUTR-at-USF/awesome-transit

awesome awesome-list bus datasets gtfs gtfs-analysis gtfs-converters gtfs-feed gtfs-files gtfs-libraries

0 stars
0 forks
296 projects

Last updated: 25 Jan 2022

Search
Keywords
awesome-list 3,953 awesome 3,595 awesome-lists 561 machine-learning 449 list 423 deep-learning 378 resources 325 ai 264 hacktoberfest 237 python 220 lists 205 llm 203 javascript 193 security 167 programming 165 artificial-intelligence 154 blockchain 139 computer-vision 136 open-source 122 nlp 121 large-language-models 117 tools 115 data-science 111 react 108 chatgpt 102 android 94 curated-list 90 natural-language-processing 88 linux 85 learning 85 devops 81 awesome-readme 81 css 78 awesome-resources 77 ios 77 papers 77 ethereum 74 reinforcement-learning 72 cybersecurity 71 rust 64 computer-science 64 kubernetes 64 nodejs 63 robotics 61 collection 60 playground 60 free-resources 59 design 59 game-development 59 free 57 macos 57 getvm 57 openai 56 java 56 golang 56 survey 55 frontend 54 tutorials 54 developer-tools 53 privacy 53 bitcoin 52 datasets 51 tutorial 51 hacking 50 go 50 web 50 php 49 llms 49 web3 49 education 48 paper 48 data 48 cloud 47 swift 46 books 45 gpt 45 labex 45 dataset 45 hands-on 45 development 45 github 44 html 44 deep-neural-networks 44 automation 43 generative-ai 43 cryptocurrency 43 security-tools 42 exercises 41 data-visualization 41 ml 40 opensource 40 database 40 vue 40 typescript 39 mcp 39 web-development 39 docker 39 gamedev 38 documentation 38 research 38 collections 37 algorithms 37 software 37 agent 36 ai-agents 36 iot 36 productivity 35 neural-network 35 testing 34 game 34 courses 34 reactjs 34 prompt-engineering 34 paper-list 34 aws 33 framework 33 diffusion-models 33 cpp 32 api 32 games 31 community 30 hardware 30 multimodal 30 dotnet 30 solidity 30 software-engineering 30 graph-neural-networks 30 projects 30 transformer 30 flutter 30 ruby 29 video 29 architecture 29 bioinformatics 29 csharp 28 libraries 28 ai-tools 28 react-native 28 smart-contracts 28 agents 27 crypto 27 tensorflow 27 windows 27 defi 27 links 27 science 26 self-hosted 26 object-detection 26 penetration-testing 26 sql 25 django 25 data-analysis 25 self-supervised-learning 25 library 25 angular 25 serverless 25 cli 25 music 25 cryptography 25 pytorch 25 microsoft 25 mlops 24 git 24 pentesting 24 best-practices 24 coding 24 mobile 24 osint 24 neural-networks 24 engineering 24 jobs 24 mathematics 24 slam 24 computer-graphics 24 data-mining 24 infosec 23 markdown 23 claude 23 programming-language 23 c 23 software-development 23 3d 23 learning-resources 23 game-engine 23 unity 23 ui 23 saas 23 awsome-list 23 roadmap 23 r 23 embedded 23 nextjs 23 startups 22 interview 22 reverse-engineering 22 blog 22 python3 22 segmentation 22 kotlin 22 deeplearning 22 knowledge-graph 22 visualization 22 statistics 22 awesomeness 22 cloud-computing 21 audio 21 hacktoberfest-accepted 21 gpt-3 21 federated-learning 21 azure 21 gpt-4 21 time-series 21 autonomous-driving 21 website 21 apple 21 opendata 21 foundation-models 21 free-tutorials 21 chatbot 21 apps 20 blogs 20 awsome 20 marketing 20 startup 20 js 20 llama 20 machinelearning 20 guidelines 20 data-structures 20 raspberry-pi 20 graph 20 chinese 20 gemini 20 front-end 19 youtube 19 big-data 19 claude-code 19 bugbounty 19 curated 19 reasoning 19 transformers 19 bash 18 stable-diffusion 18 graphql 18 prompt 18 aigc 18 optimization 18 code 18 networking 18 backend 18 agentic-ai 18 decentralized 18 finance 18 representation-learning 18 anthropic 18 image-generation 18 monitoring 18 seo 17 analytics 17 beginner-friendly 17 data-engineering 17 open-data 17 rag 17 containers 17 cloud-native 17 generative-art 17 generative-model 17 knowledge 17 cheatsheet 17 prompts 17 bert 16 algorithm 16 autonomous-vehicles 16 image-processing 16 deepseek 16 text-to-image 16 microservices 16 language 16 telegram 16 articles 16 devsecops 16 vscode 16 webgl 16 nerf 16 speech-recognition 15 mcp-server 15 malware-analysis 15 benchmark 15 mysql 15 technology 15 resource 15 animation 15 vibe-coding 15 vlm 15 android-development 15 p2p 15 plugins 15 readme 15 pentest 15 dart 15 ros 15 embodied-ai 15 evm 15 command-line 15 leetcode 15 cms 15 laravel 15 detection 15 selfhosted 15 terminal 15 distributed-systems 15 large-language-model 14 wasm 14 remote-sensing 14 android-library 14 dotnet-core 14 node 14 graphics 14 code-generation 14 generative-adversarial-network 14 videos 14 recommender-system 14 oss 14 anomaly-detection 14 foss 14 multimodal-deep-learning 14 pose-estimation 14 awesome-ai 14 language-model 14 langchain 14 mac 14 icons 14 diffusion 14 vuejs 14 swiftui 14 quantization 14 databases 14 webassembly 14 google 14 react-components 14 art 14 performance 14 semantic-segmentation 13 vision-language-model 13 multimodal-learning 13 svelte 13 model-context-protocol 13 search 13 medical-imaging 13 yolo 13 postgresql 13 hosting 13 podcast 13 deep-reinforcement-learning 13 developer 13 npm 13 transfer-learning 13 pwa 13 wordpress 13 quantum-computing 13 cross-platform 13 frameworks 13 microservice 13 gis 13 application 13 video-generation 13 multimodal-large-language-models 13 fuzzing 13 gan 13 discord 13 webdevelopment 13 interview-questions 13 unicorns 13 system-design 12 programming-languages 12 automl 12 dapp 12 es6 12 search-engine 12 rails 12 article 12 model-compression 12 static-site-generator 12 vision-and-language 12 authentication 12 chain-of-thought 12 time-series-analysis 12 static-analysis 12 guide 12 webapp 12 workflow 12 graphics-programming 12 mllm 12 sustainability 12 infrastructure-as-code 12 storage 12 objective-c 12 ai-agent 12 datascience 12 solana 12 gaming 12 image 12 edge-computing 12 elasticsearch 12 planning 12 web-design 12 free-software 11 llmops 11 ui-design 11 golang-library 11 book 11 infrastructure 11 cicd 11 embedded-systems 11 rl 11 point-cloud 11 bookmarks 11 arduino 11 cursor 11 unity3d 11 course 11 rest-api 11 interpretability 11 font 11 ctf 11 threat-intelligence 11 3d-reconstruction 11 continual-learning 11 software-architecture 11 cuda 11 hacktoberfest2020 11 few-shot-learning 11 open-science 11 machine-learning-algorithms 11 minecraft 11 climate-change 11 front-end-development 11 utilities 11 webpack 11 writing 11 ux 11 ecommerce 11 healthcare 11 design-systems 11 question-answering 11 reading-list 11 sysadmin 11 anime 11 unsupervised-learning 11 cvpr 11 creative-coding 11 in-context-learning 11 3d-graphics 11 augmented-reality 10 ide 10 mcp-servers 10 test-automation 10 swift-library 10 drones 10 fpga 10 speech-processing 10 reddit 10 uav 10 xai 10 game-design 10 semantic-web 10 text-mining 10 mobile-development 10 leadership 10 ionic 10 flutter-apps 10 haskell 10 applications 10 android-application 10 webcomponents 10 students 10 meta-learning 10 qwen 10 bot 10 explainable-ai 10 malware 10 neuroscience 10 knowledge-base 10 templates 10 webdesign 10 cyber-security 10 redis 10 docs 10 tool 10 android-app 10 cv 10 management 10 bug-bounty 10 social-network 10 movies 10 blockchain-technology 10 chatgpt-api 10 hacking-tools 10 classification 10 privacy-tools 10