An open API service indexing awesome lists of open source software.

0 "datasets" Awesome Lists

awesome-json-datasets

A curated list of awesome JSON datasets that don't require authentication.

awesome awesome-list data dataset datasets json json-dataset list

3,526 stars
386 forks
300 projects

Last updated: 19 Dec 2025

Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

ai awesome datasets llm nlp papers software-engineering survey tmlr

3,090 stars
207 forks
2,552 projects

Last updated: 27 Nov 2025

LLM4Rec-Awesome-Papers

A list of awesome papers and resources of recommender system on large language model (LLM).

awesome datasets large-language-models llm4rec recommender-system survey

2,170 stars
156 forks
124 projects

Last updated: 27 Dec 2025

Awesome-Cybersecurity-Datasets

A curated list of amazingly awesome Cybersecurity datasets

attack cybersecurity data dataframe datasets deep deeplearning events ids ips

1,876 stars
321 forks
54 projects

Last updated: 17 Dec 2025

awesome-public-real-time-datasets

A list of publicly available datasets with real-time data maintained by the team at bytewax.io

awesome-list data data-science data-visualization datasets real-time streaming

1,833 stars
144 forks
70 projects

Last updated: 06 Dec 2025

awesome-yolo-object-detection

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.

cuda datasets deepseek few-shot-object-detection gui llama llm mllm object-detection object-detection-datasets

1,655 stars
226 forks
1,704 projects

Last updated: 19 Dec 2025

awesome-transit

Community list of transit APIs, apps, datasets, research, and software :bus::star2::train::star2::steam_locomotive:

awesome awesome-list bus datasets gtfs gtfs-analysis gtfs-converters gtfs-feed gtfs-files gtfs-libraries

1,620 stars
219 forks
432 projects

Last updated: 18 Dec 2025

awesome-robotics

A curated list of awesome links and software libraries that are useful for robots.

awesome awesome-list datasets deep-learning image-segmentation list lists machine-learning optimization optimization-algorithms

1,282 stars
173 forks
72 projects

Last updated: 11 Dec 2025

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

awsome-lists datasets gpt-3 gpt-4 instruction-following instruction-tuning language-model llama

1,136 stars
56 forks
89 projects

Last updated: 09 Dec 2025

awesome-dataset-tools

🔧 A curated list of awesome dataset tools

annotation-tool annotations awsome awsome-list datasets machine-learning

925 stars
129 forks
87 projects

Last updated: 28 Nov 2025

awesome-autonomous-vehicle

无人驾驶的资源列表中文版

autonomous-vehicles awesome-list car-driving computer-vision datasets intelligent-vehicle self-driving-car tutorial

885 stars
221 forks
181 projects

Last updated: 27 Nov 2025

awesome-llm-and-aigc

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.

ai4s ai4science aigc awesome-list cuda datasets deepseek gpt langchain llama

788 stars
65 forks
1,402 projects

Last updated: 21 Dec 2025

awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

chatgpt datasets instruction llama llm prompts self-instruct

714 stars
37 forks
132 projects

Last updated: 11 Dec 2025

awesome-mobile-robotics

Useful links of different content related to AI, Computer Vision, and Robotics.

autonomous-robots autonomous-systems awesome-list books companies datasets jobs lab labs localization

661 stars
105 forks
541 projects

Last updated: 15 Nov 2025

awesome-holistic-3d

A list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision

3d-reconstruction awesome computer-vision datasets deep-learning machine-learning

640 stars
92 forks
129 projects

Last updated: 26 Oct 2025

awesome-data-analysis

🚀 500+ curated resources for Data Analysis & Data Science: Python, SQL, Statistics, ML, AI, Visualization, Cheatsheets, Roadmaps, Interview Prep. For beginners and experts.

ai analytics awesome-list big-data business-intelligence dashboard data-analysis data-science data-visualization datasets

636 stars
76 forks
859 projects

Last updated: 04 Jan 2026

awesome-segmentation-saliency-dataset

A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile:

dataset datasets deep-learning deeplearning machine-learning machinelearning saliency-detection

607 stars
99 forks
338 projects

Last updated: 27 Dec 2025

awesome-instruction-learning

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

awesome-list datasets in-context-learning instruction instruction-learning instruction-tuning large-language-models paper-list pretrained-language-model prompt

505 stars
23 forks
177 projects

Last updated: 01 Dec 2025

Awesome-Domain-Generalization

Awesome things about domain generalization, including papers, code, etc.

awesome awesome-list datasets deep-learning domain-generalization libraries papers

495 stars
50 forks
343 projects

Last updated: 11 Nov 2025

machine-learning-resources

A curated list of awesome machine learning frameworks, libraries, courses, books and many more.

awesome-list conference data-analysis data-science datasets handbook machine-learning natural-language-processing nlp-machine-learning paper

436 stars
127 forks
56 projects

Last updated: 14 Dec 2025

Awesome-3D-LiDAR-Datasets

This reposiotry is the collection for public 3D LiDAR datasets

awesome-lists datasets lidar

374 stars
35 forks
59 projects

Last updated: 29 Nov 2025

awesome-forests

🌳 A curated list of ground-truth forest datasets for the machine learning and forestry community.

biodiversity carbon climate-change datasets deep-learning ecosystems forestry machine-learning

350 stars
41 forks
64 projects

Last updated: 09 Dec 2025

Text-Summarization-Repo

텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.

awesome curated datasets nlp paper summary text-summarization

347 stars
49 forks
113 projects

Last updated: 21 Dec 2025

awesome-synthetic-datasets

awesome synthetic (text) datasets

ai awesome-list datasets llms synthetic-data synthetic-dataset-generation

315 stars
15 forks
33 projects

Last updated: 18 Dec 2025

awesome-nlp-polish

A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.

datasets nlp nlp-machine-learning polish-language

306 stars
36 forks
44 projects

Last updated: 11 Dec 2025

awesome-colour

Curated list of awesome colour science resources 😎

awesome awesome-list color color-science color-space color-spaces colorspace colorspaces colour colour-science

296 stars
23 forks
86 projects

Last updated: 16 Nov 2025

awesome-rgbd-datasets

This repository contains information for the paper "A Survey on RGB-D Datasets" and is a collaborative initiative to update the datasets list faster.

awesome awesome-list datasets depth depth-estimation lidar rgb-d survey

269 stars
16 forks
232 projects

Last updated: 15 Oct 2025

Graph-Neural-Networks-With-Heterophily

This repository contains the resources on graph neural network (GNN) considering heterophily.

awesome datasets graph-data graph-neural-networks heterophily homophily

264 stars
22 forks
334 projects

Last updated: 17 Nov 2025

Awesome-Earth-Artificial-Intelligence

A curated list of Earth Science's Artificial Intelligence (AI) tutorials, notebooks, software, datasets, courses, books, video lectures and papers. Contributions most welcome.

air-quality awesome-list biosphere datasets deep-learning dust earth-science earthquakes geosphere glacier

232 stars
59 forks
99 projects

Last updated: 24 Nov 2025

awesome-ukrainian-nlp

Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)

awesome-list datasets natural-language-processing nlp ukrainian ukrainian-nlp

217 stars
21 forks
91 projects

Last updated: 03 Nov 2025

awesome-taxonomy

A curated resource for taxonomy research

datasets hypernymy-detection taxonomy-construction taxonomy-learning

216 stars
28 forks
172 projects

Last updated: 09 Dec 2025

awesome-lidar-place-recognition

A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR

awesome awesome-list datasets lidar place-recognition point-cloud robotics slam

200 stars
6 forks
75 projects

Last updated: 20 Dec 2025

Awesome-Deepfakes

A list of datasets, tools, papers and code related to Deepfakes.

awesome datasets deepfakes image paper-with-code paperlist tools video

180 stars
9 forks
72 projects

Last updated: 23 Dec 2025

awesome-legal-data

A collection of datasets and other resources for legal text processing.

datasets legal legal-tech nlp

155 stars
24 forks
133 projects

Last updated: 16 Dec 2025

awesome-Iran-datasets

Iranian/Persian Datasets. دیتاست‌های فارسی و ایرانی

awesome data-science datasets machine-learning persian persiandataset

145 stars
15 forks
82 projects

Last updated: 24 Dec 2025

awesome-dynamic-graphs

A collection of resources on dynamic/streaming/temporal/evolving graph processing systems, databases, data structures, datasets, and related academic and industrial work

awesome awesome-list awesome-lists datasets dynamic-graph-processing dynamic-graphs evolving-graphs graph graph-analytics graph-databases

144 stars
16 forks
71 projects

Last updated: 13 Dec 2025

awesome-object-detection-datasets

A collection of some awesome public object detection and recognition datasets.

aerial-imagery autonomous-driving awesome-list chatgpt coco dataset datasets infrared large-language-models llm

126 stars
10 forks
143 projects

Last updated: 18 Nov 2025

awesome-scene-text-detection

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized with code and dataset

charmve dataset datasets detection irregular-text-recognition level-annotation ocr recognition scene-text-detection scene-text-recognition

92 stars
18 forks
45 projects

Last updated: 09 Dec 2025

Data-Science-and-Machine-Learning-Resources

List of Data Science and Machine Learning Resource that I frequently use

algorithms awesome awesome-list blog blogs collections datascience datasets deep-learning ebooks

73 stars
21 forks
264 projects

Last updated: 24 Nov 2025

awesome-data-chile

Lista curada de datasets públicos sobre Chile.

awesome awesome-list chile data datasets opendata

72 stars
3 forks
44 projects

Last updated: 11 Dec 2025

awesome-datasets

A comprehensive list of annotated training datasets classified by use case.

annotation awesome-data-science awesome-datasets awesome-public-datasets corpora data dataset datasets document-processing entity-extraction

35 stars
6 forks
87 projects

Last updated: 04 Oct 2025

awesome-swedish-nlp

A curated list of resources for natural language processing (NLP) in Swedish

awesome-list corpora corpus dataset datasets natural-language-generation natural-language-processing nlp resource-list swedish

26 stars
2 forks
68 projects

Last updated: 23 Oct 2025

awesome-malware-benign-datasets

🪲 A curated list of Malware and Benign datasets for security researchers

awesome-list datasets machine-learning malware-analysis malware-researchers security

21 stars
2 forks
32 projects

Last updated: 29 Oct 2025

awesome-marine-hacking

Awesome Resources for Ocean Hacking

awesome awesome-list dataset datasets hackathon ocean ocean-hacking oceanography

16 stars
4 forks
35 projects

Last updated: 15 Sep 2025

awesome-nba-data

A curated list of awesome NBA Data and resources.

awesome-list data datasets nba nba-data nba-stats

14 stars
1 forks
45 projects

Last updated: 29 Nov 2025

awesome-italian-public-datasets

A selection of interesting Open dataset from the Italian Public Administration and Civic Data use cases

civic-hacking civic-tech data-science datasets government-data opendata

10 stars
3 forks
43 projects

Last updated: 04 Sep 2025

awesome-ai-for-gui-agents

Awesome resources about AI for GUI Agents.

ai awesome awesome-list datasets gui models papers

10 stars
0 forks
30 projects

Last updated: 25 Nov 2025

awesome-turkish-vlm

A curated list of models, datasets and other useful resources for Turkish Vision-Language Models (VLM).

awesome awesome-list computer-vision datasets deep-learning fine-tuning multimodal nlp pretrained-models turkish

3 stars
0 forks
37 projects

Last updated: 29 Jul 2025

awesome-transit

copy of https://github.com/CUTR-at-USF/awesome-transit

awesome awesome-list bus datasets gtfs gtfs-analysis gtfs-converters gtfs-feed gtfs-files gtfs-libraries

0 stars
0 forks
296 projects

Last updated: 25 Jan 2022

Search
Keywords
awesome-list 3,789 awesome 3,440 awesome-lists 501 machine-learning 434 list 410 deep-learning 365 resources 315 hacktoberfest 237 ai 233 python 215 lists 202 javascript 190 llm 183 security 165 programming 161 artificial-intelligence 146 computer-vision 133 blockchain 133 open-source 118 nlp 115 tools 114 large-language-models 110 data-science 110 react 103 chatgpt 95 android 91 natural-language-processing 87 linux 84 curated-list 84 learning 83 css 80 awesome-readme 78 devops 77 papers 76 awesome-resources 74 ios 74 ethereum 74 reinforcement-learning 71 cybersecurity 65 computer-science 64 nodejs 62 kubernetes 61 robotics 60 playground 60 rust 60 collection 59 design 58 game-development 58 free-resources 58 macos 57 getvm 57 free 56 java 55 tutorials 54 golang 53 privacy 53 frontend 53 survey 52 tutorial 52 openai 51 web 50 hacking 50 bitcoin 49 datasets 49 go 48 llms 48 paper 48 cloud 46 data 45 swift 45 html 45 hands-on 45 web3 45 labex 45 dataset 45 php 45 developer-tools 45 books 44 education 44 deep-neural-networks 44 development 44 gpt 43 github 42 cryptocurrency 42 security-tools 42 exercises 41 data-visualization 40 typescript 39 web-development 39 vue 39 opensource 39 generative-ai 38 gamedev 38 automation 38 documentation 37 collections 37 software 37 ml 36 algorithms 36 database 36 docker 36 research 35 iot 34 testing 33 neural-network 33 paper-list 33 reactjs 33 framework 32 courses 32 mcp 32 diffusion-models 32 aws 32 game 31 projects 30 solidity 30 productivity 30 community 30 graph-neural-networks 30 cpp 30 agent 29 multimodal 29 dotnet 29 hardware 29 ruby 29 transformer 29 libraries 28 video 28 smart-contracts 28 software-engineering 28 flutter 28 api 28 csharp 28 bioinformatics 28 games 28 architecture 28 defi 27 links 27 tensorflow 27 windows 26 crypto 26 react-native 26 angular 25 self-supervised-learning 25 ai-agents 25 prompt-engineering 25 slam 25 microsoft 25 music 25 pytorch 24 self-hosted 24 computer-graphics 24 penetration-testing 24 science 24 agents 24 pentesting 24 object-detection 24 osint 24 django 24 cryptography 24 git 24 embedded 23 engineering 23 cli 23 infosec 23 coding 23 markdown 23 r 23 ui 23 unity 23 sql 23 library 23 serverless 23 mobile 22 python3 22 roadmap 22 awsome-list 22 game-engine 22 c 22 programming-language 22 visualization 22 deeplearning 22 3d 22 best-practices 22 segmentation 22 data-mining 22 interview 22 mathematics 22 neural-networks 22 mlops 22 data-analysis 22 knowledge-graph 22 hacktoberfest-accepted 21 website 21 jobs 21 gpt-3 21 apple 21 opendata 21 audio 21 learning-resources 21 awesomeness 21 autonomous-driving 21 free-tutorials 21 blog 21 federated-learning 21 statistics 21 kotlin 21 ai-tools 21 startups 21 gpt-4 21 foundation-models 20 blogs 20 data-structures 20 azure 20 graph 20 apps 20 front-end 20 machinelearning 20 software-development 20 cloud-computing 20 guidelines 20 chinese 20 llama 19 js 19 nextjs 19 time-series 19 youtube 19 marketing 19 claude 19 raspberry-pi 19 awsome 19 startup 19 saas 19 reasoning 19 curated 19 bugbounty 19 representation-learning 18 code 18 graphql 18 backend 18 aigc 18 chatbot 18 stable-diffusion 18 decentralized 18 big-data 18 reverse-engineering 18 bash 18 algorithm 17 analytics 17 networking 17 optimization 17 knowledge 17 transformers 17 cloud-native 17 containers 17 monitoring 17 cheatsheet 17 generative-art 17 image-processing 16 articles 16 seo 16 webgl 16 agentic-ai 16 autonomous-vehicles 16 open-data 16 bert 16 data-engineering 16 microservices 16 language 16 deepseek 16 beginner-friendly 16 nerf 16 mysql 15 graphics 15 readme 15 animation 15 prompt 15 p2p 15 technology 15 detection 15 android-development 15 cms 15 dart 15 finance 15 terminal 15 pentest 15 command-line 15 resource 15 ros 15 multimodal-deep-learning 14 language-model 14 recommender-system 14 telegram 14 android-library 14 embodied-ai 14 vlm 14 rag 14 leetcode 14 foss 14 devsecops 14 laravel 14 vscode 14 webassembly 14 node 14 mac 14 code-generation 14 videos 14 plugins 14 awesome-ai 14 performance 14 vuejs 14 react-components 14 icons 14 text-to-image 14 selfhosted 14 distributed-systems 14 oss 14 art 14 generative-model 14 diffusion 14 frameworks 13 large-language-model 13 fuzzing 13 deep-reinforcement-learning 13 dapp 13 webdevelopment 13 anomaly-detection 13 multimodal-large-language-models 13 malware-analysis 13 wordpress 13 dotnet-core 13 claude-code 13 interview-questions 13 vibe-coding 13 mcp-server 13 image-generation 13 cross-platform 13 generative-adversarial-network 13 speech-recognition 13 anthropic 13 swiftui 13 search 13 pose-estimation 13 remote-sensing 13 transfer-learning 13 developer 13 wasm 13 vision-language-model 13 podcast 13 postgresql 13 evm 13 application 13 unicorns 13 microservice 13 quantum-computing 13 gis 13 gaming 12 vision-and-language 12 rails 12 multimodal-learning 12 guide 12 image 12 infrastructure-as-code 12 mllm 12 prompts 12 authentication 12 npm 12 graphics-programming 12 sustainability 12 ui-design 12 storage 12 gan 12 discord 12 static-site-generator 12 pwa 12 webapp 12 elasticsearch 12 article 12 chain-of-thought 12 video-generation 12 svelte 12 es6 12 datascience 12 search-engine 12 hosting 12 workflow 12 static-analysis 12 hacktoberfest2020 11 point-cloud 11 creative-coding 11 open-science 11 web-design 11 3d-graphics 11 sysadmin 11 threat-intelligence 11 databases 11 unity3d 11 system-design 11 free-software 11 rl 11 objective-c 11 planning 11 webpack 11 benchmark 11 rest-api 11 model-compression 11 question-answering 11 programming-languages 11 ctf 11 time-series-analysis 11 cuda 11 ecommerce 11 ux 11 climate-change 11 few-shot-learning 11 cvpr 11 front-end-development 11 font 11 automl 11 yolo 11 haskell 11 cyber-security 11 utilities 11 gemini 11 design-systems 11 course 11 writing 11 google 11 book 11 interpretability 11 flutter-apps 10 management 10 3d-reconstruction 10 langchain 10 interview-preparation 10 podcasts 10 hacking-tools 10 functional-programming 10 solana 10 network 10 jupyter-notebook 10 edge-computing 10 knowledge-base 10 bug-bounty 10 applications 10 webdesign 10 movies 10 golang-library 10 xai 10 llmops 10 in-context-learning 10 semantic-segmentation 10 ansible 10 reading-list 10 ide 10 model-context-protocol 10 social-network 10 android-application 10 privacy-tools 10 fpga 10 deepseek-r1 10 text-mining 10 explainable-ai 10 operating-system 10 neuroscience 10 unsupervised-learning 10 machine-learning-algorithms 10 cv 10 medical-imaging 10 geospatial 10 test-automation 10 android-app 10 semantic-web 10 students 10 malware 10 templates 10 docs 10 speech-processing 10 anime 10 bot 10 email 10 vr 10 ai-agent 10 drones 10 quantization 10 classification 10 hacktoberfest2021 10 arduino 10 bookmarks 10 augmented-reality 10 compiler 9 arkit 9 politics 9 fairness 9 continual-learning 9 news 9 sentiment-analysis 9 ai-art 9 gpts 9 elixir 9 lua 9