An open API service indexing awesome lists of open source software.

0 "datasets" Awesome Lists

Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

ai awesome datasets llm nlp papers software-engineering survey tmlr

3,273 stars
222 forks
2,651 projects

Last updated: 25 Mar 2026

awesome-ai4med

[Medical_NLP ➟ Awesome-AI4Med] medical-related LLMs, Multimodal systems, Datasets, Benchmarks, and more.

awesome-lists collection datasets medical medical-imaging medical-llms medical-vlms models

2,601 stars
447 forks
69 projects

Last updated: 25 Mar 2026

awesome-public-real-time-datasets

A list of publicly available datasets with real-time data maintained by the team at bytewax.io

awesome-list data data-science data-visualization datasets real-time streaming

2,372 stars
188 forks
83 projects

Last updated: 23 Mar 2026

LLM4Rec-Awesome-Papers

A list of awesome papers and resources of recommender system on large language model (LLM).

awesome datasets large-language-models llm4rec recommender-system survey

2,252 stars
162 forks
124 projects

Last updated: 26 Mar 2026

Awesome-Cybersecurity-Datasets

A curated list of amazingly awesome Cybersecurity datasets

attack cybersecurity data dataframe datasets deep deeplearning events ids ips

1,940 stars
327 forks
54 projects

Last updated: 12 Mar 2026

awesome-transit

Community list of transit APIs, apps, datasets, research, and software :bus::star2::train::star2::steam_locomotive:

awesome awesome-list bus datasets gtfs gtfs-analysis gtfs-converters gtfs-feed gtfs-files gtfs-libraries

1,711 stars
226 forks
437 projects

Last updated: 25 Mar 2026

awesome-yolo-object-detection

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.

cuda datasets deepseek few-shot-object-detection gui llama llm mllm object-detection object-detection-datasets

1,703 stars
229 forks
1,704 projects

Last updated: 13 Mar 2026

awesome-robotics

A curated list of awesome links and software libraries that are useful for robots.

awesome awesome-list datasets deep-learning image-segmentation list lists machine-learning optimization optimization-algorithms

1,356 stars
175 forks
72 projects

Last updated: 19 Mar 2026

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

awsome-lists datasets gpt-3 gpt-4 instruction-following instruction-tuning language-model llama

1,146 stars
56 forks
89 projects

Last updated: 24 Mar 2026

awesome-dataset-tools

🔧 A curated list of awesome dataset tools

annotation-tool annotations awsome awsome-list datasets machine-learning

937 stars
130 forks
87 projects

Last updated: 13 Mar 2026

awesome-autonomous-vehicle

无人驾驶的资源列表中文版

autonomous-vehicles awesome-list car-driving computer-vision datasets intelligent-vehicle self-driving-car tutorial

900 stars
223 forks
181 projects

Last updated: 05 Mar 2026

awesome-data-analysis

🚀 500+ curated resources for Data Analysis & Data Science: Python, SQL, Statistics, ML, AI, Visualization, Cheatsheets, Roadmaps, Interview Prep. For beginners and experts.

ai analytics awesome-list big-data business-intelligence dashboard data-analysis data-science data-visualization datasets

881 stars
117 forks
921 projects

Last updated: 30 Mar 2026

awesome-llm-and-aigc

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.

ai4s ai4science aigc awesome-list cuda datasets deepseek gpt langchain llama

807 stars
72 forks
1,402 projects

Last updated: 31 Mar 2026

awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

chatgpt datasets instruction llama llm prompts self-instruct

725 stars
40 forks
132 projects

Last updated: 11 Mar 2026

awesome-mobile-robotics

Useful links of different content related to AI, Computer Vision, and Robotics.

autonomous-robots autonomous-systems awesome-list books companies datasets jobs lab labs localization

676 stars
105 forks
541 projects

Last updated: 06 Mar 2026

awesome-holistic-3d

A list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision

3d-reconstruction awesome computer-vision datasets deep-learning machine-learning

644 stars
93 forks
129 projects

Last updated: 17 Mar 2026

awesome-segmentation-saliency-dataset

A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile:

dataset datasets deep-learning deeplearning machine-learning machinelearning saliency-detection

621 stars
100 forks
338 projects

Last updated: 28 Mar 2026

Awesome-Domain-Generalization

Awesome things about domain generalization, including papers, code, etc.

awesome awesome-list datasets deep-learning domain-generalization libraries papers

530 stars
53 forks
343 projects

Last updated: 19 Mar 2026

awesome-instruction-learning

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

awesome-list datasets in-context-learning instruction instruction-learning instruction-tuning large-language-models paper-list pretrained-language-model prompt

508 stars
23 forks
177 projects

Last updated: 17 Feb 2026

machine-learning-resources

A curated list of awesome machine learning frameworks, libraries, courses, books and many more.

awesome-list conference data-analysis data-science datasets handbook machine-learning natural-language-processing nlp-machine-learning paper

439 stars
132 forks
56 projects

Last updated: 16 Mar 2026

Awesome-3D-LiDAR-Datasets

This reposiotry is the collection for public 3D LiDAR datasets

awesome-lists datasets lidar

404 stars
37 forks
59 projects

Last updated: 08 Mar 2026

awesome-forests

🌳 A curated list of ground-truth forest datasets for the machine learning and forestry community.

biodiversity carbon climate-change datasets deep-learning ecosystems forestry machine-learning

362 stars
41 forks
64 projects

Last updated: 26 Mar 2026

Text-Summarization-Repo

텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.

awesome curated datasets nlp paper summary text-summarization

348 stars
49 forks
113 projects

Last updated: 22 Jan 2026

awesome-synthetic-datasets

awesome synthetic (text) datasets

ai awesome-list datasets llms synthetic-data synthetic-dataset-generation

326 stars
17 forks
38 projects

Last updated: 18 Mar 2026

awesome-nlp-polish

A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.

datasets nlp nlp-machine-learning polish-language

307 stars
35 forks
44 projects

Last updated: 22 Jan 2026

awesome-colour

Curated list of awesome colour science resources 😎

awesome awesome-list color color-science color-space color-spaces colorspace colorspaces colour colour-science

302 stars
24 forks
86 projects

Last updated: 08 Feb 2026

awesome-rgbd-datasets

This repository contains information for the paper "A Survey on RGB-D Datasets" and is a collaborative initiative to update the datasets list faster.

awesome awesome-list datasets depth depth-estimation lidar rgb-d survey

284 stars
17 forks
232 projects

Last updated: 22 Feb 2026

Graph-Neural-Networks-With-Heterophily

This repository contains the resources on graph neural network (GNN) considering heterophily.

awesome datasets graph-data graph-neural-networks heterophily homophily

270 stars
23 forks
334 projects

Last updated: 05 Apr 2026

Awesome-Earth-Artificial-Intelligence

A curated list of Earth Science's Artificial Intelligence (AI) tutorials, notebooks, software, datasets, courses, books, video lectures and papers. Contributions most welcome.

air-quality awesome-list biosphere datasets deep-learning dust earth-science earthquakes geosphere glacier

238 stars
61 forks
102 projects

Last updated: 24 Mar 2026

awesome-ukrainian-nlp

Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)

awesome-list datasets natural-language-processing nlp ukrainian ukrainian-nlp

227 stars
23 forks
92 projects

Last updated: 19 Feb 2026

awesome-taxonomy

A curated resource for taxonomy research

datasets hypernymy-detection taxonomy-construction taxonomy-learning

219 stars
27 forks
172 projects

Last updated: 06 Mar 2026

awesome-lidar-place-recognition

A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR

awesome awesome-list datasets lidar place-recognition point-cloud robotics slam

204 stars
6 forks
75 projects

Last updated: 25 Feb 2026

awesome-legal-data

A collection of datasets and other resources for legal text processing.

datasets legal legal-tech nlp

192 stars
29 forks
134 projects

Last updated: 15 Mar 2026

Awesome-Deepfakes

A list of datasets, tools, papers and code related to Deepfakes.

awesome datasets deepfakes image paper-with-code paperlist tools video

191 stars
10 forks
72 projects

Last updated: 27 Feb 2026

awesome-Iran-datasets

Iranian/Persian Datasets. دیتاست‌های فارسی و ایرانی

awesome data-science datasets machine-learning persian persiandataset

149 stars
15 forks
82 projects

Last updated: 26 Feb 2026

awesome-dynamic-graphs

A collection of resources on dynamic/streaming/temporal/evolving graph processing systems, databases, data structures, datasets, and related academic and industrial work

awesome awesome-list awesome-lists datasets dynamic-graph-processing dynamic-graphs evolving-graphs graph graph-analytics graph-databases

145 stars
16 forks
71 projects

Last updated: 30 Jan 2026

awesome-object-detection-datasets

A collection of some awesome public object detection and recognition datasets.

aerial-imagery autonomous-driving awesome-list chatgpt coco dataset datasets infrared large-language-models llm

136 stars
10 forks
143 projects

Last updated: 05 Mar 2026

awesome-healthcare-datasets

Healthcare and biomedical datasets, for AI/ML

awesome-list biomedical clinical datasets healthcare imaging omics

116 stars
18 forks
195 projects

Last updated: 11 Mar 2026

awesome-human-activity-recognition

Always up-to-date, most comprehensive HAR resource — continuously scanned and auto-updated from Papers with Code. 53 datasets integrated across all modalities.

action-recognition awesome awesome-list benchmark computer-vision datasets deep-learning human-activity-recognition machine-learning motion-detection

93 stars
1 forks
123 projects

Last updated: 08 Apr 2026

awesome-scene-text-detection

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized with code and dataset

charmve dataset datasets detection irregular-text-recognition level-annotation ocr recognition scene-text-detection scene-text-recognition

92 stars
18 forks
45 projects

Last updated: 09 Dec 2025

awesome-data-chile

Lista curada de datasets públicos sobre Chile.

awesome awesome-list chile data datasets opendata

77 stars
4 forks
44 projects

Last updated: 07 Mar 2026

Data-Science-and-Machine-Learning-Resources

List of Data Science and Machine Learning Resource that I frequently use

algorithms awesome awesome-list blog blogs collections datascience datasets deep-learning ebooks

73 stars
21 forks
264 projects

Last updated: 20 Feb 2026

awesome-datasets

A comprehensive list of annotated training datasets classified by use case.

annotation awesome-data-science awesome-datasets awesome-public-datasets corpora data dataset datasets document-processing entity-extraction

38 stars
6 forks
87 projects

Last updated: 01 Mar 2026

awesome-malware-benign-datasets

🪲 A list of malware and benign datasets for malware research

android awesome-list cybersecurity datasets deep-learning machine-learning malware-analysis malware-classification malware-dataset malware-detection

33 stars
4 forks
42 projects

Last updated: 10 Mar 2026

awesome-swedish-nlp

A curated list of resources for natural language processing (NLP) in Swedish

awesome-list corpora corpus dataset datasets natural-language-generation natural-language-processing nlp resource-list swedish

27 stars
2 forks
68 projects

Last updated: 09 Feb 2026

awesome-nba-data

A curated list of awesome NBA Data and resources.

awesome-list data datasets nba nba-data nba-stats

24 stars
2 forks
45 projects

Last updated: 05 Mar 2026

awesome-marine-hacking

Awesome Resources for Ocean Hacking

awesome awesome-list dataset datasets hackathon ocean ocean-hacking oceanography

17 stars
4 forks
35 projects

Last updated: 18 Jan 2026

awesome-pdf

A curated list of amazingly libraries, services and resources to work with PDF files

awesome awesome-list data-science datasets ocr pdf pdf-generation pdf-viewer text-extraction

16 stars
6 forks
38 projects

Last updated: 20 Feb 2026

awesome-italian-public-datasets

A selection of interesting Open dataset from the Italian Public Administration and Civic Data use cases

civic-hacking civic-tech data-science datasets government-data opendata

10 stars
3 forks
43 projects

Last updated: 04 Sep 2025

awesome-ai-for-gui-agents

Awesome resources about AI for GUI Agents.

ai awesome awesome-list datasets gui models papers

10 stars
1 forks
30 projects

Last updated: 25 Nov 2025

awesome-turkish-vlm

A curated list of models, datasets and other useful resources for Turkish Vision-Language Models (VLM).

awesome awesome-list computer-vision datasets deep-learning fine-tuning multimodal nlp pretrained-models turkish

3 stars
0 forks
37 projects

Last updated: 29 Jul 2025

awesome-transit

copy of https://github.com/CUTR-at-USF/awesome-transit

awesome awesome-list bus datasets gtfs gtfs-analysis gtfs-converters gtfs-feed gtfs-files gtfs-libraries

0 stars
0 forks
296 projects

Last updated: 25 Jan 2022

Search
Keywords
awesome-list 4,049 awesome 3,666 awesome-lists 573 machine-learning 457 list 420 deep-learning 383 resources 329 ai 282 hacktoberfest 234 python 223 llm 220 lists 204 javascript 193 security 174 programming 165 artificial-intelligence 161 computer-vision 142 blockchain 140 open-source 127 nlp 125 large-language-models 120 tools 117 data-science 111 react 108 chatgpt 104 android 94 curated-list 93 natural-language-processing 91 linux 87 learning 86 devops 84 awesome-readme 84 awesome-resources 80 papers 80 ios 78 css 77 cybersecurity 77 reinforcement-learning 75 ethereum 74 robotics 66 kubernetes 65 computer-science 65 nodejs 65 rust 64 developer-tools 62 collection 62 macos 60 playground 60 game-development 59 free-resources 59 survey 59 design 59 openai 58 getvm 57 free 57 java 56 golang 56 tutorials 55 frontend 53 privacy 53 bitcoin 52 datasets 52 hacking 52 ai-agents 51 tutorial 51 web3 51 go 50 web 50 php 50 paper 49 generative-ai 49 llms 49 mcp 48 education 48 swift 47 data 47 cloud 46 gpt 46 books 46 github 45 development 45 hands-on 45 labex 45 automation 45 dataset 44 deep-neural-networks 44 security-tools 43 html 43 cryptocurrency 42 opensource 42 research 41 data-visualization 41 documentation 41 exercises 41 typescript 40 ml 40 agent 40 vue 40 database 40 docker 40 gamedev 39 web-development 38 collections 38 diffusion-models 37 algorithms 37 productivity 37 prompt-engineering 37 iot 36 game 36 software 36 ai-tools 35 neural-network 35 paper-list 35 courses 34 api 34 reactjs 34 testing 34 cpp 33 framework 33 aws 33 flutter 32 agents 31 games 31 multimodal 31 projects 31 solidity 31 software-engineering 31 community 31 video 30 transformer 30 graph-neural-networks 30 hardware 30 react-native 30 libraries 29 bioinformatics 29 ruby 29 dotnet 29 architecture 29 smart-contracts 28 crypto 28 cli 27 csharp 27 tensorflow 27 science 27 defi 27 penetration-testing 27 self-hosted 27 claude 27 links 27 object-detection 27 windows 27 serverless 26 slam 26 pytorch 26 music 26 self-supervised-learning 26 cryptography 26 mlops 26 angular 25 django 25 reverse-engineering 25 data-mining 25 best-practices 25 sql 25 git 25 pentesting 25 library 25 microsoft 25 data-analysis 25 mobile 24 engineering 24 jobs 24 3d 24 coding 24 mathematics 24 computer-graphics 24 infosec 24 agentic-ai 24 awsome-list 24 osint 24 saas 24 ui 23 unity 23 nextjs 23 markdown 23 game-engine 23 c 23 embedded 23 segmentation 23 claude-code 23 learning-resources 23 roadmap 23 software-development 23 programming-language 23 knowledge-graph 23 neural-networks 23 blog 23 gemini 22 awesomeness 22 rag 22 startups 22 time-series 22 python3 22 r 22 visualization 22 deeplearning 22 kotlin 22 statistics 22 interview 22 apple 22 foundation-models 22 autonomous-driving 22 opendata 21 hacktoberfest-accepted 21 chatbot 21 azure 21 free-tutorials 21 finance 21 networking 21 marketing 21 cloud-computing 21 blogs 21 gpt-4 21 gpt-3 21 audio 21 website 21 federated-learning 21 startup 21 llama 20 chinese 20 monitoring 20 js 20 reasoning 20 graph 20 awsome 20 data-structures 20 machinelearning 20 apps 20 raspberry-pi 20 front-end 19 bugbounty 19 youtube 19 anthropic 19 transformers 19 big-data 19 curated 19 image-generation 19 prompts 19 representation-learning 18 stable-diffusion 18 prompt 18 decentralized 18 optimization 18 devsecops 18 guidelines 18 aigc 18 backend 18 code 18 graphql 18 cloud-native 17 beginner-friendly 17 open-data 17 malware-analysis 17 model-context-protocol 17 containers 17 generative-art 17 vibe-coding 17 bash 17 nerf 17 autonomous-vehicles 17 seo 17 plugins 17 data-engineering 17 knowledge 17 cheatsheet 17 generative-model 17 analytics 17 algorithm 16 dart 16 articles 16 command-line 16 language 16 benchmark 16 ros 16 animation 16 speech-recognition 16 bert 16 vscode 16 text-to-image 16 langchain 16 deepseek 16 webgl 16 telegram 16 microservices 16 readme 15 pentest 15 technology 15 swiftui 15 multimodal-deep-learning 15 laravel 15 resource 15 remote-sensing 15 distributed-systems 15 p2p 15 mcp-server 15 leetcode 15 terminal 15 evm 15 mysql 15 android-development 15 vlm 15 embodied-ai 15 cms 15 pose-estimation 15 large-language-model 15 image-processing 15 code-generation 14 react-components 14 vision-language-model 14 node 14 selfhosted 14 wasm 14 mac 14 webassembly 14 performance 14 generative-adversarial-network 14 graphics 14 databases 14 art 14 vuejs 14 language-model 14 android-library 14 npm 14 oss 14 icons 14 awesome-ai 14 google 14 quantization 14 anomaly-detection 14 diffusion 14 recommender-system 14 detection 14 foss 14 dotnet-core 14 multimodal-learning 14 videos 14 pwa 13 dapp 13 fuzzing 13 transfer-learning 13 yolo 13 application 13 gan 13 semantic-segmentation 13 ai-agent 13 interview-questions 13 video-generation 13 cursor 13 deep-reinforcement-learning 13 skills 13 objective-c 13 storage 13 gis 13 svelte 13 wordpress 13 developer 13 discord 13 medical-imaging 13 microservice 13 utilities 13 multimodal-large-language-models 13 unicorns 13 webdevelopment 13 podcast 13 quantum-computing 13 cross-platform 13 frameworks 13 search 13 postgresql 13 es6 12 agent-skills 12 elasticsearch 12 guide 12 mllm 12 llmops 12 cuda 12 authentication 12 planning 12 webapp 12 static-site-generator 12 chain-of-thought 12 gaming 12 hosting 12 graphics-programming 12 search-engine 12 book 12 3d-graphics 12 model-compression 12 article 12 datascience 12 infrastructure-as-code 12 rails 12 minecraft 12 web-design 12 static-analysis 12 time-series-analysis 12 vision-and-language 12 edge-computing 12 image 12 sysadmin 12 infrastructure 12 system-design 12 ui-design 12 programming-languages 12 solana 12 workflow 12 cicd 12 sustainability 12 automl 12 question-answering 12 embedded-systems 11 ux 11 drones 11 movies 11 creative-coding 11 ecommerce 11 open-science 11 unsupervised-learning 11 threat-intelligence 11 hacktoberfest2020 11 free-software 11 course 11 sre 11 font 11 continual-learning 11 mobile-development 11 podcasts 11 in-context-learning 11 flutter-apps 11 interpretability 11 unity3d 11 ai-safety 11 ros2 11 climate-change 11 rest-api 11 writing 11 cvpr 11 point-cloud 11 vr 11 mcp-servers 11 ctf 11 reading-list 11 few-shot-learning 11 math 11 anime 11 rl 11 webpack 11 software-architecture 11 3d-reconstruction 11 arduino 11 machine-learning-algorithms 11 healthcare 11 uav 10 generative-models 10 bookmarks 10 explainable-ai 10 design-systems 10 flutter-examples 10 speech-processing 10 augmented-reality 10 geospatial 10 fpga 10 redis 10 apis 10 golang-library 10 lua 10 pdf 10 malware 10 flask 10 qwen 10 news 10 applications 10 codex 10 haskell 10 autonomous-agents 10 neuroscience 10 bug-bounty 10 openclaw 10 docs 10 reddit 10 trading 10 hacking-tools 10 simulation 10 tool 10 ionic 10 semantic-web 10 webdesign 10 ide 10 students 10 privacy-tools 10