An open API service indexing awesome lists of open source software.

"dataset" Awesome Lists

public-apis

A collective list of free APIs

api apis dataset development free list lists open-source public public-api

337,535 stars
35,647 forks
1,432 projects

Last updated: 23 Apr 2025

awesome-project-ideas

Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas

awesome awesome-list classification dataset deep-learning forecasting image-classification machine-learning multi-label-classification series-forecasting

8,241 stars
1,254 forks
57 projects

Last updated: 11 Apr 2025

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

bert chinese dataset ernie gpt gpt-2 large-language-models llm multimodel nezha

5,217 stars
494 forks
865 projects

Last updated: 18 Apr 2025

awesome-json-datasets

A curated list of awesome JSON datasets that don't require authentication.

awesome awesome-list data dataset datasets json json-dataset list

3,405 stars
383 forks
300 projects

Last updated: 10 Apr 2025

Awesome-Domain-LLM

收集和梳理垂直领域的开源模型、数据集及评测基准。

awesome-list dataset llm nlp paper-list

2,478 stars
201 forks
303 projects

Last updated: 14 Apr 2025

awesome-industrial-anomaly-detection

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

anomaly-detection anomaly-segmentation computer-vision dataset deep-learning defect-detection industrial-image

2,053 stars
184 forks
769 projects

Last updated: 19 Apr 2025

awesome-remote-sensing-change-detection

List of datasets, codes, and contests related to remote sensing change detection

awesome change-detection dataset remote-sensing

1,807 stars
352 forks
169 projects

Last updated: 15 Apr 2025

awesome-radar-perception

A curated list of radar datasets, detection, tracking and fusion

autonomous-driving autonomous-vehicles dataset deep-learning detection fusion radar slam

1,535 stars
306 forks
559 projects

Last updated: 11 Apr 2025

Awesome-Deepfakes-Detection

A list of tools, papers and code related to Deepfake Detection.

awesome code dataset deepfake-detection deepfakes image paper-with-code paperlist tools video

1,306 stars
124 forks
345 projects

Last updated: 16 Apr 2025

awesome-face

😎 face releated algorithm, dataset and paper

dataset face face-detection face-recognition face-releated-algorithm paper papers

903 stars
181 forks
116 projects

Last updated: 21 Mar 2025

game-datasets

:video_game: A curated list of awesome game datasets, and tools to artificial intelligence in games

artificial-intelligence awesome awesome-game awesome-list data-mining database dataset game games

831 stars
54 forks
191 projects

Last updated: 07 Apr 2025

awesome-biomechanics

A curated, public list of resources for biomechanics and human motion analysis: datasets, processing tools, software for simulation, educational videos, lectures, etc.

awesome awesome-list bioengineering biomechanics computer-simulation dataset dynamics gait gait-analysis human-motion

827 stars
136 forks
869 projects

Last updated: 17 Apr 2025

awesome-chatgpt-dataset

Unlock the Power of LLM: Explore These Datasets to Train Your Own ChatGPT!

awesome chatgpt dataset gpt4 instructions

729 stars
60 forks
70 projects

Last updated: 09 Apr 2025

awesome-segmentation-saliency-dataset

A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile:

dataset datasets deep-learning deeplearning machine-learning machinelearning saliency-detection

566 stars
96 forks
326 projects

Last updated: 08 Apr 2025

awesome-llm-eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

awsome-list awsome-lists benchmark bert chatglm chatgpt dataset evaluation gpt3 large-language-model

514 stars
44 forks
378 projects

Last updated: 18 Apr 2025

NL2SQL_Handbook

This is a continuously updated handbook for readers to easily track the latest NL2SQL (Text-to-SQL) techniques in the literature and provide practical guidance for researchers and practitioners. If we missed any interesting work, feel free to contact us.

ai ai4db awesome awesome-agents awesome-nl2sql awesome-text-to-sql awesome-text2sql dataset db llms

498 stars
31 forks
102 projects

Last updated: 07 Apr 2025

Vehicle_reID-Collection

:red_car: the collection of vehicle re-ID papers, datasets. :red_car:

awesome awesome-list cvpr-workshop dataset deep-learning paper pku-vehicle ve-ri vehicle vehicle-reid

481 stars
54 forks
101 projects

Last updated: 07 Apr 2025

awesome-time-series-segmentation-papers

This repository contains a reading list of papers on Time Series Segmentation. This repository is still being continuously improved.

anomaly-detection awesome awesome-list change-point-detection data-mining data-science dataset deep-learning machine-learning motion-capture

461 stars
10 forks
114 projects

Last updated: 10 Apr 2025

Awesome-Traffic-Agent-Trajectory-Prediction

This is a list of papers related to traffic agent trajectory prediction.

awesome dataset deep-learning papers source-code traffic-agent trajectory-prediction

452 stars
51 forks
783 projects

Last updated: 21 Apr 2025

awesome-msr

A curated repository of software engineering repository mining data sets

awesome awesome-list dataset ghtorrent mining msr

432 stars
67 forks
70 projects

Last updated: 04 Apr 2025

awesome-robotics-datasets

A collection of useful datasets for robotics and computer vision

computer-vision dataset robotics

411 stars
47 forks
79 projects

Last updated: 06 Apr 2025

Awesome-Table-Recognition

A curated list of resources dedicated to table recognition

dataset ocr ocr-recognition papers papers-with-code table-recognition

401 stars
51 forks
40 projects

Last updated: 05 Apr 2025

Awesome_Satellite_Benchmark_Datasets

Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.

awesome-list benchmark benchmark-datasets dataset deep-learning earth-observation machine-learning remote-sensing satellite-imagery

347 stars
28 forks
31 projects

Last updated: 04 Apr 2025

awesome-italian-public-datasets

A selection of interesting Open dataset from the Italian Public Administration and Civic Data use cases

awesome awesome-list civic-hacking civic-tech dataset goverment-data hacktoberfest italy open open-data

293 stars
31 forks
44 projects

Last updated: 15 Apr 2025

awesome-colour

Curated list of awesome colour science resources 😎

awesome awesome-list color color-science color-space color-spaces colorspace colorspaces colour colour-science

280 stars
22 forks
85 projects

Last updated: 02 Apr 2025

awesome-hungarian-nlp

A curated list of NLP resources for Hungarian

awesome awesome-list computational-linguistics corpus corpus-linguistics dataset hungarian hungarian-language information-extraction information-retrieval

244 stars
18 forks
174 projects

Last updated: 03 Apr 2025

Awesome-Foundation-Models-for-Weather-and-Climate

A comprehesive survey about foundation models for weather and cliamte data understanding.

ai4earth ai4science climate climate-change dataset deep-learning foundation-models large-language-models largemodel machine-learning

220 stars
32 forks
184 projects

Last updated: 03 Apr 2025

Awesome-InverseRendering

List of awesome papers on Intrinsic Decomposition & Inverse Rendering

awesome-list awesome-lists dataset intrinsic-decomposition inverse-rendering paper papers

215 stars
11 forks
68 projects

Last updated: 11 Apr 2025

Awesome_Multimodel_LLM

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.

chatgpt dataset gpt llm mllm multimodel nlp pretrained-models

181 stars
14 forks
611 projects

Last updated: 22 May 2024

turkish-nlp-resources

🔡 List of Tools, Libraries, Models, Datasets and other resources for Turkish NLP.

awesome-list dataset dogal-dil-isleme natural-language-processing nlp turkish turkish-language turkish-nlp

135 stars
6 forks
73 projects

Last updated: 15 Apr 2025

awesome-object-detection-in-aerial-images

A curated list of awesome resources for generic object detection in aerial images.

aerial-object-detection dataset object-detection oriented-object-detection remote-sensing rotated-object-detection

121 stars
14 forks
211 projects

Last updated: 23 Mar 2025

deep-learning-resources

A curated list of deep learning resources books, courses, papers, libraries, conferences, sample code, and many more.

awesome awesome-list conference data-science dataset deep-learning indonesia lecture machine-learning paper

119 stars
37 forks
54 projects

Last updated: 11 Mar 2025

awesome-object-detection-datasets

A collection of some awesome public object detection and recognition datasets.

aerial-imagery autonomous-driving awesome-list chatgpt coco dataset datasets infrared large-language-models llm

94 stars
9 forks
143 projects

Last updated: 15 Apr 2025

awesome-scene-text-detection

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized with code and dataset

charmve dataset datasets detection irregular-text-recognition level-annotation ocr recognition scene-text-detection scene-text-recognition

86 stars
17 forks
45 projects

Last updated: 04 Apr 2025

awesome-datasets

A comprehensive list of annotated training datasets classified by use case.

annotation awesome-data-science awesome-datasets awesome-public-datasets corpora data dataset datasets document-processing entity-extraction

33 stars
6 forks
87 projects

Last updated: 29 Mar 2025

awesome-iran-opendata

A repo to introduce website that share Data and Dataset about Iran

awesome datajournalism dataset farsi iran opendata persian

33 stars
5 forks
109 projects

Last updated: 13 Apr 2025

nepali-devblogs

Collection of blogs/publications by Nepali devs.

awesome-list awesomeness blogs collection dataset developer nepal reads resources

33 stars
9 forks
40 projects

Last updated: 19 Jan 2024

publicsectornl

Open Source in the public sector in the Netherlands

awesome-list dataset government netherlands open-data open-government open-science open-source open-state public-sector

31 stars
3 forks
128 projects

Last updated: 31 Mar 2025

awesome-swedish-nlp

A curated list of resources for natural language processing (NLP) in Swedish

awesome-list corpora corpus dataset datasets natural-language-generation natural-language-processing nlp resource-list swedish

24 stars
2 forks
68 projects

Last updated: 28 Jan 2025

awesome-marine-hacking

Awesome Resources for Ocean Hacking

awesome awesome-list dataset datasets hackathon ocean ocean-hacking oceanography

15 stars
3 forks
35 projects

Last updated: 20 Dec 2024

awesome-city-data

A curated list of open data sources to analyze and compare cities in a holistic way à la data science and empower citizens.

awesome-list cities data-science dataset

5 stars
0 forks
37 projects

Last updated: 01 Dec 2024

public-apis

A collective list of free APIs

api apis dataset development free list lists open-source public public-api

0 stars
0 forks
1,436 projects

Last updated: 20 Sep 2023

Search
Keywords
awesome-list 3,494 awesome 3,181 machine-learning 414 list 402 awesome-lists 379 deep-learning 354 resources 303 hacktoberfest 240 python 201 lists 194 javascript 186 ai 178 programming 158 security 156 artificial-intelligence 134 llm 133 computer-vision 130 blockchain 128 nlp 108 tools 105 data-science 104 open-source 103 react 101 chatgpt 90 large-language-models 89 android 85 natural-language-processing 84 linux 78 learning 77 awesome-readme 76 css 74 papers 73 curated-list 72 ios 72 ethereum 72 devops 70 awesome-resources 69 nodejs 63 computer-science 60 reinforcement-learning 60 playground 60 free-resources 58 courses 58 game-development 58 macos 57 getvm 57 rust 56 kubernetes 56 cybersecurity 55 robotics 54 privacy 53 java 53 collection 52 golang 52 tutorial 52 design 52 web 49 paper 48 frontend 48 hacking 47 labex 47 hands-on 47 bitcoin 47 go 47 swift 46 datasets 45 openai 44 education 44 php 44 deep-neural-networks 43 books 43 survey 43 web3 42 cloud 42 html 42 dataset 42 cryptocurrency 42 development 41 security-tools 40 exercises 40 opensource 40 data 40 gpt 40 free 39 vue 39 gamedev 38 data-visualization 38 tutorials 38 docker 37 typescript 36 github 35 collections 35 developer-tools 35 llms 34 database 34 reactjs 33 paper-list 33 research 33 neural-network 33 documentation 33 automation 32 iot 32 aws 32 framework 32 software 31 game 31 ruby 30 hardware 30 testing 29 cpp 29 ml 29 community 29 games 28 solidity 28 architecture 28 flutter 28 graph-neural-networks 28 video 28 algorithms 28 transformer 27 smart-contracts 27 generative-ai 27 libraries 27 web-development 27 tensorflow 27 dotnet 26 react-native 26 csharp 26 windows 26 bioinformatics 26 links 26 music 25 software-engineering 25 crypto 25 git 25 api 25 science 24 pentesting 24 computer-graphics 24 penetration-testing 24 diffusion-models 24 angular 24 django 24 defi 24 embedded 23 library 23 unity 23 r 23 agent 23 self-supervised-learning 23 pytorch 23 game-engine 22 infosec 22 gpt-4 22 coding 22 productivity 22 markdown 22 microsoft 22 self-hosted 22 cryptography 22 best-practices 22 object-detection 22 awsome-list 22 youtube 22 gpt-3 22 neural-networks 21 ui 21 engineering 21 serverless 21 osint 21 projects 21 data-analysis 21 kotlin 21 mobile 21 awesomeness 21 apple 21 knowledge-graph 21 hacktoberfest-accepted 21 opendata 20 cloud-computing 20 startups 20 python3 20 visualization 20 mathematics 20 website 20 data-mining 20 programming-language 20 segmentation 20 multimodal 20 audio 20 roadmap 20 c 20 slam 20 apps 20 deeplearning 19 jobs 19 sql 19 awsome 19 3d 19 guidelines 19 azure 19 federated-learning 19 graph 19 raspberry-pi 19 cli 18 bugbounty 18 time-series 18 blogs 18 startup 18 interview 18 graphql 18 chinese 18 llama 18 front-end 18 bash 18 chatbot 18 marketing 18 prompt-engineering 18 blog 18 js 18 machinelearning 18 knowledge 17 autonomous-driving 17 saas 17 statistics 17 nextjs 17 representation-learning 17 algorithm 17 cheatsheet 17 generative-art 17 curated 17 decentralized 17 language 17 code 17 containers 17 mlops 16 bert 16 aigc 16 articles 16 image-processing 16 webgl 16 optimization 16 data-structures 16 stable-diffusion 16 foundation-models 16 analytics 16 learning-resources 16 resource 16 networking 16 microservices 16 software-development 15 devsecops 15 autonomous-vehicles 15 nerf 15 cloud-native 15 technology 15 monitoring 15 reverse-engineering 15 detection 15 big-data 15 distributed-systems 15 graphics 15 transformers 15 ros 15 pentest 15 dart 14 android-development 14 multimodal-deep-learning 14 mac 14 telegram 14 speech-recognition 14 seo 14 selfhosted 14 data-engineering 14 recommender-system 14 vuejs 14 mysql 14 webassembly 14 node 14 react-components 14 terminal 14 icons 14 android-library 14 prompt 14 backend 14 animation 14 readme 14 performance 14 unicorns 13 laravel 13 open-data 13 gis 13 guide 13 mcp 13 swiftui 13 elasticsearch 13 foss 13 podcast 13 art 13 remote-sensing 13 evm 13 pose-estimation 13 finance 13 microservice 13 command-line 13 deep-reinforcement-learning 13 generative-adversarial-network 13 p2p 13 dotnet-core 13 font 12 static-analysis 12 es6 12 storage 12 transfer-learning 12 wordpress 12 gan 12 svelte 12 graphics-programming 12 objective-c 12 point-cloud 12 dapp 12 frameworks 12 large-language-model 12 oss 12 language-model 12 gaming 12 wasm 12 rails 12 videos 12 image 12 ecommerce 12 discord 12 ai-agents 12 postgresql 12 fuzzing 12 malware-analysis 12 plugins 12 pwa 12 search 12 search-engine 12 cms 12 vscode 11 cvpr 11 hacktoberfest2020 11 quantum-computing 11 question-answering 11 webapp 11 anomaly-detection 11 ansible 11 ctf 11 course 11 sustainability 11 programming-languages 11 network 11 unity3d 11 multimodal-learning 11 text-to-image 11 ui-design 11 npm 11 leetcode 11 cross-platform 11 reasoning 11 beginner-friendly 11 vision-and-language 11 open-science 11 planning 11 sysadmin 11 free-software 11 databases 11 bookmarks 11 automl 11 webpack 11 creative-coding 11 front-end-development 11 application 11 design-systems 11 speech-processing 11 3d-graphics 11 guides 10 hacking-tools 10 reading-list 10 medical-imaging 10 writing 10 vr 10 time-series-analysis 10 rag 10 gpu 10 haskell 10 interview-questions 10 semantic-web 10 rest-api 10 threat-intelligence 10 datascience 10 developer 10 benchmark 10 arduino 10 diffusion 10 operating-system 10 model-compression 10 docs 10 management 10 cuda 10 android-application 10 android-app 10 students 10 image-generation 10 chatgpt-api 10 golang-library 10 yolo 10 geospatial 10 agents 10 bot 10 text-mining 10 flutter-apps 10 deepseek 10 authentication 10 edge-computing 10 webdevelopment 10 interpretability 10 climate-change 10 unsupervised-learning 10 indonesia 10 gpts 10 cv 10 infrastructure-as-code 10 hosting 10 movies 10 hacktoberfest2021 10 code-generation 10 machine-learning-algorithms 10 unix 9 math 9 websites 9 arkit 9 ux 9 system-design 9 few-shot-learning 9 applications 9 3d-reconstruction 9 article 9 ros2 9 explainable-ai 9 static-site-generator 9 semantic-segmentation 9 jupyter-notebook 9 erlang 9 webcomponents 9 quantization 9 book 9 electronics 9 solana 9 ide 9 trading 9 neuroscience 9 podcasts 9 reddit 9 privacy-tools 9 sre 9 social-network 9 swift-library 9 fpga 9 mobile-development 9 blockchain-technology 9 fairness 9 anime 9 bug-bounty 9 google 9 microservices-architecture 9 conference 9 reference 9 in-context-learning 9 malware 9 leadership 9 exploit 9 flask 9 ionic 9 flutter-examples 9 simulation 9 cyber-security 9 shell 9 classification 9 news 9 video-generation 9 chain-of-thought 9 cicd 9 sentiment-analysis 9 chrome 9 knowledge-base 9 augmented-reality 9 material-design 9 email 9 interview-preparation 8 paperlist 8 opengl 8 command-line-tool 8 chinese-translation 8 functional-programming 8 elixir 8 spring-boot 8 notes 8