An open API service indexing awesome lists of open source software.

0 "document-processing" Awesome Lists

awesome-datasets

A comprehensive list of annotated training datasets classified by use case.

annotation awesome-data-science awesome-datasets awesome-public-datasets corpora data dataset datasets document-processing entity-extraction

36 stars
6 forks
87 projects

Last updated: 19 Nov 2025

Search
Keywords
awesome-list 3,860 awesome 3,515 awesome-lists 529 machine-learning 438 list 416 deep-learning 367 resources 320 ai 246 hacktoberfest 238 python 216 lists 202 javascript 191 llm 189 security 166 programming 162 artificial-intelligence 149 blockchain 137 computer-vision 133 open-source 118 nlp 115 tools 114 data-science 111 large-language-models 110 react 105 chatgpt 97 android 91 natural-language-processing 87 curated-list 86 linux 85 learning 84 awesome-readme 79 devops 79 css 79 papers 76 awesome-resources 75 ethereum 75 ios 74 reinforcement-learning 71 cybersecurity 68 computer-science 64 rust 61 robotics 61 nodejs 61 kubernetes 61 playground 60 collection 59 free-resources 59 design 59 game-development 59 getvm 57 macos 57 free 56 java 55 tutorials 54 golang 54 openai 54 privacy 53 frontend 53 survey 52 tutorial 52 hacking 50 bitcoin 50 web 50 go 49 datasets 49 llms 49 web3 49 data 48 developer-tools 48 paper 48 cloud 47 php 46 education 45 hands-on 45 labex 45 dataset 45 swift 45 books 44 deep-neural-networks 44 html 44 development 44 gpt 43 cryptocurrency 43 github 42 security-tools 42 exercises 41 data-visualization 41 generative-ai 41 vue 40 database 39 typescript 39 opensource 39 automation 39 web-development 39 gamedev 39 docker 38 software 37 collections 37 research 37 algorithms 37 documentation 37 ml 36 mcp 35 iot 34 neural-network 33 paper-list 33 courses 33 framework 33 testing 33 reactjs 33 diffusion-models 32 aws 32 productivity 32 cpp 31 game 31 projects 30 graph-neural-networks 30 software-engineering 30 multimodal 30 agent 30 ai-agents 30 community 30 api 30 solidity 30 transformer 29 hardware 29 dotnet 29 ruby 29 libraries 28 csharp 28 architecture 28 bioinformatics 28 games 28 smart-contracts 28 video 28 flutter 28 defi 27 links 27 tensorflow 27 prompt-engineering 27 react-native 26 agents 26 crypto 26 windows 26 sql 25 microsoft 25 music 25 self-hosted 25 cryptography 25 self-supervised-learning 25 penetration-testing 25 angular 25 science 25 slam 25 object-detection 24 pytorch 24 coding 24 ai-tools 24 django 24 computer-graphics 24 data-analysis 24 pentesting 24 git 24 serverless 24 osint 24 infosec 23 library 23 embedded 23 neural-networks 23 cli 23 r 23 roadmap 23 markdown 23 ui 23 unity 23 engineering 23 game-engine 23 best-practices 23 c 22 mobile 22 statistics 22 deeplearning 22 awsome-list 22 knowledge-graph 22 3d 22 data-mining 22 python3 22 programming-language 22 segmentation 22 mlops 22 jobs 22 visualization 22 interview 22 website 21 opendata 21 awesomeness 21 autonomous-driving 21 startups 21 mathematics 21 hacktoberfest-accepted 21 data-structures 21 kotlin 21 claude 21 federated-learning 21 audio 21 apple 21 learning-resources 21 blog 21 gpt-4 21 gpt-3 21 free-tutorials 21 cloud-computing 20 graph 20 awsome 20 guidelines 20 front-end 20 chinese 20 js 20 software-development 20 foundation-models 20 apps 20 blogs 20 nextjs 20 machinelearning 20 saas 20 azure 20 time-series 19 startup 19 youtube 19 curated 19 big-data 19 llama 19 reverse-engineering 19 marketing 19 raspberry-pi 19 reasoning 19 bugbounty 19 aigc 18 graphql 18 optimization 18 chatbot 18 backend 18 representation-learning 18 code 18 decentralized 18 bash 18 stable-diffusion 18 transformers 17 cheatsheet 17 deepseek 17 anthropic 17 networking 17 prompt 17 knowledge 17 agentic-ai 17 generative-art 17 open-data 17 analytics 17 monitoring 17 seo 17 cloud-native 17 containers 17 articles 16 algorithm 16 beginner-friendly 16 bert 16 autonomous-vehicles 16 language 16 data-engineering 16 vscode 16 microservices 16 webgl 16 claude-code 16 image-processing 16 finance 16 devsecops 16 nerf 16 resource 15 graphics 15 cms 15 image-generation 15 ros 15 pentest 15 android-development 15 readme 15 dart 15 command-line 15 terminal 15 text-to-image 15 animation 15 plugins 15 mysql 15 detection 15 mcp-server 15 technology 15 webassembly 14 prompts 14 multimodal-deep-learning 14 vlm 14 foss 14 gemini 14 icons 14 selfhosted 14 rag 14 embodied-ai 14 distributed-systems 14 language-model 14 art 14 node 14 videos 14 diffusion 14 databases 14 leetcode 14 recommender-system 14 oss 14 generative-model 14 react-components 14 awesome-ai 14 p2p 14 performance 14 evm 14 android-library 14 mac 14 telegram 14 malware-analysis 14 vuejs 14 code-generation 14 laravel 14 remote-sensing 13 benchmark 13 svelte 13 dotnet-core 13 quantum-computing 13 multimodal-large-language-models 13 unicorns 13 anomaly-detection 13 vibe-coding 13 vision-language-model 13 cross-platform 13 wasm 13 postgresql 13 speech-recognition 13 large-language-model 13 deep-reinforcement-learning 13 interview-questions 13 video-generation 13 fuzzing 13 pose-estimation 13 application 13 transfer-learning 13 wordpress 13 podcast 13 pwa 13 webdevelopment 13 search 13 frameworks 13 generative-adversarial-network 13 gis 13 gan 13 microservice 13 developer 13 swiftui 13 gaming 12 dapp 12 graphics-programming 12 mllm 12 workflow 12 storage 12 automl 12 static-analysis 12 es6 12 search-engine 12 multimodal-learning 12 authentication 12 infrastructure-as-code 12 vision-and-language 12 static-site-generator 12 elasticsearch 12 datascience 12 solana 12 chain-of-thought 12 model-context-protocol 12 google 12 article 12 hosting 12 rails 12 webapp 12 guide 12 sustainability 12 ui-design 12 npm 12 image 12 discord 12 sysadmin 11 ai-agent 11 webpack 11 planning 11 time-series-analysis 11 system-design 11 course 11 ecommerce 11 creative-coding 11 design-systems 11 yolo 11 ux 11 point-cloud 11 3d-graphics 11 golang-library 11 front-end-development 11 font 11 interpretability 11 model-compression 11 rl 11 threat-intelligence 11 open-science 11 web-design 11 hacktoberfest2020 11 utilities 11 writing 11 cuda 11 few-shot-learning 11 objective-c 11 programming-languages 11 unity3d 11 cvpr 11 climate-change 11 ctf 11 free-software 11 rest-api 11 question-answering 11 book 11 mcp-servers 10 flask 10 edge-computing 10 semantic-web 10 augmented-reality 10 ansible 10 classification 10 management 10 reading-list 10 hacktoberfest2021 10 speech-processing 10 privacy-tools 10 docs 10 drones 10 social-network 10 langchain 10 email 10 webdesign 10 infrastructure 10 malware 10 bot 10 cicd 10 llmops 10 semantic-segmentation 10 geospatial 10 cyber-security 10 interview-preparation 10 neuroscience 10 redis 10 machine-learning-algorithms 10 cv 10 cursor 10 bug-bounty 10 android-app 10 3d-reconstruction 10 medical-imaging 10 students 10 quantization 10 applications 10 unsupervised-learning 10 webcomponents 10 podcasts 10 bookmarks 10 blockchain-technology 10 flutter-apps 10 ionic 10 test-automation 10 movies 10 in-context-learning 10 xai 10 explainable-ai 10 text-mining 10 fpga 10 templates 10 knowledge-base 10 hacking-tools 10 anime 10 haskell 10 deepseek-r1 10 vr 10 android-application 10 arduino 10 ide 10 jupyter-notebook 10 reddit 9 blogging 9 microservices-architecture 9 python-library 9 indonesia 9