An open API service indexing awesome lists of open source software.

0 "data-validation" Awesome Lists

awesome-data-quality

A comprehensive collection of data quality resources, tools, papers, and projects across various data types including traditional data, LLM pretraining/fine-tuning data, multimodal data, and more. Essential reference for researchers and practitioners in data-centric AI.

awesome awesome-list data-centric-ai data-cleaning data-curation data-preprocessing data-quality data-validation graph-data llm

23 stars
4 forks
86 projects

Last updated: 28 Nov 2025

Search
Keywords
awesome-list 3,898 awesome 3,554 awesome-lists 539 machine-learning 443 list 419 deep-learning 371 resources 323 ai 252 hacktoberfest 238 python 219 lists 203 llm 196 javascript 193 security 166 programming 163 artificial-intelligence 151 blockchain 139 computer-vision 133 open-source 119 nlp 117 tools 114 large-language-models 113 data-science 111 react 107 chatgpt 98 android 92 curated-list 87 natural-language-processing 87 linux 85 learning 84 devops 81 css 79 awesome-readme 79 papers 76 awesome-resources 75 ethereum 75 ios 75 reinforcement-learning 71 cybersecurity 70 computer-science 65 nodejs 63 kubernetes 63 rust 63 collection 60 robotics 60 playground 60 game-development 59 design 59 free-resources 59 macos 57 getvm 57 free 57 java 56 golang 55 tutorials 54 openai 54 frontend 54 survey 53 privacy 53 tutorial 52 bitcoin 51 developer-tools 51 datasets 51 hacking 50 web 50 go 49 llms 49 data 48 web3 48 paper 48 cloud 47 education 46 php 46 dataset 46 labex 45 swift 45 hands-on 45 development 45 html 44 github 44 books 44 deep-neural-networks 44 gpt 43 cryptocurrency 43 security-tools 42 data-visualization 41 generative-ai 41 automation 41 exercises 41 vue 40 web-development 39 typescript 39 database 39 opensource 39 gamedev 38 documentation 38 docker 38 algorithms 38 research 37 software 37 collections 37 ml 37 iot 35 mcp 35 paper-list 34 neural-network 34 reactjs 33 diffusion-models 33 testing 33 courses 33 framework 33 aws 33 agent 32 productivity 32 api 32 cpp 32 game 31 community 30 ai-agents 30 multimodal 30 hardware 30 projects 30 graph-neural-networks 30 software-engineering 30 transformer 30 solidity 30 flutter 29 ruby 29 dotnet 29 architecture 28 smart-contracts 28 video 28 bioinformatics 28 libraries 28 csharp 28 prompt-engineering 28 defi 28 games 28 tensorflow 27 links 27 react-native 27 crypto 26 penetration-testing 26 cryptography 26 agents 26 self-hosted 26 windows 26 ai-tools 25 self-supervised-learning 25 music 25 cli 25 microsoft 25 angular 25 data-analysis 25 science 25 sql 25 pytorch 25 serverless 25 slam 24 git 24 computer-graphics 24 coding 24 osint 24 data-mining 24 object-detection 24 django 24 pentesting 24 r 23 3d 23 neural-networks 23 claude 23 embedded 23 unity 23 c 23 engineering 23 game-engine 23 best-practices 23 infosec 23 mobile 23 roadmap 23 library 23 ui 23 markdown 23 interview 22 hacktoberfest-accepted 22 nextjs 22 visualization 22 programming-language 22 learning-resources 22 jobs 22 deeplearning 22 statistics 22 mlops 22 segmentation 22 python3 22 knowledge-graph 22 awsome-list 22 awesomeness 21 mathematics 21 kotlin 21 blog 21 audio 21 reverse-engineering 21 website 21 data-structures 21 apple 21 cloud-computing 21 federated-learning 21 gpt-4 21 startups 21 free-tutorials 21 autonomous-driving 21 software-development 21 time-series 21 gpt-3 21 azure 21 opendata 21 chatbot 20 apps 20 front-end 20 chinese 20 saas 20 js 20 marketing 20 graph 20 blogs 20 machinelearning 20 awsome 20 foundation-models 20 guidelines 20 startup 19 bugbounty 19 decentralized 19 youtube 19 llama 19 raspberry-pi 19 reasoning 19 big-data 19 curated 19 code 18 backend 18 monitoring 18 agentic-ai 18 stable-diffusion 18 bash 18 optimization 18 aigc 18 finance 18 graphql 18 networking 18 anthropic 18 representation-learning 18 transformers 17 open-data 17 cheatsheet 17 seo 17 beginner-friendly 17 prompt 17 generative-art 17 analytics 17 cloud-native 17 knowledge 17 containers 17 language 16 image-generation 16 algorithm 16 articles 16 devsecops 16 autonomous-vehicles 16 bert 16 microservices 16 image-processing 16 data-engineering 16 deepseek 16 vscode 16 claude-code 16 webgl 16 mysql 15 evm 15 graphics 15 ros 15 p2p 15 plugins 15 malware-analysis 15 gemini 15 readme 15 nerf 15 mcp-server 15 terminal 15 cms 15 technology 15 android-development 15 selfhosted 15 pentest 15 text-to-image 15 resource 15 dart 15 laravel 15 command-line 15 animation 15 detection 15 multimodal-deep-learning 14 leetcode 14 large-language-model 14 telegram 14 distributed-systems 14 vlm 14 rag 14 diffusion 14 node 14 anomaly-detection 14 oss 14 android-library 14 art 14 embodied-ai 14 generative-adversarial-network 14 language-model 14 prompts 14 databases 14 awesome-ai 14 benchmark 14 react-components 14 foss 14 videos 14 generative-model 14 recommender-system 14 webassembly 14 wasm 14 vuejs 14 remote-sensing 14 interview-questions 14 performance 14 mac 14 icons 14 code-generation 14 developer 13 fuzzing 13 multimodal-large-language-models 13 dotnet-core 13 vision-language-model 13 npm 13 webdevelopment 13 video-generation 13 microservice 13 pose-estimation 13 pwa 13 gis 13 postgresql 13 swiftui 13 transfer-learning 13 speech-recognition 13 search 13 application 13 cross-platform 13 vibe-coding 13 deep-reinforcement-learning 13 gan 13 podcast 13 wordpress 13 quantum-computing 13 svelte 13 unicorns 13 frameworks 13 time-series-analysis 12 web-design 12 discord 12 article 12 ui-design 12 google 12 static-analysis 12 guide 12 vision-and-language 12 graphics-programming 12 automl 12 rails 12 sustainability 12 datascience 12 image 12 static-site-generator 12 model-compression 12 solana 12 infrastructure-as-code 12 programming-languages 12 search-engine 12 mllm 12 webapp 12 dapp 12 workflow 12 model-context-protocol 12 hosting 12 multimodal-learning 12 chain-of-thought 12 authentication 12 elasticsearch 12 es6 12 gaming 12 storage 12 ux 11 ecommerce 11 hacktoberfest2020 11 golang-library 11 objective-c 11 climate-change 11 ctf 11 rl 11 ai-agent 11 3d-graphics 11 interpretability 11 font 11 design-systems 11 sysadmin 11 cuda 11 langchain 11 system-design 11 utilities 11 cvpr 11 point-cloud 11 book 11 unity3d 11 unsupervised-learning 11 medical-imaging 11 rest-api 11 free-software 11 yolo 11 open-science 11 writing 11 anime 11 few-shot-learning 11 question-answering 11 course 11 front-end-development 11 creative-coding 11 webpack 11 planning 11 threat-intelligence 11 privacy-tools 10 infrastructure 10 email 10 docs 10 hacktoberfest2021 10 ionic 10 augmented-reality 10 hacking-tools 10 neuroscience 10 ide 10 jupyter-notebook 10 quantization 10 ansible 10 android-app 10 android-application 10 classification 10 interview-preparation 10 cyber-security 10 edge-computing 10 malware 10 vr 10 redis 10 social-network 10 flutter-apps 10 bug-bounty 10 bookmarks 10 applications 10 haskell 10 management 10 3d-reconstruction 10 knowledge-base 10 sre 10 arduino 10 test-automation 10 llmops 10 semantic-segmentation 10 bot 10 mcp-servers 10 podcasts 10 in-context-learning 10 movies 10 cv 10 machine-learning-algorithms 10 text-mining 10 semantic-web 10 blockchain-technology 10 cursor 10 webdesign 10 templates 10 software-architecture 10 chatgpt-api 10 drones 10 healthcare 10 webcomponents 10 xai 10 cicd 10 reading-list 10 geospatial 10 speech-processing 10 students 10 flask 10 fpga 10 qwen 10 deepseek-r1 9 functional-programming 9