An open API service indexing awesome lists of open source software.

0 "data-deduplication" Awesome Lists

awesome-data-llm

Official Repository of "LLM × DATA" Survey Paper

data-acquisition data-deduplication data-filtering data-mixing data-provenance data-selection data-synthesis data-transformation llm vlm

770 stars
68 forks
795 projects

Last updated: 27 Apr 2026

Search
Keywords
awesome-list 4,276 awesome 3,846 awesome-lists 620 machine-learning 469 list 425 deep-learning 391 resources 333 ai 312 llm 256 hacktoberfest 231 python 227 lists 205 javascript 198 security 180 artificial-intelligence 170 programming 165 blockchain 145 computer-vision 145 open-source 142 large-language-models 130 nlp 128 tools 123 data-science 115 curated-list 111 react 109 chatgpt 109 ai-agents 101 android 96 natural-language-processing 92 devops 89 linux 87 learning 87 awesome-readme 85 cybersecurity 85 ios 81 papers 81 mcp 81 awesome-resources 79 reinforcement-learning 79 developer-tools 78 css 77 ethereum 76 robotics 72 openai 70 rust 69 kubernetes 67 nodejs 66 computer-science 65 collection 62 macos 61 design 61 survey 61 playground 61 game-development 60 free-resources 59 golang 59 generative-ai 58 free 58 tutorials 58 getvm 57 web3 57 java 56 privacy 56 datasets 55 hacking 54 frontend 54 bitcoin 53 php 53 automation 52 tutorial 52 education 51 llms 51 ai-tools 51 go 50 paper 50 web 49 swift 49 gpt 48 data 47 github 46 security-tools 46 books 46 typescript 45 development 45 hands-on 45 labex 45 dataset 45 agent 45 cloud 45 deep-neural-networks 44 cryptocurrency 44 prompt-engineering 44 claude-code 43 html 43 research 43 data-visualization 43 productivity 42 documentation 42 database 42 gamedev 41 exercises 41 opensource 41 vue 41 claude 40 api 40 docker 40 diffusion-models 39 web-development 39 ml 39 paper-list 38 collections 38 software 38 agentic-ai 37 algorithms 37 game 36 iot 35 neural-network 35 agents 34 courses 34 cli 34 reactjs 34 aws 34 testing 34 hardware 33 multimodal 33 framework 33 cpp 33 community 32 video 32 software-engineering 32 crypto 32 flutter 32 rag 32 defi 31 solidity 31 games 31 react-native 31 projects 31 libraries 30 graph-neural-networks 30 mlops 30 transformer 30 self-hosted 30 bioinformatics 29 ruby 29 architecture 29 penetration-testing 29 smart-contracts 28 gemini 28 mobile 28 microsoft 28 pentesting 28 marketing 28 dotnet 28 best-practices 27 music 27 links 27 object-detection 27 angular 27 csharp 27 saas 27 cryptography 27 science 27 slam 27 tensorflow 27 windows 27 anthropic 26 reverse-engineering 26 data-analysis 26 serverless 26 library 26 prompts 26 self-supervised-learning 26 pytorch 26 infosec 25 data-mining 25 coding 25 django 25 3d 25 jobs 25 nextjs 25 sql 25 osint 25 knowledge-graph 25 computer-graphics 24 neural-networks 24 chatbot 24 time-series 24 git 24 kotlin 24 markdown 24 learning-resources 24 awsome-list 24 embedded 24 mathematics 24 open-data 24 game-engine 23 engineering 23 visualization 23 blog 23 apple 23 ai-agent 23 c 23 unity 23 audio 23 programming-language 23 autonomous-driving 23 segmentation 23 chinese 23 openclaw 23 software-development 23 interview 23 roadmap 23 ui 23 image-generation 23 reasoning 22 model-context-protocol 22 networking 22 awesomeness 22 website 22 deeplearning 22 foundation-models 22 monitoring 22 statistics 22 startup 22 python3 22 skills 22 seo 22 devsecops 22 startups 22 federated-learning 21 cloud-computing 21 agent-skills 21 azure 21 awsome 21 blogs 21 r 21 free-tutorials 21 hacktoberfest-accepted 21 gpt-3 21 finance 21 data-structures 20 machinelearning 20 llama 20 graphql 20 curated 20 transformers 20 decentralized 20 opendata 20 apps 20 graph 20 js 20 youtube 20 plugins 20 terminal 20 raspberry-pi 19 backend 19 command-line 19 langchain 19 front-end 19 mcp-server 19 gpt-4 19 aigc 19 benchmark 19 big-data 19 code 18 stable-diffusion 18 embodied-ai 18 vibe-coding 18 optimization 18 representation-learning 18 prompt 18 analytics 18 generative-art 18 vlm 18 guidelines 18 cursor 18 cloud-native 18 bugbounty 18 speech-recognition 18 telegram 18 text-to-image 18 animation 17 cheatsheet 17 laravel 17 bash 17 autonomous-vehicles 17 deepseek 17 generative-model 17 data-engineering 17 knowledge 17 swiftui 17 malware-analysis 17 nerf 17 pentest 17 beginner-friendly 17 containers 17 ros 16 video-generation 16 language 16 bert 16 algorithm 16 webgl 16 articles 16 quantization 16 image-processing 16 reddit 16 dart 16 videos 16 microservices 16 large-language-model 16 evm 16 ai-safety 16 autonomous-agents 16 foss 15 resource 15 mcp-servers 15 databases 15 anomaly-detection 15 awesome-ai 15 readme 15 technology 15 leetcode 15 android-development 15 cms 15 vuejs 15 graphics 15 mysql 15 art 15 distributed-systems 15 performance 15 remote-sensing 15 codex 15 p2p 15 vscode 15 multimodal-large-language-models 15 pose-estimation 15 multimodal-deep-learning 15 mac 14 wasm 14 drones 14 selfhosted 14 android-library 14 detection 14 gaming 14 npm 14 llmops 14 google 14 webassembly 14 dotnet-core 14 recommender-system 14 llm-agents 14 language-model 14 discord 14 multimodal-learning 14 icons 14 podcast 14 system-design 14 gis 14 oss 14 deep-reinforcement-learning 14 medical-imaging 14 vision-language-model 14 code-generation 14 node 14 generative-adversarial-network 14 diffusion 14 react-components 14 webdevelopment 13 unicorns 13 uav 13 wordpress 13 gan 13 search 13 copilot 13 semantic-segmentation 13 automl 13 utilities 13 quantum-computing 13 cicd 13 multi-agent 13 interview-questions 13 frameworks 13 storage 13 microservice 13 mobile-development 13 edge-computing 13 bug-bounty 13 programming-languages 13 sysadmin 13 rest-api 13 sustainability 13 mllm 13 model-compression 13 sre 13 developer 13 claude-skills 13 solana 13 svelte 13 ecommerce 13 objective-c 13 transfer-learning 13 yolo 13 dapp 13 planning 13 workflow 13 postgresql 13 rl 13 infrastructure 13 continual-learning 13 minecraft 13 cyber-security 13 pwa 13 static-site-generator 12 guide 12 ai-art 12 infrastructure-as-code 12 webapp 12 image 12 open-science 12 article 12 cuda 12 rails 12 observability 12 software-architecture 12 leadership 12 graphics-programming 12 email 12 datascience 12 ui-design 12 trading 12 3d-graphics 12 cross-platform 12 vision-and-language 12 creative-coding 12 es6 12 search-engine 12 fuzzing 12 time-series-analysis 12 speech-to-text 12 vector-database 12 ctf 12 chain-of-thought 12 privacy-tools 12 application 12 neuroscience 12 authentication 12 elasticsearch 12 threat-intelligence 12 static-analysis 12 web-design 12 question-answering 12 ai-coding 12 ux 12 math 11 point-cloud 11 anime 11 embedded-systems 11 writing 11 management 11 reading-list 11 vr 11 test-automation 11 swift-library 11 climate-change 11 healthcare 11 drone 11 cvpr 11 incident-response 11 hacktoberfest2020 11 cv 11 webpack 11 elixir 11 compliance 11 packages 11 fpga 11 3d-reconstruction 11 gpu 11 podcasts 11 simulation 11 lua 11 free-software 11 few-shot-learning 11 in-context-learning 11 xai 11 ros2 11 unity3d 11 font 11 hosting 11 geospatial 11 machine-learning-algorithms 11 unsupervised-learning 11 movies 11 evaluation 11 arduino 11 book 11 students 11 course 11 knowledge-base 11 blockchain-technology 10