An open API service indexing awesome lists of open source software.

0 "data-centric-machine-learning" Awesome Lists

awesome-data-efficient-llm

A list of data-efficient and data-centric LLM (Large Language Model) papers. Our Survey Paper: Towards Efficient LLM Post Training: A Data-centric Perspective

data-centric data-centric-ai data-centric-machine-learning data-efficient efficient large-language-models llm

55 stars
4 forks
215 projects

Last updated: 15 May 2026

Search
Keywords
awesome-list 4,224 awesome 3,806 awesome-lists 607 machine-learning 466 list 426 deep-learning 390 resources 334 ai 302 llm 248 hacktoberfest 232 python 226 lists 204 javascript 196 security 182 artificial-intelligence 169 programming 165 computer-vision 145 blockchain 143 open-source 140 nlp 128 large-language-models 126 tools 123 data-science 115 chatgpt 109 curated-list 109 react 108 android 95 ai-agents 91 natural-language-processing 91 devops 88 linux 87 learning 87 awesome-readme 85 cybersecurity 85 papers 80 awesome-resources 79 ios 79 reinforcement-learning 78 css 77 developer-tools 77 ethereum 76 mcp 72 robotics 69 rust 69 openai 68 nodejs 66 kubernetes 66 computer-science 65 survey 62 collection 62 playground 61 design 61 macos 60 game-development 60 free-resources 59 golang 59 tutorials 58 free 58 getvm 57 privacy 56 generative-ai 56 java 56 web3 55 frontend 54 hacking 54 php 53 datasets 53 bitcoin 53 tutorial 52 education 51 go 50 llms 50 automation 49 web 49 paper 49 ai-tools 49 swift 48 gpt 48 data 47 github 46 security-tools 46 labex 45 books 45 hands-on 45 cloud 45 agent 45 development 45 deep-neural-networks 44 prompt-engineering 44 dataset 44 typescript 43 cryptocurrency 43 data-visualization 43 html 43 research 43 documentation 42 database 41 gamedev 41 productivity 41 exercises 41 opensource 41 claude-code 40 docker 40 vue 40 web-development 39 ml 39 api 39 collections 38 software 38 paper-list 38 diffusion-models 38 algorithms 37 claude 37 iot 36 game 36 neural-network 35 agents 34 testing 34 reactjs 34 courses 34 aws 34 cpp 33 hardware 33 framework 33 agentic-ai 32 community 32 flutter 32 cli 32 multimodal 32 react-native 31 solidity 31 projects 31 games 31 video 31 software-engineering 31 graph-neural-networks 30 transformer 30 self-hosted 30 libraries 30 architecture 29 ruby 29 crypto 29 mlops 29 bioinformatics 29 defi 29 rag 29 penetration-testing 28 smart-contracts 28 dotnet 28 marketing 28 tensorflow 27 csharp 27 slam 27 gemini 27 saas 27 cryptography 27 object-detection 27 music 27 links 27 pentesting 27 best-practices 27 reverse-engineering 27 windows 27 science 27 serverless 26 data-analysis 26 infosec 26 mobile 26 angular 26 self-supervised-learning 26 pytorch 26 microsoft 26 library 26 sql 25 prompts 25 3d 25 django 25 osint 25 data-mining 25 knowledge-graph 25 coding 25 git 25 markdown 24 nextjs 24 kotlin 24 time-series 24 learning-resources 24 awsome-list 24 mathematics 24 embedded 24 jobs 24 computer-graphics 24 image-generation 23 chatbot 23 segmentation 23 interview 23 c 23 blog 23 roadmap 23 game-engine 23 unity 23 software-development 23 ui 23 neural-networks 23 programming-language 23 autonomous-driving 23 anthropic 23 engineering 23 monitoring 22 visualization 22 awesomeness 22 statistics 22 open-data 22 deeplearning 22 python3 22 seo 22 startup 22 networking 22 audio 22 apple 22 chinese 22 foundation-models 22 startups 22 openclaw 22 hacktoberfest-accepted 21 awsome 21 devsecops 21 gpt-3 21 federated-learning 21 website 21 azure 21 blogs 21 free-tutorials 21 skills 21 r 21 finance 21 cloud-computing 21 reasoning 21 curated 20 transformers 20 decentralized 20 youtube 20 llama 20 js 20 machinelearning 20 opendata 20 graphql 20 raspberry-pi 20 data-structures 20 model-context-protocol 20 apps 20 plugins 19 benchmark 19 gpt-4 19 graph 19 front-end 19 big-data 19 aigc 19 backend 19 ai-agent 19 command-line 19 agent-skills 19 langchain 19 pentest 18 analytics 18 telegram 18 speech-recognition 18 bugbounty 18 code 18 representation-learning 18 guidelines 18 stable-diffusion 18 generative-art 18 terminal 18 prompt 18 optimization 18 text-to-image 18 generative-model 17 cloud-native 17 knowledge 17 containers 17 embodied-ai 17 beginner-friendly 17 autonomous-vehicles 17 nerf 17 vibe-coding 17 data-engineering 17 animation 17 cursor 17 cheatsheet 17 mcp-server 17 malware-analysis 17 laravel 17 vlm 17 bash 17 language 16 dart 16 large-language-model 16 webgl 16 deepseek 16 reddit 16 swiftui 16 videos 16 algorithm 16 articles 16 microservices 16 ros 16 evm 16 bert 16 image-processing 16 art 15 graphics 15 distributed-systems 15 pose-estimation 15 resource 15 remote-sensing 15 multimodal-deep-learning 15 readme 15 android-development 15 databases 15 mysql 15 vscode 15 ai-safety 15 performance 15 technology 15 quantization 15 awesome-ai 15 leetcode 15 foss 15 cms 15 anomaly-detection 15 video-generation 15 p2p 15 llmops 14 selfhosted 14 dotnet-core 14 mac 14 mcp-servers 14 medical-imaging 14 multimodal-large-language-models 14 vision-language-model 14 gis 14 codex 14 react-components 14 google 14 diffusion 14 deep-reinforcement-learning 14 oss 14 multimodal-learning 14 podcast 14 webassembly 14 detection 14 node 14 language-model 14 drones 14 icons 14 code-generation 14 wasm 14 vuejs 14 system-design 14 recommender-system 14 npm 14 discord 14 generative-adversarial-network 14 android-library 14 gaming 13 minecraft 13 programming-languages 13 frameworks 13 planning 13 solana 13 utilities 13 sysadmin 13 infrastructure 13 gan 13 rl 13 postgresql 13 quantum-computing 13 semantic-segmentation 13 cicd 13 bug-bounty 13 svelte 13 developer 13 autonomous-agents 13 sustainability 13 fuzzing 13 storage 13 rest-api 13 interview-questions 13 microservice 13 unicorns 13 search 13 wordpress 13 transfer-learning 13 copilot 13 automl 13 webdevelopment 13 pwa 13 objective-c 13 workflow 13 cyber-security 13 yolo 13 ecommerce 13 dapp 13 multi-agent 12 article 12 mobile-development 12 cross-platform 12 ui-design 12 edge-computing 12 3d-graphics 12 graphics-programming 12 ctf 12 search-engine 12 vision-and-language 12 static-site-generator 12 threat-intelligence 12 static-analysis 12 elasticsearch 12 datascience 12 vector-database 12 infrastructure-as-code 12 cuda 12 speech-to-text 12 chain-of-thought 12 authentication 12 es6 12 email 12 question-answering 12 mllm 12 privacy-tools 12 model-compression 12 time-series-analysis 12 llm-agents 12 sre 12 web-design 12 webapp 12 image 12 uav 12 application 12 ux 12 claude-skills 12 ai-art 12 rails 12 software-architecture 12 creative-coding 12 guide 12 free-software 11 unity3d 11 knowledge-base 11 neuroscience 11 healthcare 11 elixir 11 point-cloud 11 compliance 11 few-shot-learning 11 webpack 11 geospatial 11 movies 11 podcasts 11 vr 11 lua 11 unsupervised-learning 11 test-automation 11 in-context-learning 11 climate-change 11 drone 11 course 11 open-science 11 packages 11 font 11 anime 11 leadership 11 cv 11 ai-coding 11 arduino 11 evaluation 11 cvpr 11 3d-reconstruction 11 hosting 11 continual-learning 11 gpu 11 hacktoberfest2020 11 math 11 trading 11 fpga 11 book 11 machine-learning-algorithms 11 ros2 11 reading-list 11 embedded-systems 11 observability 11 students 11 writing 11 generative-models 10 hacktoberfest2021 10 pdf 10 social-network 10 flutter-apps 10 ocr 10