An open API service indexing awesome lists of open source software.

0 "data-centric-machine-learning" Awesome Lists

awesome-data-efficient-llm

A list of data-efficient and data-centric LLM (Large Language Model) papers. Our Survey Paper: Towards Efficient LLM Post Training: A Data-centric Perspective

data-centric data-centric-ai data-centric-machine-learning data-efficient efficient large-language-models llm

39 stars
4 forks
215 projects

Last updated: 27 Sep 2025

Search
Keywords
awesome-list 3,733 awesome 3,384 awesome-lists 479 machine-learning 429 list 407 deep-learning 366 resources 314 hacktoberfest 237 ai 223 python 209 lists 201 javascript 187 llm 173 security 163 programming 159 artificial-intelligence 147 computer-vision 137 blockchain 130 nlp 114 tools 113 open-source 111 data-science 109 large-language-models 106 react 103 chatgpt 96 android 89 natural-language-processing 85 linux 83 learning 81 curated-list 79 css 78 awesome-readme 77 papers 76 devops 75 ios 74 awesome-resources 73 ethereum 73 reinforcement-learning 70 cybersecurity 63 computer-science 63 nodejs 62 robotics 60 playground 60 kubernetes 59 rust 59 free-resources 58 game-development 58 design 57 macos 57 getvm 57 collection 57 free 55 tutorials 54 java 54 privacy 53 golang 52 tutorial 51 survey 51 frontend 50 web 50 hacking 49 openai 49 bitcoin 49 paper 48 datasets 48 go 47 cloud 45 llms 45 labex 45 education 45 hands-on 45 swift 45 web3 44 php 44 deep-neural-networks 44 dataset 44 books 44 html 43 development 43 data 43 developer-tools 43 github 42 gpt 42 exercises 41 security-tools 41 cryptocurrency 41 typescript 40 data-visualization 40 vue 39 gamedev 38 opensource 38 documentation 37 software 37 ml 36 docker 36 collections 36 generative-ai 35 web-development 35 database 35 research 34 courses 33 paper-list 33 automation 33 reactjs 33 neural-network 33 iot 33 aws 33 testing 32 framework 32 algorithms 31 diffusion-models 31 game 31 graph-neural-networks 30 projects 30 solidity 29 video 29 hardware 29 cpp 29 ruby 29 community 29 architecture 28 productivity 28 transformer 28 flutter 28 bioinformatics 28 games 28 libraries 28 multimodal 28 smart-contracts 28 tensorflow 27 api 27 links 27 mcp 27 agent 27 dotnet 27 crypto 26 react-native 26 csharp 26 windows 26 software-engineering 26 defi 25 angular 25 computer-graphics 25 science 24 git 24 pytorch 24 pentesting 24 django 24 microsoft 24 slam 24 osint 24 penetration-testing 24 object-detection 24 music 24 self-supervised-learning 24 cryptography 24 engineering 23 embedded 23 awsome-list 23 hacktoberfest-accepted 23 coding 23 serverless 23 library 23 unity 23 markdown 23 r 23 infosec 23 mobile 22 ui 22 mathematics 22 prompt-engineering 22 best-practices 22 data-analysis 22 neural-networks 22 gpt-4 22 ai-agents 22 game-engine 22 3d 22 python3 22 sql 22 self-hosted 22 apple 21 knowledge-graph 21 cli 21 free-tutorials 21 interview 21 mlops 21 data-mining 21 awesomeness 21 audio 21 roadmap 21 opendata 21 kotlin 21 gpt-3 21 blog 21 jobs 21 segmentation 21 startups 21 federated-learning 20 visualization 20 ai-tools 20 programming-language 20 apps 20 cloud-computing 20 agents 20 blogs 20 deeplearning 20 guidelines 20 youtube 20 awsome 20 chinese 20 c 20 azure 20 autonomous-driving 20 website 20 chatbot 19 graph 19 foundation-models 19 marketing 19 learning-resources 19 js 19 time-series 19 saas 19 raspberry-pi 19 reasoning 19 statistics 19 reverse-engineering 18 representation-learning 18 aigc 18 llama 18 curated 18 machinelearning 18 bugbounty 18 graphql 18 bash 18 data-structures 18 front-end 18 startup 18 nextjs 18 transformers 17 cheatsheet 17 software-development 17 knowledge 17 generative-art 17 cloud-native 17 language 17 optimization 17 stable-diffusion 17 code 17 containers 17 algorithm 17 backend 17 analytics 17 image-processing 17 decentralized 17 networking 16 seo 16 articles 16 nerf 16 cms 16 microservices 16 big-data 16 bert 16 webgl 16 resource 15 data-engineering 15 dart 15 autonomous-vehicles 15 pentest 15 monitoring 15 mysql 15 ros 15 readme 15 animation 15 detection 15 technology 15 open-data 15 graphics 15 webassembly 14 oss 14 finance 14 terminal 14 recommender-system 14 foss 14 awesome-ai 14 node 14 unicorns 14 telegram 14 p2p 14 selfhosted 14 videos 14 devsecops 14 distributed-systems 14 react-components 14 android-library 14 mac 14 performance 14 icons 14 language-model 14 prompt 14 vuejs 14 diffusion 14 art 14 android-development 14 multimodal-deep-learning 14 leetcode 14 large-language-model 14 deepseek 14 code-generation 14 swiftui 13 dotnet-core 13 command-line 13 pose-estimation 13 gis 13 fuzzing 13 search 13 speech-recognition 13 multimodal-large-language-models 13 remote-sensing 13 webdevelopment 13 vscode 13 deep-reinforcement-learning 13 podcast 13 embodied-ai 13 application 13 laravel 13 agentic-ai 13 wordpress 13 postgresql 13 frameworks 13 generative-adversarial-network 13 evm 13 malware-analysis 13 transfer-learning 13 quantum-computing 13 microservice 13 text-to-image 13 beginner-friendly 13 wasm 13 vlm 13 cross-platform 13 plugins 13 svelte 12 mcp-server 12 elasticsearch 12 graphics-programming 12 rails 12 mllm 12 npm 12 video-generation 12 image-generation 12 static-site-generator 12 sustainability 12 guide 12 infrastructure-as-code 12 gaming 12 discord 12 interview-questions 12 image 12 rag 12 search-engine 12 multimodal-learning 12 es6 12 anomaly-detection 12 chain-of-thought 12 developer 12 static-analysis 12 storage 12 dapp 12 gan 12 pwa 12 benchmark 12 vision-and-language 12 generative-model 12 webapp 12 interview-preparation 11 haskell 11 book 11 google 11 course 11 design-systems 11 free-software 11 databases 11 ecommerce 11 few-shot-learning 11 objective-c 11 climate-change 11 cvpr 11 webpack 11 rest-api 11 question-answering 11 open-science 11 3d-graphics 11 vision-language-model 11 creative-coding 11 rl 11 article 11 ux 11 planning 11 yolo 11 threat-intelligence 11 cuda 11 ui-design 11 unity3d 11 claude 11 hosting 11 sysadmin 11 point-cloud 11 front-end-development 11 programming-languages 11 vibe-coding 11 ctf 11 automl 11 hacktoberfest2020 11 font 11 in-context-learning 10 semantic-segmentation 10 workflow 10 jupyter-notebook 10 solana 10 android-application 10 prompts 10 flutter-apps 10 movies 10 text-mining 10 utilities 10 bot 10 writing 10 applications 10 students 10 docs 10 cv 10 email 10 machine-learning-algorithms 10 vr 10 interpretability 10 bookmarks 10 cyber-security 10 authentication 10 medical-imaging 10 unsupervised-learning 10 neuroscience 10 augmented-reality 10 management 10 ide 10 hacktoberfest2021 10 malware 10 3d-reconstruction 10 android-app 10 social-network 10 webdesign 10 arduino 10 xai 10 web-design 10 langchain 10 gpts 10 speech-processing 10 time-series-analysis 10 golang-library 10 fpga 10 semantic-web 10 bug-bounty 10 datascience 10 operating-system 10 hacking-tools 10 podcasts 10 network 10 explainable-ai 10 geospatial 10 system-design 10 model-compression 10 ansible 10 reading-list 10 chatgpt-api 10 functional-programming 10 deepseek-r1 10 erlang 9 best-of 9 webcomponents 9 json 9 infrastructure 9 privacy-tools 9 cicd 9 chrome 9 simulation 9 anime 9 gpu 9 shell 9 blogging 9 test-automation 9 flutter-examples 9 edge-computing 9 software-architecture 9 compiler 9 trading 9 conference 9 redis 9 sre 9 indonesia 9 reference 9