0 "data-synthesis" Awesome Lists
Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
alignment compression data-augmentation data-synthesis feedback instruction-following kd knowledge-distillation large-language-model llm
1,250 stars
71 forks
347 projects
Last updated: 05 Feb 2026
awesome-data-llm
Official Repository of "LLM × DATA" Survey Paper
data-acquisition data-deduplication data-filtering data-mixing data-provenance data-selection data-synthesis data-transformation llm vlm
705 stars
67 forks
793 projects
Last updated: 11 Feb 2026
awesome-multimodal-data-recipe
Curated collection of multimodal data synthesis methods, covering papers, datasets, and best practices for vision-language model training
awesome-list data data-engineering data-generation data-science data-synthesis dataset llms multimodal vision-language-model
5 stars
0 forks
105 projects
Last updated: 20 Nov 2025