awesome-datacentric-llm
Trending projects & awesome papers about data-centric llm studies.
https://github.com/koalazf99/awesome-datacentric-llm
Last synced: 5 days ago
JSON representation
-
Projects & Blogs
- [datasets
- [code - evaluation-harness)
- [code
- [code
- [datasets - fineweb-v1)] [[pdf](https://arxiv.org/abs/2406.17557)] [May 2024]
- [blog
- [datasets
- [blog
- [code - evaluation-harness)
- [code
- [code
- [datasets - fineweb-v1)] [[pdf](https://arxiv.org/abs/2406.17557)] [May 2024]
- [blog
- [datasets
- [blog
- [code - 2)] [Oct 2024] 
- [code - sg/sailcraft)
-
Papers
- [pdf - nlp/QuRating)] [Feb 2024] 
- [pdf - art-projection/MAP-NEO)] [May 2024] 
- [pdf - sg/regmix)] [Jul 2024] 
- [pdf - NLP/ProX)] [[data](https://huggingface.co/collections/gair-prox/prox-dataset-66e81c9d560911b836bb3704)] [Sep 2024] 
- [abs
- [pdf - nlp/QuRating)] [Feb 2024] 
- [pdf - sg/regmix)] [Jul 2024] 
- [pdf - NLP/ProX)] [[data](https://huggingface.co/collections/gair-prox/prox-dataset-66e81c9d560911b836bb3704)] [Sep 2024] 
- [pdf - art-projection/MAP-NEO)] [May 2024] 
- [abs
-
Tutorials
Programming Languages
Categories
Sub Categories