Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-datacentric-llm
Trending projects & awesome papers about data-centric llm studies.
https://github.com/koalazf99/awesome-datacentric-llm
Last synced: about 21 hours ago
JSON representation
-
Projects & Blogs
- [datasets
- [code - evaluation-harness)
- [code
- [code
- [datasets - fineweb-v1)] [[pdf](https://arxiv.org/abs/2406.17557)] [May 2024]
- [blog
- [datasets
- [blog
- [code - evaluation-harness)
- [code
- [code
- [datasets - fineweb-v1)] [[pdf](https://arxiv.org/abs/2406.17557)] [May 2024]
- [blog
- [datasets
- [blog
- [code - 2)] [Oct 2024] ![stars](https://img.shields.io/github/stars/huggingface/fineweb-2)
- [code - sg/sailcraft)
-
Papers
- [pdf - nlp/QuRating)] [Feb 2024] ![stars](https://img.shields.io/github/stars/princeton-nlp/QuRating)
- [pdf - art-projection/MAP-NEO)] [May 2024] ![stars](https://img.shields.io/github/stars/multimodal-art-projection/MAP-NEO)
- [pdf - sg/regmix)] [Jul 2024] ![stars](https://img.shields.io/github/stars/sail-sg/regmix)
- [pdf - NLP/ProX)] [[data](https://huggingface.co/collections/gair-prox/prox-dataset-66e81c9d560911b836bb3704)] [Sep 2024] ![stars](https://img.shields.io/github/stars/GAIR-NLP/ProX)
- [abs
- [pdf - nlp/QuRating)] [Feb 2024] ![stars](https://img.shields.io/github/stars/princeton-nlp/QuRating)
- [pdf - sg/regmix)] [Jul 2024] ![stars](https://img.shields.io/github/stars/sail-sg/regmix)
- [pdf - NLP/ProX)] [[data](https://huggingface.co/collections/gair-prox/prox-dataset-66e81c9d560911b836bb3704)] [Sep 2024] ![stars](https://img.shields.io/github/stars/GAIR-NLP/ProX)
- [pdf - art-projection/MAP-NEO)] [May 2024] ![stars](https://img.shields.io/github/stars/multimodal-art-projection/MAP-NEO)
- [abs
-
Tutorials
Programming Languages
Categories
Sub Categories