An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with high-quality-datasets

A curated list of projects in awesome lists tagged with high-quality-datasets .

https://github.com/ksm26/pretraining-llms

Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.

ai-training cost-effective-pretraining data-preparation depth-upscaling developer-advocacy high-quality-datasets hugging-face large-language-models llm-evaluation machine-learning meta-llama model-configuration model-initialization performance-assessment pretraining-llms text-generation training-runs

Last synced: 28 Mar 2025