Projects in Awesome Lists tagged with high-quality-datasets
A curated list of projects in awesome lists tagged with high-quality-datasets .
https://github.com/ksm26/pretraining-llms
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
ai-training cost-effective-pretraining data-preparation depth-upscaling developer-advocacy high-quality-datasets hugging-face large-language-models llm-evaluation machine-learning meta-llama model-configuration model-initialization performance-assessment pretraining-llms text-generation training-runs
Last synced: 28 Mar 2025