https://github.com/huggingface/smol-course

A course on aligning smol models.
https://github.com/huggingface/smol-course

Last synced: 7 months ago
JSON representation

A course on aligning smol models.

Host: GitHub
URL: https://github.com/huggingface/smol-course
Owner: huggingface
License: apache-2.0
Created: 2024-11-25T19:22:43.000Z (8 months ago)
Default Branch: main
Last Pushed: 2024-12-02T19:25:22.000Z (7 months ago)
Last Synced: 2024-12-02T20:26:43.862Z (7 months ago)
Language: Jupyter Notebook
Size: 650 KB
Stars: 4
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-repositories - huggingface/smol-course - A course on aligning smol models. (Jupyter Notebook)
StarryDivineSky - huggingface/smol-course - course 是一个关于对齐小型语言模型的教程项目。它旨在帮助用户理解和实践如何使小型模型更好地遵循人类指令和意图。该教程可能涵盖了微调、强化学习、奖励建模等对齐技术，并可能提供代码示例和实践指导。通过学习本课程，用户可以掌握训练和对齐更安全、更有用的小型语言模型的方法。该项目可能包含数据集、训练脚本和评估指标，以方便用户进行实验和验证。课程内容可能涉及指令遵循、有害内容过滤和价值观对齐等关键方面。该项目适合对小型语言模型对齐感兴趣的研究人员、开发者和爱好者。它可能提供了一种低成本、易于上手的方式来探索和改进语言模型的行为。该课程的重点是让小型模型在特定任务上表现出色，并避免产生不良行为。 (A01_文本生成_文本对话 / 大语言对话模型及数据)

README

![smolcourse image](./banner.png)

# a smol course

This is a practical course on aligning language models for your specific use case. It's a handy way to get started with aligning language models, because everything runs on most local machines. There are minimal GPU requirements and no paid services. The course is based on the [SmolLM2](https://github.com/huggingface/smollm/tree/main) series of models, but you can transfer the skills you learn here to larger models or other small language models.

Participation is open, free, and now!

This course is open and peer reviewed. To get involved with the course open a pull request and submit your work for review. Here are the steps:

Fork the repo here

Read the material, make changes, do the exercises, add your own examples.

Open a PR on the december_2024 branch

Get it reviewed and merged

This should help you learn and to build a community-driven course that is always improving.

We can discuss the process in this [discussion thread](https://github.com/huggingface/smol-course/discussions/2#discussion-7602932).

## Course Outline

This course provides a practical, hands-on approach to working with small language models, from initial training through to production deployment.

| Module | Description | Status | Release Date |
|--------|-------------|---------|--------------|
| [Instruction Tuning](./1_instruction_tuning) | Learn supervised fine-tuning, chat templating, and basic instruction following | ✅ Complete | Dec 3, 2024 |
| [Preference Alignment](./2_preference_alignment) | Explore DPO and ORPO techniques for aligning models with human preferences | ✅ Complete | Dec 6, 2024 |
| [Parameter-efficient Fine-tuning](./3_parameter_efficient_finetuning) | Learn LoRA, prompt tuning, and efficient adaptation methods | [🚧 WIP](https://github.com/huggingface/smol-course/pull/41) | Dec 9, 2024 |
| [Evaluation](./4_evaluation) | Use automatic benchmarks and create custom domain evaluations | [🚧 WIP](https://github.com/huggingface/smol-course/issues/42) | Dec 13, 2024 |
| [Vision-language Models](./5_vision_language_models) | Adapt multimodal models for vision-language tasks | [🚧 WIP](https://github.com/huggingface/smol-course/issues/49) | Dec 16, 2024 |
| [Synthetic Datasets](./6_synthetic_datasets) | Create and validate synthetic datasets for training | 📝 Planned | Dec 20, 2024 |
| [Inference](./7_inference) | Infer with models efficiently | 📝 Planned | Dec 23, 2024 |

## Why Small Language Models?

While large language models have shown impressive capabilities, they often require significant computational resources and can be overkill for focused applications. Small language models offer several advantages for domain-specific applications:

- **Efficiency**: Require significantly less computational resources to train and deploy
- **Customization**: Easier to fine-tune and adapt to specific domains
- **Control**: Better understanding and control of model behavior
- **Cost**: Lower operational costs for training and inference
- **Privacy**: Can be run locally without sending data to external APIs
- **Green Technology**: Advocates efficient usage of resources with reduced carbon footprint
- **Easier Academic Research Development**: Provides an easy starter for academic research with cutting-edge LLMs with less logistical constraints

## Prerequisites

Before starting, ensure you have the following:
- Basic understanding of machine learning and natural language processing.
- Familiarity with Python, PyTorch, and the `transformers` library.
- Access to a pre-trained language model and a labeled dataset.

## Installation

We maintain the course as a package so you can install dependencies easily via a package manager. We recommend [uv](https://github.com/astral-sh/uv) for this purpose, but you could use alternatives like `pip` or `pdm`.

### Using `uv`

With `uv` installed, you can install the course like this:

```bash
uv venv --python 3.11.0
uv sync
```

### Using `pip`

All the examples run in the same **python 3.11** environment, so you should create an environment and install dependencies like this:

```bash
# python -m venv .venv
# source .venv/bin/activate
pip install -r requirements.txt
```

### Google Colab

**From Google Colab** you will need to install dependencies flexibly based on the hardware you're using. Like this:

```bash
pip install -r transformers trl datasets huggingface_hub
```

## Engagement

Let's share this, so that loads of people can learn to finetune LLMs without expensive hardware.

[![Star History Chart](https://api.star-history.com/svg?repos=huggingface/smol-course&type=Date)](https://star-history.com/#huggingface/smol-course&Date)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/huggingface/smol-course

Awesome Lists containing this project

README

Participation is open, free, and now!