https://github.com/juncongmoo/chatllama

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
https://github.com/juncongmoo/chatllama

Last synced: 5 months ago
JSON representation

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT

Host: GitHub
URL: https://github.com/juncongmoo/chatllama
Owner: juncongmoo
Created: 2023-02-27T19:08:45.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2025-01-18T15:53:00.000Z (9 months ago)
Last Synced: 2025-04-13T16:50:20.270Z (6 months ago)
Language: Python
Homepage:
Size: 32.2 KB
Stars: 1,205
Watchers: 20
Forks: 135
Open Issues: 3
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-totally-open-chatgpt - juncongmoo/chatllama - based ChatGPT runnable in a single GPU. ([tatsu-lab/stanford_alpaca](https://github.com/tatsu-lab/stanford_alpaca) / Other LLaMA-derived projects:)
awesome-chatgpt - juncongmoo/chatllama - ChatLLaMA is an open-source implementation for LLaMA-based ChatGPT that can be run on a single GPU. It offers a training process that is 15 times faster than ChatGPT. (Other / Other sdk/libraries)
awesome-ChatGPT-repositories - chatllama - ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT (Reimplementations)
awesome-open-chatgpt - juncongmoo/chatllama - based ChatGPT runnable in a single GPU. ([TavernAI/TavernAI](https://github.com/TavernAI/TavernAI) / 数据)
StarryDivineSky - juncongmoo/chatllama - based ChatGPT开源实现。 (A01_文本生成_文本对话 / 大语言对话模型及数据)

README

# ChatLLaMA

> 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than `ChatGPT`

- 🔥 Please check [`pyllama`](https://github.com/juncongmoo/pyllama) for `LLaMA` installation and `single GPU inference` setup.
- 🔥 To train ChatGPT in 5 mins - [minichatgpt](https://github.com/juncongmoo/minichatgpt)

Meta has recently released LLaMA, a collection of foundational large language models ranging from 7 to 65 billion parameters.
LLaMA is creating a lot of excitement because it is smaller than GPT-3 but has better performance. For example, LLaMA's 13B architecture outperforms GPT-3 despite being 10 times smaller. This new collection of fundamental models opens the door to faster inference performance and chatGPT-like real-time assistants, while being cost-effective and running on a single GPU.

However, LLaMA was not fine-tuned for instruction task with a Reinforcement Learning from Human Feedback (RLHF) training process.

The good news is that we introduce `ChatLLaMA`, the first open source implementation of LLaMA based on RLHF:

- A complete open source implementation that enables you to build a ChatGPT-style service based on pre-trained LLaMA models.
- Compared to the original ChatGPT, the training process and single-GPU inference are much faster and cheaper by taking advantage of the smaller size of LLaMA architectures.
- ChatLLaMA has built-in support for DeepSpeed ZERO to speedup the fine-tuning process.
- The library also supports all LLaMA model architectures (7B, 13B, 33B, 65B), so that you can fine-tune the model according to your preferences for training time and inference performance.

Screen Shot 2023-02-26 at 10 56 13 PM

Image from [OpenAI’s blog](https://openai.com/blog/chatgpt).

# Installation

```
pip install chatllama
```

# Get started with ChatLLaMA

> :warning: Please note this code represents the algorithmic implementation for RLHF training process of LLaMA and does not contain the model weights. To access the model weights, you need to apply to Meta's [form](https://forms.gle/jk851eBVbX1m5TAv5).

ChatLLaMA allows you to easily train LLaMA-based architectures in a similar way to ChatGPT, using RLHF.
For example, below is the code to start the training in the case of ChatLLaMA 7B.

```python
from chatllama.rlhf.trainer import RLTrainer
from chatllama.rlhf.config import Config

path = "path_to_config_file.yaml"
config = Config(path=path)
trainer = RLTrainer(config.trainer)
trainer.distillate()
trainer.train()
trainer.training_stats.plot()
```

Note that you should provide Meta's original weights and your custom dataset before starting the fine-tuning process. Alternatively, you can generate your own dataset using LangChain's agents.

```python
python generate_dataset.py
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/juncongmoo/chatllama

Awesome Lists containing this project

README