https://github.com/metriccoders/one-line-llm-tuner

This repository is the source code for fine tuning any LLM in just one line 🔥
https://github.com/metriccoders/one-line-llm-tuner

fine-tuning gpt-2 hugging-face huggingface large-language-models llama2 python3

Last synced: 6 months ago
JSON representation

This repository is the source code for fine tuning any LLM in just one line 🔥

Host: GitHub
URL: https://github.com/metriccoders/one-line-llm-tuner
Owner: metriccoders
License: apache-2.0
Created: 2024-08-03T22:24:44.000Z (11 months ago)
Default Branch: master
Last Pushed: 2024-08-08T22:03:12.000Z (11 months ago)
Last Synced: 2025-01-16T08:08:42.727Z (6 months ago)
Topics: fine-tuning, gpt-2, hugging-face, huggingface, large-language-models, llama2, python3
Language: Python
Homepage: https://pypi.org/project/one-line-llm-tuner/
Size: 30.3 KB
Stars: 4
Watchers: 2
Forks: 1
Open Issues: 4
Metadata Files:
- Readme: README.md
- Contributing: Contributing.md
- License: LICENSE.txt

Awesome Lists containing this project

README

        # 🔥 One Line LLM Tuner 🔥

Fine-tune any Large Language Model (LLM) available on [Hugging Face](https://www.huggingface.co) in a single line.

## Overview

`one-line-llm-tuner` is a Python package designed to simplify the process of fine-tuning large language models (LLMs) like GPT-2, Llama-2, GPT-3 and more. With just one line of code, you can fine-tune a pre-trained model to your specific dataset. Consider it as a wrapper for `transformers` library, just like how `keras` is for `tensorflow`.

## Features

- **Simple**: Fine-tune Large Language Models (LLMs) with minimal code.

- **Supports Popular LLMs**: Works with models from the `transformers` library, including GPT, BERT, and more.

- **Customizable**: Advanced users can customize the fine-tuning process with additional parameters.

## Installation

You can install `one-line-llm-tuner` using pip:

```bash

pip install one-line-llm-tuner

```

## Usage

The PyPI package can be used in the following way after installation.

```bash

from one_line_llm_tuner.tuner import llm_tuner

fine_tune_obj = llm_tuner.FineTuneModel()

fine_tune_obj.fine_tune_model(input_file_path="train.txt")

fine_tune_obj.predict_text("Elon musk founded Spacex in ")

```

The above example is for default values. If you want to modify the default values such as type of model used, tokenizer and more, use the following code.

```bash

from one_line_llm_tuner.tuner import llm_tuner

fine_tune_obj = llm_tuner.FineTuneModel(model_name="gpt2",

                 test_size=0.3,

                 training_dataset_filename="train_dataset.txt",

                 testing_dataset_filename="test_dataset.txt",

                 tokenizer_truncate=True,

                 tokenizer_padding=True,

                 output_dir="./results",

                 num_train_epochs=2,

                 logging_steps=500,

                 save_steps=500,

                 per_device_train_batch_size=128,

                 per_device_eval_batch_size=128,

                 max_output_length=100,

                 num_return_sequences=1,

                 skip_special_tokens=True,)

fine_tune_obj.fine_tune_model(input_file_path="train.txt")

fine_tune_obj.predict_text("Elon musk founded Spacex in ")

```

## Contributing

We welcome contributions! Please see the [contributing guide](Contributing.md) for more details.

## License

This project is licensed under the terms of the Apache license. See the [LICENSE](LICENSE.txt) file for details.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/metriccoders/one-line-llm-tuner

Awesome Lists containing this project

README