https://github.com/unclecode/callama

Democratizing Function Calling Capabilities for Open-Source Language Models
https://github.com/unclecode/callama

Last synced: 8 months ago
JSON representation

Democratizing Function Calling Capabilities for Open-Source Language Models

Host: GitHub
URL: https://github.com/unclecode/callama
Owner: unclecode
License: apache-2.0
Created: 2024-04-28T07:28:43.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2024-05-05T05:39:41.000Z (about 2 years ago)
Last Synced: 2025-10-06T10:57:48.411Z (9 months ago)
Language: Python
Homepage:
Size: 1.87 MB
Stars: 41
Watchers: 4
Forks: 6
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # 📞🦙 CaLLama: Hola "Function" are you there?

![CaLLama](https://res.cloudinary.com/kidocode/image/upload/c_pad,w_300,h_300,ar_1:1/v1714302845/callama_3_ey59xu.png)

This repository is dedicated to advancing the "function-call" features for open-source large language models (LLMs). We believe that the future of AI, specifically AI agents, depends on proper function-calling capabilities. While proprietary models like OpenAI's have these features, it is crucial for the open-source community to have access to high-quality function-calling abilities to democratize AI.

Recently, Facebook released LLaMA3, perhaps the best open-source LLM available. We have fine-tuned and created a version of LLaMA3 that natively supports function calls.

## 🎯 Solutions

We are focusing on two directions:

1. We are developing a cool library focused on function-call to build a uniform way of working with function calls (tool calls) for all LLMs. This library will be released in its first version soon.

2. Fine-tuning small models specifically for function calling, which has already been done for Llama 3 and Tiny Llama.

## Usage Methods 🛠️

[![Model in HuggingFace](https://huggingface.co/datasets/huggingface/badges/resolve/main/model-on-hf-md.svg)](https://huggingface.co/unclecode)

### 🖥️ Colab

1. To know how to run using helper class, check this colab [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1qyrNeAjURfWFAwEM0ozVEfRQeHUWK4Kq?usp=sharing)

2. For a more detailed experience, check out the [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://tinyurl.com/ucfllm)

3. To use GGFU version, check out the [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1EobHQ9fLkNvpWpXfegRUVDRpdd-Va5H_#scrollTo=rh3IlMxDduXw)

### 🛠️ Usage Locally

To use the models in this repository, follow these steps:

1. Clone the repository:

```

git clone https://github.com/unclecode/fllm.git

```

2. Create a virtual environment and activate it:

```

conda create --name env python=3.10

conda activate env

```

3. Install PyTorch if you haven't already:

```

conda install pytorch-cuda=<12.1/11.8> pytorch cudatoolkit xformers -c pytorch -c nvidia -c xformers

```

4. Install the required dependencies:

```

python setup.py

```

5. Add your HuggingFace token in ".env.text", and then rename the file to ".env".

6. Run the example code in the `examples` folder to see the models in action.

You can also refer to the `callama.py` file in the `llms` folder to see the LLM chat template.

### 🦙 Using the ollama

Steps to run the example:

1. Make sure Ollama is installed, and ollama server is running.

2. Pull the model from the ollaama hub.

```

ollama pull unclecode/llama3callama

ollama pull unclecode/tinycallama

```

3. Make sure to check the ollama example [here](https://github.com/unclecode/callama/blob/main/examples/ocallama.py)

Link to models:

- [llama3callama](https://ollama.com/unclecode/llama3callama)

- [tinycallama](https://ollama.com/unclecode/tinycallama)

## ✅ Features TODO List

- [x] Single function detection

- [x] Support for various model sizes and quantization levels

- [x] Available as a LoRA adapter that can be merged with many models

- [ ] Multi-function detection

- [ ] Function binding, allowing the model to detect the order of execution and bind the output of one function to another

- [ ] Fine-tuning models with less than 1B parameters for efficient function calling

## 🤗 Models

The following models are available on Hugging Face:

- 🦙 [unclecode/llama3-function-call-lora-adapter-240424](https://huggingface.co/unclecode/llama3-function-call-lora-adapter-240424)

- 🦙 [unclecode/llama3-function-call-Q4_K_M_GGFU-240424](https://huggingface.co/unclecode/llama3-function-call-Q4_K_M_GGFU-240424)

- 🦙 [unclecode/tinyllama-function-call-lora-adapter-250424](https://huggingface.co/unclecode/tinyllama-function-call-lora-adapter-250424)

- 🦙 [unclecode/tinyllama-function-call-Q4_K_M_GGFU-250424](https://huggingface.co/unclecode/tinyllama-function-call-Q4_K_M_GGFU-250424)

## 📊 Dataset

The models were fine-tuned using a modified version of the `ilacai/glaive-function-calling-v2-sharegpt` dataset, which can be found at [unclecode/glaive-function-calling-llama3](https://huggingface.co/datasets/unclecode/glaive-function-calling-llama3).

## 🤝 Contributing

We welcome contributions from the community. If you are interested in joining this project or have any questions, please open an issue in this repository.

Twitter (X): https://x.com/unclecode

## 📜 License

These models are released under the Apache 2.0 license.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/unclecode/callama

Awesome Lists containing this project

README