https://github.com/unclecode/callama
Democratizing Function Calling Capabilities for Open-Source Language Models
https://github.com/unclecode/callama
Last synced: 8 months ago
JSON representation
Democratizing Function Calling Capabilities for Open-Source Language Models
- Host: GitHub
- URL: https://github.com/unclecode/callama
- Owner: unclecode
- License: apache-2.0
- Created: 2024-04-28T07:28:43.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-05-05T05:39:41.000Z (about 2 years ago)
- Last Synced: 2025-10-06T10:57:48.411Z (9 months ago)
- Language: Python
- Homepage:
- Size: 1.87 MB
- Stars: 41
- Watchers: 4
- Forks: 6
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# 📞🦙 CaLLama: Hola "Function" are you there?

This repository is dedicated to advancing the "function-call" features for open-source large language models (LLMs). We believe that the future of AI, specifically AI agents, depends on proper function-calling capabilities. While proprietary models like OpenAI's have these features, it is crucial for the open-source community to have access to high-quality function-calling abilities to democratize AI.
Recently, Facebook released LLaMA3, perhaps the best open-source LLM available. We have fine-tuned and created a version of LLaMA3 that natively supports function calls.
## 🎯 Solutions
We are focusing on two directions:
1. We are developing a cool library focused on function-call to build a uniform way of working with function calls (tool calls) for all LLMs. This library will be released in its first version soon.
2. Fine-tuning small models specifically for function calling, which has already been done for Llama 3 and Tiny Llama.
## Usage Methods 🛠️
[](https://huggingface.co/unclecode)
### 🖥️ Colab
1. To know how to run using helper class, check this colab [](https://colab.research.google.com/drive/1qyrNeAjURfWFAwEM0ozVEfRQeHUWK4Kq?usp=sharing)
2. For a more detailed experience, check out the [](https://tinyurl.com/ucfllm)
3. To use GGFU version, check out the [](https://colab.research.google.com/drive/1EobHQ9fLkNvpWpXfegRUVDRpdd-Va5H_#scrollTo=rh3IlMxDduXw)
### 🛠️ Usage Locally
To use the models in this repository, follow these steps:
1. Clone the repository:
```
git clone https://github.com/unclecode/fllm.git
```
2. Create a virtual environment and activate it:
```
conda create --name env python=3.10
conda activate env
```
3. Install PyTorch if you haven't already:
```
conda install pytorch-cuda=<12.1/11.8> pytorch cudatoolkit xformers -c pytorch -c nvidia -c xformers
```
4. Install the required dependencies:
```
python setup.py
```
5. Add your HuggingFace token in ".env.text", and then rename the file to ".env".
6. Run the example code in the `examples` folder to see the models in action.
You can also refer to the `callama.py` file in the `llms` folder to see the LLM chat template.
### 🦙 Using the ollama
Steps to run the example:
1. Make sure Ollama is installed, and ollama server is running.
2. Pull the model from the ollaama hub.
```
ollama pull unclecode/llama3callama
ollama pull unclecode/tinycallama
```
3. Make sure to check the ollama example [here](https://github.com/unclecode/callama/blob/main/examples/ocallama.py)
Link to models:
- [llama3callama](https://ollama.com/unclecode/llama3callama)
- [tinycallama](https://ollama.com/unclecode/tinycallama)
## ✅ Features TODO List
- [x] Single function detection
- [x] Support for various model sizes and quantization levels
- [x] Available as a LoRA adapter that can be merged with many models
- [ ] Multi-function detection
- [ ] Function binding, allowing the model to detect the order of execution and bind the output of one function to another
- [ ] Fine-tuning models with less than 1B parameters for efficient function calling
## 🤗 Models
The following models are available on Hugging Face:
- 🦙 [unclecode/llama3-function-call-lora-adapter-240424](https://huggingface.co/unclecode/llama3-function-call-lora-adapter-240424)
- 🦙 [unclecode/llama3-function-call-Q4_K_M_GGFU-240424](https://huggingface.co/unclecode/llama3-function-call-Q4_K_M_GGFU-240424)
- 🦙 [unclecode/tinyllama-function-call-lora-adapter-250424](https://huggingface.co/unclecode/tinyllama-function-call-lora-adapter-250424)
- 🦙 [unclecode/tinyllama-function-call-Q4_K_M_GGFU-250424](https://huggingface.co/unclecode/tinyllama-function-call-Q4_K_M_GGFU-250424)
## 📊 Dataset
The models were fine-tuned using a modified version of the `ilacai/glaive-function-calling-v2-sharegpt` dataset, which can be found at [unclecode/glaive-function-calling-llama3](https://huggingface.co/datasets/unclecode/glaive-function-calling-llama3).
## 🤝 Contributing
We welcome contributions from the community. If you are interested in joining this project or have any questions, please open an issue in this repository.
Twitter (X): https://x.com/unclecode
## 📜 License
These models are released under the Apache 2.0 license.