https://github.com/aniketmaurya/llm-inference

Large Language Model (LLM) Inference API and Chatbot
https://github.com/aniketmaurya/llm-inference

chatbot langchain llama llm llm-inference mistral

Last synced: 6 months ago
JSON representation

Large Language Model (LLM) Inference API and Chatbot

Host: GitHub
URL: https://github.com/aniketmaurya/llm-inference
Owner: aniketmaurya
License: mit
Archived: true
Created: 2023-04-10T11:06:26.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-04-02T15:11:22.000Z (over 1 year ago)
Last Synced: 2024-09-23T11:33:24.088Z (10 months ago)
Topics: chatbot, langchain, llama, llm, llm-inference, mistral
Language: Python
Homepage: https://pypi.org/project/llm-inference/
Size: 3.33 MB
Stars: 123
Watchers: 3
Forks: 14
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Codeowners: .github/CODEOWNERS

Awesome Lists containing this project

awesome-ChatGPT-repositories - llm-inference - Large Language Model (LLM) Inference API and Chatbot (Chatbots)

README

        # Large Language Model (LLM) Inference API and Chatbot 🦙

![project banner](https://github.com/aniketmaurya/llm-inference/raw/main/assets/llm-inference-min.png)

Inference API for LLMs like LLaMA and Falcon powered by Lit-GPT from [Lightning AI](https://lightning.ai)

```

pip install llm-inference

```

### Install from main branch

```bash

pip install git+https://github.com/aniketmaurya/llm-inference.git@main

# You need to manually install [Lit-GPT](https://github.com/Lightning-AI/lit-gpt) and setup the model weights to use this project.

pip install lit_gpt@git+https://github.com/aniketmaurya/install-lit-gpt.git@install

```

## For Inference

```python

from llm_inference import LLMInference, prepare_weights

path = prepare_weights("EleutherAI/pythia-70m")

model = LLMInference(checkpoint_dir=path)

print(model("New York is located in"))

```

## How to use the Chatbot

![chatbot image](./assets/llm-inference-llama2_chatbot.png)

```python

from llm_chain import LitGPTConversationChain, LitGPTLLM

from llm_inference import prepare_weights

path = str(prepare_weights("meta-llama/Llama-2-7b-chat-hf"))

llm = LitGPTLLM(checkpoint_dir=path, quantize="bnb.nf4")  # 7GB GPU memory

bot = LitGPTConversationChain.from_llm(llm=llm, prompt=llama2_prompt_template)

print(bot.send("hi, what is the capital of France?"))

```

## Launch Chatbot App

  

**1. Download weights**

```py

from llm_inference import prepare_weights

path = prepare_weights("meta-llama/Llama-2-7b-chat-hf")

```

**2. Launch Gradio App**

```

python examples/chatbot/gradio_demo.py

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/aniketmaurya/llm-inference

Awesome Lists containing this project

README