https://github.com/sazonovanton/sirchatalot

SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities, tools and semantic search in vector DB.
https://github.com/sazonovanton/sirchatalot

agentic-ai anthropic chatgpt claude claude-api dall-e function-calling openai openai-api python-telegram-bot rag semantic-search stability-ai telegram-bot tool-use web-search whisper yandex-gpt yandexart yandexgpt

Last synced: 10 days ago
JSON representation

Host: GitHub
URL: https://github.com/sazonovanton/sirchatalot
Owner: sazonovanton
License: gpl-3.0
Created: 2023-03-01T22:20:41.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2025-03-03T18:28:53.000Z (7 months ago)
Last Synced: 2025-09-30T08:12:52.312Z (14 days ago)
Topics: agentic-ai, anthropic, chatgpt, claude, claude-api, dall-e, function-calling, openai, openai-api, python-telegram-bot, rag, semantic-search, stability-ai, telegram-bot, tool-use, web-search, whisper, yandex-gpt, yandexart, yandexgpt
Language: Python
Homepage:
Size: 674 KB
Stars: 72
Watchers: 4
Forks: 14
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# SirChatalot
A Telegram bot that proves you don't need a body to have a personality. It can use various text and image generation APIs to generate responses to user messages.

For text generation, the bot can use:
* OpenAI's [ChatGPT API](https://platform.openai.com/docs/guides/chat) (or other compatible API). Vision capabilities can be used with [GPT-4](https://platform.openai.com/docs/models/gpt-4-and-gpt-4-turbo) models. Function calling can be used with [Function calling](https://platform.openai.com/docs/guides/function-calling).
* Anthropic's [Claude API](https://docs.anthropic.com/claude/docs/text-generation). Vision capabilities can be used with [Claude 3](https://docs.anthropic.com/claude/docs/models-overview) models. Function calling can be used with [tool use](https://docs.anthropic.com/claude/docs/tool-use).
* [YandexGPT API](https://yandex.cloud/ru/docs/yandexgpt/).
* Any other [OpenAI compatible API](#using-openai-compatible-apis).

Bot can also generate images with:
* OpenAI's [DALL-E](https://platform.openai.com/docs/guides/images)
* [Stability AI](https://platform.stability.ai/)
* [Yandex ART](https://yandex.cloud/ru/docs/foundation-models/quickstart/yandexart)

This bot can also be used to generate responses to voice and video messages. Bot will convert voice/video message to text and then it will generate a response. Optionally it can only generate audio/video transcript, without answering. Speech recognition is done using the OpenAI's [Whisper model](https://platform.openai.com/docs/guides/speech-to-text). To use this feature, you need to install the [ffmpeg](https://ffmpeg.org/) library.

If function calling is enabled, bot can [generate images](#image-generation) and [search the web](#web-search) by itself.

## Navigation
* [Getting Started](#getting-started)
* [Configuration](#configuration)
* [Using Claude](#using-claude)
* [Using YandexGPT](#using-yandexgpt)
* [Voice](#voice)
* [Vision](#vision)
* [Image generation](#image-generation)
* [Web Search](#web-search)
* [Function calling](#function-calling)
* [Using OpenAI compatible APIs](#using-openai-compatible-apis)
* [Styles](#styles)
* [Files](#files)
* [Running the Bot](#running-the-bot)
* [Whitelisting users](#whitelisting-users)
* [Banning Users](#banning-users)
* [Safety practices](#safety-practices)
* [Rate limiting users](#rate-limiting-users)
* [Using Docker](#using-docker)
* [Read messages](#read-messages)
* [Warinings](#warinings)
* [License](#license)
* [Acknowledgements](#acknowledgements)

## Getting Started
* Create a bot using the [BotFather](https://t.me/botfather) and get the token.
* Clone the repository.
* Install the required packages by running the command `pip install -r requirements.txt`.
* Install the [ffmpeg](https://ffmpeg.org/) library for voice/video message support (for converting to supported format) and test it calling `ffmpeg --version` in the terminal.
* Create a `.config` file in the `data` directory using the example files (`config.example.*`) there as a template. Don't forget to set access codes if you want to restrict access to the bot (you will be added to whitelist when you use one of them ([learn more](#whitelisting-users))).
* You can run the bot by running the command `python3 main.py` or by using Docker with `docker compose up -d` (learn more [here](#using-docker)).

Bot is designed to talk to you in a style of a knight in the middle ages by default. You can change that in the `./data/.config` file (`SystemMessage`).
There are also some additional styles that you can choose from: Alice, Bob, Charlie and Diana. You can change style from chat by sending a message with `/style` command, but your current session will be dropped.

Styles can be set up in the `./data/chat_modes.ini` file. You can add your own styles there or change the existing ones.
`whitelist.txt`, `banlist.txt`, `.config`, `chat_modes.ini`, are stored in the `./data` directory. Logs rotate every day and are stored in the `./logs` directory.

## Configuration
The bot requires a configuration file to run. The configuration file should be in [INI file format](https://en.wikipedia.org/wiki/INI_file). Example configuration file is in the `./data` directory.
File should contain (for OpenAI API):
```ini
[Telegram]
Token = 0000000000:xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
AccessCodes = whitelistcode,secondwhitelistcode
RateLimitTime = 3600
GeneralRateLimit = 100
TextEngine = OpenAI
SpeechEngine = OpenAI
ReplyToMessage = False

[Logging]
LogLevel = WARNING
LogChats = False

[OpenAI]
SecretKey = xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
ChatModel = gpt-3.5-turbo
ChatModelPromptPrice = 0.0015
ChatModelCompletionPrice = 0.002
Temperature = 0.7
MaxTokens = 3997
MinLengthTokens = 100
SystemMessage = You are a helpful assistant named Sir Chat-a-lot, who answers in a style of a knight in the middle ages.
MaxSessionLength = 15
ChatDeletion = False
EndUserID = True
Moderation = False
Vision = False
ImageSize = 512
FunctionCalling = False
DeleteImageAfterAnswer = False
ImageDescriptionOnDelete = False
SummarizeTooLong = False

[AudioTranscript]
Engine = whisper
AudioModel = whisper-1
AudioModelPrice = 0.006
AudioFormat = mp3
TranscribeOnly = True

[Files]
Enabled = True
MaxFileSizeMB = 10
MaxSummaryTokens = 1000
MaxFileLength = 10000
DeleteAfterProcessing = True
```
Telegram:
* Telegram.Token: The token for the Telegram bot.
* Telegram.AccessCodes: A comma-separated list of access codes that can be used to add users to the whitelist. If no access codes are provided, anyone who not in the banlist will be able to use the bot.
* Telegram.RateLimitTime: The time in seconds to calculate user rate-limit. Optional.
* Telegram.GeneralRateLimit: The maximum number of messages that can be sent by a user in the `Telegram.RateLimitTime` period. Applied to all users. Optional.
* Telegram.TextEngine: The text engine to use. Optional, default is `OpenAI`. Other options are `YandexGPT` and `Claude`.
* Telegram.SpeechEngine: The speech engine to use. Optional, default is `OpenAI`.
* Telegram.ReplyToMessage: If set to `True`, bot will directly reply to the user's message. Optional, default is `False`.

Logging:
* Logging.LogLevel: The logging level. Optional, default is `WARNING`.
* Logging.LogChats: If set to `True`, bot will log all chats. Optional, default is `False`.

OpenAI:
* OpenAI.SecretKey: The secret key for the OpenAI API.
* OpenAI.ChatModel: The model to use for generating responses (learn more about OpenAI models [here](https://platform.openai.com/docs/models/)).
* OpenAI.ChatModelPrice: The [price of the model](https://openai.com/pricing) to use for generating responses (per 1000 tokens, in USD).
* OpenAI.Temperature: The temperature to use for generating responses.
* OpenAI.MaxTokens: The maximum number of tokens to use for generating responses.
* OpenAI.MinLengthTokens: The minimum number of tokens to use for generating responses. Optional, default 100.
* OpenAI.SystemMessage: The message that will shape your bot's personality.
* OpenAI.MaxSessionLength: The maximum number of user messages in a session (can be used to reduce tokens used). Optional.
* OpenAI.ChatDeletion: Whether to delete the user's history if conversation is too long. Optional.
* OpenAI.EndUserID: Whether to add the user's ID to the API request. Optional.
* OpenAI.Moderation: Whether to use the OpenAI's moderation engine. Optional.
* OpenAI.Vision: Whether to use vision capabilities of GPT-4 models. Default: `False`. See [Vision](#vision).
* OpenAI.ImageSize: Maximum size of images. If image is bigger than that it will be resized. Default: `512`
* OpenAI.DeleteImageAfterAnswer: Whether to delete image after it was seen by model. Enable it to keep cost of API usage low. Default: `False`.
* OpenAI.ImageDescriptionOnDelete: Whether to replace image with it description after it was deleted (see `OpenAI.DeleteImageAfterAnswer`). Default: `False`.
* OpenAI.FunctionCalling: Whether to use function calling capabilities (see section [Function calling](#function-calling)). Default: `False`.
* OpenAI.SummarizeTooLong: Whether to summarize first set of messages if session is too long instead of deleting it. Default: `False`.
* OpenAI.Proxy: The proxy for the OpenAI API. Optional. Default: `None`. It should be in the format `http://login:password@proxy:port`.

Files:
* Files.Enabled: Whether to enable files support. Optional. Default: `True`.
* Files.MaxFileSizeMB: The maximum file size in megabytes. Optional. Default: `20`.
* Files.MaxSummaryTokens: The maximum number of tokens to use for generating summaries. Optional. Default: `OpenAI.MaxTokens`/2.
* Files.MaxFileLength: The maximum number of tokens to use for generating summaries. Optional. Default: `10000`.
* Files.DeleteAfterProcessing: Whether to delete files after processing. Optional. Deafult: `True`.

Configuration should be stored in the `./data/.config` file. Use the `config.example` file in the `./data` directory as a template.
Claude and YandexGPT configurations are different, see [Using Claude](#using-claude-anthropic-api) and [Using YandexGPT](#using-yandexgpt) sections for more details.

## Using Claude
Claude is a family of large language models developed by [Anthropic](https://www.anthropic.com/). You should [get access](https://docs.anthropic.com/claude/docs/getting-access-to-claude) to it first.
You need to install [Anthropic's Python SDK](https://github.com/anthropics/anthropic-sdk-python) beforehand by running:
```bash
pip install anthropic
```
To use Claude, you need to change the `Telegram.TextEngine` field to `Claude` or `Anthropic` in the `./data/.config` file and replace the `OpenAI` section with `Anthropic` section:
```ini
[Telegram]
Token = 111111111:xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
AccessCodes = 123456789
TextEngine = Claude

[Anthropic]
SecretKey = sk-ant-******
ChatModel = claude-3-haiku-20240307
ChatModelPromptPrice = 0.00025
ChatModelCompletionPrice = 0.00125
Temperature = 0.7
MaxTokens = 1500
SystemMessage = You are a librarian named Bob whom one may met in tavern. You a chatting with user via Telegram messenger.
Vision = True
ImageSize = 768
DeleteImageAfterAnswer = False
ImageDescriptionOnDelete = False
SummarizeTooLong = True
FunctionCalling = False
```

* Anthropic.SecretKey: The secret key for the Anthropic API.
* Anthropic.ChatModel: The model to use for generating responses (`claude-3-haiku-20240307` by default).
* Anthropic.ChatModelPromptPrice: The price of the model to use for generating responses (per 1000 tokens, in USD).
* Anthropic.ChatModelCompletionPrice: The price of the model to use for generating responses (per 1000 tokens, in USD).
* Anthropic.Temperature: The temperature to use for generating responses.
* Anthropic.MaxTokens: The maximum number of tokens to use for generating responses.
* Anthropic.SystemMessage: The message that will shape your bot's personality.
* Anthropic.Vision: Whether to use vision capabilities of Claude 3 models. Default: `False`.
* Anthropic.ImageSize: Maximum size of images. If image is bigger than that it will be resized. Default: `512`
* Anthropic.DeleteImageAfterAnswer: Whether to delete image after it was seen by model. Enable it to keep cost of API usage low. Default: `False`.
* Anthropic.ImageDescriptionOnDelete: Whether to replace image with it description after it was deleted (see `OpenAI.DeleteImageAfterAnswer`). Default: `False`.
* Anthropic.SummarizeTooLong: Whether to summarize first set of messages if session is too long instead of deleting it. Default: `False`.
* Anthropic.FunctionCalling: Whether to use function calling capabilities (see section [Function calling](#function-calling)). Default: `False`.

You can find Claude models [here](https://docs.anthropic.com/claude/docs/models-overview).

You can also set up HTTP proxy for API requests in the `./data/.config` file (tested) like this:
```ini
[Anthropic]
...
Proxy = http://login:password@proxy:port
...
```
Example of configuration for using Claude API is in the `./data/config.claude.example` file.

## Using YandexGPT
YandexGPT is in Preview, you should request access to it.
You should have a service Yandex Cloud account to use YandexGPT (https://yandex.cloud/en/docs/yandexgpt/quickstart). Service account should have access to the YandexGPT API and role `ai.languageModels.user` or higher.
To use YandexGPT, you need to set the `Telegram.TextEngine` field to `YandexGPT` in the `./data/.config` file:
```ini
[Telegram]
Token = 111111111:xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
AccessCodes = 123456789
TextEngine = YandexGPT

[YandexGPT]
SecretKey=******
CatalogID=******
ChatModel=gpt:///yandexgpt/latest
Temperature=700
MaxTokens=1500
SystemMessage=You are a helpful assistant named Sir Chatalot.
SummarizeTooLong = True
RequestLogging = False
```
* YandexGPT.SecretKey: The secret key for the Yandex Cloud.
* YandexGPT.CatalogID: The catalog ID for the Yandex Cloud.
* YandexGPT.Endpoint: The endpoint for the Yandex GPT API. Optional, default is `https://llm.api.cloud.yandex.net/foundationModels/v1/completion`.
* YandexGPT.ChatModel: The model to use for generating responses (learn more [here](https://yandex.cloud/en/docs/yandexgpt/concepts/models)). You can use `gpt:///yandexgpt-lite/latest` or just `yandexgpt-lite/latest` (default) for the latest model in the default catalog.
* YandexGPT.ChatModelCompletionPrice: The price of the model to use for generating responses (per 1000 tokens, in USD).
* YandexGPT.ChatModelPromptPrice: The price of the model to use for generating responses (per 1000 tokens, in USD).
* YandexGPT.SummarisationModel: The model to use for summarisation. Optional, default is `summarization/latest`.
* YandexGPT.Temperature: The temperature to use for generating responses.
* YandexGPT.MaxTokens: The maximum number of tokens to use for generating responses.
* YandexGPT.SystemMessage: The message that will shape your bot's personality.
* YandexGPT.SummarizeTooLong: Whether to summarize first set of messages if session is too long instead of deleting it. Default: `False`.
* YandexGPT.RequestLogging: Whether to disable logging of API requests by the Yandex Cloud (learn more [here](https://yandex.cloud/en/docs/yandexgpt/operations/disable-logging)). Default: `False`.

## Voice
Bot can understand voice messages. To use this functionality you should make some changes in configuration file.
Example:
```ini
...
[AudioTranscript]
Engine = whisper
APIKey = ******
AudioModel = whisper-1
AudioModelPrice = 0.006
AudioFormat = mp3
TranscribeOnly = False
...
```
* AudioTranscript.Engine: The engine to use (currently only `whisper` is allowed).
* AudioTranscript.SecretKey: The secret key for the audio model (OpenAI whisper only). If unset, takes value from `OpenAI.SecretKey`.
* AudioTranscript.AudioModel: The model to use for speech recognition (Speech-to-text can be powered by `whisper-1` for now).
* AudioTranscript.AudioModelPrice: The [price of the model](https://openai.com/pricing) to use for speech recognition (per minute, in USD).
* AudioTranscript.AudioFormat: The audio format to convert voice messages (`ogg`) to (can be `wav`, `mp3` or other supported by Whisper). Stated whithout a dot.
* AudioTranscript.TranscribeOnly: If set to True, will only respond with Video/Audio transcript. If False (default), it will answer the message.

**Alternatively** you can set up Whisper in OpenAI section of the `./data/.config` file (deprecated, support can be removed in the future).
If config has section `AudioTranscript` it will be used instead and this method will be ignored.
```ini
...
WhisperModel = whisper-1
WhisperModelPrice = 0.006
AudioFormat = wav
...
```

> [!NOTE]
> Audio conversion is done with [ffmpeg](https://www.ffmpeg.org/). You should have it installed.

## Vision
Bot can understand images with [OpenAI GPT-4](https://platform.openai.com/docs/guides/vision) or [Claude 3](https://docs.anthropic.com/claude/docs/vision) models.
To use this functionality you should make some changes in configuration file (change OpenAI to Anthropic if you use Claude).
Example:
```ini
...
[OpenAI]
ChatModel = gpt-4-vision-preview
ChatModelPromptPrice = 0.01
ChatModelCompletionPrice = 0.03
...
Vision = True
ImageSize = 512
DeleteImageAfterAnswer = False
ImageDescriptionOnDelete = False
...
```
Check if you have an access to GPT-4V or Claude 3 models with vision capabilities.
OpenAI models can be found [here](https://platform.openai.com/docs/models/gpt-4) and prices can be found [here](https://openai.com/pricing).
Claude 3 models and prices can be found [here](https://docs.anthropic.com/claude/docs/models-overview).

Beware that right now functionalty for calculating cost of usage is not working for images, so you should pay attenion to that.

## Function calling
You can use function calling capabilities with some [OpenAI](https://platform.openai.com/docs/guides/function-calling) or [Claude](https://docs.anthropic.com/claude/docs/tool-use) models.
This way model will decide what function to call by itself. For example, you can ask the bot to generate an image and it will do it.
Right now image generation and some web tools are supported.
To use this functionality you should make some changes in configuration file. Example (OpenAI, for Claude change OpenAI to Anthropic):
```ini
...
[OpenAI]
FunctionCalling = True
...
```
Don't forget to enable Image generation (see [Image generation](#image-generation)).
This feature is experimental, please submit an issue if you find a problem.

## Image generation
You can generate images. Right now only [DALL-E](https://platform.openai.com/docs/guides/images) and [Stability AI](https://platform.stability.ai/) are supported.

To generate an image, send the bot a message with the `/imagine ` command. The bot will then generate an image based on the text prompt. Images are not stored on the server and processed as base64 strings.
Also if `FunctionCalling` is set to `True` in the `./data/.config` file (see [Function calling](#function-calling)), you can generate images with function calling just by asking the bot to do it.

`RateLimitCount`, `RateLimitTime` and `ImageGenerationPrice` parameters are not required, default values for them are zero. So if not set rate limit will not be applied and price will be zero.

### OpenAI DALL-E
To use this functionality with Dall-E you should make some changes in configuration file. Example:
```ini
...
[ImageGeneration]
Engine = dalle
APIKey = ******
Model = dall-e-3
RateLimitCount = 16
RateLimitTime = 3600
ImageGenerationPrice = 0.04
...
```
If you want to use OpenAI text engine and image generation you can omit `APIKey` field in the `ImageGeneration` section. Key will be taken from the `OpenAI` section.
For OpenAI you can also set `BaseURL` field in the `ImageGeneration` section. If it was set in `OpenAI` section, it will be used instead, to override it you cat set `ImageGeneration.BaseURL` to `None`.
Parameters set in `ImageGeneration` have priority over `OpenAI` section for image generation.

**Alternatively** you can set up DALL-E in OpenAI section of the `./data/.config` file (deprecated, support can be removed in the future).
If config has section `ImageGeneration` it will be used instead and this method will be ignored.
```ini
...
ImageGeneration = False
ImageGenModel = dall-e-3
ImageGenerationSize = 1024x1024
ImageGenerationStyle = vivid
ImageGenerationPrice = 0.04
...
```
### Stability AI
To use this functionality with Stability AI you should make some changes in configuration file. Example:
```ini
[ImageGeneration]
Engine = stability
ImageGenURL = https://api.stability.ai/v2beta/stable-image/generate/core
APIKey = ******
ImageGenerationRatio = 1:1
RateLimitCount = 16
RateLimitTime = 3600
ImageGenerationPrice = 0.04
```
You can also set `NegativePrompt` (str) and `Seed` (int) parameters in the `ImageGeneration` section if you want to use them.
`ImageGenURL` and `ImageGenerationRatio` are not required, default values (in example) are used if they are not set.

### Yandex ART
To use this functionality with Yandex ART you should add a section in the configuration file. Example:
```ini
[ImageGeneration]
Engine = yandex
APIKey = ******
ImageGenModel = yandex-art/latest
CatalogID = ******
RateLimitCount = 5
RateLimitTime = 3600
```
`ImageGenModel` can also have a value `art:///yandex-art/latest`.
You can also set `ImageGenerationPrice` (float) parameter in the `ImageGeneration` section if you want to use it. Also you can fix seed for image generation by setting `Seed` (int) parameter.
Service Yandex Foundation Models is on Preview, stage so it can be unstable.
YandexART API demands IAM token for requests. Service account should have access to the Yandex ART API and role `ai.imageGeneration.user` or higher.
Learn more about Yandex ART [here](https://yandex.cloud/ru/docs/foundation-models/quickstart/yandexart) (ru).

> [!WARNING]
> There can be some changes in the way Yandex ART API works, so it can be unstable.

## Web Search
You can use web search capabilities with function calling.
Right now only Google search is supported (via [Google Search API](https://developers.google.com/custom-search/v1/overview)).
To enable web search you should make some changes in configuration file. Example:
```ini
...

[Web]
SearchEngine = google
APIKey = ******
CSEID = ******
URLSummary = False
TrimLength = 3000
...
```
Keep in mind that you should also set `FunctionCalling` to `True` in the `./data/.config` file (see [Function calling](#function-calling)).
If `SearchEngine` is not set, web search functionality will not be enabled.
SirChatalot will only have information about the first 5 results (title, link and description).
It can try to open only links provided (or from history), but will not walk through the pages when using web search.
`URLSummary` parameter is used to tell the bot to summarize the content of the page.
`TrimLength` is used to limit the length of the parsed text (context can be lost).

## Using OpenAI compatible APIs
You can use APIs compatible with OpenAI's API. To do that, you need to set endpoint in the `OpenAI` section of the `./data/.config` file.
This example for running [Meta's Llama 3.1 405b via OpenRouter API](https://openrouter.ai/models/meta-llama/llama-3.1-405b-instruct):
```ini
[OpenAI]
...
APIBase = https://openrouter.ai/api/v1
SecretKey = sk-or-v1-***
ChatModel = meta-llama/llama-3.1-405b-instruct
ChatModelPromptPrice = 0.003
ChatModelCompletionPrice = 0.003
Temperature = 0.7
Moderation = False
...
```
Also it is possible to set `APIType` and `APIVersion`.
All this values are optional.

> [!NOTE]
> Tested with [LocalAI](https://github.com/mudler/LocalAI) and [OpenRouter](https://openrouter.ai/), also should be possible to use with [Ollama](https://ollama.com/) and [LM Studio](https://lmstudio.ai/).

## Styles
Bot supports different styles that can be triggered with `/style` command.
You can add your own style in the `./data/chat_modes.ini` file or change the existing ones. Styles are stored in the INI file format.
Example:
```ini
[Alice]
Description = Empathetic and friendly
SystemMessage = You are a empathetic and friendly woman named Alice, who answers helpful, funny and a bit flirty.

[Bob]
Description = Brief and informative
SystemMessage = You are a helpful assistant named Bob, who is informative and explains everything succinctly with fewer words.

```
Here is a list of the fields in this example:
* Alice or Bob: The name of the style.
* Description: Short description of the style. Is used in message that is shown when `/style` command is called.
* SystemMessage: The message that will shape your bot's personality. You will need some prompt engineering to make it work properly.

## Files
SirChatalot supports working with files through a Retrieval-Augmented Generation (RAG) system. When you send a supported file to the bot, it extracts the text content, processes it into semantic chunks, and stores these in a vector database. The bot can then use this information to provide more informed responses to your questions when needed (if function calling is enabled).

### Supported File Types
Currently supported file types: `.docx`, `.doc`, `.pptx`, `.ppt`, `.pdf`, `.txt`, `.md`, `.csv`, `.log`

### Requirements
- Install `catdoc` for `.doc` and `.ppt` files support and test it by calling `catdoc` in the terminal.

### Configuration
To enable file handling, add the following section to your `.config` file (example):
```ini
[OpenAI]
...
FunctionCalling = True

[Files]
Enabled = True
MaxFileSizeMB = 20

[Files]
Enabled = True
MaxFileSizeMB = 150

[Embeddings]
SecretKey = ***
Engine = OpenAI
Model = text-embedding-3-small
BaseURL = https://api.openai.com/v1

```
* Files.Enabled: Whether to enable files support. Default: `True`.
* Files.MaxFileSizeMB: The maximum file size in megabytes. Default: `20` (limited by Telegram).
* Embeddings.SecretKey: The secret key for the Embeddings API (OpenAI).
* Embeddings.Model: The model to use for generating embeddings. Default: `text-embedding-3-small`.
* Embeddings.BaseURL: The base URL for the Embeddings API. Default: `https://api.openai.com/v1`.
* Embeddings.Proxy: The HTTP proxy for the Embeddings API. Default: `None`. It should be in the format `http://login:password@proxy:port`.
* Embeddings.Engine: The engine to use for generating embeddings. Default: `OpenAI`.

### How It Works
1. When you send a file to the bot, it extracts the text content and adds file summary to system message.
2. The text is intelligently split into overlapping chunks at natural boundaries.
3. These chunks are embedded and stored in a ChromaDB vector database.
4. When you ask questions, the bot can search this database for relevant information.
5. If function calling is enabled, the bot can automatically search the database when it thinks information from your files might be helpful.

The implementation uses function calling to interact with the RAG database, making it more similar to an agent-based approach rather than a classic RAG system. This allows the model to decide when and how to retrieve information from your files.

### Common files
You can add files that will be accessible to all users. To do that, add files to the `./data/files/common` directory. They will be processed and added to the database on the bot start.

### Commands
* `/listfiles` - List all files you've added to the RAG database
* `/deletefiles` - Delete all your files from the RAG database

Files are temporarily stored in the `./data/files` directory. After successful processing, they are deleted if `DeleteAfterProcessing` is set to `True` in the config file.

## Running the Bot
To run the bot, simply run the command `python3 main.py`. The bot will start and will wait for messages.
The bot has the following commands:
* `/start`: starts the conversation with the bot.
* `/help`: shows the help message.
* `/delete`: deletes the conversation history.
* `/statistics`: shows the bot usage.
* `/style`: changes the style of the bot from chat.
* `/limit`: shows the current rate-limit for the user.
* `/imagine `: generates an image based on the text. You can use it only if `OpenAI.ImageGeneration` is set to `True` (see *Image generation*).
* Any other message (including voice message) will generate a response from the bot.

Users need to be whitelisted to use the bot. To whitelist yourself, send an access code to the bot using the `/start` command. The bot will then add you to the whitelist and will send a message to you confirming that you have been added to the whitelist.
Access code should be changed in the `./data/.config` file (see [Configuration](#configuration)).
Codes are shown in terminal when the bot is started.

## Whitelisting users
To restrict access to the bot, you should provide an access code (or multiple codes) in the `./data/.config` file.
If no access codes are provided, anyone who not in the banlist will be able to use the bot.

Bot is doing authorization by Telegram ID that is stored in the `./data/whitelist.txt` file.
To add yourself to the whitelist, send the bot a message with one of the access codes (see [Configuration](#configuration)). You will be added to the whitelist authomatically.
Alternatively, you can add users to the whitelist manually. To do that, add the user's Telegram ID to the `./data/whitelist.txt` file (each ID should be on a separate line). Example:
```txt
132456
789123
```

## Banning Users
To ban a user you should add their Telegram ID to the `./data/banlist.txt` file. Each ID should be on a separate line. Example:
```txt
123456
789123
```
Banlist has a higher priority than the whitelist.
If a user is on the banlist, they will not be able to use the bot and the will see a message saying that they have been banned.

## Safety practices
To prevent the bot from being used for purposes that violate the OpenAI's usage policy, you can use:
* Moderation: Moderation will filter out messages that can violate the OpenAI's usage policy with free OpenAI's [Moderation API](https://platform.openai.com/docs/guides/moderation). In this case, message is sent to the Moderation API and if it is flagged, it is not sent to the OpenAI's API. If you want to use it, set `OpenAI.Moderation` to `true` in the `./data/.config` file (see [Configuration](#configuration)). User will be notified if their message is flagged.
* End-user IDs: End-user IDs will be added to the API request if `OpenAI.EndUserID` is set to `true` in the `./data/.config` file (see [Configuration](#configuration)). Sending end-user IDs in your requests can be a useful tool to help OpenAI monitor and detect abuse. This allows OpenAI to provide your team with more actionable feedback in the event of bot abuse. End-user ID is a hashed Telegram ID of the user.
* Rate limiting: Rate limiting will limit the number of messages a user can send to the bot. If you want to use it, set `Telegram.GeneralRateLimit` to a number of messages a user can send to the bot in a time period in the `./data/.config` file (see [Configuration](#configuration)).
* Banlist: Banlist will prevent users from using the bot. If you want to use it, add user's Telegram ID to the `./data/banlist.txt` file (see *Banning Users*).
* Whitelist: Whitelist will allow only whitelisted users to use the bot. If you want to use it, add user's Telegram ID to the `./data/whitelist.txt` file (see *Whitelisting Users*).

## Rate limiting users
To limit the number of messages a user can send to the bot, add their Telegram ID and limit to the `./data/rates.txt` file. Each ID should be on a separate line.
Example:
```ini
123456789,10
987654321,500
111111,0
```
Rate limit is a number of messages a user can send to the bot in a time period. In example user with ID 123456789 has 10 and user 987654321 has 500 messages limit. User 111111 has no limit (overriding `GeneralRateLimit`).
Time period (in seconds) can be set in the `./data/.config` file in `RateLimitTime` variable in `Telegram` section (see [Configuration](#configuration)). If no time period is provided, limit is not applied.
General rate limit can be set in the `./data/.config` file in `GeneralRateLimit` variable in `Telegram` section (see [Configuration](#configuration)). To override general rate limit for a user, set their limit in the `rates.txt` file.
Users can check your limit by sending the bot a message with the `/limit` command.

## Using Docker
You can use Docker to run the bot.

First, you need to install Docker.
You can do it with [installation script](https://get.docker.com/).
```bash
curl -fsSL https://get.docker.com -o get-docker.sh
sh get-docker.sh
```
[Set it up](https://docs.docker.com/engine/install/linux-postinstall/) to start on boot and add your user to the `docker` group so you can run Docker commands without `sudo`:
```bash
sudo systemctl enable docker
sudo usermod -aG docker $USER
```

Then, you need to build the bot image.
Run the following command in the root directory of the project after configuring the bot (see [Configuration](#configuration)):
```bash
docker compose up -d
```
This will build the image and run the container.

To rebuild the image add `--build` flag to the command:
```bash
docker compose up -d --build
```
To stop the container, run the following command:
```bash
docker compose down
```
If you are using custom docker-compose file, you can specify it with `-f` flag:
```bash
docker compose -f docker-compose.yml up -d --build
```
To stop the container:
```bash
docker compose -f docker-compose.yml down
```

## Read messages
You can read user messages for moderation purposes with `read_messages.py`.
Call it from projects `chatutils` directory with:
```bash
python3 read_messages.py
```

## Warnings
* Use this bot at your own risk. I am not responsible for any damage caused by this bot.
* The bot stores the whitelist in plain text.
* The bot stores chat history in as a pickle file.
* Configurations are stored in plain text.
* The bot can store messages in a log file in a event of an error or if logger level set to `DEBUG`.
* The bot will store messages if `Logging.LogChats` set to `True` in the `./data/.config` file.
* The bot temporarily stores voice/video messages in `./data/voice` directory.
* The bot is not designed to be used in production environments. It is not secure and was build as a proof of concept.
* The bot can work with files. If file was not processed or `Files.DeleteAfterProcessing` is set to `False` in the `./data/.config` file (see [Configuration](#configuration)), the file will be stored in `./data/files` directory.
* If message is flagged by the OpenAI Moderation API, it will not be sent to the OpenAI's API, but it will be stored in `./data/moderation.txt` file for manual review.
* Functionalty for calculating cost of usage can be inaccurate for images with Dall-E.

## License
This project is licensed under [GPLv3](https://www.gnu.org/licenses/gpl-3.0.en.html). See the `LICENSE` file for more details.

## Acknowledgements
* [OpenAI ChatGPT API](https://platform.openai.com/docs/guides/chat) - The API used for generating responses.
* [OpenAI Whisper API](https://platform.openai.com/docs/guides/speech-to-text) - The API used for speech recognition.
* [OpenAI DALL-E API](https://platform.openai.com/docs/guides/images) - The API used for generating images.
* [Yandex GPT API](https://cloud.yandex.ru/docs/yandexgpt/) - The API used for generating responses.
* [Anthropic Claude API](https://docs.anthropic.com/claude/docs/text-generation) - The API used for generating responses.
* [python-telegram-bot](https://github.com/python-telegram-bot/python-telegram-bot) - The library used for interacting with the Telegram API.
* [FFmpeg](https://ffmpeg.org/) - The library used for converting voice messages.
* [pydub](https://github.com/jiaaro/pydub) - The library used for finding the duration of voice messages.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sazonovanton/sirchatalot

Awesome Lists containing this project

README