https://github.com/abi/secret-llama

Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.
https://github.com/abi/secret-llama

Last synced: about 1 month ago
JSON representation

Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.

Host: GitHub
URL: https://github.com/abi/secret-llama
Owner: abi
License: apache-2.0
Created: 2024-04-30T21:24:56.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-06-05T02:04:17.000Z (over 1 year ago)
Last Synced: 2025-04-14T11:11:51.332Z (7 months ago)
Language: TypeScript
Homepage: https://secretllama.com
Size: 360 KB
Stars: 2,608
Watchers: 20
Forks: 164
Open Issues: 24
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE

Awesome Lists containing this project

stars - secret-llama
stars - secret-llama
AiTreasureBox - abi/secret-llama - 11-03_2666_0](https://img.shields.io/github/stars/abi/secret-llama.svg)|Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.| (Repos)

README

# Secret Llama

![secret llama](https://github.com/abi/secret-llama/assets/23818/0bf43a95-4fe5-4c53-87bc-b558e5c4968f)

Entirely-in-browser, fully private LLM chatbot supporting Llama 3, Mistral and other open source models.

- Fully private = No conversation data ever leaves your computer
- Runs in the browser = No server needed and no install needed!
- Works offline
- Easy-to-use interface on par with ChatGPT, but for open source LLMs

Big thanks to the inference engine provided by [webllm](https://github.com/mlc-ai/web-llm).

Join us on Discord

[![](https://dcbadge.vercel.app/api/server/QkVzykMc9V)](https://discord.gg/QkVzykMc9V)

https://discord.gg/QkVzykMc9V

## System Requirements

To run this, you need a modern browser with support for WebGPU. According to [caniuse](https://caniuse.com/?search=WebGPU), WebGPU is supported on:

- Google Chrome
- Microsoft Edge

It's also available in Firefox, but it needs to be enabled manually through the dom.webgpu.enabled flag. Safari on MacOS also has experimental support for WebGPU which can be enabled through the WebGPU experimental feature.

In addition to WebGPU support, various models might have specific RAM requirements.

## Try it out

You can [try it here](https://secretllama.com).

To compile the React code yourself, download the repo and then, run

```
yarn
yarn build-and-preview
```

If you're looking to make changes, run the development environment with live reload:

```
yarn
yarn dev
```

## Supported models

| Model | Model Size |
|---------------------------|------------|
| TinyLlama-1.1B-Chat-v0.4-q4f32_1-1k | 600MB |
| Llama-3-8B-Instruct-q4f16_1 ⭐ | 4.3GB |
| Phi1.5-q4f16_1-1k | 1.2GB |
| Mistral-7B-Instruct-v0.2-q4f16_1 ⭐ | 4GB |

## Looking for contributors

We would love contributions to improve the interface, support more models, speed up initial model loading time and fix bugs.

## Other Projects by Author

Check out [screenshot to code](https://github.com/abi/screenshot-to-code) and [Pico - AI-powered app builder](https://picoapps.xyz)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/abi/secret-llama

Awesome Lists containing this project

README