https://github.com/souze-san/webllm-bot

Lightweight Conversational AI running on WebGPU
https://github.com/souze-san/webllm-bot

llm slm svelte webgpu webllm

Last synced: about 1 month ago
JSON representation

Lightweight Conversational AI running on WebGPU

Host: GitHub
URL: https://github.com/souze-san/webllm-bot
Owner: SouZe-San
License: apache-2.0
Created: 2025-01-09T21:20:33.000Z (5 months ago)
Default Branch: main
Last Pushed: 2025-01-10T10:05:11.000Z (5 months ago)
Last Synced: 2025-01-10T10:33:40.716Z (5 months ago)
Topics: llm, slm, svelte, webgpu, webllm
Language: Svelte
Homepage:
Size: 396 KB
Stars: 2
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Lightweight Conversational AI with WebGPU

This project demonstrates an innovative architecture for deploying lightweight, optimized large language models (LLMs) in web applications. By leveraging the WebGPU web API, the solution enables efficient inference on client-side devices, promoting energy efficiency and sustainability in conversational AI. This demo is created for ICCRET 2025 paper submission.

## Features

- **Client-Side Inference**: Utilizes WebGPU API for running models directly in the browser.
- **Optimized Model**: Uses the Qwen2.5-0.5B-Instruct-q4f16_1-MLC model.
- **User-Friendly Interface**: Built with Svelte.js for a responsive and intuitive user experience.

## Installation

To get started with this project, clone the repository and install the necessary dependencies:

```bash
git clone https://github.com/SouZe-San/webllm-bot.git
cd webllm-bot
bun install
```

## Usage

Run the application locally:

```bash
bun run dev
```

Open your browser and navigate to `http://localhost:5173` to access the demo customer support site and the chatbot.

## Acknowledgements

We would like to thank the teams behind the [WebLLM](https://github.com/mlc-ai/web-llm) JavaScript Library, which facilitated the integration of pre-compiled models and loading to WebGPU.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/souze-san/webllm-bot

Awesome Lists containing this project

README