Projects in Awesome Lists tagged with ggml

https://github.com/ggerganov/llama.cpp

LLM inference in C/C++

ggml llama

Last synced: 22 Dec 2024

https://github.com/rustformers/llm

An ecosystem of Rust libraries for working with large language models

ai ggml llm ml rust

Last synced: 15 Oct 2024

https://github.com/xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm

Last synced: 22 Dec 2024

https://github.com/leejet/stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

ai cplusplus diffusion flux flux-dev flux-schnell ggml image-generation image2image img2img latent-diffusion stable-diffusion text2image txt2img

Last synced: 18 Dec 2024

https://github.com/rwkv/rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

deep-learning ggml language-model llm machine-learning quantization rwkv

Last synced: 19 Dec 2024

https://github.com/RWKV/rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

deep-learning ggml language-model llm machine-learning quantization rwkv

Last synced: 08 Nov 2024

https://github.com/guinmoon/llmfarm

llama and other large language models on iOS and MacOS offline using GGML library.

ai ggml gpt-2 gptneox ios llama macos rwkv starcoder swift

Last synced: 22 Dec 2024

https://github.com/rahulschand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization

Last synced: 20 Dec 2024

https://github.com/RahulSChand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization

Last synced: 08 Nov 2024

https://github.com/guinmoon/LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

ai ggml gpt-2 gptneox ios llama macos rwkv starcoder swift

Last synced: 28 Oct 2024

https://github.com/azkadev/whisper

Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models

ai android dart flutter ggml indonesia ios linux macos openai speech speech-recognition speech-synthesis speech-to-text transcribe transformer whisper whisper-dart whisper-flutter windows

Last synced: 21 Dec 2024

https://github.com/Maknee/minigpt4.cpp

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

c cpp deep-learning ggml machine-learning minigpt4 multimodal quantization

Last synced: 08 Nov 2024

https://github.com/the-crypt-keeper/can-ai-code

Self-evaluating interview for AI coders

ai ggml humaneval langchain llama-cpp llm transformers

Last synced: 29 Nov 2024

https://github.com/shm007g/LLaMA-Cult-and-More

Large Language Models for All, 🦙 Cult and More, Stay in touch !

alpaca chatgpt deepspeed ggml gpt gpt4 gptq llama llm loralib pytorch tensorflow transformers vicuna

Last synced: 27 Oct 2024

https://github.com/azkadev/bark

WIP Library Text To Speech From Suno AI's Bark in C/C++ for fast inference

ai bark clone dart deep-learning fake ggml machine-learning neural-network text-to-speech tts voice

Last synced: 16 Dec 2024

https://github.com/azkadev/general_ai

GENERAL Ai Library For DART & Flutter

ai artificial-intelligence azkadev dart deep-learning flutter ggml library machine-learning ml piper stable-diffusion whisper

Last synced: 16 Dec 2024

https://github.com/staghado/vit.cpp

Inference Vision Transformer (ViT) in plain C/C++ with ggml

ai c computer-vision cpp cpu edge-computing ggml image-classification llamacpp vision-transformer whisper-cpp

Last synced: 17 Dec 2024

https://github.com/mayooear/private-chatbot-mpt30b-langchain

Chat with your data privately using MPT-30b

ggml gpt langchain llm

Last synced: 19 Dec 2024

https://github.com/balisujohn/tortoise.cpp

A ggml (C++) re-implementation of tortoise-tts

ggml local speech text text-to-speech to tortoise-tts tts

Last synced: 15 Dec 2024

https://github.com/mgonzs13/llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

cpp embeddings ggml gguf gpt langchain llama llamacpp llava llavacpp llm rerank reranking ros2 vlm

Last synced: 21 Dec 2024

https://github.com/chenhunghan/ialacol

🪶 Lightweight OpenAI drop-in replacement for Kubernetes

ai cloudnative cuda ggml gptq gpu helm kubernetes langchain llamacpp llm llm-inference llm-serving openai python

Last synced: 27 Sep 2024

https://github.com/gotzmann/booster

Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

chatgpt exllama ggml gpt llama llama-cpp llamacpp llm ollama oobabooga openai vllm

Last synced: 15 Dec 2024

https://github.com/zatevakhin/obsidian-local-llm

Obsidian Local LLM is a plugin for Obsidian that provides access to a powerful neural network, allowing users to generate text in a wide range of styles and formats using a local LLM.

ggml llama llamacpp obsidian-md obsidian-plugin

Last synced: 03 Dec 2024

https://github.com/guoriyue/langcommand

LangCommand is a local inference command-line tool that transforms natural language descriptions into shell commands.

command command-line command-line-tool ggml llama llm shell terminal

Last synced: 16 Dec 2024

https://github.com/guoriyue/llmc

LangCommand is a local inference command-line tool that transforms natural language descriptions into shell commands.

command command-line command-line-tool ggml llama llm shell terminal

Last synced: 20 Dec 2024

https://github.com/sevagh/demucs.cpp

C++17 port of Demucs v3 (hybrid) and v4 (hybrid transformer) models with ggml and Eigen3

blas demixing demucs eigen3 ggml guitar music-source-separation piano pytorch

Last synced: 27 Oct 2024

https://github.com/rbourgeat/impai

😈 ImpAI is an advanced role play app using large language and diffusion models.

ai character-ai chat docker game ggml gguf linux llama llama-cpp llm macos roleplay stable-diffusion windows

Last synced: 17 Nov 2024

https://github.com/Mobile-Artificial-Intelligence/maid_llm

maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)

facebook flutter-ai gemma ggml gguf llama llama2 llamacpp llm llm-inference local-ai meta mistral mixtral mobile-ai

Last synced: 25 Nov 2024

https://github.com/mobile-artificial-intelligence/maid_llm

maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)

facebook flutter-ai gemma ggml gguf llama llama2 llamacpp llm llm-inference local-ai meta mistral mixtral mobile-ai

Last synced: 18 Dec 2024

https://github.com/mgonzs13/whisper_ros

Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2

ggml ros2 speech-recognition speech-to-text vad voice-activity-detection whisper-cpp

Last synced: 06 Dec 2024

https://github.com/ahoylabs/gguf.js

A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.

ggml gguf large-language-models llamacpp llm machine-learning

Last synced: 03 Dec 2024

https://github.com/cztomsik/ggml-js

JavaScript bindings for the ggml-js library

ggml machine-learning node-js

Last synced: 20 Nov 2024

https://github.com/latestissue/altaeraai

A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux

ai android ggml gguf koboldai koboldcpp llamacpp mamba mistral phi rwkv rwkv4 rwkvcpp termux tinydolphin tinyllama tinyvicuna vicuna

Last synced: 03 Dec 2024

https://github.com/latestissue/AltaeraAI

A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux

ai android ggml gguf koboldai koboldcpp llamacpp mamba mistral phi rwkv rwkv4 rwkvcpp termux tinydolphin tinyllama tinyvicuna vicuna

Last synced: 07 Nov 2024

https://github.com/uminosachi/open-llm-webui

This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).

chatbot ggml gradio huggingface language-model llama llama2 llama3 llava llava-llama3 llm nlp transformers

Last synced: 10 Oct 2024

https://github.com/teleprint-me/py.gpt.prompt

PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-term memory and task automation.

chatgpt chromadb cli-app developer-tools ggml gguf huggingface llamacpp openai prompt-toolkit python

Last synced: 27 Sep 2024

https://github.com/albertstarfield/project-zephyrine

Introducing Project Zephyrine: Elevating Your Interaction Plug and Play, and Employing GPU Acceleration within a Modernized Automata Local Graphical User Interface.

chatgpt cuda electron falcon gemma ggml gguf gpt-3 gui llama llama-2 llama-3 llm metal opencl

Last synced: 30 Nov 2024

https://github.com/coderonion/ggml-zig

[ ggml: Tensor library for machine learning ] written in zig.

aigc auto-differentiation chatglm chatgpt deep-learning ggml gpt gpt4 large-language-models llama llm machine-learning tensor wizardcoder zig ziglang

Last synced: 13 Nov 2024

https://github.com/matthoffner/backseat-pilot

Bring your own copilot server and customize commands to refactor instead of autofill or tabbed completion.

copilot ggml llm starcoder vscode-extension wizardcoder

Last synced: 09 Dec 2024

https://github.com/archishmansengupta/finllm

finLLM using llama-2-7b-chat-ggml, langchain, chainlit, and faiss vector db on encyclopedia of finance, sfsu

chainlink faiss finance ggml langchain llama2 ollama

Last synced: 17 Dec 2024

https://github.com/zackshen/gguf

a GGUF file parser

ai ggml gguf gpt llama llm model

Last synced: 22 Oct 2024

https://github.com/johnclaw/chatllm.vb

VB.NET api wrapper for llm-inference chatllm.cpp

api-wrapper bindings chatllm cpu-inference gemma ggml int8 int8-inference int8-quantization llama llm-inference mistral qwen vb-net vbnet

Last synced: 22 Dec 2024

https://github.com/tbogdala/sentient_core

A terminal style user interface to chat with AI characters using llama LLMs for locally processed AI.

ai chat-application ggml llama llamacpp llm llm-inference rust terminal-ui

Last synced: 17 Nov 2024

https://github.com/johnclaw/chatllm.v

V-lang api wrapper for llm-inference chatllm.cpp

api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference llama llm llm-inference llms mistral phi3 quantization qwen v-lang vlang

Last synced: 22 Dec 2024

https://github.com/johnclaw/chatllm.nim

Nim api-wrapper for llm-inference chatllm.cpp

api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference llama llm llm-inference llms mistral nim nim-lang nim-language nimlang phi quantization qwen

Last synced: 22 Dec 2024

https://github.com/johnclaw/chatllm.cs

C# api wrapper for llm-inference chatllm.cpp

api-wrapper bindings chatllm cpu-inference csharp gemma ggml inference int8 int8-inference int8-quantization llama llm llm-inference llms mistral qwen

Last synced: 22 Dec 2024

https://github.com/iakashpaul/ghudsavar

Ghudsavar (Horse rider) - Is a quick llama.cpp server for CPU only runtimes

gemma gemma-2b ggml gguf google huggingface llama llamacpp server

Last synced: 10 Oct 2024

https://github.com/jeddyhhh/ziptie

A web interface for llama.cpp cli written in js, jQuery and php.

cpp ggml jquery js llama llamacpp llm php wsl ziptie

Last synced: 12 Nov 2024

https://github.com/thefrieber/koboldcs

Adaption of KoboldCPP with the goal to add missing core features

ai chatbot ggml gguf koboldai language-model llama llamacpp llm-interface llms

Last synced: 11 Dec 2024

https://github.com/rainyl/ggml.dart

ggml bindings for dart language

dart flutter ggml

Last synced: 19 Dec 2024

https://github.com/johnclaw/chatllm.lua

lua api wrapper for llm-inference chatllm.cpp

api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference llama llm llm-inference llms lua luajit mistral quantization qwen

Last synced: 22 Dec 2024

https://github.com/johnclaw/chatllm.kt

kotlin api wrapper for llm-inference chatllm.cpp

api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference kotlin llama llm llm-inference llms mistral quantization qwen

Last synced: 22 Dec 2024

https://github.com/johnclaw/chatllm.rs

rust api wrapper for llm-inference chatllm.cpp

api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference llama llm llm-inference llms mistral quantization qwen rust

Last synced: 22 Dec 2024

https://github.com/johnclaw/chatllm.d

D-lang api wrapper for llm-inference chatllm.cpp

api-wrapper bindings chatbot chatllm cpu-inference d-lang d-language dlang gemma ggml inference llama llm llm-inference llms mistral quantization qwen

Last synced: 22 Dec 2024

https://github.com/rbourgeat/llm-cmd

✨ LLM CMD is a toolbox allowing you to use LLM in daily developer commands 💻

ai docker ggml git llama-cpp llm

Last synced: 05 Nov 2024

https://github.com/developer239/llama.cpp-ts

llama.cpp 🦙 LLM inference in TypeScript

ggml gguf llama llama3 llm llms meta-ai node-addon-api nodejs typescript

Last synced: 03 Dec 2024

https://github.com/incubated-geek-cc/native_gpt4j

A portable and native standalone GPT bot assistant *with chat interface) built using Java bindings from GPT4All. JDK 8 compatible.

chat-application chatgpt chatgpt-app ggml gpt java java-8

Last synced: 15 Nov 2024

https://github.com/iam-baivab/llama-2-ggml-medical-chatbot

Llama2-Medical-Chatbot is a medical chatbot that uses the Llama-2-7B-Chat-GGML model and the pdf The Gale Encyclopedia of Medicine, Volume 1, 2nd Edition. It is still under development, but it has the potential to be a valuable tool for patients, healthcare professionals, and researchers.

all-minilm-l6-v2 chainlit ggml llama2 llama2-7b sentence-transformers streamlit