Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with ggml
A curated list of projects in awesome lists tagged with ggml .
https://github.com/xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm
Last synced: 22 Dec 2024
https://github.com/leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
ai cplusplus diffusion flux flux-dev flux-schnell ggml image-generation image2image img2img latent-diffusion stable-diffusion text2image txt2img
Last synced: 18 Dec 2024
https://github.com/rwkv/rwkv.cpp
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
deep-learning ggml language-model llm machine-learning quantization rwkv
Last synced: 19 Dec 2024
https://github.com/RWKV/rwkv.cpp
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
deep-learning ggml language-model llm machine-learning quantization rwkv
Last synced: 08 Nov 2024
https://github.com/rahulschand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization
Last synced: 20 Dec 2024
https://github.com/RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization
Last synced: 08 Nov 2024
https://github.com/azkadev/whisper
Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models
ai android dart flutter ggml indonesia ios linux macos openai speech speech-recognition speech-synthesis speech-to-text transcribe transformer whisper whisper-dart whisper-flutter windows
Last synced: 21 Dec 2024
https://github.com/Maknee/minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
c cpp deep-learning ggml machine-learning minigpt4 multimodal quantization
Last synced: 08 Nov 2024
https://github.com/the-crypt-keeper/can-ai-code
Self-evaluating interview for AI coders
ai ggml humaneval langchain llama-cpp llm transformers
Last synced: 29 Nov 2024
https://github.com/shm007g/LLaMA-Cult-and-More
Large Language Models for All, ðĶ Cult and More, Stay in touch !
alpaca chatgpt deepspeed ggml gpt gpt4 gptq llama llm loralib pytorch tensorflow transformers vicuna
Last synced: 27 Oct 2024
https://github.com/azkadev/bark
WIP Library Text To Speech From Suno AI's Bark in C/C++ for fast inference
ai bark clone dart deep-learning fake ggml machine-learning neural-network text-to-speech tts voice
Last synced: 16 Dec 2024
https://github.com/azkadev/general_ai
GENERAL Ai Library For DART & Flutter
ai artificial-intelligence azkadev dart deep-learning flutter ggml library machine-learning ml piper stable-diffusion whisper
Last synced: 16 Dec 2024
https://github.com/staghado/vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
ai c computer-vision cpp cpu edge-computing ggml image-classification llamacpp vision-transformer whisper-cpp
Last synced: 17 Dec 2024
https://github.com/mayooear/private-chatbot-mpt30b-langchain
Chat with your data privately using MPT-30b
Last synced: 19 Dec 2024
https://github.com/balisujohn/tortoise.cpp
A ggml (C++) re-implementation of tortoise-tts
ggml local speech text text-to-speech to tortoise-tts tts
Last synced: 15 Dec 2024
https://github.com/chenhunghan/ialacol
ðŠķ Lightweight OpenAI drop-in replacement for Kubernetes
ai cloudnative cuda ggml gptq gpu helm kubernetes langchain llamacpp llm llm-inference llm-serving openai python
Last synced: 27 Sep 2024
https://github.com/zatevakhin/obsidian-local-llm
Obsidian Local LLM is a plugin for Obsidian that provides access to a powerful neural network, allowing users to generate text in a wide range of styles and formats using a local LLM.
ggml llama llamacpp obsidian-md obsidian-plugin
Last synced: 03 Dec 2024
https://github.com/guoriyue/langcommand
LangCommand is a local inference command-line tool that transforms natural language descriptions into shell commands.
command command-line command-line-tool ggml llama llm shell terminal
Last synced: 16 Dec 2024
https://github.com/guoriyue/llmc
LangCommand is a local inference command-line tool that transforms natural language descriptions into shell commands.
command command-line command-line-tool ggml llama llm shell terminal
Last synced: 20 Dec 2024
https://github.com/sevagh/demucs.cpp
C++17 port of Demucs v3 (hybrid) and v4 (hybrid transformer) models with ggml and Eigen3
blas demixing demucs eigen3 ggml guitar music-source-separation piano pytorch
Last synced: 27 Oct 2024
https://github.com/Mobile-Artificial-Intelligence/maid_llm
maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)
facebook flutter-ai gemma ggml gguf llama llama2 llamacpp llm llm-inference local-ai meta mistral mixtral mobile-ai
Last synced: 25 Nov 2024
https://github.com/mobile-artificial-intelligence/maid_llm
maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)
facebook flutter-ai gemma ggml gguf llama llama2 llamacpp llm llm-inference local-ai meta mistral mixtral mobile-ai
Last synced: 18 Dec 2024
https://github.com/mgonzs13/whisper_ros
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
ggml ros2 speech-recognition speech-to-text vad voice-activity-detection whisper-cpp
Last synced: 06 Dec 2024
https://github.com/ahoylabs/gguf.js
A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.
ggml gguf large-language-models llamacpp llm machine-learning
Last synced: 03 Dec 2024
https://github.com/cztomsik/ggml-js
JavaScript bindings for the ggml-js library
Last synced: 20 Nov 2024
https://github.com/latestissue/altaeraai
A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux
ai android ggml gguf koboldai koboldcpp llamacpp mamba mistral phi rwkv rwkv4 rwkvcpp termux tinydolphin tinyllama tinyvicuna vicuna
Last synced: 03 Dec 2024
https://github.com/latestissue/AltaeraAI
A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux
ai android ggml gguf koboldai koboldcpp llamacpp mamba mistral phi rwkv rwkv4 rwkvcpp termux tinydolphin tinyllama tinyvicuna vicuna
Last synced: 07 Nov 2024
https://github.com/uminosachi/open-llm-webui
This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).
chatbot ggml gradio huggingface language-model llama llama2 llama3 llava llava-llama3 llm nlp transformers
Last synced: 10 Oct 2024
https://github.com/teleprint-me/py.gpt.prompt
PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-term memory and task automation.
chatgpt chromadb cli-app developer-tools ggml gguf huggingface llamacpp openai prompt-toolkit python
Last synced: 27 Sep 2024
https://github.com/albertstarfield/project-zephyrine
Introducing Project Zephyrine: Elevating Your Interaction Plug and Play, and Employing GPU Acceleration within a Modernized Automata Local Graphical User Interface.
chatgpt cuda electron falcon gemma ggml gguf gpt-3 gui llama llama-2 llama-3 llm metal opencl
Last synced: 30 Nov 2024
https://github.com/coderonion/ggml-zig
[ ggml: Tensor library for machine learning ] written in zig.
aigc auto-differentiation chatglm chatgpt deep-learning ggml gpt gpt4 large-language-models llama llm machine-learning tensor wizardcoder zig ziglang
Last synced: 13 Nov 2024
https://github.com/matthoffner/backseat-pilot
Bring your own copilot server and customize commands to refactor instead of autofill or tabbed completion.
copilot ggml llm starcoder vscode-extension wizardcoder
Last synced: 09 Dec 2024
https://github.com/johnclaw/chatllm.vb
VB.NET api wrapper for llm-inference chatllm.cpp
api-wrapper bindings chatllm cpu-inference gemma ggml int8 int8-inference int8-quantization llama llm-inference mistral qwen vb-net vbnet
Last synced: 22 Dec 2024
https://github.com/tbogdala/sentient_core
A terminal style user interface to chat with AI characters using llama LLMs for locally processed AI.
ai chat-application ggml llama llamacpp llm llm-inference rust terminal-ui
Last synced: 17 Nov 2024
https://github.com/johnclaw/chatllm.v
V-lang api wrapper for llm-inference chatllm.cpp
api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference llama llm llm-inference llms mistral phi3 quantization qwen v-lang vlang
Last synced: 22 Dec 2024
https://github.com/johnclaw/chatllm.nim
Nim api-wrapper for llm-inference chatllm.cpp
api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference llama llm llm-inference llms mistral nim nim-lang nim-language nimlang phi quantization qwen
Last synced: 22 Dec 2024
https://github.com/johnclaw/chatllm.cs
C# api wrapper for llm-inference chatllm.cpp
api-wrapper bindings chatllm cpu-inference csharp gemma ggml inference int8 int8-inference int8-quantization llama llm llm-inference llms mistral qwen
Last synced: 22 Dec 2024
https://github.com/iakashpaul/ghudsavar
Ghudsavar (Horse rider) - Is a quick llama.cpp server for CPU only runtimes
gemma gemma-2b ggml gguf google huggingface llama llamacpp server
Last synced: 10 Oct 2024
https://github.com/thefrieber/koboldcs
Adaption of KoboldCPP with the goal to add missing core features
ai chatbot ggml gguf koboldai language-model llama llamacpp llm-interface llms
Last synced: 11 Dec 2024
https://github.com/johnclaw/chatllm.lua
lua api wrapper for llm-inference chatllm.cpp
api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference llama llm llm-inference llms lua luajit mistral quantization qwen
Last synced: 22 Dec 2024
https://github.com/johnclaw/chatllm.kt
kotlin api wrapper for llm-inference chatllm.cpp
api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference kotlin llama llm llm-inference llms mistral quantization qwen
Last synced: 22 Dec 2024
https://github.com/johnclaw/chatllm.rs
rust api wrapper for llm-inference chatllm.cpp
api-wrapper bindings chatbot chatllm cpu-inference gemma ggml inference llama llm llm-inference llms mistral quantization qwen rust
Last synced: 22 Dec 2024
https://github.com/johnclaw/chatllm.d
D-lang api wrapper for llm-inference chatllm.cpp
api-wrapper bindings chatbot chatllm cpu-inference d-lang d-language dlang gemma ggml inference llama llm llm-inference llms mistral quantization qwen
Last synced: 22 Dec 2024
https://github.com/developer239/llama.cpp-ts
llama.cpp ðĶ LLM inference in TypeScript
ggml gguf llama llama3 llm llms meta-ai node-addon-api nodejs typescript
Last synced: 03 Dec 2024
https://github.com/incubated-geek-cc/native_gpt4j
A portable and native standalone GPT bot assistant *with chat interface) built using Java bindings from GPT4All. JDK 8 compatible.
chat-application chatgpt chatgpt-app ggml gpt java java-8
Last synced: 15 Nov 2024
https://github.com/iam-baivab/llama-2-ggml-medical-chatbot
Llama2-Medical-Chatbot is a medical chatbot that uses the Llama-2-7B-Chat-GGML model and the pdf The Gale Encyclopedia of Medicine, Volume 1, 2nd Edition. It is still under development, but it has the potential to be a valuable tool for patients, healthcare professionals, and researchers.
all-minilm-l6-v2 chainlit ggml llama2 llama2-7b sentence-transformers streamlit
Last synced: 10 Oct 2024
https://github.com/teleprint-me/nn
A detailed walk-through of the ggml library
Last synced: 09 Nov 2024
https://github.com/kartiksood10/blog-generator
An LLM App that generates Blogs based on style, topic and number of words input by the user.
ggml huggingface llm python streamlit
Last synced: 17 Nov 2024
https://github.com/shubham0204/smolchat-android
Running any GGUF SLMs/LLMs locally, on-device in Android
android cpp ggml kotlin llamacpp small-language-models
Last synced: 03 Dec 2024
https://github.com/godmode2k/whisper.cpp.android
whisper.cpp.android with CLBlast(OpenCL), Translation (Google ML-Kit) and TTS
android clblast ggml kotlin ml-kit openai-whisper opencl tts whisper whisper-ai whisper-cpp
Last synced: 10 Nov 2024