An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with llama-cpp

A curated list of projects in awesome lists tagged with llama-cpp .

https://github.com/getumbrel/llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

ai chatgpt code-llama codellama gpt gpt-4 gpt4all llama llama-2 llama-cpp llama2 llamacpp llm localai openai self-hosted

Last synced: 13 May 2025

https://github.com/scisharp/llamasharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 14 May 2025

https://github.com/SciSharp/LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 24 Mar 2025

https://github.com/mobile-artificial-intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

android android-ai chatbot chatgpt facebook flutter free-chatgpt gguf large-language-models llama llama-cpp llama2 llamacpp local-ai mistral mobile-ai mobile-artificial-intelligence ollama openai openorca

Last synced: 11 Apr 2025

https://github.com/withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

ai bindings catai cmake cmake-js cuda embedding function-calling gguf gpu grammar json-schema llama llama-cpp llm metal nodejs prebuilt-binaries self-hosted vulkan

Last synced: 26 Jan 2026

https://github.com/Mobile-Artificial-Intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

android android-ai chatbot chatgpt facebook flutter free-chatgpt gguf large-language-models llama llama-cpp llama2 llamacpp local-ai mistral mobile-ai mobile-artificial-intelligence ollama openai openorca

Last synced: 24 Mar 2025

https://github.com/gotzmann/llama.go

llama.go is like llama.cpp in pure Golang!

alpaca chatgpt dalai gpt gpt3 gpt4 gpt4all llama llama-cpp llm vicuna

Last synced: 15 May 2025

https://github.com/mybigday/llama.rn

React Native binding of llama.cpp

android ios llama llama-cpp llm react-native

Last synced: 13 Apr 2026

https://github.com/the-crypt-keeper/can-ai-code

Self-evaluating interview for AI coders

ai ggml humaneval langchain llama-cpp llm transformers

Last synced: 05 Apr 2025

https://github.com/jlonge4/local_llama

This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.

artificial-intelligence langchain llama-cpp llamaindex machinelearning offline python

Last synced: 03 Apr 2025

https://github.com/phronmophobic/llama.clj

Run LLMs locally. A clojure wrapper for llama.cpp.

clojure llama llama-cpp llm

Last synced: 09 Apr 2025

https://github.com/gpustack/gguf-parser-go

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

gguf go llama-box llama-cpp stable-diffusion-cpp

Last synced: 19 Apr 2025

https://github.com/gotzmann/booster

Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

chatgpt exllama ggml gpt llama llama-cpp llamacpp llm ollama oobabooga openai vllm

Last synced: 11 Jun 2025

https://github.com/nuance1979/llama-server

LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.

chatbot-ui llama llama-cpp llamacpp

Last synced: 12 May 2025

https://github.com/nrl-ai/customchar

Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.

cpp ggml llama llama-cpp llama-v2 llm stt tts whisper-cpp

Last synced: 25 Aug 2025

https://github.com/ferranpons/llamatik

True on-device AI for Kotlin Multiplatform (Android, iOS, Desktop, JVM, WASM). LLM, Speech-to-Text and Image Generation — powered by llama.cpp, whisper.cpp and stable-diffusion.cpp.

ai android desktop edge-ai ggml inference ios kmp kmp-library kotlin ktor llama llama-cpp llm mobile-ai multiplatform offline-ai on-device-ai privacy rag

Last synced: 17 Apr 2026

https://github.com/vtuber-plan/langport

Langport is a language model inference service

api chatgpt chatgpt-api fauxpilot langchain language-model llama llama-cpp llm openai tabby

Last synced: 30 Jun 2025

https://github.com/abhi5h3k/privatedocbot

📚 Local PDF-Integrated Chat Bot: Secure Conversations and Document Assistance with LLM-Powered Privacy

ai chatgpt generative gpt gpt-4 gpt4all langchain llama llama-2 llama-cpp llama2 llamacpp llm localai openai pdf private privategpt self-hosted vectorstore

Last synced: 05 Oct 2025

https://github.com/r3gm/insightsolver-colab

InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.

ai-ops aiops autogpt colab-notebook colorization computer-vision deep-learning llama-2 llama-cpp llm machine-learning object-detection stable-diffusion text-to-speech

Last synced: 12 Oct 2025

https://github.com/Abhi5h3k/PrivateDocBot

📚 Local PDF-Integrated Chat Bot: Secure Conversations and Document Assistance with LLM-Powered Privacy

ai chatgpt generative gpt gpt-4 gpt4all langchain llama llama-2 llama-cpp llama2 llamacpp llm localai openai pdf private privategpt self-hosted vectorstore

Last synced: 07 Apr 2025

https://github.com/zhouwg/kantv

workbench for learing&practising AI tech in real scenario on Android device, powered by GGML(Georgi Gerganov Machine Learning) and NCNN(Nihui Convolutional Neural Network) and FFmpeg + OpenCV

edge-ai ffmpeg ffmpeg-android livetv llama-cpp ncnn-android whisper-cpp

Last synced: 04 Apr 2025

https://github.com/rbourgeat/impai

😈 ImpAI is an advanced role play app using large language and diffusion models.

ai character-ai chat docker game ggml gguf linux llama llama-cpp llm macos roleplay stable-diffusion windows

Last synced: 29 Jul 2025

https://github.com/fboulnois/llama-cpp-docker

Run llama.cpp in a GPU accelerated Docker container

chatgpt docker docker-compose llama llama-cpp llama2 llama3 llm mistral

Last synced: 07 Mar 2026

https://github.com/lifevalue/healthwallet.me

Open-source, patient-controlled health record app with on-device AI. Aggregates medical data from 52K+ providers via FHIR R4. Offline-first. Flutter.

ai dart digital-health ehr emr fhir flutter health healthcare llama-cpp llm medical-records mobile-health offline-first on-device-ai open-source patient-data personal-health-record privacy self-hosted

Last synced: 24 Apr 2026

https://github.com/aj-archipelago/cortex

Simplify and accelerate AI-powered application development with structured interfaces to models and powerful prompt execution environments.

ai chatgpt gpt-3 gpt-35-turbo gpt-4 graphql langchain llama llama-cpp llamacpp llm openai palm palm2 rest-api vertex-ai

Last synced: 11 Feb 2026

https://github.com/defilantech/llmkube

Kubernetes operator for local LLM inference with llama.cpp, vLLM, and TGI - multi-GPU, autoscaling, air-gapped, production-ready

ai ai-infrastructure apple-silicon autoscaling edge-computing gguf gpu homelab inference kubernetes kubernetes-operator llama-cpp llm local-llm metal mlops multi-gpu nvidia self-hosted vllm

Last synced: 27 Apr 2026

https://github.com/countzero/windows_llama.cpp

PowerShell automation to rebuild llama.cpp for a Windows environment.

cmake conda cuda llama-cpp openblas powershell windows

Last synced: 26 Apr 2026

https://github.com/stampby/halo-ai-core

Bare-metal AI platform for AMD Strix Halo. One script. Everything works. Lego blocks — snap in what you need.

agent-framework ai amd arch-linux bare-metal caddy gaia gpu inference lemonade llama-cpp local-ai privacy rocm ryzen-ai self-hosted strix-halo systemd

Last synced: 18 Apr 2026

https://github.com/opencsgs/llm-inference

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

deepspeed llama-cpp llm-inference ray transformer vllm

Last synced: 12 Apr 2025

https://github.com/hyparam/hyllama

llama.cpp gguf file parser for javascript

gguf javascript js llama-cpp llamacpp llm machine-learning ml parser

Last synced: 17 Mar 2025

https://github.com/Lizonghang/prima.cpp

prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters

distributed-ai llama-cpp llm-inference on-device-llms

Last synced: 23 Apr 2025

https://github.com/pranavkumaarofficial/nlcli-wizard

Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)

cli-tools fine-tuning gemma llama-cpp llm local-first machine-learning nlp qlora quantization slm unsloth

Last synced: 07 Mar 2026

https://github.com/shakfu/cyllama

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

cython cython-wrapper llama-cpp python3 stable-diffusion-cpp whisper-cpp

Last synced: 02 Apr 2026

https://github.com/mybigday/llama.node

Node.js binding of llama.cpp

llama llama-cpp llamacpp llm node-js nodejs

Last synced: 01 Apr 2026

https://github.com/brutalcoding/llama_dart

Flutter / Dart bindings for llama.cpp

dart flutter llama-cpp llama-dart shady-ai

Last synced: 17 Sep 2025

https://github.com/kocort/kocort

Desktop AI agent runtime with dual-brain safety review, GUI-first operation, local model support, and multi-channel delivery.

agent-runtime ai-agent ai-agents desktop-ai desktop-app golang llama-cpp local-first local-llm multi-agent nextjs openai-compatible slack-bot subagents task-scheduler telegram-bot tool-calling webhook workflow-automation

Last synced: 12 Apr 2026

https://github.com/itzderock/llama-playground

A simple to use and powerful web-interface to mess around with Meta's LLaMA LLM.

llama llama-cpp llama-inference-server llamacpp nextjs trpc

Last synced: 29 Oct 2025

https://github.com/rhinodevel/mt_llm

Pure C wrapper library to use llama.cpp with Linux and Windows as simple as possible.

inference llama-cpp llm

Last synced: 06 Oct 2025

https://github.com/robjsliwa/llama-agent

Fun project to run your own LLM chat bot using llama.cpp

agents ai langchain langchain-python llama llama-cpp llamacpp llm

Last synced: 14 Jul 2025

https://github.com/e-lab/syntaxshaper

Powering Agent Chains by Constraining LLM Outputs

agent ai ai-eng llama llama-cpp llm llm-agent

Last synced: 08 Mar 2025

https://github.com/acai66/qwen2.5_numpy

使用numpy实现DeepSeek-R1-Distill-Qwen-1.5B的推理过程,易于学习LLM推理与移植到其它编程语言加速。 Implementing the inference process of DeepSeek-R1-Distill-Qwen-1.5B using numpy, making it easy to learn LLM (Large Language Model) inference and to port to other programming languages for acceleration.

deepseek deepseek-r1 llama-cpp llm-inference numpy qwen qwen2

Last synced: 22 Apr 2025

https://github.com/rudolfolah/metatron

Metatron is a project that brings together whisper.cpp, llama.cpp, and piper into a deployable stack with an awesome Node.js API wrapper for each of them.

dockerized llama-cpp llamacpp piper whisper-cpp whispercpp

Last synced: 28 Aug 2025

https://github.com/nonatofabio/luna-agent

Custom minimal AI agent with persistent memory, MCP tools, and Discord

agent-framework ai-agent discord-bot homelab llama-cpp llm local-llm mcp openai-compatible python sqlite vector-search

Last synced: 04 Apr 2026

https://github.com/rbourgeat/llm-cmd

✨ LLM CMD is a toolbox allowing you to use LLM in daily developer commands 💻

ai docker ggml git llama-cpp llm

Last synced: 11 Apr 2025

https://github.com/bdqfork/go-llama.cpp

go binding for llama.cpp, offer low level and high level api

chatgpt go gpt llama llama-cpp llamacpp

Last synced: 08 Mar 2026

https://github.com/kevinknights29/llama-v2-gpu-gtx-1650

Running Llama v2 with Llama.cpp in a 4GB VRAM GTX 1650.

docker gpu llama-cpp python

Last synced: 23 Apr 2025

https://github.com/mili-tan/onllama.gguflinkout

Create out symbolic links for the GGUF Models in Ollama Blobs. for use in other applications such as Llama.cpp/Jan/LMStudio etc. / 将 Ollama GGUF 模型文件软链接出,以便其他应用使用。

gguf gguf-models jan llama-cpp llamacpp lmstudio ollama

Last synced: 12 Apr 2025

https://github.com/rafaelpierre/openai-agents-redis

Native OpenAI Agents SDK session management implementation using Redis as the persistence layer.

agents artificial-intelligence llama llama-cpp multiagent-systems ollama openai openai-agents-sdk redis

Last synced: 06 Aug 2025

https://github.com/1038lab/ComfyUI-MiniCPM

A ComfyUI custom node for MiniCPM vision-language models, enabling high-quality image captioning and analysis.

comfyui custom-nodes gguf llama-cpp minicpm minicpm-v muti-models stable-diffusion

Last synced: 02 Sep 2025

https://github.com/sullygreene/tinyagi

TinyAGI is a lightweight, modular, and extensible Python-based AGI framework designed to create and manage AI agents seamlessly. It supports various model backends like OpenAI, Llama.cpp, Ollama, AlpacaX, and Tabitha, along with dynamic plugin loading for enhanced flexibility.

agents agi ai api artificial-intelligence cli developer-tools extensible framework llama-cpp machine-learning modular ollama openai plugins python task-automation

Last synced: 06 Oct 2025

https://github.com/statikfintechllc/godcore

All-in-one local AI stack for Mistral-13B and Llama.cpp, with one-step CUDA wheel install, OpenAI-compatible API, and modern web dashboard. Switch between local and cloud chat, run on your own GPU, and deploy instantly—no API keys or paywalls. Designed for easy install, custom builds, and fast remote access. Enjoy!

ai chatbot chatgpt cuda dashboard fastapi llama-cpp llm local-ai mistral openai-compatible react selfhosted webui

Last synced: 25 Jun 2025

https://github.com/countzero/windows_manage_large_language_models

PowerShell automation to download large language models (LLMs) from Git repositories and quantize them with llama.cpp into the GGUF format.

gguf git large-language-models lfs llama-cpp powershell quantization windows

Last synced: 26 Apr 2026

https://github.com/alichherawalla/offline-mobile-llm-manager

The Swiss Army Knife of Offline AI. Chat, Speak, and Generate Images—Privacy First, Zero Internet. Download an LLM and use it on your mobile device. No data ever leaves your phone. Privacy first. Supports text-to-text, vision, text-to-image

edge-ai edge-ai-image-gen gguf llama-3-android llama-cpp llm local-ai local-image-gen offline offline-image-gen offline-llm privacy-first stable-diffusion-android whisper whisper-android

Last synced: 12 Feb 2026

https://github.com/ahitokun/hushai-android

Offline AI assistant for Android. Runs Qwen3 locally via custom JNI bridge. No servers, no accounts, no data collection.

android gguf jni kotlin llama-cpp llm offline on-device-ai privacy qwen3

Last synced: 12 Mar 2026

https://github.com/mizy/local-agent-chat

a flutter llama.cpp chat ui

chatgpt flutter ios-app llama-cpp llm localai macos-app

Last synced: 22 Aug 2025

https://github.com/slb350/octoroute

Smart HTTP router for local LLMs (Ollama, LM Studio, llama.cpp). Rule-based + LLM-powered routing, health checks, load balancing, Prometheus metrics. Rust-native, zero-overhead.

ai artificial-intelligence homelab llama-cpp llm lm-studio local-llm ollama prometheus rust

Last synced: 13 Jan 2026

https://github.com/eniompw/llama-cpp-gpu

Load larger models by offloading model layers to both GPU and CPU

colab colab-notebook gpu gpu-acceleration llama llama-cpp llamacpp

Last synced: 05 May 2026

https://github.com/kaust-generative-ai/local-deployment-of-generative-ai-models

Training materials on how to deploy generative AI models locally on your laptop or workstation.

ai carpentries-incubator deployment english generative-ai lesson llama-cpp llamafile llm-inference ollama pre-alpha python

Last synced: 17 Aug 2025

https://github.com/shekharp1536/ollama-web

Ollama Web UI is a simple yet powerful web-based interface for interacting with large language models. It offers chat history, voice commands, voice output, model download and management, conversation saving, terminal access, multi-model chat, and more—all in one streamlined platform.

llama llama-cpp llama3 llm-inference ollama ollama-app ollama-chat ollama-client ollama-gui ollama-interface ollama-python ollama-ui ollama-webui python-llm-integration

Last synced: 06 Oct 2025

https://github.com/mycellm/mycellm

Distributed LLM inference across heterogeneous hardware. Pool GPUs into a P2P network with QUIC transport, Ed25519 identity, and an OpenAI-compatible API.

decentralized distributed-computing fleet-management gpu inference llama-cpp llm machine-learning openai-api peer-to-peer python quic self-hosted

Last synced: 08 Apr 2026

https://github.com/scitrera/sparkrun

sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems

dgx-spark inference llama-cpp sglang vllm

Last synced: 24 Feb 2026

https://github.com/zuellni/orpheus-gguf

Orpheus-TTS inference.

ai llama-cpp tts

Last synced: 18 Apr 2026

https://github.com/jonasneves/colab-slm-playground

SLM inference and chatbot notebooks for Google Colab

gguf google-colab huggingface jupyter-notebook llama-cpp llm onnx python slm transformers

Last synced: 07 Apr 2026

https://github.com/mrprincerawat/openwhispr

Open source WhisperFlow alternative for Mac. Hold Fn, speak, release — transcribes and pastes into any app. Runs fully on device, nothing leaves your machine.

apple-silicon dictation llama-cpp local-first macos open-source privacy speech-to-text swift swiftui voice-to-text whisper whisperkit

Last synced: 02 May 2026

https://github.com/prithivsakthiur/triangulum

Triangulum 10B: Multilingual Large Language Models (LLMs)

10b 1b 5b llama-cpp llama-cpp-python llm ollama text-generation

Last synced: 17 May 2026

https://github.com/abhrankan-chakrabarti/llamainteract

An interactive AI platform with both terminal and web-based interfaces for real-time model interactions, featuring dynamic model integration and immediate feedback streaming. Developed by *The Vanguards*.

ai chatbot flask interactive-cli llama llama-cpp llm nlp ollama python real-time-streaming text-generation web-app

Last synced: 10 May 2026

https://github.com/gramanoid/focuslens

A native macOS menu bar app that silently captures your screen, classifies activity with a local vision model, and visualizes work patterns — all on-device, no cloud.

activity-tracker llama-cpp macos menu-bar-app on-device-ai privacy productivity screen-capture swift swiftui

Last synced: 07 Apr 2026

https://github.com/haschka/cli-rag

Command line tool to Interact with a llama.cpp server. Also implements a basic vector database with cosine similarity search.

artificial-intelligence cli large-language-models llama-cpp llm unix-shell

Last synced: 22 Feb 2025

https://github.com/kevinknights29/llama_to_llama.cpp

This project aims to create a guide for generating Llama.cpp GGUF model files out of base Llama v2 model weigths

llama-cpp llama-v2 notebook

Last synced: 04 Sep 2025

https://github.com/jadonduff/corellm

Efficient LLM framework for local GGUF models with streaming and GUI

ai chat gguf interface llama-cpp llm local-llm nlp streaming text-generation

Last synced: 13 Jan 2026

https://github.com/jwinman91/ai-ocr

An AI-powered, but model-agnostic (Optical-Character-Recognition) OCR tool

genai image-to-plot-generation image-to-text-generation llama-cpp ocr-python ocr-recognition python3

Last synced: 29 Dec 2025

https://github.com/runedgeai/agents-sdk

A modern, high performance C++ SDK for AI Agents

agentic-ai agents anthropic artificial-intelligence cpp gemini llama-cpp llm ollama openai sdk

Last synced: 08 Oct 2025

https://github.com/miyako/llama-cpp

Local inference engine

4d-dependency 4d-llm llama-cpp

Last synced: 06 Apr 2026

https://github.com/n-engine/devit

Rust CLI dev agent — patch-only, sandboxed, with local LLMs (Ollama/LM Studio).

ai-agent ai-agents approval-policy cli code-generation developer-tools git llama-cpp lm-studio ollama patch-only rust sandbox testing unified-diff wasm

Last synced: 07 Oct 2025

https://github.com/lloyal-ai/lloyal.node

Covalent inference for Node.js

beam-search inference llama-cpp test-time-scaling

Last synced: 08 Mar 2026

https://github.com/fangyuan025/hushdoc

Chat with your documents — privately, offline, on your own machine. Local-first RAG over PDFs/DOCX/images with GPU-accelerated streaming, optional voice mode, multi-conversation history, and citation-anchored sources. Bilingual (中/EN). FastAPI + React + llama.cpp.

bilingual chromadb document-ai fastapi llama-cpp llm local-llm offline-first pdf-chat privacy rag react typescript voice-assistant whisper

Last synced: 17 May 2026

https://github.com/alexg-4w/autonomous-engineering-copilot

The Autonomous Engineering Copilot is a cross-platform Python desktop application designed to act as an offline AI assistant. By utilizing local LLMs and embedded vector databases, it provides powerful RAG capabilities entirely on the user's hardware.

ai-assistant lancedb llama-cpp local-llm offline pyside6 rag

Last synced: 06 Apr 2026

https://github.com/mlorentedev/pollex

Text polishing API (Go) + Chrome extension + llama.cpp GPU inference on Jetson Nano. Self-hosted, private, fast.

chrome-extension go gpu-inference jetson-nano llama-cpp llm self-hosted text-processing

Last synced: 05 Mar 2026

https://github.com/m4k15y6666fk/llm-in-browser

Using LLM with browser features.

llama-cpp llm

Last synced: 03 Jul 2025

https://github.com/dwain-barnes/llm-gguf-auto-converter

Automated Jupyter notebook solution for batch converting Large Language Models to GGUF format with multiple quantization options. Built on llama.cpp with HuggingFace integration.

auto-converter batch-processing cuda gguf huggingface jupyter-notebook llama-cpp llm model-quantization

Last synced: 17 Jun 2025

https://github.com/testli-ai/outlines-llama-cpp-python-streaming-output

This repository demonstrates how to use outlines and llama-cpp-python for structured JSON generation with streaming output, integrating llama.cpp for local model inference and outlines for schema-based text generation.

gguf gguf-models llama-cpp llama-cpp-python llamacpp llamacpp-python outlines

Last synced: 18 Apr 2026

https://github.com/sshoecraft/shepherd

An interactive multi-backend LLM runtime with intelligent cache eviction and persistent retrieval-augmented memory.

anthropic cli cpp cuda gemini grok inference kv-cache llama-cpp llm mcp ollama openai openai-server rag smart-evictions tensorrt tool-calling ulimited-context

Last synced: 10 Apr 2026

https://github.com/pranav11024/genai-text-based-rpg-with-npcs

A text-based role-playing game with AI-powered NPC dialogue, built with Python and Mistral 7B via llama.cpp.

ai-chat ai-npc interactive-story llama-cpp llm-integration local-ai mistral-7b npc python python-game role-playing-game rpg streamlit text-adventure text-rpg

Last synced: 19 Apr 2026

https://github.com/michaelkrauty/llamesh

OpenAI-compatible mesh proxy for llama.cpp

ai inference llama-cpp llm load-balancer openai-api proxy rust

Last synced: 24 Apr 2026

https://github.com/countzero/presentation_on_artificial_intelligence

Presentation on Artificial Intelligence for the Free Drawing and Print Graphics class of the Muthesius Academy of Art.

ai diffusion-models large-language-models llama-cpp stable-diffusion

Last synced: 26 Apr 2026

https://github.com/syedazharmbnr1/computer-use-mcp

macOS Computer Use MCP Server - 33 tools for screen control via Model Context Protocol. Works with Claude Code, Cursor, LM Studio, Ollama, llama.cpp, MLX, and all MCP clients.

ai-tools automation claude-code computer-use cursor llama-cpp lm-studio macos mcp mcp-server mlx model-context-protocol ollama

Last synced: 27 Apr 2026

https://github.com/magicteamc/autogguf

Let me make GGUF files quickly

gguf llama-cpp llamacpp llm llm-quantization

Last synced: 28 Apr 2026