Projects in Awesome Lists tagged with llama-cpp-python
A curated list of projects in awesome lists tagged with llama-cpp-python .
https://github.com/woolverine94/biniou
a self-hosted webui for 30+ generative ai
animatediff audiocraft bark controlnet diffusers flux generative-ai gfpgan gradio huggingface insightface ip-adapter kandinsky llama-cpp-python photomaker real-esrgan stable-diffusion stable-diffusion-3-5 webui whisper
Last synced: 15 May 2025
https://github.com/Woolverine94/biniou
a self-hosted webui for 30+ generative ai
animatediff audiocraft bark controlnet diffusers flux generative-ai gfpgan gradio huggingface insightface ip-adapter kandinsky llama-cpp-python photomaker real-esrgan stable-diffusion stable-diffusion-3 webui whisper
Last synced: 24 Mar 2025
https://github.com/jasonacox/tinyllm
Setup and run a local LLM and Chatbot using consumer grade hardware.
artificial-intelligence chatbot large-language-models llama-cpp-python llm openai rag retrieval-augmented-generation vllm
Last synced: 06 Sep 2025
https://github.com/unixwzrd/oobabooga-macOS
Information on optimizing python libraries specifically for oobabooga to take advantage of Apple Silicon and Accelerate Framework.
blas journal llama-cpp-python macos numpy oobabooga pytorch
Last synced: 22 Jul 2025
https://github.com/mlc-delgado/pytldr-oss
An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intelligent assistant for modern professionals.
cassandra docker docker-compose gradio llama-cpp-python llama2 python3
Last synced: 20 Aug 2025
https://github.com/laelhalawani/gguf_modeldb
A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b
database hugginface inference llama llama-cpp-python llama2 llm model-database python3
Last synced: 08 Sep 2025
https://github.com/woheller69/llama_tk_chat
Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent
gui llama-cpp-agent llama-cpp-python llm-inference
Last synced: 12 Apr 2025
https://github.com/renatoelho/llama-cpp-local
Llama.cpp é uma biblioteca desenvolvida em C++ para a implementação eficiente de grandes modelos de linguagem, como o LLaMA da Meta. Otimizada para rodar em diversas plataformas, incluindo dispositivos com recursos limitados, oferece performance, velocidade de inferência e uso eficiente da memória, essenciais para a execução de grandes. modelos
ia llama-cpp-python llama2 llama3 llms python shell-script
Last synced: 10 Sep 2025
https://github.com/prithivsakthiur/triangulum
Triangulum 10B: Multilingual Large Language Models (LLMs)
10b 1b 5b llama-cpp llama-cpp-python llm ollama text-generation
Last synced: 22 Feb 2025
https://github.com/wambugu71/offlinegpt-
Local gpt in llama.cpp models with chat interface
llama-cpp-python llamacpp openai python python3 streamlit
Last synced: 06 Sep 2025
https://github.com/perpendicularai/sekernel_for_llm_ui
This is the repository for the UI for the SeKernel_for_LLM module
chat database-management internet llama-cpp-python pyqt5 semantic-kernel
Last synced: 17 Mar 2025
https://github.com/testli-ai/outlines-llama-cpp-python-streaming-output
This repository demonstrates how to use outlines and llama-cpp-python for structured JSON generation with streaming output, integrating llama.cpp for local model inference and outlines for schema-based text generation.
gguf gguf-models llama-cpp llama-cpp-python llamacpp llamacpp-python outlines
Last synced: 06 Mar 2025
https://github.com/magnuss0/huginnhears
Huginn Hears is a local app that transcribes and summarizes your meetings in Norwegian and English, using state-of-the-art models and open-source libraries. No cloud needed, run everything offline.
faster-whisper langchain llama-cpp-python llamacpp local-llm norwegian speech-to-text
Last synced: 08 Apr 2025
https://github.com/serhaturtis/ai-blackshiftworkflow
A flexible Python framework for building complex workflows with LLM integration, robust error handling, and structured data processing.
agentic-ai agentic-workflow agents framework llama llama-cpp-python llm python workflow
Last synced: 18 Jun 2025
https://github.com/pchsu-hsupc/edge_ai_13th
This project optimizes the LLaMA-3.2B-Instruct model for fast inference on a single NVIDIA T4 GPU (16 GB), targeting high throughput and low perplexity for efficient edge deployment.
gguf llama-cpp-python llama3 lora
Last synced: 04 Jul 2025
https://github.com/ankitajadhav611/chatbot_llm_moinvonbremen
LLM chat bot for multimodal processing
llama llama-cpp-python llm llm-chatbot multimodal rag whisper-ai
Last synced: 30 Sep 2025
https://github.com/serhaturtis/ai-flowlib
A Python framework for building structured, flow-based LLM applications with built-in pipeline management, model configuration, and validation capabilities.
ai async flow framework llama-cpp-python llm pipeline python structured-data validation
Last synced: 26 Feb 2025