An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with llama-cpp-python

A curated list of projects in awesome lists tagged with llama-cpp-python .

https://github.com/jasonacox/tinyllm

Setup and run a local LLM and Chatbot using consumer grade hardware.

artificial-intelligence chatbot large-language-models llama-cpp-python llm openai rag retrieval-augmented-generation vllm

Last synced: 06 Sep 2025

https://github.com/unixwzrd/oobabooga-macOS

Information on optimizing python libraries specifically for oobabooga to take advantage of Apple Silicon and Accelerate Framework.

blas journal llama-cpp-python macos numpy oobabooga pytorch

Last synced: 22 Jul 2025

https://github.com/mlc-delgado/pytldr-oss

An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intelligent assistant for modern professionals.

cassandra docker docker-compose gradio llama-cpp-python llama2 python3

Last synced: 20 Aug 2025

https://github.com/laelhalawani/gguf_modeldb

A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b

database hugginface inference llama llama-cpp-python llama2 llm model-database python3

Last synced: 08 Sep 2025

https://github.com/woheller69/llama_tk_chat

Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent

gui llama-cpp-agent llama-cpp-python llm-inference

Last synced: 12 Apr 2025

https://github.com/renatoelho/llama-cpp-local

Llama.cpp é uma biblioteca desenvolvida em C++ para a implementação eficiente de grandes modelos de linguagem, como o LLaMA da Meta. Otimizada para rodar em diversas plataformas, incluindo dispositivos com recursos limitados, oferece performance, velocidade de inferência e uso eficiente da memória, essenciais para a execução de grandes. modelos

ia llama-cpp-python llama2 llama3 llms python shell-script

Last synced: 10 Sep 2025

https://github.com/prithivsakthiur/triangulum

Triangulum 10B: Multilingual Large Language Models (LLMs)

10b 1b 5b llama-cpp llama-cpp-python llm ollama text-generation

Last synced: 22 Feb 2025

https://github.com/wambugu71/offlinegpt-

Local gpt in llama.cpp models with chat interface

llama-cpp-python llamacpp openai python python3 streamlit

Last synced: 06 Sep 2025

https://github.com/perpendicularai/sekernel_for_llm_ui

This is the repository for the UI for the SeKernel_for_LLM module

chat database-management internet llama-cpp-python pyqt5 semantic-kernel

Last synced: 17 Mar 2025

https://github.com/testli-ai/outlines-llama-cpp-python-streaming-output

This repository demonstrates how to use outlines and llama-cpp-python for structured JSON generation with streaming output, integrating llama.cpp for local model inference and outlines for schema-based text generation.

gguf gguf-models llama-cpp llama-cpp-python llamacpp llamacpp-python outlines

Last synced: 06 Mar 2025

https://github.com/magnuss0/huginnhears

Huginn Hears is a local app that transcribes and summarizes your meetings in Norwegian and English, using state-of-the-art models and open-source libraries. No cloud needed, run everything offline.

faster-whisper langchain llama-cpp-python llamacpp local-llm norwegian speech-to-text

Last synced: 08 Apr 2025

https://github.com/serhaturtis/ai-blackshiftworkflow

A flexible Python framework for building complex workflows with LLM integration, robust error handling, and structured data processing.

agentic-ai agentic-workflow agents framework llama llama-cpp-python llm python workflow

Last synced: 18 Jun 2025

https://github.com/pchsu-hsupc/edge_ai_13th

This project optimizes the LLaMA-3.2B-Instruct model for fast inference on a single NVIDIA T4 GPU (16 GB), targeting high throughput and low perplexity for efficient edge deployment.

gguf llama-cpp-python llama3 lora

Last synced: 04 Jul 2025

https://github.com/dougeeai/llama-cpp-python-wheels

Pre-built wheels for llama-cpp-python across platforms and CUDA versions

ampere cuda cuda13 gguf llama-cpp-python llm machine-learning prebuilt python313 rtx3060 rtx3070 rtx3080 rtx3090 wheels windows

Last synced: 03 Nov 2025

https://github.com/serhaturtis/ai-flowlib

A Python framework for building structured, flow-based LLM applications with built-in pipeline management, model configuration, and validation capabilities.

ai async flow framework llama-cpp-python llm pipeline python structured-data validation

Last synced: 26 Feb 2025