Projects in Awesome Lists tagged with llamacpp

https://github.com/khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

agent ai assistant chat chatgpt emacs image-generation llama3 llamacpp llm obsidian obsidian-md offline-llm productivity rag research self-hosted semantic-search stt whatsapp-ai

Last synced: 12 May 2025

https://github.com/menloresearch/jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer

electron gpt llama2 llamacpp localai self-hosted

Last synced: 12 May 2025

https://github.com/janhq/jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

electron gpt llama2 llamacpp localai self-hosted

Last synced: 17 Mar 2025

https://github.com/llmware-ai/llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing retrieval-augmented-generation small-specialized-models

Last synced: 12 May 2025

https://github.com/getumbrel/llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

ai chatgpt code-llama codellama gpt gpt-4 gpt4all llama llama-2 llama-cpp llama2 llamacpp llm localai openai self-hosted

Last synced: 13 May 2025

https://github.com/reorproject/reor

Private & local AI personal knowledge management app for high entropy people.

ai lancedb llama llamacpp local-first markdown note-taking ollama pkm rag second-brain vector-database

Last synced: 12 May 2025

https://github.com/xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm

Last synced: 13 May 2025

https://github.com/LostRuins/koboldcpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

gemma ggml gguf koboldai koboldcpp language-model llama llamacpp llm mistral

Last synced: 23 Mar 2025

https://llmware-ai.github.io/llmware/

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm parsing retrieval-augmented-generation small-specialized-models vector-db

Last synced: 16 Jan 2025

https://github.com/serge-chat/serge

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

alpaca docker fastapi llama llamacpp nginx python svelte sveltekit tailwindcss web

Last synced: 23 Apr 2025

https://github.com/johnsnowlabs/spark-nlp

State of the Art Natural Language Processing

bert entity-extraction language-detection lemmatizer llamacpp llm machine-translation named-entity-recognition natural-language-processing nlp onnx part-of-speech-tagger pyspark question-answering sentiment-analysis spark spell-checker tensorflow text-classification transformers

Last synced: 13 May 2025

https://github.com/JohnSnowLabs/spark-nlp

State of the Art Natural Language Processing

bert entity-extraction language-detection lemmatizer llamacpp llm machine-translation named-entity-recognition natural-language-processing nlp onnx part-of-speech-tagger pyspark question-answering sentiment-analysis spark spell-checker tensorflow text-classification transformers

Last synced: 07 Apr 2025

https://github.com/gptme/gptme

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.

ai-agents ai-assistant anthropic chatbot chatgpt cli code-generation llamacpp llm llm-agent llm-apps openai openrouter rag

Last synced: 13 May 2025

https://github.com/lostruins/koboldcpp

A simple one-file way to run various GGML and GGUF models with KoboldAI's UI

koboldcpp llamacpp llm

Last synced: 20 Jan 2025

https://github.com/twinnydotdev/twinny

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.

artificial-intelligence code-chat code-completion code-generation codellama copilot free llama2 llamacpp ollama ollama-api ollama-chat private symmetry vscode-extension

Last synced: 29 Apr 2025

https://github.com/erikbjare/gptme

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.

ai-agents ai-assistant anthropic chatbot chatgpt cli code-generation llamacpp llm llm-agent llm-apps openai openrouter rag

Last synced: 04 Mar 2025

https://github.com/scisharp/llamasharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 14 May 2025

https://github.com/SciSharp/LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 24 Mar 2025

https://github.com/josh-xt/agixt

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 12 May 2025

https://github.com/silasmarvin/lsp-ai

LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.

ai auto-completion developer-tools ide language-client llama llamacpp llm lsp mistral openai self-hosted

Last synced: 13 May 2025

https://github.com/gpustack/gpustack

Manage GPU clusters for running AI models

ascend cuda deepseek distributed distributed-inference genai ggml inference llama llamacpp llm llm-inference llm-serving maas metal mindie openai qwen rocm vllm

Last synced: 15 May 2025

https://github.com/menloresearch/cortex.cpp

Local AI API Platform

gguf llamacpp onnx onnxruntime

Last synced: 14 May 2025

https://josh-xt.github.io/AGiXT/

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 16 Jan 2025

https://github.com/SilasMarvin/lsp-ai

LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.

ai auto-completion developer-tools ide language-client llama llamacpp llm lsp mistral openai self-hosted

Last synced: 26 Mar 2025

https://github.com/Josh-XT/AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 24 Mar 2025

https://github.com/janhq/cortex.cpp

Local AI API Platform

gguf llamacpp onnx onnxruntime tensorrt-llm

Last synced: 13 Mar 2025

https://github.com/janhq/nitro

Local AI API Platform

gguf llamacpp onnx onnxruntime tensorrt-llm

Last synced: 08 Mar 2025

https://github.com/mobile-artificial-intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

android android-ai chatbot chatgpt facebook flutter free-chatgpt gguf large-language-models llama llama-cpp llama2 llamacpp local-ai mistral mobile-ai mobile-artificial-intelligence ollama openai openorca

Last synced: 11 Apr 2025

https://github.com/floneum/floneum

Instant, controllable, local pre-trained AI models in Rust

ai candle constrained-generation dioxus floneum-v3 kalosm llama llamacpp llm mistral rust transcription whisper

Last synced: 13 May 2025

https://github.com/alexpinel/dot

Text-To-Speech, RAG, and LLMs. All local!

document-chat embeddings faiss langchain llamacpp llm local phi-3 privategpt rag self-hosted standalone standalone-app tts whisper-cpp

Last synced: 16 May 2025

https://github.com/Mobile-Artificial-Intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

android android-ai chatbot chatgpt facebook flutter free-chatgpt gguf large-language-models llama llama-cpp llama2 llamacpp local-ai mistral mobile-ai mobile-artificial-intelligence ollama openai openorca

Last synced: 24 Mar 2025

https://github.com/alexpinel/Dot

Text-To-Speech, RAG, and LLMs. All local!

document-chat embeddings faiss langchain llamacpp llm local phi-3 privategpt rag self-hosted standalone standalone-app tts whisper-cpp

Last synced: 24 Mar 2025

https://github.com/containers/ramalama

The goal of RamaLama is to make working with AI boring.

ai containers inference-server llamacpp llm podman vllm

Last synced: 13 May 2025

https://github.com/alexrozanski/llamachat

Chat with your favourite LLaMA models in a native macOS app

ai llama llamacpp machine-learning macos swift swiftui

Last synced: 16 May 2025

https://github.com/alexrozanski/LlamaChat

Chat with your favourite LLaMA models in a native macOS app

ai llama llamacpp machine-learning macos swift swiftui

Last synced: 24 Mar 2025

https://github.com/rahulschand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization

Last synced: 14 May 2025

https://github.com/vercel/modelfusion

The TypeScript library for building AI applications.

ai artificial-intelligence chatbot claude dall-e embedding gpt-3 huggingface javascript js llamacpp llm mistral multi-modal ollama openai stable-diffusion ts typescript whisper

Last synced: 15 May 2025

https://github.com/lgrammel/ai-utils.js

The TypeScript library for building AI applications.

ai artificial-intelligence chatbot claude dall-e embedding gpt-3 huggingface javascript js llamacpp llm mistral multi-modal ollama openai stable-diffusion ts typescript whisper

Last synced: 03 Mar 2025

https://github.com/RahulSChand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization

Last synced: 17 Apr 2025

https://github.com/Dicklesworthstone/swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

Last synced: 09 Apr 2025

https://github.com/dicklesworthstone/swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

Last synced: 15 May 2025

https://github.com/awaescher/ollamasharp

The easiest way to use the Ollama API in .NET

ai gpt ichatclient llama llamacpp llm localllama microsoft-extensions-ai ollama ollama-api streaming

Last synced: 13 May 2025

https://github.com/awaescher/OllamaSharp

The easiest way to use the Ollama API in .NET

ai gpt ichatclient llama llamacpp llm localllama microsoft-extensions-ai ollama ollama-api streaming

Last synced: 30 Mar 2025

https://github.com/Atome-FE/llama-node

Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.

ai embeddings gpt langchain large-language-models llama llama-node llama-rs llamacpp llm napi napi-rs nodejs rwkv

Last synced: 14 Apr 2025

https://github.com/atome-fe/llama-node

Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.

ai embeddings gpt langchain large-language-models llama llama-node llama-rs llamacpp llm napi napi-rs nodejs rwkv

Last synced: 30 Mar 2025

https://github.com/huggingface/llm-ls

LSP server leveraging LLMs for code completion (and more?)

ai code-generation huggingface ide llamacpp llm lsp lsp-server openai self-hosted

Last synced: 16 May 2025

https://github.com/mukel/llama3.java

Practical Llama 3 inference in Java

chatgpt genai gguf huggingface java llama llama3 llamacpp llm llm-inference llms openai simd transformers

Last synced: 15 May 2025

https://github.com/ngxson/wllama

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

llama llamacpp llm wasm webassembly

Last synced: 13 Apr 2025

https://github.com/if-ai/comfyui-if_ai_tools

ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.

anthropic comfyui flux gemini graphrag groq koboldcpp llamacpp lmstudio mistral ocr ollama omost rag stable-diffusion supervision textgeneration transformers xai

Last synced: 15 May 2025

https://github.com/xnul/code-llama-for-vscode

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

assistant code code-llama codellama continue continuedev copilot llama llama2 llamacpp llm local meta ollama studio visual vscode

Last synced: 04 Apr 2025

https://github.com/xNul/code-llama-for-vscode

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

assistant code code-llama codellama continue continuedev copilot llama llama2 llamacpp llm local meta ollama studio visual vscode

Last synced: 07 Apr 2025

https://github.com/maximilian-winter/llama-cpp-agent

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.

agents function-calling llamacpp llm llm-agent llm-framework llms parallel-function-call

Last synced: 15 May 2025

https://github.com/distantmagic/paddler

Stateful load balancer custom-tailored for llama.cpp

ai llamacpp llm llmops load-balancer

Last synced: 15 May 2025

https://github.com/if-ai/ComfyUI-IF_AI_tools

ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.

anthropic comfyui flux gemini graphrag groq koboldcpp llamacpp lmstudio mistral ocr ollama omost rag stable-diffusion supervision textgeneration transformers xai

Last synced: 19 Dec 2024

https://github.com/mostlygeek/llama-swap

Model swapping for llama.cpp (or any local OpenAPI compatible server)

golang llama llamacpp localllama localllm openai openai-api vllm

Last synced: 11 Apr 2025

https://github.com/pythops/tenere

🤖 TUI interface for LLMs written in Rust

chatgpt cli llamacpp llm ollama ratatui rust tui

Last synced: 14 May 2025

https://github.com/lxe/llavavision

A simple "Be My Eyes" web app with a llama.cpp/llava backend

ai artificial-intelligence computer-vision llama llamacpp llm local-llm machine-learning multimodal webapp

Last synced: 05 Apr 2025

https://github.com/Maximilian-Winter/llama-cpp-agent

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.

agents function-calling llamacpp llm llm-agent llm-framework llms parallel-function-call

Last synced: 09 Apr 2025

https://github.com/fynnfluegge/codeqai

Local first semantic code search and chat | Leverage custom copilots with fine-tuning datasets from code in Alpaca, Conversational, Completion and Instruction format

codellama faiss gpt huggingface langchain llama2 llamacpp llm ollama openai sentence-transformers

Last synced: 15 May 2025

https://github.com/kelindar/search

Go library for embedded vector search and semantic embeddings using llama.cpp

ai bert embeddings gguf gpu llamacpp search-engine semantic-search simd vector-search

Last synced: 16 May 2025

https://github.com/Fuzzy-Search/realtime-bakllava

llama.cpp with BakLLaVA model describes what does it see

bakllavva cpp demo-application inference llama llamacpp llm

Last synced: 17 Apr 2025

https://github.com/fuzzy-search/realtime-bakllava

llama.cpp with BakLLaVA model describes what does it see

bakllavva cpp demo-application inference llama llamacpp llm

Last synced: 06 Apr 2025

https://github.com/intel/neural-speed

An innovative library for efficient LLM inference via low-bit quantization

cpu fp4 fp8 gaudi2 gpu int1 int2 int3 int4 int5 int6 int7 int8 llamacpp llm-fine-tuning llm-inference low-bit mxformat nf4 sparsity

Last synced: 10 Feb 2025

https://github.com/morpheuslord/hackbot

AI-powered cybersecurity chatbot designed to provide helpful and accurate answers to your cybersecurity-related queries and also do code analysis and scan analysis.

ai automation chatbot cli-chat-app cybersecurity cybersecurity-education cybersecurity-tools llama-api llama2 llama2-7b llamacpp llm-inference runpod

Last synced: 16 May 2025

https://github.com/vicuna-tools/vicuna-installation-guide

The "vicuna-installation-guide" provides step-by-step instructions for installing and configuring Vicuna 13 and 7B

large-language-models llamacpp llm vicuna vicuna-installation-guide

Last synced: 07 Apr 2025

https://github.com/andrewkchan/yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

cpp cuda inference-engine llama llamacpp llm llm-inference machine-learning mistral

Last synced: 12 Apr 2025

https://github.com/brutalcoding/aub.ai

AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.

android dart flutter gemini gemini-nano gen-ai genai indiedev ios ipados linux llamacpp localllama macos mistral-7b native-apps nlp on-device on-device-ai pubdev

Last synced: 09 Apr 2025

https://github.com/staghado/vit.cpp

Inference Vision Transformer (ViT) in plain C/C++ with ggml

ai c computer-vision cpp cpu edge-computing ggml image-classification llamacpp vision-transformer whisper-cpp

Last synced: 07 Apr 2025

https://github.com/joone/loz

Loz is a command-line tool that enables your preferred LLM to execute system commands and utilize Unix pipes, integrating AI capabilities with other Unix tools.

automation cli codellama git gpt llama2 llamacpp llm nodejs ollama openai-api typescript

Last synced: 16 May 2025

https://github.com/shubham0204/smolchat-android

Running any GGUF SLMs/LLMs locally, on-device in Android

android cpp ggml kotlin llamacpp small-language-models

Last synced: 13 Apr 2025

https://github.com/inferflow/inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

baichuan2 bloom deepseek falcon gemma internlm llama2 llamacpp llm-inference m2m100 minicpm mistral mixtral mixture-of-experts model-quantization moe multi-gpu-inference phi-2 qwen

Last synced: 07 Apr 2025

https://github.com/su77ungr/casalioy

♾️ toolkit for air-gapped LLMs on consumer-grade hardware

langchain llamacpp llm qdrant question-answering

Last synced: 08 Apr 2025

https://github.com/Genta-Technology/Kolosal

Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.

c cpp deepseek gemma gemma2 gemma3 gpt llama llama2 llama3 llamacpp llava llm llms localai mistral phi3 phi4 qwen self-hosted

Last synced: 04 May 2025

https://github.com/OEvortex/Webscout

Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails and phone numbers; perform text-to-speech conversions; and much more!

ai chatgpt-free deepseek-r1 free freeai freegpt4 gguf llamacpp ml openai openinterpreter python tempnumber text-generation websearch youtube youtube-api

Last synced: 13 May 2025

https://github.com/distantmagic/resonance

PHP Framework designed to build IO-intensive web applications.

ai framework llamacpp llm php swoole

Last synced: 16 May 2025

https://github.com/nuked88/comfyui-n-nodes

A suite of custom nodes for ConfyUI that includes GPT text-prompt generation, LoadVideo, SaveVideo, LoadFramesFromFolder and FrameInterpolator

comfyui gpt llama llamacpp loadvideo savevideo stablediffusion videonode

Last synced: 07 Apr 2025

https://github.com/oevortex/webscout

Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails and phone numbers; perform text-to-speech conversions; and much more!

ai api free freeai g4f gguf llamacpp localgpt ml ollama openai openinterpreter python tempmail tempnumber text-generation websearch youtube youtube-api

Last synced: 05 Apr 2025

https://github.com/Nuked88/ComfyUI-N-Nodes

A suite of custom nodes for ConfyUI that includes GPT text-prompt generation, LoadVideo, SaveVideo, LoadFramesFromFolder and FrameInterpolator

comfyui gpt llama llamacpp loadvideo savevideo stablediffusion videonode

Last synced: 19 Dec 2024

https://github.com/nerve-sparks/iris_android

IRIS is an android app for interfacing with GGUF / llama.cpp models locally.

ai android huggingface llama llamacpp llm local

Last synced: 23 Apr 2025

https://github.com/mrdbourke/mac-ml-speed-test

A few quick scripts focused on testing TensorFlow/PyTorch/Llama 2 on macOS.

apple-silicon benchmark deep-learning llama2 llamacpp llm m1 m1-mac m2-mac m3-mac machine-learning macos metal metal-performance-shaders ml mps pytorch speedtest tensorflow2

Last synced: 10 Apr 2025

https://github.com/austin-starks/promptimizer

An Automated AI-Powered Prompt Optimization Framework

ai anthropic aritificial-intelligence genetic-algorithm large-language-model llama3 llama3-1 llamacpp llm machine-learning mongodb ollama open-source openai optimization prompt-engineering prompt-management

Last synced: 09 May 2025

https://github.com/genta-technology/kolosal

Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.

c cpp deepseek gemma gemma2 gemma3 gpt llama llama2 llama3 llamacpp llava llm llms localai mistral phi3 phi4 qwen self-hosted

Last synced: 09 Apr 2025

https://github.com/nekomeowww/ollama-operator

🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫

ai kubernetes kubernetes-operators llama llamacpp llm ollama

Last synced: 04 Apr 2025

https://github.com/adriankhl/godot-llm

LLM in Godot

game-development gamedev gdextension godot godot-engine godotengine llamacpp llm-inference

Last synced: 11 Apr 2025

https://github.com/jabberjabberjabber/imageindexer

Creates an index of images, queries a local LLM and adds tags to the image metadata

ai dataset-generation exif-metadata exiftool image-classification image-processing image-recognition keywords koboldcpp large-language-models llamacpp local multimodal tags

Last synced: 06 Apr 2025

https://github.com/1b5d/llm-api

Run any Large Language Model behind a unified API

chatgpt gptq huggingface langchain llama llamacpp llm llm-inference machine-learning python

Last synced: 24 Jan 2025

https://github.com/austin-starks/Promptimizer

An Automated AI-Powered Prompt Optimization Framework

ai anthropic aritificial-intelligence genetic-algorithm large-language-model llama3 llama3-1 llamacpp llm machine-learning mongodb ollama open-source openai optimization prompt-engineering prompt-management

Last synced: 07 Dec 2024

https://github.com/eliranwong/toolmate

ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.

agent ai autogen chatgpt claude dalle-3 fabric gemini google grok groq imagen-3 llama3 llamacpp mistral ollama openai tool vision xai

Last synced: 16 May 2025

https://github.com/iohub/collama

VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.

ai-assistant code-generation cody copilot llama2 llamacpp vscode-extension

Last synced: 20 Jan 2025

https://github.com/mgonzs13/llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

cpp embeddings ggml gguf gpt langchain llama llamacpp llava llavacpp llm rerank reranking ros2 vlm

Last synced: 04 Apr 2025

https://github.com/chenhunghan/ialacol

🪶 Lightweight OpenAI drop-in replacement for Kubernetes

ai cloudnative cuda ggml gptq gpu helm kubernetes langchain llamacpp llm llm-inference llm-serving openai python

Last synced: 20 Jan 2025

https://github.com/oe-lucifer/webscout

Search for anything using Google, DuckDuckGo, phind.com, Contains AI models, can transcribe yt videos, temporary email and phone number generation, has TTS support, webai (terminal gpt and open interpreter) and offline LLMs

ai api free freeai g4f gguf llamacpp localgpt ml ollama openai openinterpreter python tempmail tempnumber text-generation tgpt websearch youtube youtube-api

Last synced: 28 Mar 2025

https://github.com/OE-LUCIFER/Webscout

Search for anything using Google, DuckDuckGo, phind.com, Contains AI models, can transcribe yt videos, temporary email and phone number generation, has TTS support, webai (terminal gpt and open interpreter) and offline LLMs

ai api free freeai g4f gguf llamacpp localgpt ml ollama openai openinterpreter python tempmail tempnumber text-generation tgpt websearch youtube youtube-api

Last synced: 06 Mar 2025

https://github.com/gotzmann/booster

Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

chatgpt exllama ggml gpt llama llama-cpp llamacpp llm ollama oobabooga openai vllm

Last synced: 16 Feb 2025

https://github.com/zatevakhin/obsidian-local-llm

Obsidian Local LLM is a plugin for Obsidian that provides access to a powerful neural network, allowing users to generate text in a wide range of styles and formats using a local LLM.

ggml llama llamacpp obsidian-md obsidian-plugin

Last synced: 24 Jan 2025