Projects in Awesome Lists tagged with llamacpp
A curated list of projects in awesome lists tagged with llamacpp .
https://github.com/khoj-ai/khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
agent ai assistant chat chatgpt emacs image-generation llama3 llamacpp llm obsidian obsidian-md offline-llm productivity rag research self-hosted semantic-search stt whatsapp-ai
Last synced: 12 May 2025
https://github.com/menloresearch/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
electron gpt llama2 llamacpp localai self-hosted
Last synced: 12 May 2025
https://github.com/janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
electron gpt llama2 llamacpp localai self-hosted
Last synced: 17 Mar 2025
https://github.com/llmware-ai/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
agents generative-ai-tools llamacpp llm onnx openvino parsing retrieval-augmented-generation small-specialized-models
Last synced: 12 May 2025
https://github.com/getumbrel/llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
ai chatgpt code-llama codellama gpt gpt-4 gpt4all llama llama-2 llama-cpp llama2 llamacpp llm localai openai self-hosted
Last synced: 13 May 2025
https://github.com/reorproject/reor
Private & local AI personal knowledge management app for high entropy people.
ai lancedb llama llamacpp local-first markdown note-taking ollama pkm rag second-brain vector-database
Last synced: 12 May 2025
https://github.com/xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm
Last synced: 13 May 2025
https://llmware-ai.github.io/llmware/
Unified framework for building enterprise RAG pipelines with small, specialized models
agents generative-ai-tools llamacpp llm parsing retrieval-augmented-generation small-specialized-models vector-db
Last synced: 16 Jan 2025
https://github.com/johnsnowlabs/spark-nlp
State of the Art Natural Language Processing
bert entity-extraction language-detection lemmatizer llamacpp llm machine-translation named-entity-recognition natural-language-processing nlp onnx part-of-speech-tagger pyspark question-answering sentiment-analysis spark spell-checker tensorflow text-classification transformers
Last synced: 13 May 2025
https://github.com/JohnSnowLabs/spark-nlp
State of the Art Natural Language Processing
bert entity-extraction language-detection lemmatizer llamacpp llm machine-translation named-entity-recognition natural-language-processing nlp onnx part-of-speech-tagger pyspark question-answering sentiment-analysis spark spell-checker tensorflow text-classification transformers
Last synced: 07 Apr 2025
https://github.com/gptme/gptme
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.
ai-agents ai-assistant anthropic chatbot chatgpt cli code-generation llamacpp llm llm-agent llm-apps openai openrouter rag
Last synced: 13 May 2025
https://github.com/lostruins/koboldcpp
A simple one-file way to run various GGML and GGUF models with KoboldAI's UI
Last synced: 20 Jan 2025
https://github.com/twinnydotdev/twinny
The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.
artificial-intelligence code-chat code-completion code-generation codellama copilot free llama2 llamacpp ollama ollama-api ollama-chat private symmetry vscode-extension
Last synced: 29 Apr 2025
https://github.com/erikbjare/gptme
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.
ai-agents ai-assistant anthropic chatbot chatgpt cli code-generation llamacpp llm llm-agent llm-apps openai openrouter rag
Last synced: 04 Mar 2025
https://github.com/scisharp/llamasharp
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel
Last synced: 14 May 2025
https://github.com/SciSharp/LLamaSharp
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel
Last synced: 24 Mar 2025
https://github.com/josh-xt/agixt
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python
Last synced: 12 May 2025
https://github.com/silasmarvin/lsp-ai
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
ai auto-completion developer-tools ide language-client llama llamacpp llm lsp mistral openai self-hosted
Last synced: 13 May 2025
https://github.com/gpustack/gpustack
Manage GPU clusters for running AI models
ascend cuda deepseek distributed distributed-inference genai ggml inference llama llamacpp llm llm-inference llm-serving maas metal mindie openai qwen rocm vllm
Last synced: 15 May 2025
https://github.com/menloresearch/cortex.cpp
Local AI API Platform
gguf llamacpp onnx onnxruntime
Last synced: 14 May 2025
https://josh-xt.github.io/AGiXT/
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python
Last synced: 16 Jan 2025
https://github.com/SilasMarvin/lsp-ai
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
ai auto-completion developer-tools ide language-client llama llamacpp llm lsp mistral openai self-hosted
Last synced: 26 Mar 2025
https://github.com/Josh-XT/AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python
Last synced: 24 Mar 2025
https://github.com/janhq/cortex.cpp
Local AI API Platform
gguf llamacpp onnx onnxruntime tensorrt-llm
Last synced: 13 Mar 2025
https://github.com/janhq/nitro
Local AI API Platform
gguf llamacpp onnx onnxruntime tensorrt-llm
Last synced: 08 Mar 2025
https://github.com/mobile-artificial-intelligence/maid
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
android android-ai chatbot chatgpt facebook flutter free-chatgpt gguf large-language-models llama llama-cpp llama2 llamacpp local-ai mistral mobile-ai mobile-artificial-intelligence ollama openai openorca
Last synced: 11 Apr 2025
https://github.com/floneum/floneum
Instant, controllable, local pre-trained AI models in Rust
ai candle constrained-generation dioxus floneum-v3 kalosm llama llamacpp llm mistral rust transcription whisper
Last synced: 13 May 2025
https://github.com/alexpinel/dot
Text-To-Speech, RAG, and LLMs. All local!
document-chat embeddings faiss langchain llamacpp llm local phi-3 privategpt rag self-hosted standalone standalone-app tts whisper-cpp
Last synced: 16 May 2025
https://github.com/Mobile-Artificial-Intelligence/maid
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
android android-ai chatbot chatgpt facebook flutter free-chatgpt gguf large-language-models llama llama-cpp llama2 llamacpp local-ai mistral mobile-ai mobile-artificial-intelligence ollama openai openorca
Last synced: 24 Mar 2025
https://github.com/alexpinel/Dot
Text-To-Speech, RAG, and LLMs. All local!
document-chat embeddings faiss langchain llamacpp llm local phi-3 privategpt rag self-hosted standalone standalone-app tts whisper-cpp
Last synced: 24 Mar 2025
https://github.com/containers/ramalama
The goal of RamaLama is to make working with AI boring.
ai containers inference-server llamacpp llm podman vllm
Last synced: 13 May 2025
https://github.com/alexrozanski/llamachat
Chat with your favourite LLaMA models in a native macOS app
ai llama llamacpp machine-learning macos swift swiftui
Last synced: 16 May 2025
https://github.com/alexrozanski/LlamaChat
Chat with your favourite LLaMA models in a native macOS app
ai llama llamacpp machine-learning macos swift swiftui
Last synced: 24 Mar 2025
https://github.com/rahulschand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization
Last synced: 14 May 2025
https://github.com/vercel/modelfusion
The TypeScript library for building AI applications.
ai artificial-intelligence chatbot claude dall-e embedding gpt-3 huggingface javascript js llamacpp llm mistral multi-modal ollama openai stable-diffusion ts typescript whisper
Last synced: 15 May 2025
https://github.com/lgrammel/ai-utils.js
The TypeScript library for building AI applications.
ai artificial-intelligence chatbot claude dall-e embedding gpt-3 huggingface javascript js llamacpp llm mistral multi-modal ollama openai stable-diffusion ts typescript whisper
Last synced: 03 Mar 2025
https://github.com/RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization
Last synced: 17 Apr 2025
https://github.com/Dicklesworthstone/swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
embedding-similarity embedding-vectors embeddings llama2 llamacpp semantic-search
Last synced: 09 Apr 2025
https://github.com/dicklesworthstone/swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
embedding-similarity embedding-vectors embeddings llama2 llamacpp semantic-search
Last synced: 15 May 2025
https://github.com/awaescher/ollamasharp
The easiest way to use the Ollama API in .NET
ai gpt ichatclient llama llamacpp llm localllama microsoft-extensions-ai ollama ollama-api streaming
Last synced: 13 May 2025
https://github.com/awaescher/OllamaSharp
The easiest way to use the Ollama API in .NET
ai gpt ichatclient llama llamacpp llm localllama microsoft-extensions-ai ollama ollama-api streaming
Last synced: 30 Mar 2025
https://github.com/Atome-FE/llama-node
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
ai embeddings gpt langchain large-language-models llama llama-node llama-rs llamacpp llm napi napi-rs nodejs rwkv
Last synced: 14 Apr 2025
https://github.com/atome-fe/llama-node
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
ai embeddings gpt langchain large-language-models llama llama-node llama-rs llamacpp llm napi napi-rs nodejs rwkv
Last synced: 30 Mar 2025
https://github.com/huggingface/llm-ls
LSP server leveraging LLMs for code completion (and more?)
ai code-generation huggingface ide llamacpp llm lsp lsp-server openai self-hosted
Last synced: 16 May 2025
https://github.com/mukel/llama3.java
Practical Llama 3 inference in Java
chatgpt genai gguf huggingface java llama llama3 llamacpp llm llm-inference llms openai simd transformers
Last synced: 15 May 2025
https://github.com/ngxson/wllama
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
llama llamacpp llm wasm webassembly
Last synced: 13 Apr 2025
https://github.com/if-ai/comfyui-if_ai_tools
ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.
anthropic comfyui flux gemini graphrag groq koboldcpp llamacpp lmstudio mistral ocr ollama omost rag stable-diffusion supervision textgeneration transformers xai
Last synced: 15 May 2025
https://github.com/xnul/code-llama-for-vscode
Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.
assistant code code-llama codellama continue continuedev copilot llama llama2 llamacpp llm local meta ollama studio visual vscode
Last synced: 04 Apr 2025
https://github.com/xNul/code-llama-for-vscode
Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.
assistant code code-llama codellama continue continuedev copilot llama llama2 llamacpp llm local meta ollama studio visual vscode
Last synced: 07 Apr 2025
https://github.com/maximilian-winter/llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
agents function-calling llamacpp llm llm-agent llm-framework llms parallel-function-call
Last synced: 15 May 2025
https://github.com/distantmagic/paddler
Stateful load balancer custom-tailored for llama.cpp
ai llamacpp llm llmops load-balancer
Last synced: 15 May 2025
https://github.com/if-ai/ComfyUI-IF_AI_tools
ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.
anthropic comfyui flux gemini graphrag groq koboldcpp llamacpp lmstudio mistral ocr ollama omost rag stable-diffusion supervision textgeneration transformers xai
Last synced: 19 Dec 2024
https://github.com/mostlygeek/llama-swap
Model swapping for llama.cpp (or any local OpenAPI compatible server)
golang llama llamacpp localllama localllm openai openai-api vllm
Last synced: 11 Apr 2025
https://github.com/lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
ai artificial-intelligence computer-vision llama llamacpp llm local-llm machine-learning multimodal webapp
Last synced: 05 Apr 2025
https://github.com/Maximilian-Winter/llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
agents function-calling llamacpp llm llm-agent llm-framework llms parallel-function-call
Last synced: 09 Apr 2025
https://github.com/fynnfluegge/codeqai
Local first semantic code search and chat | Leverage custom copilots with fine-tuning datasets from code in Alpaca, Conversational, Completion and Instruction format
codellama faiss gpt huggingface langchain llama2 llamacpp llm ollama openai sentence-transformers
Last synced: 15 May 2025
https://github.com/kelindar/search
Go library for embedded vector search and semantic embeddings using llama.cpp
ai bert embeddings gguf gpu llamacpp search-engine semantic-search simd vector-search
Last synced: 16 May 2025
https://github.com/Fuzzy-Search/realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
bakllavva cpp demo-application inference llama llamacpp llm
Last synced: 17 Apr 2025
https://github.com/fuzzy-search/realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
bakllavva cpp demo-application inference llama llamacpp llm
Last synced: 06 Apr 2025
https://github.com/morpheuslord/hackbot
AI-powered cybersecurity chatbot designed to provide helpful and accurate answers to your cybersecurity-related queries and also do code analysis and scan analysis.
ai automation chatbot cli-chat-app cybersecurity cybersecurity-education cybersecurity-tools llama-api llama2 llama2-7b llamacpp llm-inference runpod
Last synced: 16 May 2025
https://github.com/vicuna-tools/vicuna-installation-guide
The "vicuna-installation-guide" provides step-by-step instructions for installing and configuring Vicuna 13 and 7B
large-language-models llamacpp llm vicuna vicuna-installation-guide
Last synced: 07 Apr 2025
https://github.com/andrewkchan/yalm
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
cpp cuda inference-engine llama llamacpp llm llm-inference machine-learning mistral
Last synced: 12 Apr 2025
https://github.com/brutalcoding/aub.ai
AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.
android dart flutter gemini gemini-nano gen-ai genai indiedev ios ipados linux llamacpp localllama macos mistral-7b native-apps nlp on-device on-device-ai pubdev
Last synced: 09 Apr 2025
https://github.com/staghado/vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
ai c computer-vision cpp cpu edge-computing ggml image-classification llamacpp vision-transformer whisper-cpp
Last synced: 07 Apr 2025
https://github.com/joone/loz
Loz is a command-line tool that enables your preferred LLM to execute system commands and utilize Unix pipes, integrating AI capabilities with other Unix tools.
automation cli codellama git gpt llama2 llamacpp llm nodejs ollama openai-api typescript
Last synced: 16 May 2025
https://github.com/shubham0204/smolchat-android
Running any GGUF SLMs/LLMs locally, on-device in Android
android cpp ggml kotlin llamacpp small-language-models
Last synced: 13 Apr 2025
https://github.com/inferflow/inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
baichuan2 bloom deepseek falcon gemma internlm llama2 llamacpp llm-inference m2m100 minicpm mistral mixtral mixture-of-experts model-quantization moe multi-gpu-inference phi-2 qwen
Last synced: 07 Apr 2025
https://github.com/su77ungr/casalioy
♾️ toolkit for air-gapped LLMs on consumer-grade hardware
langchain llamacpp llm qdrant question-answering
Last synced: 08 Apr 2025
https://github.com/Genta-Technology/Kolosal
Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.
c cpp deepseek gemma gemma2 gemma3 gpt llama llama2 llama3 llamacpp llava llm llms localai mistral phi3 phi4 qwen self-hosted
Last synced: 04 May 2025
https://github.com/OEvortex/Webscout
Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails and phone numbers; perform text-to-speech conversions; and much more!
ai chatgpt-free deepseek-r1 free freeai freegpt4 gguf llamacpp ml openai openinterpreter python tempnumber text-generation websearch youtube youtube-api
Last synced: 13 May 2025
https://github.com/nuked88/comfyui-n-nodes
A suite of custom nodes for ConfyUI that includes GPT text-prompt generation, LoadVideo, SaveVideo, LoadFramesFromFolder and FrameInterpolator
comfyui gpt llama llamacpp loadvideo savevideo stablediffusion videonode
Last synced: 07 Apr 2025
https://github.com/oevortex/webscout
Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails and phone numbers; perform text-to-speech conversions; and much more!
ai api free freeai g4f gguf llamacpp localgpt ml ollama openai openinterpreter python tempmail tempnumber text-generation websearch youtube youtube-api
Last synced: 05 Apr 2025
https://github.com/Nuked88/ComfyUI-N-Nodes
A suite of custom nodes for ConfyUI that includes GPT text-prompt generation, LoadVideo, SaveVideo, LoadFramesFromFolder and FrameInterpolator
comfyui gpt llama llamacpp loadvideo savevideo stablediffusion videonode
Last synced: 19 Dec 2024
https://github.com/nerve-sparks/iris_android
IRIS is an android app for interfacing with GGUF / llama.cpp models locally.
ai android huggingface llama llamacpp llm local
Last synced: 23 Apr 2025
https://github.com/mrdbourke/mac-ml-speed-test
A few quick scripts focused on testing TensorFlow/PyTorch/Llama 2 on macOS.
apple-silicon benchmark deep-learning llama2 llamacpp llm m1 m1-mac m2-mac m3-mac machine-learning macos metal metal-performance-shaders ml mps pytorch speedtest tensorflow2
Last synced: 10 Apr 2025
https://github.com/austin-starks/promptimizer
An Automated AI-Powered Prompt Optimization Framework
ai anthropic aritificial-intelligence genetic-algorithm large-language-model llama3 llama3-1 llamacpp llm machine-learning mongodb ollama open-source openai optimization prompt-engineering prompt-management
Last synced: 09 May 2025
https://github.com/genta-technology/kolosal
Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.
c cpp deepseek gemma gemma2 gemma3 gpt llama llama2 llama3 llamacpp llava llm llms localai mistral phi3 phi4 qwen self-hosted
Last synced: 09 Apr 2025
https://github.com/nekomeowww/ollama-operator
🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
ai kubernetes kubernetes-operators llama llamacpp llm ollama
Last synced: 04 Apr 2025
https://github.com/adriankhl/godot-llm
LLM in Godot
game-development gamedev gdextension godot godot-engine godotengine llamacpp llm-inference
Last synced: 11 Apr 2025
https://github.com/jabberjabberjabber/imageindexer
Creates an index of images, queries a local LLM and adds tags to the image metadata
ai dataset-generation exif-metadata exiftool image-classification image-processing image-recognition keywords koboldcpp large-language-models llamacpp local multimodal tags
Last synced: 06 Apr 2025
https://github.com/1b5d/llm-api
Run any Large Language Model behind a unified API
chatgpt gptq huggingface langchain llama llamacpp llm llm-inference machine-learning python
Last synced: 24 Jan 2025
https://github.com/austin-starks/Promptimizer
An Automated AI-Powered Prompt Optimization Framework
ai anthropic aritificial-intelligence genetic-algorithm large-language-model llama3 llama3-1 llamacpp llm machine-learning mongodb ollama open-source openai optimization prompt-engineering prompt-management
Last synced: 07 Dec 2024
https://github.com/eliranwong/toolmate
ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.
agent ai autogen chatgpt claude dalle-3 fabric gemini google grok groq imagen-3 llama3 llamacpp mistral ollama openai tool vision xai
Last synced: 16 May 2025
https://github.com/iohub/collama
VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.
ai-assistant code-generation cody copilot llama2 llamacpp vscode-extension
Last synced: 20 Jan 2025
https://github.com/chenhunghan/ialacol
🪶 Lightweight OpenAI drop-in replacement for Kubernetes
ai cloudnative cuda ggml gptq gpu helm kubernetes langchain llamacpp llm llm-inference llm-serving openai python
Last synced: 20 Jan 2025
https://github.com/oe-lucifer/webscout
Search for anything using Google, DuckDuckGo, phind.com, Contains AI models, can transcribe yt videos, temporary email and phone number generation, has TTS support, webai (terminal gpt and open interpreter) and offline LLMs
ai api free freeai g4f gguf llamacpp localgpt ml ollama openai openinterpreter python tempmail tempnumber text-generation tgpt websearch youtube youtube-api
Last synced: 28 Mar 2025
https://github.com/OE-LUCIFER/Webscout
Search for anything using Google, DuckDuckGo, phind.com, Contains AI models, can transcribe yt videos, temporary email and phone number generation, has TTS support, webai (terminal gpt and open interpreter) and offline LLMs
ai api free freeai g4f gguf llamacpp localgpt ml ollama openai openinterpreter python tempmail tempnumber text-generation tgpt websearch youtube youtube-api
Last synced: 06 Mar 2025
https://github.com/zatevakhin/obsidian-local-llm
Obsidian Local LLM is a plugin for Obsidian that provides access to a powerful neural network, allowing users to generate text in a wide range of styles and formats using a local LLM.
ggml llama llamacpp obsidian-md obsidian-plugin
Last synced: 24 Jan 2025
https://github.com/sinanuozdemir/oreilly-hands-on-gpt-llm
Mastering the Art of Scalable and Efficient AI Model Deployment
deepseek distillation docker gguf gpt groq k8s llamacpp llm mlops quantization
Last synced: 04 Apr 2025
https://github.com/cactus-compute/cactus
Framework for AI on mobile devices and wearables, hardware-aware C/CPP backend, with wrappers for Kotlin, Java, Swift, React, Flutter.
android dart flutter framework ios java javascript kotlin library llamacpp llm llm-inference llms objective-c react-native swift transformer transformers typescript
Last synced: 07 May 2025
https://github.com/iongpt/llm-for-whatsapp
WhatsApp auto responder with LLM integration. It support OpenAI API and also local LLMs
autoanswer huggingface llamacpp llm ollama openai openai-chatgpt whatsapp whatsapp-bot
Last synced: 05 Apr 2025
https://github.com/nuance1979/llama-server
LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
chatbot-ui llama llama-cpp llamacpp
Last synced: 12 May 2025
https://github.com/xnul/chat-llama-discord-bot
A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other Large Language Model (LLM) supported by text-generation-webui or llama.cpp.
alpaca bot chat chat-bot chatbot chatgpt chatllama discord gpt-4 gpt4 large-language-model large-language-models llama llamacpp llm text-generation-webui vicuna
Last synced: 24 Jan 2025