An open API service indexing awesome lists of open source software.

Large Language Model

A large language model (LLM) is a type of machine learning model designed for understanding, generating, and interacting with human language. These models are trained on extensive datasets containing text from books, articles, websites, and other sources to learn patterns, context, and semantics in language. LLMs are widely used in applications like chatbots, code generation, translation, summarization, and more. They are often built using transformer architectures and are central to the field of generative AI.

https://github.com/liguodongiot/llm-action

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

llm llm-inference llm-serving llm-training llmops

Last synced: 15 May 2025

https://github.com/deepseek-ai/janus

Janus-Series: Unified Multimodal Understanding and Generation Models

any-to-any foundation-models llm multimodal unified-model vision-language-pretraining

Last synced: 14 May 2025

https://github.com/eosphoros-ai/db-gpt

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

agents bgi database deepseek gpt gpt-4 hacktoberfest llm private rag security vicuna

Last synced: 09 Sep 2025

https://github.com/pydantic/pydantic-ai

AI Agent Framework, the Pydantic way

agent-framework genai llm pydantic python

Last synced: 14 May 2026

https://github.com/1panel-dev/maxkb

💬 MaxKB is an open-source AI assistant for enterprise. It seamlessly integrates RAG pipelines, supports robust workflows, and provides MCP tool-use capabilities.

chatbot deepseek-r1 knowledgebase langchain llama3 llm maxkb mcp-server ollama pgvector qwen3 rag

Last synced: 01 Apr 2026

https://github.com/transformeroptimus/superagi

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

agents agi ai artificial-general-intelligence artificial-intelligence autonomous-agents gpt-4 hacktoberfest llm llmops nextjs openai pinecone python superagi

Last synced: 12 May 2025

https://github.com/letta-ai/letta

Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.

ai ai-agents llm llm-agent

Last synced: 15 Dec 2025

https://github.com/microsoft/GraphRAG

A modular graph-based Retrieval-Augmented Generation (RAG) system

gpt gpt-4 gpt4 graphrag llm llms rag

Last synced: 21 Aug 2025

https://github.com/kubesphere/kubesphere

The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️

argocd cloud-native cncf container-management devops ebpf hacktoberfest istio jenkins k8s kubernetes kubernetes-platform-solution kubesphere llm multi-cluster observability servicemesh

Last synced: 12 May 2025

https://github.com/thudm/chatglm2-6b

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

chatglm chatglm-6b large-language-models llm

Last synced: 13 May 2025

https://github.com/nirdiamant/rag_techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

ai langchain llama-index llm llms opeani python rag tutorials

Last synced: 14 May 2025

https://github.com/THUDM/ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

chatglm chatglm-6b large-language-models llm

Last synced: 20 Mar 2025

https://github.com/cpacker/MemGPT

Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.

ai ai-agents llm llm-agent

Last synced: 06 Apr 2025

https://github.com/swe-agent/swe-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

agent agent-based-model ai cybersecurity developer-tools llm lms

Last synced: 14 May 2025

https://github.com/yamadashy/repomix

📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

ai anthropic artificial-intelligence chatbot chatgpt claude deepseek developer-tools gemini genai generative-ai gpt javascript language-model llama llm mcp nodejs openai typescript

Last synced: 13 May 2025

https://github.com/arc53/DocsGPT

DocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private and reliable information retrieval, with tooling and agentic system capability built in.

ai chatgpt docsgpt hacktoberfest information-retrieval language-model llm machine-learning natural-language-processing python pytorch rag react semantic-search transformers web-app

Last synced: 14 Mar 2025

https://github.com/datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

chatglm chatglm3 gemma-2b-it glm-4 internlm2 llama3 llm lora minicpm q-wen qwen qwen1-5 qwen2

Last synced: 14 May 2025

https://github.com/TransformerOptimus/SuperAGI

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

agents agi ai artificial-general-intelligence artificial-intelligence autonomous-agents gpt-4 hacktoberfest llm llmops nextjs openai pinecone python superagi

Last synced: 27 Mar 2025

https://github.com/mlc-ai/web-llm

High-performance In-browser LLM Inference Engine

chatgpt deep-learning language-model llm tvm webgpu webml

Last synced: 12 May 2025

https://github.com/llmware-ai/llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing retrieval-augmented-generation small-specialized-models

Last synced: 14 Apr 2026

https://github.com/LlamaChinese/Llama-Chinese

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

agent llama llama4 llm pretraining rl

Last synced: 06 Apr 2026

https://github.com/llamafamily/llama-chinese

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

agent llama llama4 llm pretraining rl

Last synced: 14 May 2025

https://github.com/plandex-ai/plandex

Open source AI coding agent. Designed for large projects and real world tasks.

ai ai-agents ai-developer-tools ai-tools cli command-line developer-tools git golang gpt-4 llm openai polyglot-programming terminal terminal-based terminal-ui

Last synced: 04 Oct 2025

https://llmware-ai.github.io/llmware/

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing retrieval-augmented-generation small-specialized-models

Last synced: 25 Sep 2025

https://github.com/mediar-ai/screenpipe

AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording

agents agi ai computer-vision llm machine-learning ml multimodal vision

Last synced: 03 Feb 2026

https://github.com/sillytavern/sillytavern

LLM Frontend for Power Users.

ai characters chat llm openai

Last synced: 14 Feb 2026

https://github.com/esengine/DeepSeek-Reasonix

DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.

agent agent-framework ai-agent ai-coding cli coding-agent deepseek developer-tools ink llm prompt-caching r1 terminal tool-use tui typescript

Last synced: 01 Jun 2026

https://github.com/LlamaFamily/Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

finetune-llm llama llama3 llm pretraining

Last synced: 14 Mar 2025

https://github.com/QwenLM/Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

chinese flash-attention large-language-models llm natural-language-processing pretrained-models

Last synced: 16 Mar 2025

https://princeton-nlp.github.io/SWE-agent

[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

agent agent-based-model ai cybersecurity developer-tools llm lms

Last synced: 11 Sep 2025

https://github.com/princeton-nlp/SWE-agent

[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

agent agent-based-model ai cybersecurity developer-tools llm lms

Last synced: 01 Apr 2025

https://github.com/SWE-agent/SWE-agent

[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

agent agent-based-model ai cybersecurity developer-tools llm lms

Last synced: 07 Aug 2025

https://github.com/tencent/weknora

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

agent agentic ai chatbot chatbots embeddings evaluation generative-ai golang knowledge-base llm multi-tenant multimodel ollama openai question-answering rag reranking semantic-search vector-search

Last synced: 15 Apr 2026

https://github.com/modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...) (AAAI 2025).

deepseek-r1 embedding grpo internvl liger llama llama4 llm lora megatron moe multimodal open-r1 peft qwen3 qwen3-6 qwen3-omni qwen3-vl reranker sft

Last synced: 25 Apr 2026

https://github.com/jujumilk3/leaked-system-prompts

Collection of leaked system prompts

ai document llm prompt

Last synced: 16 Jan 2026

https://github.com/botpress/botpress

The open-source hub to build & deploy GPT/LLM Agents ⚡️

agent ai botpress chatbot chatgpt gpt gpt-4 langchain llm nlp openai prompt

Last synced: 08 Oct 2025

https://github.com/eosphoros-ai/DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

agents bgi database gpt gpt-4 hacktoberfest langchain llm private rag security vicuna

Last synced: 27 Mar 2025

https://github.com/skyvern-ai/skyvern

Automate browser-based workflows with LLMs and Computer Vision

api automation browser browser-automation computer gpt llm playwright python rpa vision workflow

Last synced: 09 Apr 2026

https://github.com/cocktailpeanut/dalai

The simplest way to run LLaMA on your local machine

ai llama llm

Last synced: 13 May 2025

https://cocktailpeanut.github.io/dalai/

The simplest way to run LLaMA on your local machine

ai llama llm

Last synced: 25 Sep 2025

https://github.com/alibaba/mnn

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README.md). MNN TaoAvatar Android - Local 3D Avatar Intelligence: apps/Android/Mnn3dAvatar/README.md

arm convolution deep-learning embedded-devices llm machine-learning ml mnn transformer vulkan winograd-algorithm

Last synced: 07 Feb 2026

https://github.com/mastra-ai/mastra

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

agents ai chatbots evals javascript llm mcp nextjs nodejs reactjs tts typescript workflows

Last synced: 14 May 2025

https://github.com/memvid/memvid

Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

ai context embedded faiss knowledge-base knowledge-graph llm machine-learning memory memvid mv2 nlp offline-first opencv python rag retrieval-augmented-generation semantic-search vector-database video-processing

Last synced: 15 Feb 2026

https://github.com/Skyvern-AI/skyvern

Automate browser-based workflows with LLMs and Computer Vision

api automation browser browser-automation computer gpt llm playwright python rpa vision workflow

Last synced: 07 Apr 2025

https://github.com/eugeneyan/open-llms

📋 A list of open LLMs available for commercial use.

commercial large-language-models llm llms

Last synced: 25 Jan 2026

https://github.com/Skyvern-AI/Skyvern

Automate browser-based workflows with LLMs and Computer Vision

api automation browser browser-automation computer gpt llm playwright python rpa vision workflow

Last synced: 09 Mar 2025

https://github.com/78/xiaozhi-esp32

Build your own AI friend

chatbot esp32 llm

Last synced: 14 May 2025

https://github.com/CopilotKit/CopilotKit

React UI + elegant infrastructure for AI Copilots, in-app AI agents, AI chatbots, and AI-powered Textareas 🪁

agent agents ai ai-agent ai-assistant assistant copilot copilot-chat hacktoberfest langchain langgraph llm nextjs open-source react reactjs ts typescript

Last synced: 24 Mar 2025

https://github.com/PaddlePaddle/PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

bert compression distributed-training document-intelligence embedding ernie information-extraction llama llm neural-search nlp paddlenlp pretrained-models question-answering search-engine semantic-analysis sentiment-analysis transformers uie

Last synced: 18 Mar 2025

https://github.com/HKUDS/LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

genai gpt gpt-4 graphrag knowledge-graph large-language-models llm rag retrieval-augmented-generation

Last synced: 27 Feb 2025

https://github.com/lightning-ai/litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

ai artificial-intelligence deep-learning large-language-models llm llm-inference llms

Last synced: 12 May 2025

https://github.com/1Panel-dev/MaxKB

💬 Ready-to-use, flexible RAG Chatbot.

chatbot knowledgebase langchain llm maxkb ollama pgvector rag

Last synced: 24 Mar 2025

https://github.com/shishirpatil/gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

api api-documentation chatgpt claude-api gpt-4-api llm openai-api openai-functions

Last synced: 13 May 2025

https://github.com/codexu/note-gen

A cross-platform Markdown AI note-taking software.

agent chatbot knowledge-base llm markdown mcp nextjs note-taking rag tauri webdav

Last synced: 31 May 2026

https://github.com/nirdiamant/genai_agents

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.

agents ai genai langchain langgraph llm llms openai tutorials

Last synced: 15 May 2025

https://github.com/Lightning-AI/litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

ai artificial-intelligence deep-learning large-language-models llm llm-inference llms

Last synced: 26 Mar 2025

https://github.com/StarTrail-org/LEANN

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm localstorage offline-first ollama privacy python rag retrieval-augmented-generation vector-database vector-search vectors

Last synced: 02 Jun 2026

https://github.com/langchain4j/langchain4j

LangChain4j is an idiomatic, open-source Java library for building LLM-powered applications on the JVM. It offers a unified API over popular LLM providers and vector stores, and makes implementing tool calling (including MCP support), agents and RAG easy. It integrates seamlessly with enterprise Java frameworks like Quarkus and Spring Boot.

anthropic chatgpt chroma embeddings gemini gpt huggingface java langchain llama llm llms milvus ollama onnx openai openai-api pgvector pinecone vector-database

Last synced: 30 Apr 2026

https://github.com/h2oai/h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

ai chatgpt embeddings generative gpt gpt4all llama2 llm mixtral pdf private privategpt vectorstore

Last synced: 13 May 2025

https://github.com/zai-org/CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

cogvideox image-to-video llm sora text-to-video video-generation

Last synced: 30 Jul 2025

https://github.com/ShishirPatil/gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

api api-documentation chatgpt claude-api gpt-4-api llm openai-api openai-functions

Last synced: 27 Mar 2025

https://github.com/thudm/cogvideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

cogvideox image-to-video llm sora text-to-video video-generation

Last synced: 16 May 2025

https://github.com/bentoml/openllm

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna

Last synced: 23 Oct 2025

https://github.com/ZhuLinsen/daily_stock_analysis

LLM驱动的 A/H/美股智能分析器,多数据源行情 + 实时新闻 + Gemini 决策仪表盘 + 多渠道推送,零成本,纯白嫖,定时运行

agent ai aigc gemini llm quant quantitative-trading rag stock

Last synced: 16 Feb 2026

https://github.com/langfuse/langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

analytics autogen evaluation langchain large-language-models llama-index llm llm-evaluation llm-observability llmops monitoring observability open-source openai playground prompt-engineering prompt-management self-hosted ycombinator

Last synced: 13 Mar 2026

https://microsoft.github.io/agent-lightning/

The absolute trainer to light up AI agents.

agent agentic-ai llm mlops reinforcement-learning

Last synced: 22 Jan 2026

https://github.com/THUDM/CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

cogvideox image-to-video llm sora text-to-video video-generation

Last synced: 28 Mar 2025

https://github.com/e2b-dev/e2b

Open-source, secure environment with real-world tools for enterprise-grade agents.

agent ai ai-agent ai-agents code-interpreter copilot development devtools gpt gpt-4 javascript llm nextjs openai python react software typescript

Last synced: 27 May 2026

https://github.com/rockchinq/langbot

😎简单易用、🧩丰富生态 - 大模型原生即时通信机器人平台 | 适配 QQ / 微信(企业微信、个人微信)/ 飞书 / 钉钉 / Discord / Telegram / Slack 等平台 | 支持 ChatGPT、DeepSeek、Dify、Claude、Gemini、xAI、PPIO、Ollama、LM Studio、阿里云百炼、火山方舟、SiliconFlow、Qwen、Moonshot、ChatGLM、SillyTraven、MCP 等 LLM 的机器人 / Agent | LLM-based instant messaging bots platform, supports Discord, Telegram, WeChat, Lark, DingTalk, QQ, Slack

agent chatgpt deepseeek dify llm openai plugins qq telegram wechat

Last synced: 10 May 2025

https://github.com/RockChinQ/LangBot

😎简单易用、🧩丰富生态 - 大模型原生即时通信机器人平台 | 适配 QQ / 微信(企业微信、个人微信)/ 飞书 / 钉钉 / Discord / Telegram / Slack 等平台 | 支持 ChatGPT、DeepSeek、Dify、Claude、Gemini、xAI、PPIO、Ollama、LM Studio、阿里云百炼、火山方舟、SiliconFlow、Qwen、Moonshot、ChatGLM、SillyTraven、MCP 等 LLM 的机器人 / Agent | LLM-based instant messaging bots platform, supports Discord, Telegram, WeChat, Lark, DingTalk, QQ, Slack

agent chatgpt deepseeek dify llm openai plugins qq telegram wechat

Last synced: 11 May 2025

https://github.com/getumbrel/llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

ai chatgpt code-llama codellama gpt gpt-4 gpt4all llama llama-2 llama-cpp llama2 llamacpp llm localai openai self-hosted

Last synced: 13 May 2025

https://github.com/ther1d/shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

chatgpt cheat-sheet cli commands gpt-3 gpt-4 linux llama llm ollama openai productivity python shell terminal

Last synced: 06 May 2026

https://github.com/huggingface/chat-ui

The open source codebase powering HuggingChat

chatgpt hacktoberfest huggingface llm svelte svelte-kit sveltekit tailwindcss typescript

Last synced: 11 May 2026

https://github.com/tensorzero/tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt large-language-models llama llm llmops llms machine-learning ml ml-engineering mlops openai python rust

Last synced: 16 Jan 2026

https://github.com/RockChinQ/QChatGPT

😎简单易用、🧩丰富生态 - 大模型原生即时通信机器人平台 | 适配 QQ / 微信(企业微信、个人微信)/ 飞书 / 钉钉 / Discord / Telegram / Slack 等平台 | 支持 ChatGPT、DeepSeek、Dify、Claude、Gemini、xAI Grok、Ollama、LM Studio、阿里云百炼、火山方舟、SiliconFlow、Qwen、Moonshot、ChatGLM、SillyTraven、MCP 等 LLM 的机器人 / Agent | LLM-based instant messaging bots platform, supports Discord, Telegram, WeChat, Lark, DingTalk, QQ, Slack

agent chatgpt deepseeek dify llm openai plugins qq telegram wechat

Last synced: 16 Apr 2025

https://github.com/explodinggradients/ragas

Supercharge Your LLM Application Evaluations 🚀

evaluation llm llmops

Last synced: 22 Aug 2025

https://github.com/microsoft/promptflow

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

ai ai-application-development ai-applications chatgpt gpt llm prompt prompt-engineering

Last synced: 12 May 2025

https://github.com/mistralai/mistral-inference

Official inference library for Mistral models

llm llm-inference mistralai

Last synced: 12 May 2025

https://github.com/yichuan-w/leann

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm localstorage offline-first ollama privacy python rag retrieval-augmented-generation vector-database vector-search vectors

Last synced: 08 Mar 2026

https://github.com/mistralai/mistral-inference?tab=readme-ov-file

Official inference library for Mistral models

llm llm-inference mistralai

Last synced: 16 Mar 2025

https://github.com/Zackriya-Solutions/meetily

Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.

ai ai-meeting-assistant llm local-ai mac meeting-minutes meeting-notes offline-first ollama parakeet privacy-focused privacy-tools rust self-hosted speech-to-text transcription whisper whisper-cpp windows

Last synced: 05 Mar 2026

https://github.com/bentoml/OpenLLM

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna

Last synced: 14 Mar 2025

https://github.com/GoogleCloudPlatform/generative-ai

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

gemini gemini-api generative-ai google google-cloud google-gemini langchain llm palm-api vertex-ai vertex-ai-gemini-api vertexai

Last synced: 17 Mar 2025

https://github.com/chainlit/chainlit

Build Conversational AI in minutes ⚡️

chatgpt langchain llm openai openai-chatgpt python ui

Last synced: 08 Apr 2026

https://github.com/TheR1D/shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

chatgpt cheat-sheet cli commands gpt-3 gpt-4 linux llama llm ollama openai productivity python shell terminal

Last synced: 20 Mar 2025