Large Language Model | Ecosyste.ms: Awesome

https://github.com/TimDettmers/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

llm machine-learning pytorch qlora quantization

Last synced: 24 Mar 2025

https://github.com/microsoft/ufo

A UI-Focused Agent for Windows OS Interaction.

agent automation copilot gui llm windows

Last synced: 23 Apr 2025

https://github.com/intel-analytics/BigDL

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc

gpu llm pytorch transformers

Last synced: 14 Dec 2024

https://github.com/intel-analytics/ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc

gpu llm pytorch transformers

Last synced: 20 Jan 2025

https://github.com/microsoft/UFO

A UI-Focused Agent for Windows OS Interaction.

agent automation copilot gui llm windows

Last synced: 25 Mar 2025

https://github.com/comet-ml/opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

langchain llama-index llm llm-evaluation llm-observability llmops open-source openai playground prompt-engineering

Last synced: 22 Apr 2025

https://github.com/soulter/astrbot

✨ 易上手的多平台 LLM 聊天机器人及开发框架 ✨ 平台支持 QQ、QQ频道、Telegram、微信、企微、飞书 | OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify 等。附带 WebUI。

agent ai chatbot chatgpt docker function-calling gemini gpt llama llm ollama openai python qq qqbot qqchannel telegram

Last synced: 27 Mar 2025

https://github.com/sigoden/aichat

All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.

ai ai-agents chatbot claude cli function-calling gemini llm ollama openai rag rust shell webui

Last synced: 22 Apr 2025

https://github.com/SillyTavern/SillyTavern

LLM Frontend for Power Users.

ai characters chat llm openai

Last synced: 23 Mar 2025

https://github.com/run-llama/rags

Build ChatGPT over your data, all with natural language

agent chatbot chatgpt gpts llamaindex llm openai rag streamlit

Last synced: 11 Apr 2025

https://github.com/yangjianxin1/firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 24 Apr 2025

https://github.com/internlm/mindsearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

ai-search-engine gpt llm llms multi-agent-systems perplexity-ai search searchgpt transformer web-search

Last synced: 24 Apr 2025

https://github.com/yangjianxin1/Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 19 Mar 2025

https://github.com/postgresml/postgresml

Postgres with GPUs for ML/AI apps.

ai ann approximate-nearest-neighbor-search artificial-intelligence classification clustering embeddings forecasting knn llm machine-learning ml postgres rag regression sql vector-database

Last synced: 22 Apr 2025

https://github.com/taskingai/taskingai

The open source platform for AI-native application development.

agent ai ai-native function-call generative-ai gpt langchain llm rag retrieval-augmented-generation vector

Last synced: 22 Apr 2025

https://github.com/TaskingAI/TaskingAI

The open source platform for AI-native application development.

agent ai ai-native function-call generative-ai gpt langchain llm rag retrieval-augmented-generation vector

Last synced: 28 Mar 2025

https://github.com/flyteorg/flyte

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

data data-analysis data-science dataops declarative fine-tuning flyte golang grpc hacktoberfest kubernetes kubernetes-operator llm machine-learning mlops orchestration-engine production python scale workflow

Last synced: 22 Apr 2025

https://github.com/internlm/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 22 Apr 2025

https://llmware-ai.github.io/llmware/

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm parsing retrieval-augmented-generation small-specialized-models vector-db

Last synced: 16 Jan 2025

https://github.com/haifengl/smile

Statistical Machine Intelligence & Learning Engine

classification clustering computer-algebra-system computer-vision data-science dataframe deep-learning genetic-algorithm interpolation linear-algebra llm machine-learning manifold-learning multidimensional-scaling nearest-neighbor-search nlp regression statistics visualization wavelet

Last synced: 22 Apr 2025

https://haifengl.github.io/smile

Statistical Machine Intelligence & Learning Engine

classification clustering computer-algebra-system computer-vision data-science dataframe deep-learning genetic-algorithm interpolation linear-algebra llm machine-learning manifold-learning multidimensional-scaling nearest-neighbor-search nlp regression statistics visualization wavelet

Last synced: 29 Jan 2025

https://github.com/rustformers/llm

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

ai ggml llm ml rust

Last synced: 25 Feb 2025

https://github.com/OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

gpt gpt-4o gpt-4v image-classification image-text-retrieval llm multi-modal semantic-segmentation video-classification vision-language-model vit-22b vit-6b

Last synced: 16 Mar 2025

https://github.com/evidentlyai/evidently

Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

data-drift data-quality data-science data-validation generative-ai hacktoberfest html-report jupyter-notebook llm llmops machine-learning mlops model-monitoring pandas-dataframe

Last synced: 22 Apr 2025

https://github.com/modelscope/agentscope

Start building LLM-empowered multi-agent applications in an easier way.

agent chatbot distributed-agents drag-and-drop gpt-4 gpt-4o large-language-models llama3 llm llm-agent multi-agent multi-modal

Last synced: 22 Apr 2025

https://github.com/nilsherzig/llocalsearch

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.

llm search-engine

Last synced: 09 Apr 2025

https://github.com/nilsherzig/LLocalSearch

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.

llm search-engine

Last synced: 24 Mar 2025

https://github.com/InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 20 Mar 2025

https://github.com/lavague-ai/LaVague

Large Action Model framework to develop AI Web Agents

ai browser large-action-model llm oss rag

Last synced: 03 Apr 2025

https://github.com/lavague-ai/lavague

Large Action Model framework to develop AI Web Agents

ai browser large-action-model llm oss rag

Last synced: 22 Apr 2025

https://github.com/josstorer/rwkv-runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

api api-client chatgpt llm rwkv tool wails

Last synced: 12 Apr 2025

https://github.com/jlowin/fastmcp

🚀 The fast, Pythonic way to build MCP servers and clients

anthropic api claude fastmcp llm mcp mcp-client mcp-server model-context-protocol python server

Last synced: 22 Apr 2025

https://github.com/lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

chinese-llm chinese-nlp finetune generative-ai instruct-gpt instruction-set llama llm lora open-models open-source open-source-models qlora

Last synced: 22 Apr 2025

https://github.com/promptfoo/promptfoo

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

ci ci-cd cicd evaluation evaluation-framework llm llm-eval llm-evaluation llm-evaluation-framework llmops pentesting prompt-engineering prompt-testing prompts rag red-teaming testing vulnerability-scanners

Last synced: 14 Mar 2025

https://github.com/tencentqqgylab/appagent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

agent chatgpt generative-ai gpt4 gpt4v llm

Last synced: 24 Apr 2025

https://github.com/traceloop/openllmetry

Open-source observability for your LLM application, based on OpenTelemetry

artifical-intelligence datascience generative-ai good-first-issue good-first-issues help-wanted llm llmops metrics ml model-monitoring monitoring observability open-source open-telemetry opentelemetry opentelemetry-python python

Last synced: 22 Apr 2025

https://github.com/microsoft/taskweaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

agent ai-agents code-interpreter copilot data-analysis llm openai

Last synced: 23 Apr 2025

https://github.com/prefecthq/marvin

✨ AI agents that spark joy

agents ai ai-functions ambient-ai chatbots gpt llm nli openai python

Last synced: 23 Apr 2025

https://github.com/automq/automq

AutoMQ is a stateless Kafka on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.

ai apache-kafka aws azure cloud cloud-first cloud-native ebs gcp kafka llm messaging minio s3 serverless spot streaming

Last synced: 23 Apr 2025

https://github.com/modelscope/ms-swift

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).

agent deepseek-r1 deploy distill embedding grpo internvl liger llama llama3-3 llm lora multimodal open-r1 peft qwen2-5 qwen2-vl rft sft

Last synced: 23 Apr 2025

https://github.com/PrefectHQ/marvin

✨ AI agents that spark joy

agents ai ai-functions ambient-ai chatbots gpt llm nli openai python

Last synced: 02 Apr 2025

https://github.com/arcee-ai/mergekit

Tools for merging pretrained large language models.

llama llm model-merging

Last synced: 24 Apr 2025

https://github.com/aiwaves-cn/agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

autonomous-agents language-model llm

Last synced: 08 Apr 2025

https://github.com/NirDiamant/GenAI_Agents

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.

agents ai genai langchain langgraph llm llms openai tutorials

Last synced: 21 Jan 2025

https://github.com/mufeedvh/code2prompt

A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.

ai chatgpt claude cli command-line command-line-tool gpt llm prompt prompt-engineering prompt-generator prompt-toolkit rust

Last synced: 22 Apr 2025

https://github.com/ericlbuehler/mistral.rs

Blazingly fast LLM inference.

llm rust

Last synced: 22 Apr 2025

https://github.com/funaudiollm/sensevoice

Multilingual Voice Understanding Model

ai aigc asr audio-event-classification cross-lingual gpt-4o llm multilingual python pytorch speech-emotion-recognition speech-recognition speech-to-text

Last synced: 24 Apr 2025

https://github.com/ten-framework/ten-agent

TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.

agent ai asr cpp gemini golang gpt-4 gpt-4o llm low-latency multimodal nextjs14 openai python rag real-time realtime tts vision voice-assistant

Last synced: 31 Mar 2025

https://github.com/TencentQQGYLab/AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

agent chatgpt generative-ai gpt4 gpt4v llm

Last synced: 24 Jan 2025

https://github.com/InternLM/InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support

chatbot chinese fine-tuning-llm flash-attention gpt large-language-model llm long-context pretrained-models rlhf

Last synced: 16 Mar 2025

https://github.com/internlm/internlm

Official release of InternLM2 7B and 20B base and chat models. 200K context support

chatbot chinese fine-tuning-llm flash-attention gpt large-language-model llm long-context pretrained-models rlhf

Last synced: 09 Apr 2025

https://github.com/dsdanielpark/bard-api

The unofficial python package that returns response of Google Bard through cookie value.

ai-api api bard bard-api chatbot google google-bard google-bard-api google-bard-python google-maps-api googlebard llm nlp

Last synced: 17 Jan 2025

https://github.com/mnotgod96/appagent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

agent chatgpt generative-ai gpt4 gpt4v llm

Last synced: 14 Jan 2025

https://github.com/microsoft/TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

agent ai-agents code-interpreter copilot data-analysis llm openai

Last synced: 25 Mar 2025

https://github.com/cvhub520/x-anylabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

annotation-tool classification clip deep-learning deeplearning depth-estimation grounding-dino image-segmentation labeling-tool llm matting object-detection onnx paddle pose-estimation pytorch resnet sam vlm yolo

Last synced: 23 Apr 2025

https://github.com/dsdanielpark/Bard-API

The unofficial python package that returns response of Google Bard through cookie value.

ai-api api bard bard-api chatbot google google-bard google-bard-api google-bard-python google-maps-api googlebard llm nlp

Last synced: 24 Mar 2025

https://github.com/e2b-dev/fragments

Open-source Next.js template for building apps that are fully generated by AI. By E2B.

ai ai-code-generation anthropic claude claude-ai code-interpreter e2b javascript llm nextjs react sandbox typescript

Last synced: 24 Apr 2025

https://github.com/princeton-nlp/tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

large-language-models llm prompting tree-of-thoughts tree-search

Last synced: 23 Apr 2025

https://github.com/homanp/superagent

🥷 Run AI-agents with an API

agent ai assistant generative-ai llm open-source python rag

Last synced: 28 Nov 2024

https://github.com/superagent-ai/superagent

🥷 Run AI-agents with an API

agent ai assistant generative-ai llm open-source python rag

Last synced: 09 Apr 2025

https://github.com/julep-ai/julep

Serverless AI Workflows for Data & ML Teams

agents ai ai-agents ai-agents-framework ai-memory ai-platform aiagents developer-tools devfest hacktoberfest hacktoberfest2024 llm llm-ops python

Last synced: 12 Apr 2025

https://github.com/gluonfield/enchanted

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

ios large-language-model llama llama2 llm mistral ollama ollama-app swift

Last synced: 11 Apr 2025

https://github.com/huggingface/alignment-handbook

Robust recipes to align language models with human and AI preferences

llm rlhf transformers

Last synced: 23 Apr 2025

https://github.com/openchatai/copilot

ai-copilot copilot llm sidekick

Last synced: 09 Apr 2025

https://github.com/open-compass/opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 22 Apr 2025

https://github.com/openchatai/OpenCopilot

🤖 🔥 Language-to-actions engine

ai-copilot copilot llm sidekick

Last synced: 14 Feb 2025

https://github.com/salesforce/codegen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

codex generativemodel languagemodel llm programsynthesis tpu-acceleration

Last synced: 11 Apr 2025

https://github.com/InternLM/MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

ai-search-engine gpt llm llms multi-agent-systems perplexity-ai search searchgpt transformer web-search

Last synced: 13 Nov 2024

https://github.com/getasterisk/deepclaude

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

ai anthropic anthropic-claude api chain-of-thought claude deepseek deepseek-r1 llm rust

Last synced: 10 Apr 2025

https://github.com/nirdiamant/prompt_engineering

This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essential resource for mastering the art of effectively communicating with and leveraging large language models in AI applications.

ai genai llm llms opeani prompt-engineering python tutorials

Last synced: 13 Apr 2025

https://github.com/salesforce/CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

codex generativemodel languagemodel llm programsynthesis tpu-acceleration

Last synced: 28 Mar 2025

https://github.com/pbek/qownnotes

QOwnNotes is a plain-text file notepad and todo-list manager with Markdown support and Nextcloud / ownCloud integration.

bookmark c-plus-plus caldav chrome-extension dropbox firefox-extension llm local-first markdown nextcloud nextcloud-notes note-taking notebook notes owncloud pim pkm qownnotes qt second-brain

Last synced: 23 Apr 2025

https://github.com/TEN-framework/TEN-Agent

TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.

agent ai asr cpp gemini golang gpt-4 gpt-4o llm low-latency multimodal nextjs14 openai python rag real-time realtime tts vision voice-assistant

Last synced: 08 Mar 2025

https://github.com/guardrails-ai/guardrails

Adding guardrails to large language models.

ai foundation-model gpt-3 llm openai

Last synced: 22 Apr 2025

https://github.com/scir-hi/huatuo-llama-med-chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草（原名：华驼）模型仓库，基于中文医学知识的大语言模型指令微调

aidoctor bloom chinese huozi llama llm medgpt medical medqa nlp

Last synced: 13 Apr 2025

https://github.com/langmanus/langmanus

A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search, crawling, and Python code execution, while giving back to the community that made this possible.

agent agents agi ai automation deep-research deepseek deepseek-r1 langchain langgraph llm multi-agent multi-agent-systems qwen qwen-vl qwen2-vl

Last synced: 12 Apr 2025

https://github.com/timescale/pgai

A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL

ai llm postgresql rag

Last synced: 22 Apr 2025

https://github.com/mishushakov/llm-scraper

Turn any webpage into structured data using LLMs

ai artificial-intelligence browser browser-automation gpt gpt-4 langchain llama llm openai playwright puppeteer scraper

Last synced: 10 Apr 2025

https://github.com/QuivrHQ/MegaParse

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

docx llm parser pdf powerpoint

Last synced: 27 Dec 2024

https://github.com/josStorer/RWKV-Runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

api api-client chatgpt llm rwkv tool wails

Last synced: 08 Apr 2025

https://github.com/pyspur-dev/pyspur

A visual playground for agentic workflows: Iterate over your agents 10x faster

agent agents ai builder deepseek framework gemini graph human-in-the-loop llm llms loops multimodal ollama python rag reasoning tool trace workflow

Last synced: 23 Apr 2025

https://github.com/parisneo/lollms-webui

Lord of Large Language and Multi modal Systems Web User Interface

ai llm text-generation

Last synced: 24 Apr 2025

https://github.com/SCIR-HI/Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草（原名：华驼）模型仓库，基于中文医学知识的大语言模型指令微调

aidoctor bloom chinese huozi llama llm medgpt medical medqa nlp

Last synced: 19 Mar 2025

https://github.com/zenml-io/zenml

ZenML 🙏: The bridge between ML and Ops. https://zenml.io.

ai automl data-science deep-learning devops-tools hacktoberfest llm llmops machine-learning metadata-tracking ml mlops pipelines production-ready pytorch tensorflow workflow zenml

Last synced: 22 Apr 2025

https://github.com/internlm/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 22 Apr 2025

https://github.com/giskard-ai/giskard

🐢 Open-Source Evaluation & Testing for AI & LLM systems

agent-evaluation ai-red-team ai-security ai-testing fairness-ai llm llm-eval llm-evaluation llm-security llmops ml-testing ml-validation mlops rag-evaluation red-team-tools responsible-ai trustworthy-ai

Last synced: 22 Apr 2025

https://github.com/nexaai/nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.

asr audio edge-computing language-model llm on-device-ai on-device-ml sdk sdk-python stable-diffusion transformers tts vlm whisper

Last synced: 22 Apr 2025

https://github.com/katanaml/sparrow

Data processing with ML, LLM and Vision LLM

computer-vision gpt huggingface-transformers llm machinelearning nlp-machine-learning rag vllm

Last synced: 23 Apr 2025

https://github.com/EricLBuehler/mistral.rs

Blazingly fast LLM inference.

llm rust

Last synced: 05 Apr 2025

https://github.com/argilla-io/argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

active-learning ai annotation-tool developer-tools gpt-4 human-in-the-loop langchain llm machine-learning mlops natural-language-processing nlp rlhf text-annotation text-labeling weak-supervision weakly-supervised-learning

Last synced: 22 Apr 2025

https://github.com/openbmb/agentverse

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

agent ai gpt gpt-4 llm

Last synced: 08 Apr 2025

https://github.com/Giskard-AI/giskard

🐢 Open-Source Evaluation & Testing for AI & LLM systems

agent-evaluation ai-red-team ai-security ai-testing fairness-ai llm llm-eval llm-evaluation llm-security llmops ml-testing ml-validation mlops rag-evaluation red-team-tools responsible-ai trustworthy-ai

Last synced: 15 Apr 2025

https://github.com/trigaten/learn_prompting

Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community

chatgpt chatgpt-api deep-learning gpt-3 gpt-4 gpt-4-api gpt3 large-language-models llm machine-learning nlp openai-api prompt-engineering prompt-toolkit prompt-tuning prompting transformers

Last synced: 23 Apr 2025

https://github.com/OpenBMB/AgentVerse

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

agent ai gpt gpt-4 llm

Last synced: 28 Mar 2025