Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with llama3

A curated list of projects in awesome lists tagged with llama3 .

https://github.com/ollama/ollama

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

gemma gemma2 go golang llama llama2 llama3 llava llm llms mistral ollama phi3

Last synced: 16 Dec 2024

https://github.com/langgenius/dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

agent ai anthropic backend-as-a-service chatbot gemini genai gpt gpt-4 llama3 llm llmops nextjs openai orchestration python rag workflow workflows

Last synced: 16 Dec 2024

https://github.com/go-skynet/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes llama llama3 llm mamba mistral musicgen p2p rerank rwkv stable-diffusion text-generation tts

Last synced: 09 Nov 2024

https://github.com/mudler/localai

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes llama llama3 llm mamba mistral musicgen p2p rerank rwkv stable-diffusion text-generation tts

Last synced: 16 Dec 2024

https://github.com/mudler/LocalAI

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

ai api audio-generation distributed gemma gpt4all image-generation kubernetes llama llama3 llm mamba mistral musicgen p2p rerank rwkv stable-diffusion text-generation tts

Last synced: 25 Oct 2024

https://github.com/unslothai/unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

ai fine-tuning finetuning gemma gemma2 llama llama3 llm llms lora mistral phi3 qlora unsloth

Last synced: 16 Dec 2024

https://github.com/Mintplex-Labs/anything-llm

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

ai-agents chromadb crewai crewaiui desktop-app llama3 llamacpp llm llm-application llm-webui lmstudio local-llm localai ollama rag vector-database webui

Last synced: 27 Oct 2024

https://github.com/llamafamily/llama-chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

finetune-llm llama llama3 llm pretraining

Last synced: 16 Dec 2024

https://github.com/khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the internet or your docs. Use any online or local LLM (e.g gpt, claude, gemini, llama, qwen, mistral). Build custom agents, personalized automations.

agent ai assistant chat chatgpt emacs image-generation llama3 llamacpp llm obsidian obsidian-md offline-llm productivity rag research self-hosted semantic-search stt whatsapp-ai

Last synced: 16 Dec 2024

https://github.com/LlamaFamily/Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

finetune-llm llama llama3 llm pretraining

Last synced: 25 Oct 2024

https://github.com/datawhalechina/self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程

chatglm chatglm3 gemma-2b-it glm-4 internlm2 llama3 llm lora minicpm q-wen qwen qwen1-5 qwen2

Last synced: 16 Dec 2024

https://github.com/sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

cuda inference llama llama2 llama3 llama3-1 llava llm llm-serving moe pytorch transformer vlm

Last synced: 16 Dec 2024

https://github.com/yangjianxin1/firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 17 Dec 2024

https://github.com/yangjianxin1/Firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 27 Oct 2024

https://github.com/xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm

Last synced: 16 Dec 2024

https://github.com/modelscope/agentscope

Start building LLM-empowered multi-agent applications in an easier way.

agent chatbot distributed-agents drag-and-drop gpt-4 gpt-4o large-language-models llama3 llm llm-agent multi-agent multi-modal

Last synced: 18 Dec 2024

https://github.com/internlm/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 16 Dec 2024

https://github.com/InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 28 Oct 2024

https://github.com/internlm/opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 13 Dec 2024

https://github.com/open-compass/opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 16 Dec 2024

https://github.com/open-compass/OpenCompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 04 Dec 2024

https://github.com/crazyboym/llama3-chinese-chat

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

llama llama2 llama3 llama3-chinese llama3-finetune

Last synced: 17 Dec 2024

https://github.com/modelscope/ms-swift

Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

agent deploy dpo internvl liger llama llama3 llava llm lora megatron minicpm-v modelscope multimodal peft pre-training qwen2 qwen2-vl reflection sft

Last synced: 17 Dec 2024

https://github.com/internlm/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 16 Dec 2024

https://github.com/CrazyBoyM/llama3-Chinese-chat

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

llama llama2 llama3 llama3-chinese llama3-finetune

Last synced: 29 Oct 2024

https://github.com/InternLM/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 28 Oct 2024

https://github.com/linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

finetuning gemma2 llama llama3 llm-training llms mistral phi3 triton triton-kernels

Last synced: 20 Dec 2024

https://github.com/linkedin/liger-kernel

Efficient Triton Kernels for LLM Training

finetuning gemma2 llama llama3 llm-training llms mistral phi3 triton triton-kernels

Last synced: 16 Dec 2024

https://github.com/brianpetro/obsidian-smart-connections

Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3

chatgpt claude embeddings gemini llama3 obsidian obsidian-plugin

Last synced: 18 Dec 2024

https://github.com/scisharp/llamasharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 17 Dec 2024

https://github.com/SciSharp/LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 28 Oct 2024

https://github.com/darrenburns/elia

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

ai chatgpt claude gemma gpt large-language-models llama llama3 llm mistral mistral-ai mixtral ollama ollama-client ollama-interface phi-3 python terminal tui

Last synced: 17 Dec 2024

https://github.com/qiuyannnn/local-file-organizer

An AI-powered file management tool that ensures privacy by organizing local texts, images. Using Llama3.2 3B and Llava v1.6 models with the Nexa SDK, it intuitively scans, restructures, and organizes files for quick, seamless access and easy retrieval.

file-organizer llama3 llm on-device-ai vlm

Last synced: 19 Dec 2024

https://github.com/ymcui/chinese-llama-alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

alpaca large-language-models llama llama-2 llama-3 llama3 llm nlp

Last synced: 19 Dec 2024

https://github.com/ymcui/Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

alpaca large-language-models llama llama-2 llama-3 llama3 llm nlp

Last synced: 14 Nov 2024

https://github.com/ozgrozer/ai-renamer

A Node.js CLI that uses Ollama and LM Studio models (Llava, Gemma, Llama etc.) to intelligently rename files by their contents

ai automation cli-tool file-management file-renamer files image-renamer llama3 lm-studio machine-learning ollama openai video-renamer

Last synced: 19 Dec 2024

https://github.com/b4rtaz/distributed-llama

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

distributed-computing distributed-llm llama2 llama3 llm llm-inference llms neural-network open-llm

Last synced: 19 Dec 2024

https://github.com/afc163/fanyi

A 🇨🇳 and 🇺🇸 translator in your command line

chinese command-line command-line-tools groq llama3 nodejs translation translator

Last synced: 17 Dec 2024

https://github.com/yusufcanb/tlm

Local CLI Copilot, powered by CodeLLaMa. 💻🦙

bash codellama llama3 llm powershell zsh

Last synced: 19 Dec 2024

https://github.com/mangiucugna/json_repair

A python module to repair invalid JSON, commonly used to parse the output of LLMs

deep-learning gpt-4 json llama3 llm machine-learning mistral openai-api parser repair

Last synced: 17 Dec 2024

https://github.com/bklieger/infinite-bookshelf

Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3

ai groq groq-api llama3

Last synced: 19 Dec 2024

https://github.com/Bklieger/infinite-bookshelf

Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3

ai groq groq-api llama3

Last synced: 14 Dec 2024

https://github.com/serverless/aws-ai-stack

AWS AI Stack – A ready-to-use, full-stack boilerplate project for building serverless AI applications on AWS

aws aws-bedrock aws-lambda claude-ai full-stack llama3 serverless serverless-framework

Last synced: 20 Dec 2024

https://github.com/horseee/llm-pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

baichuan bloom chatglm compression language-model llama llama-2 llama3 llm neurips-2023 pruning pruning-algorithms vicuna

Last synced: 20 Dec 2024

https://github.com/aws-samples/generative-ai-use-cases-jp

すぐに業務活用できるビジネスユースケース集付きの安全な生成AIアプリ実装

aws bedrock chatbot claude claude3 command-r generative-ai image-generation lambda llama2 llama3 llm mistral rag react sagemaker typescript

Last synced: 20 Dec 2024

https://github.com/szczyglis-dev/py-gpt

Desktop AI Assistant powered by o1, GPT-4, GPT-4 Vision, Gemini, Claude, Llama 3, Bielik, DALL-E, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, agents, command execution, file upload/download, speech synthesis and recognition, access to Web, memory, presets, assistants, plugins, and more. Linux, Windows, Mac.

ai ai-assistant artificial-intelligence autonomous-agent bielik chatbot claude dalle-3 desktop-app gemini gpt-4 gpt-4-vision gpt4 langchain llama-index llama3 llm o1 ollama openai

Last synced: 19 Dec 2024

https://github.com/jianzhnie/llamatuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

chatgpt dpo llama llama3 mixtral ppo qlora qwen rlhf

Last synced: 21 Dec 2024

https://github.com/aidc-ai/ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

chatbot llama3 multimodal multimodal-large-language-models multimodality qwen vision-language-learning vision-language-model

Last synced: 21 Dec 2024

https://github.com/jianzhnie/LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

chatgpt dpo llama llama3 mixtral ppo qlora qwen rlhf

Last synced: 10 Nov 2024

https://github.com/nvlabs/eagle

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

demo eagle gpt4 huggingface large-language-models llama llama3 llava llm lmm lvlm mllm nvdia

Last synced: 21 Dec 2024

https://github.com/magpie-align/magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

alignment dataset gemma llama2 llama3 llm nlp paper phi3 qwen2 supervised-finetuning synthetic-data synthetic-dataset-generation

Last synced: 21 Dec 2024

https://github.com/bklieger/scribewizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

ai groq groq-api llama3 replit whisper

Last synced: 21 Dec 2024

https://github.com/NVlabs/EAGLE

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

demo eagle gpt4 huggingface large-language-models llama llama3 llava llm lmm lvlm mllm nvdia

Last synced: 26 Sep 2024

https://github.com/Bklieger/ScribeWizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

ai groq groq-api llama3 replit whisper

Last synced: 22 Nov 2024

https://github.com/nrl-ai/llama-assistant

AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more.

llama llama-3-2 llama3 llava moondream owen personal-assistant private-gpt

Last synced: 15 Dec 2024

https://github.com/rlhflow/online-rlhf

A recipe for online RLHF and online iterative DPO.

llama3 llm rlhf

Last synced: 14 Dec 2024

https://github.com/jakobdylanc/llmcord

A Discord LLM chat bot that supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more)

bot chatbot chatgpt discord gpt gpt-4 gpt-4o grok groq llama llama3 llava llm lmstudio mistral ollama oobabooga openai vllm xai

Last synced: 15 Dec 2024

https://github.com/fatwang2/siri-ultra

The most intelligent Siri powered by LLMs

apple groq llama3 llms openai shortcuts siri

Last synced: 27 Sep 2024

https://github.com/vectorch-ai/ScaleLLM

A high-performance inference system for large language models, designed for production environments.

cuda efficiency gpu inference llama llama3 llm llm-inference model performance production serving speculative transformer

Last synced: 16 Nov 2024

https://github.com/vectorch-ai/scalellm

A high-performance inference system for large language models, designed for production environments.

cuda efficiency gpu inference llama llama3 llm llm-inference model performance production serving speculative transformer

Last synced: 20 Dec 2024

https://github.com/xinthedark/raycast-g4f

Raycast extension to use GPT-4, Llama-3, and more... all for FREE. No API Key required!

chatbot chatgpt claude free-gpt gemini gpt gpt-4 gpt-4o gpt4free llama3 raycast-extension

Last synced: 15 Dec 2024

https://github.com/turing-machines/mentals-ai

AI agents in Markdown syntax (loops, memory and tools included)

ai ai-agents artificial-intelligence cli gpt gpt-4o llama3 llm machine-learning openai terminal

Last synced: 15 Dec 2024

https://github.com/InternLM/InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

910b deepspeed-ulysses flash-attention gemma internlm internlm2 llama3 llava llm-framework llm-training multi-modal pipeline-parallelism pytorch ring-attention sequence-parallelism tensor-parallelism transformers-models zero3

Last synced: 30 Oct 2024

https://github.com/internlm/internevo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

910b deepspeed-ulysses flash-attention gemma internlm internlm2 llama3 llava llm-framework llm-training multi-modal pipeline-parallelism pytorch ring-attention sequence-parallelism tensor-parallelism transformers-models zero3

Last synced: 14 Dec 2024

https://github.com/vietanhdev/llama-assistant

AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more.

llama llama-3-2 llama3 llava moondream owen personal-assistant private-gpt

Last synced: 14 Oct 2024

https://github.com/strvm/meta-ai-api

Llama 3 API 70B & 405B (MetaAI Reverse Engineered)

405b 70b ai api llama llama2 llama3 meta

Last synced: 21 Dec 2024

https://github.com/ai-commandos/llama2lang

Convenience scripts to finetune (chat-)LLaMa3 and other models for any language

ai genai huggingface llama2 llama3 llm mistral

Last synced: 15 Dec 2024

https://github.com/developersdigest/ai-devices

AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more

function-calling gpt-4-vision groq langchain langsmith llama3 llava llm openai serper tts whisper

Last synced: 16 Dec 2024

https://github.com/jakobdylanc/llmcord.py

A Discord LLM chat bot that supports any OpenAI compatible API. Run a local model with ollama, oobabooga, Jan and more

ai bot chatbot chatgpt discord gpt gpt-4 gpt-4o groq litellm llama llama3 llava llm llmcord lmstudio mistral ollama oobabooga openai

Last synced: 10 Oct 2024

https://github.com/nisaaragharia/advanced_rag

Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,Agents.

agent agents ai chatgpt genai langchain llama3 llm machine-learning nlp openai rag retrival-augmented vectordb

Last synced: 17 Dec 2024

https://github.com/ollama4j/ollama4j

A simple Java library for interacting with Ollama server.

gen-ai genai generative-ai gpt java language-model large-language-models llama llama2 llama3 llm meta-llama ollama

Last synced: 16 Dec 2024

https://github.com/zjhellofss/kuiperllama

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

cpp cuda inference-engine llama2 llama3 llm llm-inference qwen qwen2

Last synced: 16 Dec 2024

https://github.com/mbzuai-oryx/videogpt-plus

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

chatbot clip dual-encoder gpt4 gpt4o image-encoder llama3 llava multimodal phi-3-mini vicuna video-chatbot video-conversation video-encoder vision-language vision-language-pretraining

Last synced: 20 Dec 2024

https://github.com/picomlx/picomlxserver

The easiest way to run the fastest MLX-based LLMs locally

ai llama3 mlx ollama openai openai-api

Last synced: 20 Dec 2024

https://github.com/mbzuai-oryx/VideoGPT-plus

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

chatbot clip dual-encoder gpt4 gpt4o image-encoder llama3 llava multimodal phi-3-mini vicuna video-chatbot video-conversation video-encoder vision-language vision-language-pretraining

Last synced: 12 Dec 2024

https://github.com/jose-donato/ollama-reply

open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.

browser-extension llama3 ollama tailwindcss

Last synced: 17 Dec 2024

https://github.com/empower-ai/empower-functions

GPT-4 level function calling models for real-world tool using use cases

ai function-calling llama3 llm mixtral

Last synced: 15 Dec 2024

https://github.com/alexfazio/openplexity-pages

SearchGPT / Perplexity Pages clone, but personalised for you.

crewai groq llama3 search-engine streamlit

Last synced: 18 Dec 2024

https://github.com/aws-samples/foundation-model-benchmarking-tool

Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.

bedrock benchmark benchmarking evaluation-metrics foundation-models g5 g6 g6e generative-ai inferentia llama2 llama3 p4d p5 sagemaker trainium

Last synced: 21 Dec 2024

https://github.com/PicoMLX/PicoMLXServer

The easiest way to run the fastest MLX-based LLMs locally

ai llama3 mlx ollama openai openai-api

Last synced: 11 Nov 2024

https://github.com/agents-flex/agents-flex

Agents-Flex is an elegant LLM Application Framework like LangChain with Java.

agent ai chatbot chatgpt gpt langchain4j llama3 llm ollama spring-ai

Last synced: 21 Dec 2024