Projects in Awesome Lists tagged with llama3

https://github.com/ollama/ollama

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

gemma gemma2 go golang llama llama2 llama3 llava llm llms mistral ollama phi3

Last synced: 16 Dec 2024

https://github.com/langgenius/dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

agent ai anthropic backend-as-a-service chatbot gemini genai gpt gpt-4 llama3 llm llmops nextjs openai orchestration python rag workflow workflows

Last synced: 16 Dec 2024

https://github.com/hiyouga/llama-factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers

Last synced: 16 Dec 2024

https://github.com/hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers

Last synced: 25 Oct 2024

https://github.com/hiyouga/LLaMA-Efficient-Tuning

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers

Last synced: 05 Sep 2024

https://github.com/mintplex-labs/anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

agent-framework-javascript ai-agents crewai custom-ai-agents desktop-app llama3 llm llm-application llm-webui lmstudio local-llm localai multimodal nodejs ollama rag vector-database webui

Last synced: 16 Dec 2024

https://github.com/go-skynet/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes llama llama3 llm mamba mistral musicgen p2p rerank rwkv stable-diffusion text-generation tts

Last synced: 09 Nov 2024

https://github.com/mudler/localai

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes llama llama3 llm mamba mistral musicgen p2p rerank rwkv stable-diffusion text-generation tts

Last synced: 16 Dec 2024

https://github.com/mudler/LocalAI

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

ai api audio-generation distributed gemma gpt4all image-generation kubernetes llama llama3 llm mamba mistral musicgen p2p rerank rwkv stable-diffusion text-generation tts

Last synced: 25 Oct 2024

https://github.com/unslothai/unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

ai fine-tuning finetuning gemma gemma2 llama llama3 llm llms lora mistral phi3 qlora unsloth

Last synced: 16 Dec 2024

https://github.com/Mintplex-Labs/anything-llm

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

ai-agents chromadb crewai crewaiui desktop-app llama3 llamacpp llm llm-application llm-webui lmstudio local-llm localai ollama rag vector-database webui

Last synced: 27 Oct 2024

https://github.com/scrapegraphai/scrapegraph-ai

Python scraper based on AI

ai automated-scraper gpt-3 gpt-4 llama3 llm machine-learning sc scraping scraping-python scrapingweb webscraping

Last synced: 16 Dec 2024

https://github.com/llamafamily/llama-chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

finetune-llm llama llama3 llm pretraining

Last synced: 16 Dec 2024

https://github.com/ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

automated-scraper gpt-3 gpt-4 llama3 llm machine-learning sc scraping scraping-python scrapingweb

Last synced: 05 Oct 2024

https://github.com/khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the internet or your docs. Use any online or local LLM (e.g gpt, claude, gemini, llama, qwen, mistral). Build custom agents, personalized automations.

agent ai assistant chat chatgpt emacs image-generation llama3 llamacpp llm obsidian obsidian-md offline-llm productivity rag research self-hosted semantic-search stt whatsapp-ai

Last synced: 16 Dec 2024

https://github.com/LlamaFamily/Llama-Chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

finetune-llm llama llama3 llm pretraining

Last synced: 25 Oct 2024

https://github.com/datawhalechina/self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型，更适合中国宝宝的部署教程

chatglm chatglm3 gemma-2b-it glm-4 internlm2 llama3 llm lora minicpm q-wen qwen qwen1-5 qwen2

Last synced: 16 Dec 2024

https://github.com/sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

cuda inference llama llama2 llama3 llama3-1 llava llm llm-serving moe pytorch transformer vlm

Last synced: 16 Dec 2024

https://github.com/yangjianxin1/firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 17 Dec 2024

https://github.com/yangjianxin1/Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 27 Oct 2024

https://github.com/xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm

Last synced: 16 Dec 2024

https://github.com/modelscope/agentscope

Start building LLM-empowered multi-agent applications in an easier way.

agent chatbot distributed-agents drag-and-drop gpt-4 gpt-4o large-language-models llama3 llm llm-agent multi-agent multi-modal

Last synced: 18 Dec 2024

https://github.com/internlm/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 16 Dec 2024

https://github.com/InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 28 Oct 2024

https://github.com/internlm/opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 13 Dec 2024

https://github.com/open-compass/opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 16 Dec 2024

https://github.com/open-compass/OpenCompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 04 Dec 2024

https://github.com/crazyboym/llama3-chinese-chat

Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

llama llama2 llama3 llama3-chinese llama3-finetune

Last synced: 17 Dec 2024

https://github.com/modelscope/ms-swift

Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

agent deploy dpo internvl liger llama llama3 llava llm lora megatron minicpm-v modelscope multimodal peft pre-training qwen2 qwen2-vl reflection sft

Last synced: 17 Dec 2024

https://github.com/internlm/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 16 Dec 2024

https://github.com/CrazyBoyM/llama3-Chinese-chat

Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

llama llama2 llama3 llama3-chinese llama3-finetune

Last synced: 29 Oct 2024

https://github.com/InternLM/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 28 Oct 2024

https://github.com/linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

finetuning gemma2 llama llama3 llm-training llms mistral phi3 triton triton-kernels

Last synced: 20 Dec 2024

https://github.com/linkedin/liger-kernel

Efficient Triton Kernels for LLM Training

finetuning gemma2 llama llama3 llm-training llms mistral phi3 triton triton-kernels

Last synced: 16 Dec 2024

https://github.com/entropy-research/devon

Devon: An open-source pair programmer

agent agent-based-framework agent-based-model ai ai-developer ai-software ai-software-engineer code-assistant code-generation developer-tool developer-tools gpt-4 gpt-4o groq llama3 ollama vscode

Last synced: 17 Dec 2024

https://github.com/brianpetro/obsidian-smart-connections

Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3

chatgpt claude embeddings gemini llama3 obsidian obsidian-plugin

Last synced: 18 Dec 2024

https://github.com/scisharp/llamasharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 17 Dec 2024

https://github.com/entropy-research/Devon

Devon: An open-source pair programmer

agent agent-based-framework agent-based-model ai ai-developer ai-software ai-software-engineer code-assistant code-generation developer-tool developer-tools gpt-4 gpt-4o groq llama3 ollama vscode

Last synced: 04 Nov 2024

https://github.com/SciSharp/LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 28 Oct 2024

https://github.com/darrenburns/elia

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

ai chatgpt claude gemma gpt large-language-models llama llama3 llm mistral mistral-ai mixtral ollama ollama-client ollama-interface phi-3 python terminal tui

Last synced: 17 Dec 2024

https://github.com/run-llama/llamaindexts

Data framework for your LLM applications. Focus on server side solution

agent anthr chatbot claude claude-ai create-llama embedding firewo groq-ai javascript llama llama-index llama2 llama3 llm mistr nodejs openai react typescript

Last synced: 17 Dec 2024

https://github.com/qiuyannnn/local-file-organizer

An AI-powered file management tool that ensures privacy by organizing local texts, images. Using Llama3.2 3B and Llava v1.6 models with the Nexa SDK, it intuitively scans, restructures, and organizes files for quick, seamless access and easy retrieval.

file-organizer llama3 llm on-device-ai vlm

Last synced: 19 Dec 2024

https://github.com/ymcui/chinese-llama-alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

alpaca large-language-models llama llama-2 llama-3 llama3 llm nlp

Last synced: 19 Dec 2024

https://github.com/ymcui/Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

alpaca large-language-models llama llama-2 llama-3 llama3 llm nlp

Last synced: 14 Nov 2024

https://github.com/ozgrozer/ai-renamer

A Node.js CLI that uses Ollama and LM Studio models (Llava, Gemma, Llama etc.) to intelligently rename files by their contents

ai automation cli-tool file-management file-renamer files image-renamer llama3 lm-studio machine-learning ollama openai video-renamer

Last synced: 19 Dec 2024

https://github.com/b4rtaz/distributed-llama

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

distributed-computing distributed-llm llama2 llama3 llm llm-inference llms neural-network open-llm

Last synced: 19 Dec 2024

https://github.com/afc163/fanyi

A 🇨🇳 and 🇺🇸 translator in your command line

chinese command-line command-line-tools groq llama3 nodejs translation translator

Last synced: 17 Dec 2024

https://github.com/run-llama/LlamaIndexTS

LlamaIndex is a data framework for your LLM applications

agent anthr chatbot claude claude-ai create-llama embedding firewo groq-ai javascript llama llama-index llama2 llama3 llm mistr nodejs openai react typescript

Last synced: 28 Oct 2024

https://github.com/kevinhermawan/ollamac

Mac app for Ollama

ai llama3 llm llma llma2 llms macos mistral mixtral ollama

Last synced: 19 Dec 2024

https://github.com/kevinhermawan/Ollamac

Mac app for Ollama

ai llama3 llm llma llma2 llms macos mistral mixtral ollama

Last synced: 30 Oct 2024

https://github.com/yusufcanb/tlm

Local CLI Copilot, powered by CodeLLaMa. 💻🦙

bash codellama llama3 llm powershell zsh

Last synced: 19 Dec 2024

https://github.com/mangiucugna/json_repair

A python module to repair invalid JSON, commonly used to parse the output of LLMs

deep-learning gpt-4 json llama3 llm machine-learning mistral openai-api parser repair

Last synced: 17 Dec 2024

https://github.com/bklieger/infinite-bookshelf

Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3

ai groq groq-api llama3

Last synced: 19 Dec 2024

https://github.com/Bklieger/infinite-bookshelf

Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3

ai groq groq-api llama3

Last synced: 14 Dec 2024

https://github.com/serverless/aws-ai-stack

AWS AI Stack – A ready-to-use, full-stack boilerplate project for building serverless AI applications on AWS

aws aws-bedrock aws-lambda claude-ai full-stack llama3 serverless serverless-framework

Last synced: 20 Dec 2024

https://github.com/horseee/llm-pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

baichuan bloom chatglm compression language-model llama llama-2 llama3 llm neurips-2023 pruning pruning-algorithms vicuna

Last synced: 20 Dec 2024

https://github.com/mbzuai-oryx/LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

conversation llama-3-llava llama-3-vision llama3 llama3-llava llama3-vision llava llava-llama3 llava-phi3 llm lmms phi-3-llava phi-3-vision phi3 phi3-llava phi3-vision vision-language

Last synced: 08 Nov 2024

https://github.com/mbzuai-oryx/llava-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

conversation llama-3-llava llama-3-vision llama3 llama3-llava llama3-vision llava llava-llama3 llava-phi3 llm lmms phi-3-llava phi-3-vision phi3 phi3-llava phi3-vision vision-language

Last synced: 20 Dec 2024

https://github.com/aws-samples/generative-ai-use-cases-jp

すぐに業務活用できるビジネスユースケース集付きの安全な生成AIアプリ実装

aws bedrock chatbot claude claude3 command-r generative-ai image-generation lambda llama2 llama3 llm mistral rag react sagemaker typescript

Last synced: 20 Dec 2024

https://github.com/mariocandela/beelzebub

A secure low code honeypot framework, leveraging AI for System Virtualization.

cloudnative cloudsecurity cybersecurity framework go golang honeypot kubernetes llama3 llm llm-honeypot llm-security low-code ollama openai research research-project security whitehat

Last synced: 20 Dec 2024

https://github.com/szczyglis-dev/py-gpt

Desktop AI Assistant powered by o1, GPT-4, GPT-4 Vision, Gemini, Claude, Llama 3, Bielik, DALL-E, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, agents, command execution, file upload/download, speech synthesis and recognition, access to Web, memory, presets, assistants, plugins, and more. Linux, Windows, Mac.

ai ai-assistant artificial-intelligence autonomous-agent bielik chatbot claude dalle-3 desktop-app gemini gpt-4 gpt-4-vision gpt4 langchain llama-index llama3 llm o1 ollama openai

Last synced: 19 Dec 2024

https://github.com/mukel/llama3.java

Practical Llama 3 inference in Java

chatgpt genai gguf huggingface java llama llama3 llamacpp llm llm-inference llms openai simd transformers

Last synced: 21 Dec 2024

https://github.com/jianzhnie/llamatuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

chatgpt dpo llama llama3 mixtral ppo qlora qwen rlhf

Last synced: 21 Dec 2024

https://github.com/aidc-ai/ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

chatbot llama3 multimodal multimodal-large-language-models multimodality qwen vision-language-learning vision-language-model

Last synced: 21 Dec 2024

https://github.com/jianzhnie/LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

chatgpt dpo llama llama3 mixtral ppo qlora qwen rlhf

Last synced: 10 Nov 2024

https://github.com/nvlabs/eagle

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

demo eagle gpt4 huggingface large-language-models llama llama3 llava llm lmm lvlm mllm nvdia

Last synced: 21 Dec 2024

https://github.com/magpie-align/magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

alignment dataset gemma llama2 llama3 llm nlp paper phi3 qwen2 supervised-finetuning synthetic-data synthetic-dataset-generation

Last synced: 21 Dec 2024

https://github.com/bklieger/scribewizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

ai groq groq-api llama3 replit whisper

Last synced: 21 Dec 2024

https://github.com/NVlabs/EAGLE

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

demo eagle gpt4 huggingface large-language-models llama llama3 llava llm lmm lvlm mllm nvdia

Last synced: 26 Sep 2024

https://github.com/Bklieger/ScribeWizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

ai groq groq-api llama3 replit whisper

Last synced: 22 Nov 2024

https://github.com/nrl-ai/llama-assistant

AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more.

llama llama-3-2 llama3 llava moondream owen personal-assistant private-gpt

Last synced: 15 Dec 2024

https://github.com/rlhflow/online-rlhf

A recipe for online RLHF and online iterative DPO.

llama3 llm rlhf

Last synced: 14 Dec 2024

https://github.com/jakobdylanc/llmcord

A Discord LLM chat bot that supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more)

bot chatbot chatgpt discord gpt gpt-4 gpt-4o grok groq llama llama3 llava llm lmstudio mistral ollama oobabooga openai vllm xai

Last synced: 15 Dec 2024

https://github.com/voxos-ai/bolna

End-to-end platform for building voice first multimodal agents

anyscale chatgpt-api claude-3-sonnet deepgram elevenlabs fastapi gpt-4o llama3 llm mistral openai perplexity-api polly telephony twilio voice-assistant websocket-chat websockets whisper xtts

Last synced: 21 Dec 2024

https://github.com/bolna-ai/bolna

End-to-end platform for building voice first multimodal agents

anyscale chatgpt-api claude-3-sonnet deepgram elevenlabs fastapi gpt-4o llama3 llm mistral openai perplexity-api polly telephony twilio voice-assistant websocket-chat websockets whisper xtts

Last synced: 20 Dec 2024

https://github.com/fatwang2/siri-ultra

The most intelligent Siri powered by LLMs

apple groq llama3 llms openai shortcuts siri

Last synced: 27 Sep 2024

https://github.com/vectorch-ai/ScaleLLM

A high-performance inference system for large language models, designed for production environments.

cuda efficiency gpu inference llama llama3 llm llm-inference model performance production serving speculative transformer

Last synced: 16 Nov 2024

https://github.com/vectorch-ai/scalellm

A high-performance inference system for large language models, designed for production environments.

cuda efficiency gpu inference llama llama3 llm llm-inference model performance production serving speculative transformer

Last synced: 20 Dec 2024

https://github.com/xinthedark/raycast-g4f

Raycast extension to use GPT-4, Llama-3, and more... all for FREE. No API Key required!

chatbot chatgpt claude free-gpt gemini gpt gpt-4 gpt-4o gpt4free llama3 raycast-extension

Last synced: 15 Dec 2024

https://github.com/turing-machines/mentals-ai

AI agents in Markdown syntax (loops, memory and tools included)

ai ai-agents artificial-intelligence cli gpt gpt-4o llama3 llm machine-learning openai terminal

Last synced: 15 Dec 2024

https://github.com/InternLM/InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

910b deepspeed-ulysses flash-attention gemma internlm internlm2 llama3 llava llm-framework llm-training multi-modal pipeline-parallelism pytorch ring-attention sequence-parallelism tensor-parallelism transformers-models zero3

Last synced: 30 Oct 2024

https://github.com/internlm/internevo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

910b deepspeed-ulysses flash-attention gemma internlm internlm2 llama3 llava llm-framework llm-training multi-modal pipeline-parallelism pytorch ring-attention sequence-parallelism tensor-parallelism transformers-models zero3

Last synced: 14 Dec 2024

https://github.com/vietanhdev/llama-assistant

AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more.

llama llama-3-2 llama3 llava moondream owen personal-assistant private-gpt