Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with llama2

A curated list of projects in awesome lists tagged with llama2 .

https://github.com/ollama/ollama

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

gemma gemma2 go golang llama llama2 llama3 llava llm llms mistral ollama phi3

Last synced: 16 Dec 2024

https://github.com/janhq/jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

electron gpt llama2 llamacpp localai self-hosted

Last synced: 16 Dec 2024

https://github.com/haotian-liu/llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

chatbot chatgpt foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava multi-modality multimodal vision-language-model visual-language-learning

Last synced: 16 Dec 2024

https://github.com/haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

chatbot chatgpt foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava multi-modality multimodal vision-language-model visual-language-learning

Last synced: 25 Oct 2024

https://github.com/meta-llama/llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

ai finetuning langchain llama llama2 llm machine-learning python pytorch vllm

Last synced: 16 Dec 2024

https://github.com/h2oai/h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

ai chatgpt embeddings generative gpt gpt4all llama2 llm mixtral pdf private privategpt vectorstore

Last synced: 16 Dec 2024

https://github.com/getumbrel/llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

ai chatgpt code-llama codellama gpt gpt-4 gpt4all llama llama-2 llama-cpp llama2 llamacpp llm localai openai self-hosted

Last synced: 17 Dec 2024

https://github.com/bentoml/openllm

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna

Last synced: 16 Dec 2024

https://github.com/bentoml/OpenLLM

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint, locally and in the cloud.

ai bentoml falcon fine-tuning llama llama2 llm llm-inference llm-ops llm-serving llmops mistral ml mlops model-inference mpt open-source-llm openllm stablelm vicuna

Last synced: 26 Oct 2024

https://github.com/ymcui/chinese-llama-alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

64k alpaca alpaca-2 alpaca2 flash-attention large-language-models llama llama-2 llama2 llm nlp rlhf yarn

Last synced: 17 Dec 2024

https://github.com/ymcui/Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

64k alpaca alpaca-2 alpaca2 flash-attention large-language-models llama llama-2 llama2 llm nlp rlhf yarn

Last synced: 29 Oct 2024

https://github.com/sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

cuda inference llama llama2 llama3 llama3-1 llava llm llm-serving moe pytorch transformer vlm

Last synced: 16 Dec 2024

https://github.com/yangjianxin1/firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 17 Dec 2024

https://github.com/yangjianxin1/Firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 27 Oct 2024

https://github.com/internlm/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 16 Dec 2024

https://github.com/InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 28 Oct 2024

https://github.com/internlm/opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 13 Dec 2024

https://github.com/open-compass/opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 16 Dec 2024

https://github.com/open-compass/OpenCompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 04 Dec 2024

https://github.com/baichuan-inc/baichuan2

A series of large language models developed by Baichuan Intelligent Technology

artificial-intelligence benchmark ceval chatgpt chinese gpt gpt-4 huggingface large-language-models llama2 mmlu natural-language-processing

Last synced: 17 Dec 2024

https://github.com/baichuan-inc/Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

artificial-intelligence benchmark ceval chatgpt chinese gpt gpt-4 huggingface large-language-models llama2 mmlu natural-language-processing

Last synced: 02 Nov 2024

https://github.com/crazyboym/llama3-chinese-chat

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

llama llama2 llama3 llama3-chinese llama3-finetune

Last synced: 17 Dec 2024

https://github.com/h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

ai chatbot chatgpt fine-tuning finetuning generative generative-ai gpt llama llama2 llm llm-training

Last synced: 17 Dec 2024

https://github.com/internlm/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 16 Dec 2024

https://github.com/CrazyBoyM/llama3-Chinese-chat

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

llama llama2 llama3 llama3-chinese llama3-finetune

Last synced: 29 Oct 2024

https://github.com/augustdev/enchanted

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

ios large-language-model llama llama2 llm mistral ollama ollama-app swift

Last synced: 17 Dec 2024

https://github.com/InternLM/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 28 Oct 2024

https://github.com/AugustDev/enchanted

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

ios large-language-model llama llama2 llm mistral ollama ollama-app swift

Last synced: 05 Nov 2024

https://github.com/higgsfield-ai/higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

cluster-management deep-learning distributed llama llama2 llm machine-learning mlops pytorch

Last synced: 17 Dec 2024

https://github.com/higgsfield/higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

cluster-management deep-learning distributed llama llama2 llm machine-learning mlops pytorch

Last synced: 09 Nov 2024

https://github.com/wenge-research/yayi

雅意大模型:为客户打造安全可靠的专属大模型,基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型,由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)

bloom chat chinese llama llama2 llm lora yayi

Last synced: 20 Dec 2024

https://github.com/wenge-research/YaYi

雅意大模型:为客户打造安全可靠的专属大模型,基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型,由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)

bloom chat chinese llama llama2 llm lora yayi

Last synced: 02 Nov 2024

https://github.com/rjmacarthy/twinny

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but completely free and 100% private.

artificial-intelligence code-chat code-completion code-generation codellama copilot free llama2 llamacpp ollama ollama-api ollama-chat private symmetry vscode-extension

Last synced: 12 Nov 2024

https://github.com/twinnydotdev/twinny

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but completely free and 100% private.

artificial-intelligence code-chat code-completion code-generation codellama copilot free llama2 llamacpp ollama ollama-api ollama-chat private symmetry vscode-extension

Last synced: 18 Dec 2024

https://github.com/scisharp/llamasharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 17 Dec 2024

https://github.com/SciSharp/LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 28 Oct 2024

https://github.com/xusenlinzy/api-for-open-llm

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口

baichuan chatglm code-llama docker internlm langchain llama llama2 llms nlp openai qwen sqlcoder xverse

Last synced: 18 Dec 2024

https://github.com/dicklesworthstone/llm_aided_ocr

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

ai-assist llama2 llm ocr ocr-correction tesseract

Last synced: 19 Dec 2024

https://github.com/tigerresearch/tigerbot

TigerBot: A multi-language multi-task LLM

chinese data llama2 llm nlp

Last synced: 19 Dec 2024

https://github.com/TigerResearch/TigerBot

TigerBot: A multi-language multi-task LLM

chinese data llama2 llm nlp

Last synced: 05 Nov 2024

https://github.com/LinkSoul-AI/Chinese-Llama-2-7b

开源社区第一个能下载、能运行的中文 LLaMA2 模型!

deep-learning llama2 llama2-docker llm pytorch

Last synced: 25 Oct 2024

https://github.com/linksoul-ai/chinese-llama-2-7b

开源社区第一个能下载、能运行的中文 LLaMA2 模型!

deep-learning llama2 llama2-docker llm pytorch

Last synced: 20 Dec 2024

https://github.com/semanser/codel

✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.

agent ai autonomous-agents bot devin llama2 llms ollama openai

Last synced: 20 Dec 2024

https://github.com/semanser/codel?tab=readme-ov-file

✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.

agent ai autonomous-agents bot devin llama2 llms ollama openai

Last synced: 12 Nov 2024

https://github.com/chenking2020/findthechatgpter

ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利

agi alpaca autogpt baichuan belle ceval chatglm chatgpt codi guanaco learderboard linly llama llama2 llava lora minigpt4 self-instruct vicuna wizadlm

Last synced: 21 Dec 2024

https://github.com/chenking2020/FindTheChatGPTer

ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利

agi alpaca autogpt baichuan belle ceval chatglm chatgpt codi guanaco learderboard linly llama llama2 llava lora minigpt4 self-instruct vicuna wizadlm

Last synced: 16 Nov 2024

https://github.com/liltom-eth/llama2-webui

Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.

llama-2 llama2 llm llm-inference

Last synced: 19 Dec 2024

https://github.com/ray-project/llm-applications

A comprehensive guide to building RAG-based LLM applications for production.

anyscale fine-tuning llama2 llms machine-learning openai ray serving

Last synced: 19 Dec 2024

https://github.com/melih-unsal/demogpt

🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.

agent agents ai artificial-intelligence autogpt autonomous-agents chatgpt chatgpt-api demo gpt-4 gpt3-turbo langchain langchain-app langchain-python llama2 llms openai python streamlit streamlit-application

Last synced: 17 Dec 2024

https://github.com/melih-unsal/DemoGPT

🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.

agent agents ai artificial-intelligence autogpt autonomous-agents chatgpt chatgpt-api demo gpt-4 gpt3-turbo langchain langchain-app langchain-python llama2 llms openai python streamlit streamlit-application

Last synced: 06 Nov 2024

https://github.com/Dicklesworthstone/llm_aided_ocr

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

ai-assist llama2 llm ocr ocr-correction tesseract

Last synced: 31 Aug 2024

https://github.com/smallcloudai/refact

WebUI for Fine-Tuning and Self-hosting of Open-Source Large Language Models for Coding

ai autocompletion chat developer-tools devtools fine-tuning llama2 llms refactoring self-hosted starchat starcoder wizardlm

Last synced: 18 Dec 2024

https://github.com/mobile-artificial-intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

android android-ai chatbot chatgpt facebook flutter free-chatgpt gguf large-language-models llama llama-cpp llama2 llamacpp local-ai mistral mobile-ai mobile-artificial-intelligence ollama openai openorca

Last synced: 19 Dec 2024

https://github.com/b4rtaz/distributed-llama

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

distributed-computing distributed-llm llama2 llama3 llm llm-inference llms neural-network open-llm

Last synced: 19 Dec 2024

https://github.com/rahulschand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization

Last synced: 20 Dec 2024

https://github.com/curiousily/get-things-done-with-prompt-engineering-and-langchain

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

artificial-intelligence chatgpt deep-learning gpt-4 gpt4 langchain language-models large-language-models llama2 openai prompt-engineering python

Last synced: 15 Dec 2024

https://github.com/RahulSChand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization

Last synced: 08 Nov 2024

https://github.com/curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

artificial-intelligence chatgpt deep-learning gpt-4 gpt4 langchain language-models large-language-models llama2 openai prompt-engineering python

Last synced: 24 Oct 2024

https://github.com/brucemacd/chatd

Chat with your documents using local AI

chat desktop electron llama2 llm mistral mistral-7b ollama rag

Last synced: 20 Dec 2024

https://github.com/dicklesworthstone/swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

embedding-similarity embedding-vectors embeddings llama2 llamacpp semantic-search

Last synced: 20 Dec 2024

https://github.com/Dicklesworthstone/swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

embedding-similarity embedding-vectors embeddings llama2 llamacpp semantic-search

Last synced: 06 Nov 2024

https://github.com/BruceMacD/chatd

Chat with your documents using local AI

chat desktop electron llama2 llm mistral mistral-7b ollama rag

Last synced: 05 Nov 2024

https://github.com/win4r/aisuperdomain

Aila(AI超元域): The premier AI integration tool for Windows, macOS, and Android. Ask once, get answers from 10+ AIs like ChatGPT, Gemini, Claude3, Copilot, Poe, perplexity and more. Features customizable AI and prompts.

autogpt autogpt-no-paid-api chatgpt chatgpt-app chatgpt-bot chatgpt-plugins chatgpt4 claude claude3 freegpt gemini gpt-5 gpt5 llama2 openai openai-api stable stable-diffusion stable-diffusion-webui

Last synced: 20 Dec 2024

https://github.com/wangrongsheng/caregpt

🌞 CareGPT (关怀GPT)是一个医疗大语言模型,同时它集合了数十个公开可用的医疗微调数据集和开放可用的医疗大语言模型,包含LLM的训练、测评、部署等以促进医疗LLM快速发展。Medical LLM, Open Source Driven for a Healthy Future.

baichuan gpt large-language-models llama llama2 medical-llm

Last synced: 18 Dec 2024

https://github.com/aws-samples/generative-ai-use-cases-jp

すぐに業務活用できるビジネスユースケース集付きの安全な生成AIアプリ実装

aws bedrock chatbot claude claude3 command-r generative-ai image-generation lambda llama2 llama3 llm mistral rag react sagemaker typescript

Last synced: 20 Dec 2024

https://github.com/WangRongsheng/CareGPT

🌞 CareGPT (关怀GPT)是一个医疗大语言模型,同时它集合了数十个公开可用的医疗微调数据集和开放可用的医疗大语言模型,包含LLM的训练、测评、部署等以促进医疗LLM快速发展。Medical LLM, Open Source Driven for a Healthy Future.

baichuan gpt large-language-models llama llama2 medical-llm

Last synced: 02 Nov 2024

https://github.com/Azure-Samples/miyagi

Sample to envision intelligent apps with Microsoft's Copilot stack for AI-infused product experiences.

agents aks assistants azure azure-openai azureai copilot gpt-4 guidance langchain llama-index llama2 openai phi-2 prompt-engineering promptflow semantic-kernel taskweaver typechat

Last synced: 04 Nov 2024

https://github.com/azure-samples/miyagi

Sample to envision intelligent apps with Microsoft's Copilot stack for AI-infused product experiences.

agents aks assistants azure azure-openai azureai copilot gpt-4 guidance langchain llama-index llama2 openai phi-2 prompt-engineering promptflow semantic-kernel taskweaver typechat

Last synced: 20 Dec 2024

https://github.com/efeslab/nanoflow

A throughput-oriented high-performance serving framework for LLMs

cuda inference llama2 llm llm-serving model-serving

Last synced: 20 Dec 2024

https://github.com/Mobile-Artificial-Intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

android android-ai chatbot chatgpt facebook ffigen flutter gguf large-language-models llama llama-cpp llama2 llamacpp local-ai mistral mobile-ai mobile-artificial-intelligence ollama openai openorca

Last synced: 28 Oct 2024

https://github.com/xnul/code-llama-for-vscode

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

assistant code code-llama codellama continue continuedev copilot llama llama2 llamacpp llm local meta ollama studio visual vscode

Last synced: 21 Dec 2024

https://github.com/foundationvision/groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

foundation-models grounding large-language-models llama llama2 llm mllm multimodal vision-language-model

Last synced: 21 Dec 2024

https://github.com/princeton-nlp/LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

efficiency llama llama2 llm nlp pre-training pruning

Last synced: 08 Nov 2024

https://github.com/princeton-nlp/llm-shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

efficiency llama llama2 llm nlp pre-training pruning

Last synced: 20 Dec 2024

https://github.com/xNul/code-llama-for-vscode

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

assistant code code-llama codellama continue continuedev copilot llama llama2 llamacpp llm local meta ollama studio visual vscode

Last synced: 06 Nov 2024

https://github.com/soulteary/docker-llama2-chat

Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! ( non GPU / 5GB vRAM / 8~14GB vRAM)

llama llama2 llama2-docker llama2-playground llm

Last synced: 21 Dec 2024

https://github.com/magpie-align/magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

alignment dataset gemma llama2 llama3 llm nlp paper phi3 qwen2 supervised-finetuning synthetic-data synthetic-dataset-generation

Last synced: 21 Dec 2024

https://github.com/owlaiproject/owl

A personal wearable AI that runs locally

ai ble bluetooth esp32 llama2 mistral nrf52840 ollama wearable whisper

Last synced: 21 Dec 2024

https://github.com/OwlAIProject/Owl

A personal wearable AI that runs locally

ai ble bluetooth esp32 llama2 mistral nrf52840 ollama wearable whisper

Last synced: 22 Nov 2024

https://github.com/evilpsycho/play-with-llms

Tutorial on training, evaluating LLM, as well as utilizing RAG, Agent, Chain to build entertaining applications with LLMs.分享如何训练、评估LLMs,如何基于RAG、Agent、Chain构建有趣的LLMs应用。

agent baichuan2 chatgpt gpt large-language-models llama2 llms mistral rag retrieval-augmented-generation

Last synced: 21 Dec 2024