Projects in Awesome Lists tagged with peft

https://github.com/hiyouga/llama-factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers

Last synced: 12 May 2025

https://github.com/hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers

Last synced: 14 Mar 2025

https://github.com/modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).

deepseek-r1 deploy embedding grpo internvl liger llama llama4 llm lora megatron multimodal omni open-r1 peft qwen2-vl qwen3 qwen3-moe rft sft

Last synced: 12 May 2025

https://github.com/yangjianxin1/firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 14 May 2025

https://github.com/yangjianxin1/Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr

Last synced: 19 Mar 2025

https://github.com/internlm/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 11 May 2025

https://github.com/InternLM/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 20 Mar 2025

https://github.com/mymusise/chatglm-tuning

基于ChatGLM-6B + LoRA的Fintune方案

chatglm chatgpt lora peft

Last synced: 14 May 2025

https://github.com/mymusise/ChatGLM-Tuning

基于ChatGLM-6B + LoRA的Fintune方案

chatglm chatgpt lora peft

Last synced: 13 Mar 2025

https://github.com/hiyouga/chatglm-efficient-tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

alpaca chatglm chatglm2 chatgpt fine-tuning huggingface language-model lora peft pytorch qlora rlhf transformers

Last synced: 19 Jan 2025

https://github.com/hiyouga/ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

alpaca chatglm chatglm2 chatgpt fine-tuning huggingface language-model lora peft pytorch qlora rlhf transformers

Last synced: 29 Mar 2025

https://github.com/zyds/transformers-code

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

huggingface peft transformers

Last synced: 15 May 2025

https://github.com/stochasticai/xturing

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

adapter alpaca deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization

Last synced: 15 May 2025

https://github.com/stochasticai/xTuring

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

adapter alpaca deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization

Last synced: 13 Mar 2025

https://github.com/ashishpatel26/llm-finetuning

LLM Finetuning with peft

falcon fine-tuning huggingface llama llama2 llm llms lora peft pytorch text-generation

Last synced: 14 May 2025

https://github.com/ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

falcon fine-tuning huggingface llama llama2 llm llms lora peft pytorch text-generation

Last synced: 24 Mar 2025

https://github.com/lxe/simple-llm-finetuner

Simple UI for LLM Model Finetuning

ai gpt-2 gpt-3 huggingface huggingface-transformers llama llm peft pytorch

Last synced: 15 May 2025

https://github.com/x-lance/slam-llm

Speech, Language, Audio, Music Processing with Large Language Model

audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing

Last synced: 15 May 2025

https://github.com/X-LANCE/SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing

Last synced: 06 Jan 2025

https://github.com/zetavg/llama-lora-tuner

UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.

ai alpaca alpaca-lora google-colab gpt gpt-j language-model llama lora machine-learning peft

Last synced: 05 Apr 2025

https://github.com/mindspore-courses/step_into_llm

MindSpore online courses: Step into LLM

bert chatglm chatglm2 chatgpt codegeex gpt gpt2 instruction-tuning large-language-models llama llama2 llm mindspore moe natural-language-processing nlp parallel-computing peft prompt-tuning rlhf

Last synced: 15 May 2025

https://github.com/zetavg/LLaMA-LoRA-Tuner

UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.

ai alpaca alpaca-lora google-colab gpt gpt-j language-model llama lora machine-learning peft

Last synced: 23 Apr 2025

https://github.com/Guitaricet/relora

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

deep-learning distributed-training llama nlp peft transformer

Last synced: 29 Nov 2024

https://github.com/tudb-labs/mlora

An Efficient "Factory" to Build Multiple LoRA Adapters

baichuan chatglm dpo finetune gpu llama llama2 llm lora mlora peft rlhf

Last synced: 15 May 2025

https://github.com/km1994/llms_paper

该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记（多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT）

agent llms lora peft qa rag

Last synced: 03 Mar 2025

https://github.com/iamarunbrahma/finetuned-qlora-falcon7b-medical

Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset

chatbot chatbots conversational-ai falcon falcon-7b fine-tuning healthcare llm lora mental-health peft qlora

Last synced: 09 Apr 2025

https://github.com/jackaduma/vicuna-lora-rlhf-pytorch

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

chatgpt finetune gpt llama llm lora peft ppo pytorch reward-models rlhf vicuna vicuna-7b

Last synced: 13 Apr 2025

https://github.com/jianzhnie/open-chatgpt

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

chatgpt gpt llama llm lora peft ppo rlhf stanford-alpaca

Last synced: 24 Jan 2025

https://github.com/jackaduma/chatglm-lora-rlhf-pytorch

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

chatglm chatglm-6b chatgpt deepspeed finetune gpt llama llm lora peft ppo pytorch reward-models rlhf

Last synced: 27 Apr 2025

https://github.com/simplifine-llm/simplifine

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨

ai cloud fine-tuning fine-tuning-llm finetuning-llms gpt instruction-tuning large-language-models llama llama3 llm llm-training lora mistral moe open-source peft phi qwen

Last synced: 16 Feb 2025

https://github.com/simplifine-llm/Simplifine

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨

ai cloud fine-tuning fine-tuning-llm finetuning-llms gpt instruction-tuning large-language-models llama llama3 llm llm-training lora mistral moe open-source peft phi qwen

Last synced: 04 Dec 2024

https://github.com/borealisai/flora-opt

This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.

deep-learning flax jax large-language-models lora memory-efficient-tuning optax peft random-projection transformers

Last synced: 29 Dec 2024

https://github.com/tudb-labs/moe-peft

An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT

mixlora mlora peft peft-fine-tuning-llm

Last synced: 05 Apr 2025

https://github.com/kamalkraj/e5-mistral-7b-instruct

Finetune mistral-7b-instruct for sentence embeddings

finetuning huggingface lora mistral-7b peft pytorch sentence-embeddings transformers

Last synced: 12 Apr 2025

https://github.com/nisaaragharia/indian-lawyergpt

Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model with a profound understanding of the Indian legal context.

falcon fine-tuning gpt huggingface-transformers large-language-models llama llama2 llms peft qlora

Last synced: 23 Nov 2024

https://github.com/ziplab/SPT

[ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.

adapter lora parameter-efficient-fine-tuning peft prompt-tuning transfer-learning

Last synced: 05 Apr 2025

https://github.com/jackaduma/alpaca-lora-rlhf-pytorch

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

alpaca chatgpt deepspeed finetune gpt llama llm lora peft ppo pytorch reward-models rlhf

Last synced: 27 Apr 2025

https://github.com/baijiong-lin/lora-torch

PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention)

fine-tuning finetuning lora peft

Last synced: 20 Dec 2024

https://github.com/gunale0926/sorsa

SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models

deep-learning fine-tuning llama lora machine-learning nlp peft python pytorch rwkv sorsa svd transformer

Last synced: 17 Mar 2025

https://github.com/adithya-s-k/companionllm

CompanionLLM - A framework to finetune LLMs to be your own sentient conversational companion

fine-tuning finetuning hacktoberfest hacktoberfest-accepted hacktoberfest2023 huggingface llama llama2 llamacpp llm llm-inference llm-training lora mit-license open-source peft

Last synced: 03 Dec 2024

https://github.com/roim1998/apt

[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference

bert efficient-deep-learning llama2 llm llm-finetuning peft peft-fine-tuning-llm pruning roberta t5

Last synced: 16 May 2025

https://github.com/daekeun-ml/genai-ko-llm

This hands-on lab walks you through a step-by-step approach to efficiently serving and fine-tuning large-scale Korean models on AWS infrastructure.

fine-tuning genai korean-llm peft sagemaker serving

Last synced: 29 Jan 2025

https://github.com/aisuko/notebooks

Implementation for the different ML tasks on Kaggle platform with GPUs.

accelerator computer-vision fine-tuning kaggle large-language-models multimodal natural-language-processing neural-network peft pytorch quantization renforcement-learning tensorboard transformers visulization wandb

Last synced: 19 Apr 2025

https://github.com/wiseodd/lapeft-bayesopt

Discrete Bayesian optimization with LLMs, PEFT finetuning methods, and the Laplace approximation.

bayesian-optimization laplace-approximation llm peft

Last synced: 06 Apr 2025

https://github.com/prithivsakthiur/gallo-3xl

High Quality Image Generation Model - Powered with NVIDIA A100

ai dall-e dalle2 dalle3 diffusers gradio huggingface image-generation peft peft-fine-tuning-llm text-to-image torch transformers

Last synced: 17 Dec 2024

https://github.com/llm-db/fineinfer

Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)

fine-tuning inference llm lora peft pytorch

Last synced: 21 Nov 2024

https://github.com/shaheennabi/production-ready-instruction-finetuning-of-meta-llama-3.2-3b-instruct-project

🎋🌿🌟 Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations 🌟🌿🎋 Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. 🚀✨ Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning🎇🎉

4bit-quantize 4bitprecision anthropic-hh-golden bitsandbytes deployed finetuning gguf gpu huggingface inference meta modular-code open-source peft production-ready qlora quantization training unified-language-model-aligning unsloth

Last synced: 31 Jan 2025

https://github.com/jordandeklerk/starcoder2-finetune-code-completion

Finetuning Starcoder2-3B for Code Completion on a single A100 GPU

artificial-intelligence code-llms finetuning-large-language-models llms lora machine-learning peft starcoder2

Last synced: 15 Apr 2025

https://github.com/neuralwork/instruct-finetune-mistral

Fine-tune Mistral 7B to generate fashion style suggestions

finetuning-llms huggingface llm llm-inference mistral peft quantization

Last synced: 12 May 2025

https://github.com/rezaakb/peft-vit

Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting

catastrophic-forgetting peft self-supervised vision-transformer

Last synced: 05 Apr 2025

https://github.com/monk1337/nanopeft

The simplest repository & Neat implementation of different Lora methods for training/fine-tuning Transformer-based models (i.e., BERT, GPTs). [ Research purpose ]

huggingface llama llm lora low-rank-adaptation mistral peft qlora quantization

Last synced: 17 Mar 2025

https://github.com/daskol/lotr

Low Tensor Rank adaptation of large language models

fine-tuning llm lora lotr parameter-efficient-tuning peft

Last synced: 13 Feb 2025

https://github.com/dvgodoy/finetuningllms

Official repository of the book "A Hands-On Guide to Fine-Tuning LLMs" with PyTorch and Hugging Face

bitsandbytes fine-tuning finetuning finetuning-llms hugging-face huggingface large-language-models llamacpp lora ollama peft peft-fine-tuning-llm pytorch transformers

Last synced: 27 Jan 2025

https://github.com/avnlp/llm-finetuning

fine-tuning lora p-tuning peft qlora sft

Last synced: 13 Apr 2025

https://github.com/naveen-v-v/llm_fine_tune_lora

Fine tune a Large Language Model using LORA to perform Sentiment Analysis

fine-tune large-language-models notebook-jupyter peft python

Last synced: 18 Mar 2025

https://github.com/zeyadusf/topics-in-nlp-llm

In this repo I will share different topics on anything I want to know in nlp and llms

bpe llms lora nlp peft peft-fine-tuning-llm quantization

Last synced: 25 Feb 2025

https://github.com/lucatosc/generative-ai-1-

Generative AI nano degree program

chatbot genai generative-ai inpainting llms peft

Last synced: 12 Apr 2025

https://github.com/jordandeklerk/opencodeinterpreter-finetune-sql

Fine-tuning coding LLM OpenCodeInterpreter-DS-6.7B for Text-to-SQL Code Generation on a Single A100 GPU in PyTorch

artificial-intelligence code-llms finetuning-llms llms machine-learning open-code-interpreter peft qlora

Last synced: 20 Feb 2025

https://github.com/rahul-lashkari/llm-ecosystem-enhancement

Executed Fine-tuning & Benchmarking, optimizing 12+ LLMs (Gemma-family, Mistral, LLaMA, etc) across 6+ datasets (GSM8K, BoolQ, IMDB, Alpaca-GPT4 & more). Delivered a research-level contribution—model training, evaluation, insights, DeepMind benchmark comparisons & documentation. Also crafting a custom dataset from open-sourced V0 system prompts.🛰

analysis-reports anomaly-detection benchmarking custom-dataset fine-tuning kmeans-clustering llm-evaluation llms lora peft research t-sne-visualization umap-projection

Last synced: 15 May 2025

https://github.com/aman-17/medisoap

FineTuning LLMs on conversational medical dataset.

fine-tuning generative-ai llama llama-2 llm-training lora medical peft peft-fine-tuning-llm qlora summarization

Last synced: 20 Mar 2025

https://github.com/tongjilibo/llm_finetune

基于bert4torch的大模型微调代码，含chatglm+pv2, lora, plora等多种方式

chatglm chatglm2 llm lora peft

Last synced: 08 Apr 2025

https://github.com/fong0202/r1-v

Witness the aha moment of VLM with less than $3.

csm32rv20 deepseek-api deepseek-reasoner deepseek-v3 github-config internlm3 llama nextjs peft qitas r10k ruby typescript wch

Last synced: 05 Apr 2025

https://github.com/sanskaryo/llm-finetuning-projects

This repository contains various projects focused on fine-tuning Large Language Models (LLMs). i am currently working on

finetuning-llms huggingface llm lora nlp peft qlora transformer

Last synced: 31 Mar 2025

https://github.com/eliask93/instruction-fine-tuned-gemma-2-for-stance-detection

Example application for applying QLoRA-based Parameter-Efficient Fine-Tuning (PEFT) to a Stance Detection task using Gemma-2-9B-Instruct

argument-mining gemma2 lora nlp peft qlora quantization stance-detection

Last synced: 02 Mar 2025

https://github.com/justsomerandomdude264/homework_solver_llm

A fine-tuned LLM to solve homework questions ranging from maths to science and social science.

adapters large-language-models llama3 llm peft qlora question-answering text-generation unsloth

Last synced: 08 Apr 2025

https://github.com/amira921/ai-based-healthcare-monitoring-system-using-iot

Medical IOT System Consists of Smart Band, Medical generative QA model, mobile application which facilitate efficient healthcare monitoring and medical assistance for patients and doctors.

arduino artificial-intelligence biogpt cpp database-design embedded-systems iot java-android large-language-models mobile-development mysql peft pyhton pytorch srs-document system-design transformers ui-ux-design uml-diagrams

Last synced: 25 Feb 2025

https://github.com/d-kleine/generativeai

Generative AI nano degree program

chatbot genai generative-ai inpainting llms peft

Last synced: 17 Mar 2025

https://github.com/zeyadusf/summarization-by-finetuning-flant5-lora

fine-tuning llm lora nlp peft peft-fine-tuning-llm summarization

Last synced: 25 Feb 2025

https://github.com/md-emon-hasan/fine-tuning

End-to-end fine-tuning of Hugging Face models using LoRA, QLoRA, quantization, and PEFT techniques. Optimized for low-memory with efficient model deployment

bitsandbytes deep-learning fine-tuning fp16-training gpu-optimization gradient-checkpointing huggingface huggingface-datasets lora low-memory-training machine-learning model-training natural-language-processing nlp parameter-efficient-fine-tuning peft pytorch qlora quantization transformers

Last synced: 21 Feb 2025

https://github.com/cahlen/conversation-dataset-generator

Craft conversational datasets (JSONL format with rich metadata) using LLMs. Specify parameters manually or use a creative brief for LLM-generated arguments with automatic topic/scenario variation. Optional web search improves persona grounding. Ideal for LoRA tuning, persona training, and creative writing. Includes Hugging Face Hub upload.

dataset-generation dialogue-generation fine-tuning huggingface jsonl llm lora nlp peft persona python synthentic-data transformers

Last synced: 11 Apr 2025

https://github.com/salahu01/flutter-codegen-finetuner

🚀 Fine-tune LLMs to generate Flutter code in your personal style! 🎯 This toolkit provides step-by-step guides, scripts, and examples for creating custom code generation models. 💻 From data preparation to deployment, transform natural language into Flutter widgets matching your unique coding patterns. 🔧

ai-tools code-assistant code-generation dart developer-tools fine-tuning-llm-codellama flutter language-model lora machine-learning nlp peft transformer

Last synced: 21 Mar 2025

https://github.com/nimad70/mistral-qa-optimization

mistral-qa-optimization | project for the Natural Language Processing course (ComputerScience @ UniPd) | w/ Shakiba Farjood Fashalam - Marcos Tidball - Tobia Pavona - Giacomo Ferrante

deep-learning finetuning huggingface huggingface-transformers llms mistral-7b nlp peft python3 question-answering raft rag tensorflow

Last synced: 04 Apr 2025

https://github.com/rishabhmathur06/quantization-fundamentals

artificial-intelligence fine-tuning generative-ai large-language-models lora peft python qlora quantization

Last synced: 12 Mar 2025

https://github.com/sofiakhutsieva/llm_experiments

Эксперименты с LLM (инференс, rag, дообучение)

langchain llamacpp llm mistral peft rag trl

Last synced: 08 Apr 2025

https://github.com/eliask93/qlora-based-efficient-fine-tuning-for-sentence-pair-classification

Example application for applying QLoRA-based Parameter-Efficient Fine-Tuning (PEFT) to a sentence pair classification task using Mistral-7B and Llama3-8B

argument-mining llama3 lora mistral nlp peft qlora quantization sentence-pair-classification stance-detection

Last synced: 08 Apr 2025

https://github.com/venkata-naveen-varma/llm_fine_tune_lora

Fine tune a Large Language Model using LORA to perform Sentiment Analysis

fine-tune large-language-models notebook-jupyter peft python

Last synced: 24 Nov 2024

https://github.com/architj6/llama2-finetuning

🦙 Llama2-FineTuning: Fine-tune LLAMA 2 with Custom Datasets Using LoRA and QLoRA Techniques

bitsandbytes fine-tuning fine-tuning-llama2 fine-tuning-llm google-colab huggingface large-language-models llama2 lora low-rank-adaptation nlp peft pytorch qlora quantization supervised-fine-tuning text-generation transformer-reinforcement-learning transformers

Last synced: 19 Apr 2025

https://github.com/umutkavakli/molformer-regression

Fine-tuned chemical language model for predicting molecular lipophilicity in drug design. Explores parameter-efficient fine-tuning strategies (LoRA, BitFit, IA3), layer freezing techniques, and influence-based data selection. Balances accuracy and computational efficiency for molecular property prediction tasks.

bitfit ia3 llm lora mlm molformer peft pytorch regression

Last synced: 01 Apr 2025

https://github.com/reshalfahsi/qa-gpt2-lora

Question-Answering using GPT-2's PEFT with LoRA

gpt-2 huggingface lora low-rank-adaptation nlp peft question-answering squad-dataset

Last synced: 01 Apr 2025

https://github.com/afondiel/finetuning-llms-crash-course-dlai

Notes & Resources of LLMs Finetuning Crash Course from LAMINI.AI & DeepLearning.AI.

finetuning finetuning-llms llms lora peft peft-fine-tuning-llm

Last synced: 15 Mar 2025

https://github.com/hrolive/large-language-models-on-supercomputers

Comprehensive exploration of LLMs, including cutting-edge techniques and tools such as parameter-efficient fine-tuning (PEFT), quantization, zero redundancy optimizers (ZeRO), fully sharded data parallelism (FSDP), DeepSpeed, and Huggingface accelerate.

deepspeed evaluation-metrics fsdp high-performance-computing hpc huggingface huggingface-transformers jupyter llm llm-inference llm-training monitoring peft python quantization slurm tokenization transformer unsloth

Last synced: 23 Feb 2025

https://github.com/mafda/lightweight_fine_tuning_project

This repository provides a Jupyter notebook demonstrating parameter-efficient fine-tuning (PEFT) with LoRA on Hugging Face models.

huggingface huggingface-datasets huggingface-transformers lora peft peft-fine-tuning-llm pytorch

Last synced: 06 Mar 2025

https://github.com/andron00e/mf-p2eft

Parameter- and Energy-Efficient Fine-Tuning

fine-tuning matmul-free peft

Last synced: 22 Feb 2025

https://github.com/ruvenguna94/dialogue-summary-peft-fine-tuning

This notebook fine-tunes the FLAN-T5 model for dialogue summarization, comparing full fine-tuning with Parameter-Efficient Fine-Tuning (PEFT). It evaluates performance using ROUGE metrics, demonstrating PEFT's efficiency while achieving competitive results.

dialogue-summarization fine-tuning flan-t5 generative-ai hugging-face lora low-rank-adaptation natural-language-processing nlp parameter-efficient-fine-tuning peft pytorch rouge

Last synced: 06 Apr 2025

https://github.com/llm-db/tensor-program-optimization-with-auto-batching

Tensor Program Optimization with Auto-Batching (Master Thesis, ETH Zürich, 2025)

inference llm lora peft tvm

Last synced: 06 Apr 2025

https://github.com/flozi00/simplepeft

An simple trainer for efficient finetuning large models on different tasks

llm peft pytorch transformers

Last synced: 10 Feb 2025

https://github.com/rishabhmathur06/fine-tuning-llama2-for-text-generation-using-quantization-and-lora

fine-tuning-llm generative-ai large-language-models llama2 llama2-7b lora machine-learning natural-language-processing nlp peft python pytorch qlora textgeneration torch

Last synced: 12 Mar 2025

https://github.com/zeyadusf/finetune-llama2

Fine-Tune Your Own Llama 2 Model

fine-tuning llama2 llm lora peft text-generation

Last synced: 14 Mar 2025

https://github.com/aman-17/MediSOAP

FineTuning LLMs on conversational medical dataset.

fine-tuning generative-ai llama llama-2 llm-training lora medical peft peft-fine-tuning-llm qlora summarization

Last synced: 06 Jan 2025

https://github.com/vpgits/sdgp-ml

This repository contains notebooks and resources related to the Software Development Group Project (SDGP) machine learning component. Specifically, it includes two notebooks used for creating a dataset and fine-tuning a Mistral-7B-v0.1-Instruct model.

autoawq awq machine-learning peft pytorch qlora transformers

Last synced: 23 Feb 2025

https://github.com/smit-parekh/proactive-supply-chain-disruption-agent

An AI Agent leveraging fine-tuned LLMs (Mistral-7B w/ PEFT) and LangGraph to proactively identify, assess, and suggest mitigations for supply chain disruptions, tailored to specific client needs (e.g., Shell, Pfizer). Includes MLOps integration using Vertex AI and MLflow.

ai-agent api fastapi fine-tuning forecasting google-cloud langchain langgraph llm lora machine-learning mlflow mlops peft python risk-management supply-chain vertex-ai

Last synced: 11 Apr 2025