Projects in Awesome Lists tagged with peft
A curated list of projects in awesome lists tagged with peft .
https://github.com/hiyouga/llama-factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers
Last synced: 12 May 2025
https://github.com/hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers
Last synced: 14 Mar 2025
https://github.com/modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).
deepseek-r1 deploy embedding grpo internvl liger llama llama4 llm lora megatron multimodal omni open-r1 peft qwen2-vl qwen3 qwen3-moe rft sft
Last synced: 12 May 2025
https://github.com/yangjianxin1/firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr
Last synced: 14 May 2025
https://github.com/yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
alpaca aquila baichuan chatglm gemma gpt internlm llama llama2 llama3 llm lora minicpm mistral mixtral peft qlora qwen qwen2 zephyr
Last synced: 19 Mar 2025
https://github.com/internlm/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning
Last synced: 11 May 2025
https://github.com/InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning
Last synced: 20 Mar 2025
https://github.com/hiyouga/chatglm-efficient-tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
alpaca chatglm chatglm2 chatgpt fine-tuning huggingface language-model lora peft pytorch qlora rlhf transformers
Last synced: 19 Jan 2025
https://github.com/hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
alpaca chatglm chatglm2 chatgpt fine-tuning huggingface language-model lora peft pytorch qlora rlhf transformers
Last synced: 29 Mar 2025
https://github.com/zyds/transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
Last synced: 15 May 2025
https://github.com/stochasticai/xturing
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
adapter alpaca deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization
Last synced: 15 May 2025
https://github.com/stochasticai/xTuring
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
adapter alpaca deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization
Last synced: 13 Mar 2025
https://github.com/ashishpatel26/llm-finetuning
LLM Finetuning with peft
falcon fine-tuning huggingface llama llama2 llm llms lora peft pytorch text-generation
Last synced: 14 May 2025
https://github.com/ashishpatel26/LLM-Finetuning
LLM Finetuning with peft
falcon fine-tuning huggingface llama llama2 llm llms lora peft pytorch text-generation
Last synced: 24 Mar 2025
https://github.com/lxe/simple-llm-finetuner
Simple UI for LLM Model Finetuning
ai gpt-2 gpt-3 huggingface huggingface-transformers llama llm peft pytorch
Last synced: 15 May 2025
https://github.com/x-lance/slam-llm
Speech, Language, Audio, Music Processing with Large Language Model
audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing
Last synced: 15 May 2025
https://github.com/X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing
Last synced: 06 Jan 2025
https://github.com/zetavg/llama-lora-tuner
UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.
ai alpaca alpaca-lora google-colab gpt gpt-j language-model llama lora machine-learning peft
Last synced: 05 Apr 2025
https://github.com/mindspore-courses/step_into_llm
MindSpore online courses: Step into LLM
bert chatglm chatglm2 chatgpt codegeex gpt gpt2 instruction-tuning large-language-models llama llama2 llm mindspore moe natural-language-processing nlp parallel-computing peft prompt-tuning rlhf
Last synced: 15 May 2025
https://github.com/zetavg/LLaMA-LoRA-Tuner
UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.
ai alpaca alpaca-lora google-colab gpt gpt-j language-model llama lora machine-learning peft
Last synced: 23 Apr 2025
https://github.com/Guitaricet/relora
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
deep-learning distributed-training llama nlp peft transformer
Last synced: 29 Nov 2024
https://github.com/iamarunbrahma/finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
chatbot chatbots conversational-ai falcon falcon-7b fine-tuning healthcare llm lora mental-health peft qlora
Last synced: 09 Apr 2025
https://github.com/jackaduma/vicuna-lora-rlhf-pytorch
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
chatgpt finetune gpt llama llm lora peft ppo pytorch reward-models rlhf vicuna vicuna-7b
Last synced: 13 Apr 2025
https://github.com/jianzhnie/open-chatgpt
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
chatgpt gpt llama llm lora peft ppo rlhf stanford-alpaca
Last synced: 24 Jan 2025
https://github.com/jackaduma/chatglm-lora-rlhf-pytorch
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
chatglm chatglm-6b chatgpt deepspeed finetune gpt llama llm lora peft ppo pytorch reward-models rlhf
Last synced: 27 Apr 2025
https://github.com/simplifine-llm/simplifine
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨
ai cloud fine-tuning fine-tuning-llm finetuning-llms gpt instruction-tuning large-language-models llama llama3 llm llm-training lora mistral moe open-source peft phi qwen
Last synced: 16 Feb 2025
https://github.com/simplifine-llm/Simplifine
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨
ai cloud fine-tuning fine-tuning-llm finetuning-llms gpt instruction-tuning large-language-models llama llama3 llm llm-training lora mistral moe open-source peft phi qwen
Last synced: 04 Dec 2024
https://github.com/borealisai/flora-opt
This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.
deep-learning flax jax large-language-models lora memory-efficient-tuning optax peft random-projection transformers
Last synced: 29 Dec 2024
https://github.com/tudb-labs/moe-peft
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
mixlora mlora peft peft-fine-tuning-llm
Last synced: 05 Apr 2025
https://github.com/kamalkraj/e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
finetuning huggingface lora mistral-7b peft pytorch sentence-embeddings transformers
Last synced: 12 Apr 2025
https://github.com/nisaaragharia/indian-lawyergpt
Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model with a profound understanding of the Indian legal context.
falcon fine-tuning gpt huggingface-transformers large-language-models llama llama2 llms peft qlora
Last synced: 23 Nov 2024
https://github.com/ziplab/SPT
[ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.
adapter lora parameter-efficient-fine-tuning peft prompt-tuning transfer-learning
Last synced: 05 Apr 2025
https://github.com/jackaduma/alpaca-lora-rlhf-pytorch
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca
alpaca chatgpt deepspeed finetune gpt llama llm lora peft ppo pytorch reward-models rlhf
Last synced: 27 Apr 2025
https://github.com/baijiong-lin/lora-torch
PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention)
fine-tuning finetuning lora peft
Last synced: 20 Dec 2024
https://github.com/gunale0926/sorsa
SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models
deep-learning fine-tuning llama lora machine-learning nlp peft python pytorch rwkv sorsa svd transformer
Last synced: 17 Mar 2025
https://github.com/adithya-s-k/companionllm
CompanionLLM - A framework to finetune LLMs to be your own sentient conversational companion
fine-tuning finetuning hacktoberfest hacktoberfest-accepted hacktoberfest2023 huggingface llama llama2 llamacpp llm llm-inference llm-training lora mit-license open-source peft
Last synced: 03 Dec 2024
https://github.com/roim1998/apt
[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference
bert efficient-deep-learning llama2 llm llm-finetuning peft peft-fine-tuning-llm pruning roberta t5
Last synced: 16 May 2025
https://github.com/daekeun-ml/genai-ko-llm
This hands-on lab walks you through a step-by-step approach to efficiently serving and fine-tuning large-scale Korean models on AWS infrastructure.
fine-tuning genai korean-llm peft sagemaker serving
Last synced: 29 Jan 2025
https://github.com/aisuko/notebooks
Implementation for the different ML tasks on Kaggle platform with GPUs.
accelerator computer-vision fine-tuning kaggle large-language-models multimodal natural-language-processing neural-network peft pytorch quantization renforcement-learning tensorboard transformers visulization wandb
Last synced: 19 Apr 2025
https://github.com/wiseodd/lapeft-bayesopt
Discrete Bayesian optimization with LLMs, PEFT finetuning methods, and the Laplace approximation.
bayesian-optimization laplace-approximation llm peft
Last synced: 06 Apr 2025
https://github.com/prithivsakthiur/gallo-3xl
High Quality Image Generation Model - Powered with NVIDIA A100
ai dall-e dalle2 dalle3 diffusers gradio huggingface image-generation peft peft-fine-tuning-llm text-to-image torch transformers
Last synced: 17 Dec 2024
https://github.com/llm-db/fineinfer
Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)
fine-tuning inference llm lora peft pytorch
Last synced: 21 Nov 2024
https://github.com/shaheennabi/production-ready-instruction-finetuning-of-meta-llama-3.2-3b-instruct-project
🎋🌿🌟 Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations 🌟🌿🎋 Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. 🚀✨ Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning🎇🎉
4bit-quantize 4bitprecision anthropic-hh-golden bitsandbytes deployed finetuning gguf gpu huggingface inference meta modular-code open-source peft production-ready qlora quantization training unified-language-model-aligning unsloth
Last synced: 31 Jan 2025
https://github.com/jordandeklerk/starcoder2-finetune-code-completion
Finetuning Starcoder2-3B for Code Completion on a single A100 GPU
artificial-intelligence code-llms finetuning-large-language-models llms lora machine-learning peft starcoder2
Last synced: 15 Apr 2025
https://github.com/neuralwork/instruct-finetune-mistral
Fine-tune Mistral 7B to generate fashion style suggestions
finetuning-llms huggingface llm llm-inference mistral peft quantization
Last synced: 12 May 2025
https://github.com/rezaakb/peft-vit
Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting
catastrophic-forgetting peft self-supervised vision-transformer
Last synced: 05 Apr 2025
https://github.com/monk1337/nanopeft
The simplest repository & Neat implementation of different Lora methods for training/fine-tuning Transformer-based models (i.e., BERT, GPTs). [ Research purpose ]
huggingface llama llm lora low-rank-adaptation mistral peft qlora quantization
Last synced: 17 Mar 2025
https://github.com/daskol/lotr
Low Tensor Rank adaptation of large language models
fine-tuning llm lora lotr parameter-efficient-tuning peft
Last synced: 13 Feb 2025
https://github.com/dvgodoy/finetuningllms
Official repository of the book "A Hands-On Guide to Fine-Tuning LLMs" with PyTorch and Hugging Face
bitsandbytes fine-tuning finetuning finetuning-llms hugging-face huggingface large-language-models llamacpp lora ollama peft peft-fine-tuning-llm pytorch transformers
Last synced: 27 Jan 2025
https://github.com/avnlp/llm-finetuning
fine-tuning lora p-tuning peft qlora sft
Last synced: 13 Apr 2025
https://github.com/naveen-v-v/llm_fine_tune_lora
Fine tune a Large Language Model using LORA to perform Sentiment Analysis
fine-tune large-language-models notebook-jupyter peft python
Last synced: 18 Mar 2025
https://github.com/zeyadusf/topics-in-nlp-llm
In this repo I will share different topics on anything I want to know in nlp and llms
bpe llms lora nlp peft peft-fine-tuning-llm quantization
Last synced: 25 Feb 2025
https://github.com/lucatosc/generative-ai-1-
Generative AI nano degree program
chatbot genai generative-ai inpainting llms peft
Last synced: 12 Apr 2025
https://github.com/jordandeklerk/opencodeinterpreter-finetune-sql
Fine-tuning coding LLM OpenCodeInterpreter-DS-6.7B for Text-to-SQL Code Generation on a Single A100 GPU in PyTorch
artificial-intelligence code-llms finetuning-llms llms machine-learning open-code-interpreter peft qlora
Last synced: 20 Feb 2025
https://github.com/rahul-lashkari/llm-ecosystem-enhancement
Executed Fine-tuning & Benchmarking, optimizing 12+ LLMs (Gemma-family, Mistral, LLaMA, etc) across 6+ datasets (GSM8K, BoolQ, IMDB, Alpaca-GPT4 & more). Delivered a research-level contribution—model training, evaluation, insights, DeepMind benchmark comparisons & documentation. Also crafting a custom dataset from open-sourced V0 system prompts.🛰
analysis-reports anomaly-detection benchmarking custom-dataset fine-tuning kmeans-clustering llm-evaluation llms lora peft research t-sne-visualization umap-projection
Last synced: 15 May 2025
https://github.com/aman-17/medisoap
FineTuning LLMs on conversational medical dataset.
fine-tuning generative-ai llama llama-2 llm-training lora medical peft peft-fine-tuning-llm qlora summarization
Last synced: 20 Mar 2025
https://github.com/fong0202/r1-v
Witness the aha moment of VLM with less than $3.
csm32rv20 deepseek-api deepseek-reasoner deepseek-v3 github-config internlm3 llama nextjs peft qitas r10k ruby typescript wch
Last synced: 05 Apr 2025
https://github.com/sanskaryo/llm-finetuning-projects
This repository contains various projects focused on fine-tuning Large Language Models (LLMs). i am currently working on
finetuning-llms huggingface llm lora nlp peft qlora transformer
Last synced: 31 Mar 2025
https://github.com/eliask93/instruction-fine-tuned-gemma-2-for-stance-detection
Example application for applying QLoRA-based Parameter-Efficient Fine-Tuning (PEFT) to a Stance Detection task using Gemma-2-9B-Instruct
argument-mining gemma2 lora nlp peft qlora quantization stance-detection
Last synced: 02 Mar 2025
https://github.com/justsomerandomdude264/homework_solver_llm
A fine-tuned LLM to solve homework questions ranging from maths to science and social science.
adapters large-language-models llama3 llm peft qlora question-answering text-generation unsloth
Last synced: 08 Apr 2025
https://github.com/amira921/ai-based-healthcare-monitoring-system-using-iot
Medical IOT System Consists of Smart Band, Medical generative QA model, mobile application which facilitate efficient healthcare monitoring and medical assistance for patients and doctors.
arduino artificial-intelligence biogpt cpp database-design embedded-systems iot java-android large-language-models mobile-development mysql peft pyhton pytorch srs-document system-design transformers ui-ux-design uml-diagrams
Last synced: 25 Feb 2025
https://github.com/d-kleine/generativeai
Generative AI nano degree program
chatbot genai generative-ai inpainting llms peft
Last synced: 17 Mar 2025
https://github.com/zeyadusf/summarization-by-finetuning-flant5-lora
fine-tuning llm lora nlp peft peft-fine-tuning-llm summarization
Last synced: 25 Feb 2025
https://github.com/md-emon-hasan/fine-tuning
End-to-end fine-tuning of Hugging Face models using LoRA, QLoRA, quantization, and PEFT techniques. Optimized for low-memory with efficient model deployment
bitsandbytes deep-learning fine-tuning fp16-training gpu-optimization gradient-checkpointing huggingface huggingface-datasets lora low-memory-training machine-learning model-training natural-language-processing nlp parameter-efficient-fine-tuning peft pytorch qlora quantization transformers
Last synced: 21 Feb 2025
https://github.com/cahlen/conversation-dataset-generator
Craft conversational datasets (JSONL format with rich metadata) using LLMs. Specify parameters manually or use a creative brief for LLM-generated arguments with automatic topic/scenario variation. Optional web search improves persona grounding. Ideal for LoRA tuning, persona training, and creative writing. Includes Hugging Face Hub upload.
dataset-generation dialogue-generation fine-tuning huggingface jsonl llm lora nlp peft persona python synthentic-data transformers
Last synced: 11 Apr 2025
https://github.com/salahu01/flutter-codegen-finetuner
🚀 Fine-tune LLMs to generate Flutter code in your personal style! 🎯 This toolkit provides step-by-step guides, scripts, and examples for creating custom code generation models. 💻 From data preparation to deployment, transform natural language into Flutter widgets matching your unique coding patterns. 🔧
ai-tools code-assistant code-generation dart developer-tools fine-tuning-llm-codellama flutter language-model lora machine-learning nlp peft transformer
Last synced: 21 Mar 2025
https://github.com/nimad70/mistral-qa-optimization
mistral-qa-optimization | project for the Natural Language Processing course (ComputerScience @ UniPd) | w/ Shakiba Farjood Fashalam - Marcos Tidball - Tobia Pavona - Giacomo Ferrante
deep-learning finetuning huggingface huggingface-transformers llms mistral-7b nlp peft python3 question-answering raft rag tensorflow
Last synced: 04 Apr 2025
https://github.com/eliask93/qlora-based-efficient-fine-tuning-for-sentence-pair-classification
Example application for applying QLoRA-based Parameter-Efficient Fine-Tuning (PEFT) to a sentence pair classification task using Mistral-7B and Llama3-8B
argument-mining llama3 lora mistral nlp peft qlora quantization sentence-pair-classification stance-detection
Last synced: 08 Apr 2025
https://github.com/venkata-naveen-varma/llm_fine_tune_lora
Fine tune a Large Language Model using LORA to perform Sentiment Analysis
fine-tune large-language-models notebook-jupyter peft python
Last synced: 24 Nov 2024
https://github.com/architj6/llama2-finetuning
🦙 Llama2-FineTuning: Fine-tune LLAMA 2 with Custom Datasets Using LoRA and QLoRA Techniques
bitsandbytes fine-tuning fine-tuning-llama2 fine-tuning-llm google-colab huggingface large-language-models llama2 lora low-rank-adaptation nlp peft pytorch qlora quantization supervised-fine-tuning text-generation transformer-reinforcement-learning transformers
Last synced: 19 Apr 2025
https://github.com/umutkavakli/molformer-regression
Fine-tuned chemical language model for predicting molecular lipophilicity in drug design. Explores parameter-efficient fine-tuning strategies (LoRA, BitFit, IA3), layer freezing techniques, and influence-based data selection. Balances accuracy and computational efficiency for molecular property prediction tasks.
bitfit ia3 llm lora mlm molformer peft pytorch regression
Last synced: 01 Apr 2025
https://github.com/reshalfahsi/qa-gpt2-lora
Question-Answering using GPT-2's PEFT with LoRA
gpt-2 huggingface lora low-rank-adaptation nlp peft question-answering squad-dataset
Last synced: 01 Apr 2025
https://github.com/afondiel/finetuning-llms-crash-course-dlai
Notes & Resources of LLMs Finetuning Crash Course from LAMINI.AI & DeepLearning.AI.
finetuning finetuning-llms llms lora peft peft-fine-tuning-llm
Last synced: 15 Mar 2025
https://github.com/hrolive/large-language-models-on-supercomputers
Comprehensive exploration of LLMs, including cutting-edge techniques and tools such as parameter-efficient fine-tuning (PEFT), quantization, zero redundancy optimizers (ZeRO), fully sharded data parallelism (FSDP), DeepSpeed, and Huggingface accelerate.
deepspeed evaluation-metrics fsdp high-performance-computing hpc huggingface huggingface-transformers jupyter llm llm-inference llm-training monitoring peft python quantization slurm tokenization transformer unsloth
Last synced: 23 Feb 2025
https://github.com/mafda/lightweight_fine_tuning_project
This repository provides a Jupyter notebook demonstrating parameter-efficient fine-tuning (PEFT) with LoRA on Hugging Face models.
huggingface huggingface-datasets huggingface-transformers lora peft peft-fine-tuning-llm pytorch
Last synced: 06 Mar 2025
https://github.com/andron00e/mf-p2eft
Parameter- and Energy-Efficient Fine-Tuning
Last synced: 22 Feb 2025
https://github.com/ruvenguna94/dialogue-summary-peft-fine-tuning
This notebook fine-tunes the FLAN-T5 model for dialogue summarization, comparing full fine-tuning with Parameter-Efficient Fine-Tuning (PEFT). It evaluates performance using ROUGE metrics, demonstrating PEFT's efficiency while achieving competitive results.
dialogue-summarization fine-tuning flan-t5 generative-ai hugging-face lora low-rank-adaptation natural-language-processing nlp parameter-efficient-fine-tuning peft pytorch rouge
Last synced: 06 Apr 2025
https://github.com/flozi00/simplepeft
An simple trainer for efficient finetuning large models on different tasks
Last synced: 10 Feb 2025
https://github.com/zeyadusf/finetune-llama2
Fine-Tune Your Own Llama 2 Model
fine-tuning llama2 llm lora peft text-generation
Last synced: 14 Mar 2025
https://github.com/aman-17/MediSOAP
FineTuning LLMs on conversational medical dataset.
fine-tuning generative-ai llama llama-2 llm-training lora medical peft peft-fine-tuning-llm qlora summarization
Last synced: 06 Jan 2025
https://github.com/vpgits/sdgp-ml
This repository contains notebooks and resources related to the Software Development Group Project (SDGP) machine learning component. Specifically, it includes two notebooks used for creating a dataset and fine-tuning a Mistral-7B-v0.1-Instruct model.
autoawq awq machine-learning peft pytorch qlora transformers
Last synced: 23 Feb 2025
https://github.com/smit-parekh/proactive-supply-chain-disruption-agent
An AI Agent leveraging fine-tuned LLMs (Mistral-7B w/ PEFT) and LangGraph to proactively identify, assess, and suggest mitigations for supply chain disruptions, tailored to specific client needs (e.g., Shell, Pfizer). Includes MLOps integration using Vertex AI and MLflow.
ai-agent api fastapi fine-tuning forecasting google-cloud langchain langgraph llm lora machine-learning mlflow mlops peft python risk-management supply-chain vertex-ai
Last synced: 11 Apr 2025