An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with finetuning

A curated list of projects in awesome lists tagged with finetuning .

https://github.com/unslothai/unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! ๐Ÿฆฅ

deepseek deepseek-r1 fine-tuning finetuning gemma gemma3 llama llama-4 llama3 llama4 llm llms lora mistral qlora qwen qwen3 text-to-speech tts unsloth

Last synced: 01 Apr 2026

https://github.com/meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

ai finetuning langchain llama llama2 llm machine-learning python pytorch vllm

Last synced: 12 Feb 2026

https://github.com/linkedin/liger-kernel

Efficient Triton Kernels for LLM Training

finetuning gemma2 llama llama3 llm-training llms mistral phi3 triton triton-kernels

Last synced: 13 May 2025

https://github.com/h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

ai chatbot chatgpt fedramp fine-tuning finetuning generative generative-ai gpt llama llama2 llm llm-training

Last synced: 07 Apr 2026

https://github.com/linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

finetuning gemma2 llama llama3 llm-training llms mistral phi3 triton triton-kernels

Last synced: 21 Aug 2025

https://github.com/dataherald/dataherald

Interact with your SQL database, Natural Language to SQL using LLMs

ai database finetuning llm nl-to-sql rag sql text-to-sql

Last synced: 14 May 2025

https://github.com/Dataherald/dataherald

Interact with your SQL database, Natural Language to SQL using LLMs

ai database finetuning llm nl-to-sql rag sql text-to-sql

Last synced: 03 Apr 2025

https://github.com/stochasticai/xturing

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

adapter alpaca deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization

Last synced: 15 May 2025

https://github.com/stochasticai/xTuring

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

adapter alpaca deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization

Last synced: 13 Mar 2025

https://github.com/LazyAGI/LazyLLM

Easiest and laziest way for building multi-agent LLMs applications.

agents ai-agent data deep-learning documentation-tool finetuning framework knowlege-graph langchain lazyllm llamaindex llm llms rag

Last synced: 06 May 2025

https://github.com/lazyagi/lazyllm

Easiest and laziest way for building multi-agent LLMs applications.

agents ai-agent data deep-learning documentation-tool finetuning framework knowlege-graph langchain lazyllm llamaindex llm llms rag

Last synced: 26 Jan 2026

https://github.com/socialai-tianji/tianji

ๅˆถไฝœๆ‡‚ไบบๆƒ…ไธ–ๆ•…็š„ๅคง่ฏญ่จ€ๆจกๅž‹ | ๆถต็›–ๆ็คบ่ฏๅทฅ็จ‹ใ€RAGใ€Agentใ€LLMๅพฎ่ฐƒๆ•™็จ‹

finetuning gpt llm prompt qwen rag

Last synced: 14 May 2025

https://github.com/SocialAI-tianji/Tianji

ๅˆถไฝœๆ‡‚ไบบๆƒ…ไธ–ๆ•…็š„ๅคง่ฏญ่จ€ๆจกๅž‹ | ๆถต็›–ๆ็คบ่ฏๅทฅ็จ‹ใ€RAGใ€Agentใ€LLMๅพฎ่ฐƒๆ•™็จ‹

finetuning gpt llm prompt qwen rag

Last synced: 23 Oct 2025

https://github.com/daswer123/xtts-webui

Webui for using XTTS and for finetuning it

cocqui finetuning tts xtts xttsv2

Last synced: 15 May 2025

https://github.com/minosvasilias/godot-dodo

Finetuning large language models for GDScript generation.

ai finetuning gdscript godot llama

Last synced: 10 Oct 2025

https://github.com/Microsoft/AzureML-BERT

End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service

azure-machine-learning azureml-bert bert bert-model finetuning language-model nlp pretrained-models pretraining pytorch tuning

Last synced: 02 Apr 2025

https://github.com/microsoft/azureml-bert

End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service

azure-machine-learning azureml-bert bert bert-model finetuning language-model nlp pretrained-models pretraining pytorch tuning

Last synced: 05 Apr 2025

https://github.com/microsoft/AzureML-BERT

End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service

azure-machine-learning azureml-bert bert bert-model finetuning language-model nlp pretrained-models pretraining pytorch tuning

Last synced: 19 Jul 2025

https://github.com/servicenow/tapeagents

TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle

agentic agents ai-agents finetuning llm-agent multi-agent multi-agent-simulation prompt-tuning

Last synced: 09 Oct 2025

https://github.com/josefalbers/phi-3-vision-mlx

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

agent api bgpt fine-tuning finetuning llm lora lstm mac macos metal mlx multi-agent-systems multimodal phi-3 phi-3-5 phi-3-mini phi-3-vision retnet vlm

Last synced: 05 Apr 2025

https://github.com/babycommando/neuralgraffiti

Live-bending a foundation modelโ€™s output at neural network level.

finetuning liquid-neural-networks llm neural-network pytorch self-attention transformers

Last synced: 19 Jan 2026

https://github.com/rasbt/dora-from-scratch

LoRA and DoRA from Scratch Implementations

finetuning llm pytorch

Last synced: 17 Mar 2025

https://github.com/LHRLAB/ChatKBQA

ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models

finetuning graph-database knowledge-graph large-language-models semantic-parsing sparql-query

Last synced: 09 May 2025

https://github.com/zjysteven/lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, qwen-vl, qwen2-vl, phi3-v etc.

finetuning foundation-models instruction-tuning large-language-model large-multimodal-models llava llava-next multimodal multimodal-large-language-models qwen-vl vision-language visual-instruction-tuning

Last synced: 04 Apr 2025

https://github.com/ssbuild/chatglm2_finetuning

chatglm2 6b finetuning and alpaca finetuning

chatglm chatglm2 chatglm2-6b deep-training finetuning ia3 lora qlora

Last synced: 21 Aug 2025

https://github.com/Trainy-ai/llm-atc

Fine-tuning and serving LLMs on any cloud

finetuning llama2 llms vllm

Last synced: 06 May 2025

https://github.com/kamalkraj/e5-mistral-7b-instruct

Finetune mistral-7b-instruct for sentence embeddings

finetuning huggingface lora mistral-7b peft pytorch sentence-embeddings transformers

Last synced: 12 Apr 2025

https://github.com/baijiong-lin/lora-torch

PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention in OpenCLIP)

fine-tuning finetuning lora peft

Last synced: 13 Oct 2025

https://github.com/speediedan/finetuning-scheduler

A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.

artificial-intelligence fine-tuning finetuning machine-learning neural-networks pytorch pytorch-lightning superglue transfer-learning

Last synced: 01 Sep 2025

https://github.com/makazhanalpamys/soup

Soup turns the pain of LLM fine-tuning into a simple workflow. One config, one command, done.

artificial-intelligence cli dpo fine-tuning finetuning gguf huggingface llm llmops local-llm lora machine-learning model-finetuning ollama peft python pytorch qlora sft transformers

Last synced: 01 Jun 2026

https://github.com/hmunachi/super-lazy-autograd

Hand-derived memory-efficient super lazy PyTorch VJPs for training LLMs on laptop, all using one op (bundled scaled matmuls).

artificial-intelligence fine-tuning finetuning huggingface llm llms pytorch qwen2-5 transformer

Last synced: 13 Jun 2025

https://github.com/unit-mesh/unit-gen

UnitGen ๆ˜ฏไธ€ไธช็”จไบŽ็”Ÿๆˆๅพฎ่ฐƒไปฃ็ ็š„ๆ•ฐๆฎๆก†ๆžถ โ€”โ€” ็›ดๆŽฅไปŽไฝ ็š„ไปฃ็ ๅบ“ไธญ็”Ÿๆˆๅพฎ่ฐƒๆ•ฐๆฎ๏ผšไปฃ็ ่กฅๅ…จใ€ๆต‹่ฏ•็”Ÿๆˆใ€ๆ–‡ๆกฃ็”Ÿๆˆ็ญ‰ใ€‚UnitGen is a code fine-tuning data framework that generates data from your existing codebase.

data-engineering evaluating finetuning llm

Last synced: 16 Oct 2025

https://github.com/zou-group/sirius

SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning

finetuning llm multiagent reasoning self-improving

Last synced: 08 Mar 2026

https://github.com/deshwalmahesh/phudge

Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.

ai custom-dataset evaluation feedback-collection finetuning hallucination hallucination-detection judge llm llm-evaluation ml nlp phi-3 pytorch sota

Last synced: 11 Jun 2025

https://github.com/conneroisu/Text-Dataset-Aid-Plugin

This is a obsidian plugin to help with the creation of personal jsonl datasets for text generation models.

fine-tuning finetuning language-model obsidian obsidian-md obsidian-plugin plugin

Last synced: 18 Jul 2025

https://github.com/conneroisu/text-dataset-aid-plugin

This is a obsidian plugin to help with the creation of personal jsonl datasets for text generation models.

fine-tuning finetuning language-model obsidian obsidian-md obsidian-plugin plugin

Last synced: 10 Jul 2025

https://github.com/chuangchuangtan/llava-next-image-llama3-lora

LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft

finetuning llama3 llava-next lora

Last synced: 25 Oct 2025

https://github.com/poloclub/fine-tuning-llms

Finetune Llama 2 on Colab for free on your own data: step-by-step tutorial

colab finetuning llm tutorial

Last synced: 17 Jun 2025

https://github.com/kyegomez/lets-verify-step-by-step

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

artificial-intelligence finetuning gpt4 gpt4-api gpt4vision llama machine-learning

Last synced: 28 Jul 2025

https://github.com/dvgodoy/finetuningllms

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

bitsandbytes fine-tuning finetuning finetuning-llms hugging-face huggingface large-language-models llamacpp lora ollama peft peft-fine-tuning-llm pytorch transformers

Last synced: 09 Oct 2025

https://github.com/paulocoutinhox/mini-llm

Simple and lightweight tool to fine-tune GPT models (like GPT-2 and GPT-Neo) using your own data โ€” built with Python and Transformers. Adapt powerful language models to your domain with ease.

ai artificial-intelligence fine-tuning finetuning gpt llm python slm training transformer

Last synced: 18 Jan 2026

https://github.com/ssbuild/chatglm_rlhf

chatglm_rlhf_finetuning

chat chatglm finetuning lora qlora reward rlhf

Last synced: 17 Oct 2025

https://github.com/maxidonkey/delphigemini

The Gemini API wrapper for Delphi utilizes advanced models developed by Google to provide robust capabilities, including interactive chat, text embeddings, code generation, image and video prompting, audio analysis and transcription, fine-tuning, caching, and integration with Google Search.

agents api-wrapper audio-transcription delphi fine-tuning finetuning gemini gemini-ai gemini-api gemini-flash gemini-pro-vision google-search gpt image-prompting video-prompting vision

Last synced: 05 Apr 2025

https://github.com/machinelearningnuremberg/QuickTune

[ICLR2024] Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How

deep-learning finetuning hyperparameter-tuning model-hub optimization pretrained-models

Last synced: 02 Mar 2025

https://github.com/cre4t3tiv3/unsloth-llama3-alpaca-lora

Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-edge parameter-efficient fine-tuning with Unsloth integration.

4bit alpaca colab finetuning gradio huggingface instruction-tuning llama3 llm lora open-source peft qlora transformers unsloth

Last synced: 13 Oct 2025

https://github.com/shaheennabi/production-ready-instruction-finetuning-of-meta-llama-3.2-3b-instruct-project

Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning ๐ŸŽ‹

4bit-quantize 4bitprecision anthropic-hh-golden bitsandbytes deployed finetuning gguf gpu huggingface inference meta modular-code open-source peft production-ready qlora quantization training unified-language-model-aligning unsloth

Last synced: 26 Oct 2025

https://github.com/HomoScriptor-Project/HomoScriptor

Fuel innovation and advance language models with HomoScriptor: A vibrant, community-driven dataset for fine-tuning large language models.

dataset datasets fine-tuning finetuning llm

Last synced: 22 Jul 2025

https://github.com/thudm/efficient-head-finetuning

Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head

finetuning language-model

Last synced: 28 Oct 2025

https://github.com/gallen881/physics_master

Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!

ai fine-tuning finetune finetuning llama3 llm physics

Last synced: 25 Oct 2025

https://github.com/jyhong836/llm-dp-finetune

End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP

dp finetuning llm

Last synced: 19 Apr 2025

https://github.com/adithya-s-k/indic-llm

A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of domains and languages.

continual-pre-training dpo finetuning finetuning-llms llm lora

Last synced: 03 Aug 2025

https://github.com/thunlp-mt/trice

Code for our paper "Transfer Learning for Sequence Generation: from Single-source to Multi-source" in ACL 2021.

automatic-post-editing finetuning machine-translation multi-source-translation natural-language-processing

Last synced: 14 Apr 2025

https://github.com/bhattbhavesh91/google-gemma-finetuning-n2sql

Finetuning Google's Gemma Model for Translating Natural Language into SQL

fine-tuning finetuning finetuning-llms gemma google lora natural-language-to-sql supervised-finetuning

Last synced: 06 Jul 2025

https://github.com/lennartpurucker/finetune_tabpfn_v2

Code for finetuning TabPFN on one downstream tabular dataset.

finetuning tabpfn tabular-data

Last synced: 05 Mar 2025

https://github.com/rs-py/howtofinetunellama3.1

Quick tutorial showing how to fine-tune Llama3.1 with nothing but free tools and text data. All code included in ipynb. For a step by step walkthrough take a look at the tutorial below on medium.

fine-tuning finetuning huggingface llama3 llm llm-training

Last synced: 24 Apr 2025

https://github.com/dartvauder/neurotrainerwebui

(Windows/Linux) Local WebUI for finetuning, evaluation and generation of neural network models (LLM and StableDiffusion) on python (In Gradio interface). Translated on 3 languages

conversion datasets-preparation diffusers evaluation finetuning generation gradio neural-networks python quantization safetensors transformers

Last synced: 25 May 2026

https://github.com/maxidonkey/delphimistralai

The MistralAI API wrapper for Delphi utilizes the various advanced models developed by Mistral to provide robust capabilities for chat interactions, string embeddings, and precise code generation with Codestral.

agents api-wrapper chat-bot chatgpt codestral delphi fine-tune fine-tuning fine-tuning-llm finetune finetuning gpt mistral mistral-7b mistral-api mistral-embed mistral-small mistralai mixtral-8x22b mixtral-8x7b

Last synced: 18 Aug 2025

https://github.com/gmongaras/llama-2_huggingface_4bit_qlora

A working example of a 4bit QLoRA Falcon model using huggingface

falcon finetuning huggingface huggingface-transformers llm lora qlora

Last synced: 13 Oct 2025

https://github.com/shashankgupta10/flipdrip-ai

FlipDrip AI, your personalized fashion destination! Discover the latest trends tailored just for you.

finetuning flipkart-grid stable-diffusion vector-database

Last synced: 07 May 2025

https://github.com/samadpls/sentimentfinetuning

Efficient fine-tuned large language model (LLM) for the task of sentiment analysis using the IMDB dataset.

finetuning huggingface llm low-rank-adaptation opensource sentiment-analysis transformer

Last synced: 06 Apr 2025

https://github.com/gmongaras/wizard_qlora_finetuning

Finetuning Some Wizard Models With QLoRA

finetuning llama llm llm-finetuning lora qlora wizard

Last synced: 12 Apr 2025

https://github.com/shaheennabi/generative-ai-practices-and-mini-projects

๐ŸŽ‡๐ŸŽ† Generative AI Projects ๐ŸŽ†๐ŸŽ‡ A hands-on repository for Generative AI projects! ๐Ÿค–โœจ Explore model building, fine-tuning, and RAG techniques. Includes experiments with open-source models like LLaMA and Gemma, plus deployments using OpenAI and Google Gemini APIs. ๐Ÿš€

aws finetuning gcp gemini genai groq langchain llama-index llama2 llms nim nvidia ollama rag

Last synced: 30 Sep 2025

https://github.com/murapadev/phinetuning

A repository dedicated to finetuning phi2 models using advanced machine learning techniques. This includes training scripts, model evaluation methods, and data processing tools.

deep-learning finetuning machine-learning model-training models natural-language-processing nlp phi2 python pytorch transformers

Last synced: 26 Dec 2025

https://github.com/sukanyabag/finetuning-qwen2-7b-vqa-on-radiology-scans

This repository is doing the finetuning of the Qwen2 7B VLM for performing VQA (Visual Question Answering) on various kinds of patient radiologies or medical scans.

adapter-tuning deep-learning finetuning generative-ai healthcare lora quantization-aware-training vision-language-models visual-question-answering

Last synced: 27 Apr 2026

https://github.com/fareedkhan-dev/improve-weak-llm-using-spin-technique

After RLHF and SFT show promising results, a new technique named SPIN is invented for 2024

finetuning gemini large-language-models llm rlhf

Last synced: 25 Aug 2025

https://github.com/strickvl/isafpr_finetune

Finetuning an LLM for structured data extraction from press releases

fine-tuning finetuning llm llms

Last synced: 10 Jul 2025

https://github.com/Rs-py/HowToFineTuneLlama3.1

Quick tutorial showing how to fine-tune Llama3.1 with nothing but free tools and text data. All code included in ipynb. For a step by step walkthrough take a look at the tutorial below on medium.

fine-tuning finetuning huggingface llama3 llm llm-training

Last synced: 11 Sep 2025

https://github.com/hyeonsangjeon/pdf2llm-tuning-studio

PDF ๋ฌธ์„œ์—์„œ GPU ๊ฐ€์† ์ฒ˜๋ฆฌ๋กœ ๊ณ ํ’ˆ์งˆ ์งˆ์˜์‘๋‹ต(QA) ๋ฐ์ดํ„ฐ๋ฅผ ์ž๋™ ์ƒ์„ฑํ•˜๊ณ  LLM์„ ํšจ์œจ์ ์œผ๋กœ ํŒŒ์ธํŠœ๋‹ํ•˜๋Š” ์†”๋ฃจ์…˜์ž…๋‹ˆ๋‹ค. Unstructured ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์™€ AWS Bedrock Claude๋กœ ๋„๋ฉ”์ธ ํŠนํ™” QA ์Œ์„ ์ƒ์„ฑํ•˜๊ณ , LoRA ๊ธฐ๋ฒ•์œผ๋กœ ๊ฒฝ๋Ÿ‰ ๋ชจ๋ธ์„ ํ›ˆ๋ จํ•ฉ๋‹ˆ๋‹ค.

aws bedrock claude cuda data-argumantation data-extraction distillation docker finetuning gpu llm pdf-generation pdf-text-extraction processing processing-job sagemaker text-disti unsloth unstructured

Last synced: 15 Jun 2025

https://github.com/ayeshaaaaaaaaa/car-plate-detection-and-ocr-recognition-with-yolov8

This project utilizes YOLOv8 for real-time car plate detection and OCR (Optical Character Recognition) to extract plate numbers from detected regions. The system leverages advanced computer vision techniques to streamline the recognition process and provide precise results.

computer-vision easy-ocr finetuning licenceplate-recognition tesseract-ocr yolov8

Last synced: 02 Jul 2025

https://github.com/itspranavajay/merge-diffusion-tool

Merge Diffusion Tool is an open-source solution for merging LoRA models, integrating LoRA into checkpoints, and blending Flux And Stable Diffusion models (SD1.5, SD2, SD3, SDXL). Optimize your AI workflows with ease.

ai checkpoint deep-learning dreambooth finetuning flux fluxai lora stable-diffusion

Last synced: 29 Oct 2025

https://github.com/balnarendrasapa/faq-llm

This is course project for DSCI 6004 deals with fine-tuning a pretrained model llm with a custom data

chatbot falcon-7b fine-tuning finetuning huggingface jupyter-notebook kaggle kaggle-notebook large-language-models llm lora nlp peft streamlit transformers

Last synced: 21 Apr 2026

https://github.com/arawxx/torch-utils

A library containing useful and frequently used PyTorch functions and classes.

deep-learning deeplearning fine-tuning finetuning library llm llms pytorch scheduler schedulers torch torch-utils utils utils-library

Last synced: 02 Apr 2026

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 08 Apr 2026

https://github.com/ntphuc149/vieqa

ViEQA is an innovative project aimed at advancing Extractive Question Answering capabilities for the Vietnamese language. By fine-tuning state-of-the-art language models on Vietnamese datasets, we strive to bridge the gap in natural language understanding for Vietnamese text.

extractive-question-answering finetuning language-model machine-reading-comprehension natural-language-processing question-answering

Last synced: 11 Jan 2026