Projects in Awesome Lists tagged with sft
A curated list of projects in awesome lists tagged with sft .
https://github.com/dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
agent ai chatbot enterprise finetune genai gpt langchian llama llm llmdevops llmops ocr openai orchestration python rag react sft workflow
Last synced: 14 May 2025
https://github.com/modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).
deepseek-r1 deploy embedding grpo internvl liger llama llama4 llm lora megatron multimodal omni open-r1 peft qwen2-vl qwen3 qwen3-moe rft sft
Last synced: 12 May 2025
https://github.com/ai-hypercomputer/maxtext
A simple, performant and scalable Jax LLM!
deepseek fine-tuning gemma2 gemma3 gpt jax large-language-models llama2 llama3 llama4 llm mistral mixtral sft
Last synced: 14 May 2025
https://github.com/ssbuild/chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning
adalora chatglm deep-learning freeze ia3 lora p-tuning-v2 pytorch qlora sft
Last synced: 14 May 2025
https://github.com/jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
chinese finance large-language-models llama nlp qa rlhf sft text-generation transformers
Last synced: 01 Apr 2025
https://github.com/ukairia777/tensorflow-nlp-tutorial
tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.
bert bert-ner dpo huggingface keras-tutorial llama llm lora named-entity-recognition natural-language-processing nlp nlp-tutorial question-answering sft tensorflow trainer transformers
Last synced: 21 Apr 2025
https://github.com/choosewhatulike/trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
agent character language-model large-language-models llm natural-language-processing roleplay sft
Last synced: 15 May 2025
https://github.com/0xsequence/erc-1155
Ethereum Semi Fungible Standard (ERC-1155)
erc1155 ethereum nft semi-fungible sft token-contract
Last synced: 27 Nov 2024
https://github.com/awesome-rag/awesome-rag
Awesome-RAG: Collect typical RAG papers and systems.
agent ai awesome awesome-list graphrag llm mm opensource paper rag sft
Last synced: 25 Jan 2025
https://github.com/NiuTrans/Vision-LLM-Alignment
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
alignment dpo llama3-vision llava llm mllm multi-model ppo reward rlhf sft vision
Last synced: 07 May 2025
https://github.com/niutrans/vision-llm-alignment
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
alignment dpo llama3-vision llava llm mllm multi-model ppo reward rlhf sft vision
Last synced: 06 Apr 2025
https://github.com/opensparsellms/llama-moe-v2
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
attention fine-tuning instruction-tuning llama llama3 mixture-of-experts moe sft sparsity
Last synced: 07 Apr 2025
https://github.com/ElvenTools/elven-tools-cli
Elven Tools CLI - command line tool for launching NFTs collections on the MultiversX blockchain (Plus other tools).
blockchain cli elrond javascript multiversx nft nodejs sft
Last synced: 14 Mar 2025
https://github.com/kennethanceyer/diy-generative-ai-lm
Make your Generative AI LM model from the scratch (Including pretraining / SFT with LoRA)
colab genai generativeai llm lm lora nlp pretrain sft torch transformer
Last synced: 19 Apr 2025
https://github.com/dvgodoy/llm-visuals
Over 60 figures and diagrams of LLMs, quantization, low-rank adapters (LoRA), and chat templates FREE TO USE in your blog posts, slides, presentations, or papers.
bf16 chat-template data-types fine-tuning fine-tuning-llm hugging-face llm llms lora low-rank-adaptation quantization sft supervised-learning
Last synced: 07 May 2025
https://github.com/DaehanKim/EasyRLHF
EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets
dpo instruction-tuning ipo language-model rlhf rrhf sft
Last synced: 29 Mar 2025
https://github.com/thu-keg/dice
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
benchmark data-contamination fine-tuning-llm gsm8k llm sft
Last synced: 13 May 2025
https://github.com/ElvenTools/elven-tools-sft-minter-sc
Elven Tools SFT Minter Smart Contract - launching SFTs collections on the MultiversX blockchain
blockchain multiversx rust sft smart-contracts
Last synced: 14 Mar 2025
https://github.com/km1994/awesomemultimodel
【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语音合成(TTS),人像分割(SA),多模态(VLM),Ai 换脸(Face Swapping), 文生视频(VD),图生视频(SVD),Ai 动作迁移,Ai 虚拟试衣,数字人,全模态理解(Omni),Ai音乐生成 干货学习 等 实战与经验。
agent animate asr face-recognition llm llms mllm ocr omni peft-fine-tuning-llm ppt rag sft stable-diffusion svd text-to-music text-to-sql video-diffusion-model virtual-try-on vlm
Last synced: 14 May 2025
https://github.com/avnlp/llm-finetuning
fine-tuning lora p-tuning peft qlora sft
Last synced: 13 Apr 2025
https://github.com/aws-samples/sample-for-multi-modal-document-to-json-with-sagemaker-ai
This open-source project delivers a complete pipeline for converting multi-page documents (PDFs/images) into structured JSON using Vision LLMs on Amazon SageMaker. The solution leverages the SWIFT Framework to fine-tune models specifically for document understanding tasks.
aws document-processing fine-tuning huggingface idp llama multimodal qwen2-vl sagemaker sft swift
Last synced: 23 Mar 2025
https://github.com/imadsaddik/bodmaghdataset
BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language
arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft
Last synced: 03 Apr 2025
https://github.com/tonyskapunk/sft-aur
Scripts to keep up with latest scaleft packages to build them for AUR
arch aur hacktoberfest linux sft
Last synced: 18 Apr 2025
https://github.com/jmaczan/c-137
🦙 Llama 2 7B fine-tuned to revive Rick
apple-m2 c-137 deep-learning fine-tuning finetuning google-colab llama-2 llama2 llama2-7b llm machine-learning nlp rick-and-morty rick-sanchez rickandmorty sft supervised-finetuning
Last synced: 18 Feb 2025
https://github.com/francescodisalesgithub/few-shots-importer
sft training by using only command instruction on a ollama modelfile
ai hack modelfile ollama sft supervised-learning training
Last synced: 19 Feb 2025
https://github.com/data-dream-gdsp/hello-happy-world
AI-powered automatic dataset creation from the web, Support for LoRA and SFT question generation!
ai data-science llama llm lora sft
Last synced: 16 Feb 2025