An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with sft

A curated list of projects in awesome lists tagged with sft .

https://github.com/dataelement/bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

agent ai chatbot enterprise finetune genai gpt langchian llama llm llmdevops llmops ocr openai orchestration python rag react sft workflow

Last synced: 14 May 2025

https://github.com/modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).

deepseek-r1 deploy embedding grpo internvl liger llama llama4 llm lora megatron multimodal omni open-r1 peft qwen2-vl qwen3 qwen3-moe rft sft

Last synced: 12 May 2025

https://github.com/ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

adalora chatglm deep-learning freeze ia3 lora p-tuning-v2 pytorch qlora sft

Last synced: 14 May 2025

https://github.com/jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

chinese finance large-language-models llama nlp qa rlhf sft text-generation transformers

Last synced: 01 Apr 2025

https://github.com/ukairia777/tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

bert bert-ner dpo huggingface keras-tutorial llama llm lora named-entity-recognition natural-language-processing nlp nlp-tutorial question-answering sft tensorflow trainer transformers

Last synced: 21 Apr 2025

https://github.com/choosewhatulike/trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

agent character language-model large-language-models llm natural-language-processing roleplay sft

Last synced: 15 May 2025

https://github.com/0xsequence/erc-1155

Ethereum Semi Fungible Standard (ERC-1155)

erc1155 ethereum nft semi-fungible sft token-contract

Last synced: 27 Nov 2024

https://github.com/awesome-rag/awesome-rag

Awesome-RAG: Collect typical RAG papers and systems.

agent ai awesome awesome-list graphrag llm mm opensource paper rag sft

Last synced: 25 Jan 2025

https://github.com/solv-finance/erc-3525

ERC-3525 Reference Implementation

erc-3525 erc3525 sft solv

Last synced: 07 Apr 2025

https://github.com/NiuTrans/Vision-LLM-Alignment

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

alignment dpo llama3-vision llava llm mllm multi-model ppo reward rlhf sft vision

Last synced: 07 May 2025

https://github.com/niutrans/vision-llm-alignment

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

alignment dpo llama3-vision llava llm mllm multi-model ppo reward rlhf sft vision

Last synced: 06 Apr 2025

https://github.com/opensparsellms/llama-moe-v2

🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

attention fine-tuning instruction-tuning llama llama3 mixture-of-experts moe sft sparsity

Last synced: 07 Apr 2025

https://github.com/ElvenTools/elven-tools-cli

Elven Tools CLI - command line tool for launching NFTs collections on the MultiversX blockchain (Plus other tools).

blockchain cli elrond javascript multiversx nft nodejs sft

Last synced: 14 Mar 2025

https://github.com/wangclnlp/deepspeed-chat-extension

This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).

deepspeed llama llm rlhf sft

Last synced: 26 Apr 2025

https://github.com/kennethanceyer/diy-generative-ai-lm

Make your Generative AI LM model from the scratch (Including pretraining / SFT with LoRA)

colab genai generativeai llm lm lora nlp pretrain sft torch transformer

Last synced: 19 Apr 2025

https://github.com/dvgodoy/llm-visuals

Over 60 figures and diagrams of LLMs, quantization, low-rank adapters (LoRA), and chat templates FREE TO USE in your blog posts, slides, presentations, or papers.

bf16 chat-template data-types fine-tuning fine-tuning-llm hugging-face llm llms lora low-rank-adaptation quantization sft supervised-learning

Last synced: 07 May 2025

https://github.com/DaehanKim/EasyRLHF

EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets

dpo instruction-tuning ipo language-model rlhf rrhf sft

Last synced: 29 Mar 2025

https://github.com/thu-keg/dice

DICE: Detecting In-distribution Data Contamination with LLM's Internal State

benchmark data-contamination fine-tuning-llm gsm8k llm sft

Last synced: 13 May 2025

https://github.com/ElvenTools/elven-tools-sft-minter-sc

Elven Tools SFT Minter Smart Contract - launching SFTs collections on the MultiversX blockchain

blockchain multiversx rust sft smart-contracts

Last synced: 14 Mar 2025

https://github.com/km1994/awesomemultimodel

【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语音合成(TTS),人像分割(SA),多模态(VLM),Ai 换脸(Face Swapping), 文生视频(VD),图生视频(SVD),Ai 动作迁移,Ai 虚拟试衣,数字人,全模态理解(Omni),Ai音乐生成 干货学习 等 实战与经验。

agent animate asr face-recognition llm llms mllm ocr omni peft-fine-tuning-llm ppt rag sft stable-diffusion svd text-to-music text-to-sql video-diffusion-model virtual-try-on vlm

Last synced: 14 May 2025

https://github.com/aws-samples/sample-for-multi-modal-document-to-json-with-sagemaker-ai

This open-source project delivers a complete pipeline for converting multi-page documents (PDFs/images) into structured JSON using Vision LLMs on Amazon SageMaker. The solution leverages the SWIFT Framework to fine-tune models specifically for document understanding tasks.

aws document-processing fine-tuning huggingface idp llama multimodal qwen2-vl sagemaker sft swift

Last synced: 23 Mar 2025

https://github.com/imadsaddik/bodmaghdataset

BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language

arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft

Last synced: 03 Apr 2025

https://github.com/tonyskapunk/sft-aur

Scripts to keep up with latest scaleft packages to build them for AUR

arch aur hacktoberfest linux sft

Last synced: 18 Apr 2025

https://github.com/francescodisalesgithub/few-shots-importer

sft training by using only command instruction on a ollama modelfile

ai hack modelfile ollama sft supervised-learning training

Last synced: 19 Feb 2025

https://github.com/philipmay/llm-data

LLM Training Data

llm sft

Last synced: 22 Feb 2025

https://github.com/data-dream-gdsp/hello-happy-world

AI-powered automatic dataset creation from the web, Support for LoRA and SFT question generation!

ai data-science llama llm lora sft

Last synced: 16 Feb 2025