https://github.com/study8677/post-training

本仓库聚焦于大模型后训练（Post-Training），内容包括但不限于：核心方法：SFT、RLHF、RL、DPO、RAG 其他方向：模型蒸馏、对齐方法、红队测试、安全性调优等目标：记录和沉淀相关知识、方法与实践经验，旨在从原型 Demo 推动到实际落地应用。
https://github.com/study8677/post-training

Last synced: 4 months ago
JSON representation

本仓库聚焦于大模型后训练（Post-Training），内容包括但不限于：核心方法：SFT、RLHF、RL、DPO、RAG 其他方向：模型蒸馏、对齐方法、红队测试、安全性调优等目标：记录和沉淀相关知识、方法与实践经验，旨在从原型 Demo 推动到实际落地应用。

Host: GitHub
URL: https://github.com/study8677/post-training
Owner: study8677
License: mit
Created: 2025-08-29T01:53:00.000Z (10 months ago)
Default Branch: main
Last Pushed: 2025-09-09T13:45:41.000Z (10 months ago)
Last Synced: 2025-09-09T16:40:38.250Z (10 months ago)
Language: Jupyter Notebook
Size: 168 KB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Roadmap: docs/roadmap.md

Awesome Lists containing this project

README

# post-training

本仓库聚焦于大模型后训练（Post-Training），涵盖 SFT、RLHF、DPO、RAG 等方法以及模型蒸馏、对齐与安全等方向。目标是记录与沉淀相关知识、方法与实践经验。

## Quick Start

```bash
make init
make test
```

## Directory

- `configs/` 项目统一配置
- `scripts/` 一键运行脚本
- `src/post_training/` 可复用的库代码
- `examples/` 最小可运行示例
- `tests/` 单元与冒烟测试

更多内容见各子目录文档。

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/study8677/post-training

Awesome Lists containing this project

README