An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by PKU-Alignment

A curated list of projects in awesome lists by PKU-Alignment .

https://github.com/pku-alignment/align-anything

Align Anything: Training All-modality Model with Feedback

chameleon dpo large-language-models multimodal rlhf vision-language-model

Last synced: 14 May 2025

https://github.com/PKU-Alignment/align-anything

Align Anything: Training All-modality Model with Feedback

chameleon dpo large-language-models multimodal rlhf vision-language-model

Last synced: 01 Apr 2025

https://github.com/pku-alignment/safe-policy-optimization

NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms

benchmarks constrained-reinforcement-learning reinforcement-learning-algorithms safe safe-reinforcement-learning

Last synced: 07 May 2025

https://github.com/pku-alignment/aligner

[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

aisafety aligner alignment interpretability llm mecinterp rlhf weak-to-strong

Last synced: 07 May 2025

https://github.com/pku-alignment/beavertails

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

ai-safety beaver datasets gpt human-feedback human-feedback-data language-model large-language-model llama llm llms rlhf safe-rlhf safety

Last synced: 09 Aug 2025

https://github.com/pku-alignment/proagent

AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models

cooperative cooperative-ai human-ai human-ai-interaction language-model llm-agent overcooked

Last synced: 07 May 2025

https://github.com/pku-alignment/safe-sora

SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (LVMs).

alignment human-preferences large-vision-models text-to-video-generation

Last synced: 07 May 2025

https://github.com/pku-alignment/progressgym

Alignment with a millennium of moral progress. Spotlight@NeurIPS 2024 Track on Datasets and Benchmarks.

Last synced: 07 May 2025

https://github.com/pku-alignment/redman

ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Manipulation.

Last synced: 07 May 2025

https://github.com/pku-alignment/sae-v

[ICML 2025 Poster] SAE-V: Interpreting Multimodal Models for Enhanced Alignment

Last synced: 19 Feb 2026

https://github.com/pku-alignment/llms-resist-alignment

Repo for paper "Language Models Resist Alignment"

ai-safety alignment alpaca llama llama2 llama3 llm llms rlhf safe safe-rlhf vicuna

Last synced: 01 Oct 2025