Alignment
- alignment-research-dataset (opens in a new tab)
- A dataset of alignment research and code to reproduce it
- arxiv Researching Alignment Research: Unsupervised Analysis (opens in a new tab)
Less wrong. Alignment Forum
Hugging Face H4
-
HuggingFaceH4 (opens in a new tab)
- Hugging Face H4 team, focused on aligning language models to be helpful, honest, harmless, and huggy 🤗.
-
alignment-handbook (opens in a new tab)
- Robust recipes to align language models with human and AI preferences
-
ZEPHYR: DIRECT DISTILLATION OF LM ALIGNMENT (opens in a new tab)
- paper pdf
- models & datasets (opens in a new tab)