Constitutional AI: Harmlessness from AI Feedback

Bai et al., Anthropic  •  Mar 22, 2026  •  31 views

This paper introduces Constitutional AI, a method for training harmless AI assistants using AI-generated feedback rather than human labels.

Read Paper