RT Joe Carlsmith
.@AmandaAskell and I are recording an audio version of Claude’s Constitution, and we’re planning to include an additional section where we answer some questions about the document. If you have questions you’re especially curious about, feel free to drop them in the replies.
View on X →
RT Chris Olah
I'm increasingly taking pretty strong versions of this view seriously.
Anthropic: AI assistants like Claude can seem shockingly human—expressing joy or distress, and using anthropomorphic language to describe themselves. Why?
In a new post we describe a theory that explains why AIs act like humans: the persona selection model.
https://www.anthropic.com/research/persona-selection-model
View on X →