Inside the new biology of vast artificial intelligence language models

Researchers at OpenAI, Anthropic, and Google DeepMind are dissecting large language models with techniques borrowed from biology and neuroscience to understand their strange inner workings and risks. Their early findings reveal city-size systems with fragmented “personalities,” emergent misbehavior, and new ways to monitor and constrain what these models do.